Abstract
Sentiment Analysis (SA) using machine learning approaches is a fascinating field of research to unravel the opinion expressed by online users. It deals with the classification problem of opinions expressed on a product or service in review texts as positive or negative or neutral. Classification beyond binary classes poses even greater challenges using machine learning. In this paper two sets of approaches, first set of approach on traditional machine learning and second one on deep learning technique was and evaluated and analyzed for multiclass polarity classification at fine-grain level on cross-domain review dataset. The analysis of performance of traditional machine learning approaches that comprise ensemble models and incorporate semantic and diverse statistical features, shows that ensemble models are better in their category. With the proposed sentiment-document model coupled with the robustness of the proposed ensemble models, we were able to investigate and establish that the document polarity could serve as the latent mutual information that could leverage the model performance of predicting sentiments at sentence and document levels. Furthermore, the evaluation of deep learning models infers that they are more viable option for multi-class polarity classification of cross-domain dataset. They outperformed the traditional approaches based on feature extraction and feature engineering and achieved better accuracy and F1 scores.
Similar content being viewed by others
Data Availability
Authors declare that all the data being used in the design and production cum layout of the manuscript is declared in the manuscript.
Notes
References
Aggarwal CC, Zhai C (2012) Mining text data (Springer Science Business Media, 2012)
Al-Moslmi T, Omar N, Abdullah S, Albared M (2017) Approaches to cross-domain sentiment analysis: a systematic literature review. IEEE access 5:16173
Aue A, Gamon M (2005) .. In: Proceedings of recent advances in natural language processing (RANLP), vol 1 (Citeseer, 2005), pp 2–1
Baccianella S, Esuli A, Sebastiani F (2010) .. In: LREC, vol 10, pp 2200–2204
Benamara F, Chardon B, Mathieu Y, Popescu V, Asher N (2012) .. In: Proceedings of the workshop on extra-propositional aspects of meaning in computational linguistics (Association for computational linguistics, 2010), pp 10–18
Benamara F, Chardon B, Mathieu YY, Popescu V et al (2011). In: IJCNLP, pp 1180–1188
Boiy E, Moens MF (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retrieval 12(5):526
Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22
De Marneffe MC, MacCartney B, Manning CD et al (2006) .. In: Proceedings of LREC, vol 6 (Genoa Italy, 2006), vol 6, pp 449–454
Ding X, Liu B, Yu PS (2008) .. In: Proceedings of the 2008 international conference on web search and data mining (ACM, 2008), pp 231–240
Esuli A, Sebastiani F (2006) .. In: EACL, vol 6, p 2006
Hoang M, Bihorac OA, Rouces J (2019) .. In: NEAL proceedings of the 22nd nordic conference on computional linguistics (NoDaLiDa), September 30-October 2, Turku, Finland (Linköping University Electronic Press, 2019), vol 167, pp 187–196
Hung C, Chen SJ (2016) Word sense disambiguation based sentiment lexicons for sentiment classification. Knowl-Based Syst 110:224
Indurkhya N, Damerau FJ (2010) Handbook of natural language processing, vol 2 (CRC Press, 2010)
Ji-won J, Jee-Eun K (2017) Sentiment analysis utilizing modal expressions. J British American Studies 39:179
Kaji N, Kitsuregawa M (2007) .. In: EMNLP-CoNLL (2007), pp 1075–1083
Koppel M, Schler J (2006) The importance of neutral examples for learning sentiment. Comput Intell 22(2):100
Kotzias D, Denil M, De Freitas N, Smyth P (2015) .. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining (ACM, 2015), pp 597–606
Ku LW, Lee CY, Chen HH (2009) Identification of opinion holders. Comput Linguist Chinese Language Process:383
Li X, Bing L, Zhang W, Lam W (2019) Exploiting bert for end-to-end aspect-based sentiment analysis, arXiv:1910.00883
Li T, Sindhwani V, Ding C, Zhang Y (2009) .. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp 716–717
Liu B (2007) Web data mining: exploring hyperlinks, contents and usage data (Springer Science & Business Media
Liu B (2012) Sentiment analysis and opinion mining. Synthesis Lectures Human Language Technol 5(1):1
Liu Y, Yu X, Chen Z, Liu B (2013) Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing (ACM, 2013), pp 39–44
Liu Y, Yu X, Liu B, Chen Z (2014) .. In: Proceedings of the 15th international conference on computational linguistics and intelligent text processing-vol 8404 (Springer New York, Inc., 2014), pp 1–16
Martineau J, Finin T (2009) Delta tfidf: an improved feature space for sentiment analysis. Icwsm 9:106
McDonald R, Hannan K, Neylon T, Wells M, Reynar J (2007) .. In: Proceedings of the 45th annual meeting of the association of computational linguistics, pp 432–439
Mihalcea R (2011) .. In: Encyclopedia of machine learning (Springer, 2011), pp 1027–1030
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) .. In: Advances in neural information processing systems, pp 3111–3119
Narayanan R, Liu B, Choudhary A (2009) .. In: Proceedings of the 2009 conference on empirical methods in natural language processing: vol 1. (Association for computational linguistics, 2009), pp 180–189
Paltoglou G, Thelwall M (2010) .. In: Proceedings of the 48th annual meeting of the association for computational linguistics (Association for computational linguistics, 2010), pp 1386–1395
Pang B, Lee L (2004) .. In: Proceedings of the 42nd annual meeting on association for computational linguistics (Association for computational linguistics, 2004), p 271
Pang B, Lee L (2004) A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04), pages 271–278, Barcelona, Spain
Pang B, Lee L, et al. (2008) Opinion mining and sentiment analysis. Found Trends®;, Inf Retrieval 2(1–2):1
Paramesha K, Gururaj H, Jena OP (2021) Applications of machine learning in biomedical text processing and food industry. Mach Learn Healthcare Appl:151–167
Paramesha K, Ravishankar KC (2016) Analysis of opinionated text for opinion mining. Mach Learn Appl: Int J (MLAIJ) 3(2):65
Polpinij J, Ghose AK (2008) .. In: Proceedings of the 2008 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology-vol 01 (IEEE computer society, 2008), pp 518–524
Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 108:42
Severyn A, Moschitti A (2015) .. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 959–962
Stevenson M, Wilks Y (2003) Word sense disambiguation. The Oxford Handbook of Comp. Linguistics pp 249–265
Sumanth C, Inkpen D (2015) .. In: Proceedings of the 6th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 115–121
Sun C, Huang L, Qiu X (2019) Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence, arXiv:1903.09588
Täckström O, McDonald R (2011) .. In: Advances in information retrieval (Springer, 2011), pp 368–374
Turney PD (2002) .. In: Proceedings of the 40th annual meeting on association for computational linguistics (Association for computational linguistics, 2002), pp 417–424
Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decision Support Syst 57:77
Whitelaw C, Garg N, Argamon S (2005) .. In: Proceedings of the 14th ACM international conference on Information and knowledge management (ACM, 2005), pp 625–631
Wiebe J, Mihalcea R (2006) .. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the association for computational linguistics (Association for computational linguistics, 2006), pp 1065–1072
Wiegand M, Balahur A, Roth B, Klakow D, Montoyo A (2010) .. In: Proceedings of the workshop on negation and speculation in natural language processing (Association for computational linguistics, 2010), pp 60–68
Wilson T, Wiebe J, Hoffmann P (2009) Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput Linguist 35(3):399
Xu H, Liu B, Shu L, Yu PS (2019) Bert post-training for review reading comprehension and aspect-based sentiment analysis, arXiv:1904.02232
Yadav A, Vishwakarma DK (2020) Sentiment analysis using deep learning architectures: a review. Artif Intell Rev 53(6):4335
Yarowsky D (2010) .. In: Handbook of natural language processing, 2nd edn. (Chapman and Hall/CRC, 2010), pp 315–338
Zhang L, Wang S, Liu B (2018) Deep learning for sentiment analysis: a survey. Wiley Interdisciplinary Rev: Data Mining Knowl Discover 8(4):e1253
Zhang W, Yoshida T, Tang X (2011) A comparative study of tf* idf, lsi and multi-words for text classification. Expert Syst Appl 38(3):2758
Funding
The authors received no specific funding for this study.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflicts of interest to report regarding the present study.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Paramesha, K., Gururaj, H.L., Nayyar, A. et al. Sentiment analysis on cross-domain textual data using classical and deep learning approaches. Multimed Tools Appl 82, 30759–30782 (2023). https://doi.org/10.1007/s11042-023-14427-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14427-9