Skip to main content
Log in

Sentiment analysis on cross-domain textual data using classical and deep learning approaches

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Sentiment Analysis (SA) using machine learning approaches is a fascinating field of research to unravel the opinion expressed by online users. It deals with the classification problem of opinions expressed on a product or service in review texts as positive or negative or neutral. Classification beyond binary classes poses even greater challenges using machine learning. In this paper two sets of approaches, first set of approach on traditional machine learning and second one on deep learning technique was and evaluated and analyzed for multiclass polarity classification at fine-grain level on cross-domain review dataset. The analysis of performance of traditional machine learning approaches that comprise ensemble models and incorporate semantic and diverse statistical features, shows that ensemble models are better in their category. With the proposed sentiment-document model coupled with the robustness of the proposed ensemble models, we were able to investigate and establish that the document polarity could serve as the latent mutual information that could leverage the model performance of predicting sentiments at sentence and document levels. Furthermore, the evaluation of deep learning models infers that they are more viable option for multi-class polarity classification of cross-domain dataset. They outperformed the traditional approaches based on feature extraction and feature engineering and achieved better accuracy and F1 scores.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data Availability

Authors declare that all the data being used in the design and production cum layout of the manuscript is declared in the manuscript.

Notes

  1. https://huggingface.co/transformers/pretrained_models.html

  2. http://reviews.imdb.com/Reviews

  3. http://www.cs.cornell.edu/people/pabo/movie-review-data/

  4. https://archive.ics.uci.edu/ml/datasets/Sentiment+Labelled+Sentences

  5. https://nlp.stanford.edu/software/lex-parser.shtml

  6. http://www.idilia.com

  7. http://sentiwordnet.isti.cnr.it/

  8. http://www.lexicoder.com/download.html

  9. http://www.wjh.harvard.edu/~inquirer/spreadsheet_guide.htm

  10. http://sentic.net/downloads/

  11. https://www.cs.uic.edu/~liub/FBS/sentimentanalysis.html#lexicon

  12. http://mpqa.cs.pitt.edu/#subj_lexicon

  13. https://hlt-nlp.fbk.eu/technologies/sentiwords

  14. http://provalisresearch.com/products/contentanalysissoftware/wordstatdictionary/sentimentdictionaries/

  15. https://allennlp.org/elmo

  16. https://nlp.fast.ai/classification/2018/05/15/introducing-ulmfit.html

  17. https://openai.com/blog/better-language-models/

  18. https://nlp.stanford.edu/projects/glove/

  19. https://fasttext.cc/docs/en/english-vectors.html

References

  1. Aggarwal CC, Zhai C (2012) Mining text data (Springer Science Business Media, 2012)

  2. Al-Moslmi T, Omar N, Abdullah S, Albared M (2017) Approaches to cross-domain sentiment analysis: a systematic literature review. IEEE access 5:16173

    Article  Google Scholar 

  3. Aue A, Gamon M (2005) .. In: Proceedings of recent advances in natural language processing (RANLP), vol 1 (Citeseer, 2005), pp 2–1

  4. Baccianella S, Esuli A, Sebastiani F (2010) .. In: LREC, vol 10, pp 2200–2204

  5. Benamara F, Chardon B, Mathieu Y, Popescu V, Asher N (2012) .. In: Proceedings of the workshop on extra-propositional aspects of meaning in computational linguistics (Association for computational linguistics, 2010), pp 10–18

  6. Benamara F, Chardon B, Mathieu YY, Popescu V et al (2011). In: IJCNLP, pp 1180–1188

  7. Boiy E, Moens MF (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retrieval 12(5):526

    Article  Google Scholar 

  8. Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22

    Google Scholar 

  9. De Marneffe MC, MacCartney B, Manning CD et al (2006) .. In: Proceedings of LREC, vol 6 (Genoa Italy, 2006), vol 6, pp 449–454

  10. Ding X, Liu B, Yu PS (2008) .. In: Proceedings of the 2008 international conference on web search and data mining (ACM, 2008), pp 231–240

  11. Esuli A, Sebastiani F (2006) .. In: EACL, vol 6, p 2006

  12. Hoang M, Bihorac OA, Rouces J (2019) .. In: NEAL proceedings of the 22nd nordic conference on computional linguistics (NoDaLiDa), September 30-October 2, Turku, Finland (Linköping University Electronic Press, 2019), vol 167, pp 187–196

  13. Hung C, Chen SJ (2016) Word sense disambiguation based sentiment lexicons for sentiment classification. Knowl-Based Syst 110:224

    Article  Google Scholar 

  14. Indurkhya N, Damerau FJ (2010) Handbook of natural language processing, vol 2 (CRC Press, 2010)

  15. Ji-won J, Jee-Eun K (2017) Sentiment analysis utilizing modal expressions. J British American Studies 39:179

    Google Scholar 

  16. Kaji N, Kitsuregawa M (2007) .. In: EMNLP-CoNLL (2007), pp 1075–1083

  17. Koppel M, Schler J (2006) The importance of neutral examples for learning sentiment. Comput Intell 22(2):100

    Article  MathSciNet  Google Scholar 

  18. Kotzias D, Denil M, De Freitas N, Smyth P (2015) .. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining (ACM, 2015), pp 597–606

  19. Ku LW, Lee CY, Chen HH (2009) Identification of opinion holders. Comput Linguist Chinese Language Process:383

  20. Li X, Bing L, Zhang W, Lam W (2019) Exploiting bert for end-to-end aspect-based sentiment analysis, arXiv:1910.00883

  21. Li T, Sindhwani V, Ding C, Zhang Y (2009) .. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp 716–717

  22. Liu B (2007) Web data mining: exploring hyperlinks, contents and usage data (Springer Science & Business Media

  23. Liu B (2012) Sentiment analysis and opinion mining. Synthesis Lectures Human Language Technol 5(1):1

    Article  MathSciNet  Google Scholar 

  24. Liu Y, Yu X, Chen Z, Liu B (2013) Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing (ACM, 2013), pp 39–44

  25. Liu Y, Yu X, Liu B, Chen Z (2014) .. In: Proceedings of the 15th international conference on computational linguistics and intelligent text processing-vol 8404 (Springer New York, Inc., 2014), pp 1–16

  26. Martineau J, Finin T (2009) Delta tfidf: an improved feature space for sentiment analysis. Icwsm 9:106

    Google Scholar 

  27. McDonald R, Hannan K, Neylon T, Wells M, Reynar J (2007) .. In: Proceedings of the 45th annual meeting of the association of computational linguistics, pp 432–439

  28. Mihalcea R (2011) .. In: Encyclopedia of machine learning (Springer, 2011), pp 1027–1030

  29. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) .. In: Advances in neural information processing systems, pp 3111–3119

  30. Narayanan R, Liu B, Choudhary A (2009) .. In: Proceedings of the 2009 conference on empirical methods in natural language processing: vol 1. (Association for computational linguistics, 2009), pp 180–189

  31. Paltoglou G, Thelwall M (2010) .. In: Proceedings of the 48th annual meeting of the association for computational linguistics (Association for computational linguistics, 2010), pp 1386–1395

  32. Pang B, Lee L (2004) .. In: Proceedings of the 42nd annual meeting on association for computational linguistics (Association for computational linguistics, 2004), p 271

  33. Pang B, Lee L (2004) A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04), pages 271–278, Barcelona, Spain

  34. Pang B, Lee L, et al. (2008) Opinion mining and sentiment analysis. Found Trends®;, Inf Retrieval 2(1–2):1

    Article  Google Scholar 

  35. Paramesha K, Gururaj H, Jena OP (2021) Applications of machine learning in biomedical text processing and food industry. Mach Learn Healthcare Appl:151–167

  36. Paramesha K, Ravishankar KC (2016) Analysis of opinionated text for opinion mining. Mach Learn Appl: Int J (MLAIJ) 3(2):65

    Google Scholar 

  37. Polpinij J, Ghose AK (2008) .. In: Proceedings of the 2008 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology-vol 01 (IEEE computer society, 2008), pp 518–524

  38. Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 108:42

    Article  Google Scholar 

  39. Severyn A, Moschitti A (2015) .. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 959–962

  40. Stevenson M, Wilks Y (2003) Word sense disambiguation. The Oxford Handbook of Comp. Linguistics pp 249–265

  41. Sumanth C, Inkpen D (2015) .. In: Proceedings of the 6th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 115–121

  42. Sun C, Huang L, Qiu X (2019) Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence, arXiv:1903.09588

  43. Täckström O, McDonald R (2011) .. In: Advances in information retrieval (Springer, 2011), pp 368–374

  44. Turney PD (2002) .. In: Proceedings of the 40th annual meeting on association for computational linguistics (Association for computational linguistics, 2002), pp 417–424

  45. Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decision Support Syst 57:77

    Article  Google Scholar 

  46. Whitelaw C, Garg N, Argamon S (2005) .. In: Proceedings of the 14th ACM international conference on Information and knowledge management (ACM, 2005), pp 625–631

  47. Wiebe J, Mihalcea R (2006) .. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the association for computational linguistics (Association for computational linguistics, 2006), pp 1065–1072

  48. Wiegand M, Balahur A, Roth B, Klakow D, Montoyo A (2010) .. In: Proceedings of the workshop on negation and speculation in natural language processing (Association for computational linguistics, 2010), pp 60–68

  49. Wilson T, Wiebe J, Hoffmann P (2009) Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Comput Linguist 35(3):399

    Article  Google Scholar 

  50. Xu H, Liu B, Shu L, Yu PS (2019) Bert post-training for review reading comprehension and aspect-based sentiment analysis, arXiv:1904.02232

  51. Yadav A, Vishwakarma DK (2020) Sentiment analysis using deep learning architectures: a review. Artif Intell Rev 53(6):4335

    Article  Google Scholar 

  52. Yarowsky D (2010) .. In: Handbook of natural language processing, 2nd edn. (Chapman and Hall/CRC, 2010), pp 315–338

  53. Zhang L, Wang S, Liu B (2018) Deep learning for sentiment analysis: a survey. Wiley Interdisciplinary Rev: Data Mining Knowl Discover 8(4):e1253

    Google Scholar 

  54. Zhang W, Yoshida T, Tang X (2011) A comparative study of tf* idf, lsi and multi-words for text classification. Expert Syst Appl 38(3):2758

    Article  Google Scholar 

Download references

Funding

The authors received no specific funding for this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anand Nayyar.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflicts of interest to report regarding the present study.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Paramesha, K., Gururaj, H.L., Nayyar, A. et al. Sentiment analysis on cross-domain textual data using classical and deep learning approaches. Multimed Tools Appl 82, 30759–30782 (2023). https://doi.org/10.1007/s11042-023-14427-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14427-9

Keywords

Navigation