Skip to main content
Log in

Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis

  • Published:
Cognitive Computation Aims and scope Submit manuscript

Abstract

Sentiment analysis is an important research area in natural language processing (NLP), and the performance of sentiment analysis models is largely influenced by the quality of sentiment lexicons. Existing sentiment lexicons contain only the sentiment information of words. In this paper, we propose an approach for automatically constructing a fine-grained sentiment lexicon that contains both emotion information and sentiment information to solve the problem that the emotion and sentiment of texts cannot be jointly analyzed. We design an emotion-sentiment transfer method and construct a fine-grained sentiment seed lexicon, and we then expand the sentiment seed lexicon by applying the graph dissemination method to the synonym set. Subsequently, we propose a multi-information fusion method based on neural network to expand the sentiment lexicon based on a corpus. Finally, we generate the Fine-Grained Sentiment Lexicon (FGSL), which contains 40,554 words. FGSL achieves F1 values of 61.97%, 69.58%, and 66.99% on three emotion datasets and 88.19%, 89.31%, and 86.88% on three sentiment datasets. Experimental results on multiple public benchmark datasets illustrate that FGSL achieves significantly better performance in both emotion analysis and sentiment analysis tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://nlp.stanford.edu/software/tagger.shtml

  2. http://nlp.cs.swarthmore.edu/semeval/tasks/task14/data.shtml

  3. http://knoesis.org/?q=projects/emotion

  4. http://saimacs.github.io

  5. http://www.cs.cornell.edu/people/pabo/movie-review-data

  6. http://ai.stanford.edu/ amaas//data/sentiment/

  7. https://www.cs.york.ac.uk/semeval-2013/

  8. http://www.csie.ntu.edu.tw/ cjlin/liblinear/

References

  1. Cambria E, Kumar A, Al-Ayyoub M, Howard N. Guest Editorial: explainable artificial intelligence for sentiment analysis. Elsevier; 2021.

  2. Liang B, Su H, Gui L, Cambria E, Xu R. Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks. Knowl-Based Syst. 2022;235:107643.

  3. Mohammad SM, Turney PD. Crowdsourcing a word-emotion association lexicon. Comput Intell. 2013;29(3):436–65.

    Article  MathSciNet  Google Scholar 

  4. Mohammad SM. Word affect intensities. arXiv preprint arXiv:1704.08798. 2017.

  5. Wilson T, Wiebe J, Hoffmann P. Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005. p. 347–54.

  6. Stone PJ, Dunphy DC, Smith MS. The general inquirer: a computer approach to content analysis. 1966.

  7. Hu M, Liu B. Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004. p. 168–77.

  8. Bravo-Marquez F, Khanchandani A, Pfahringer B. Incremental word vectors for time-evolving sentiment lexicon induction. Cogn Comput. 2022;14(1):425–41.

    Article  Google Scholar 

  9. Sharma SS, Dutta G. Sentidraw: using star ratings of reviews to develop domain specific sentiment lexicon for polarity determination. Inf Process Manag. 2021;58(1):102412.

  10. Huang M, Xie H, Rao Y, Feng J, Wang FL. Sentiment strength detection with a context-dependent lexicon-based convolutional neural network. Inform Sci. 2020;520:389–99.

    Article  Google Scholar 

  11. Viegas F, Alvim MS, Canuto S, Rosa T, Gonçalves MA, Rocha L. Exploiting semantic relationships for unsupervised expansion of sentiment lexicons. Inf Syst. 2020;94:101606.

  12. Hutto C, Gilbert E. Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 8. 2014.

  13. De Bruyne L, Atanasova P, Augenstein I. Joint emotion label space modeling for affect lexica. Comput Speech Lang. 2022;71:101257.

  14. Bandhakavi A, Wiratunga N, Massie S. Emotion-aware polarity lexicons for twitter sentiment analysis. Expert Syst. 2021;38(7):12332.

  15. Yin F, Wang Y, Liu J, Lin L. The construction of sentiment lexicon based on context-dependent part-of-speech chunks for semantic disambiguation. IEEE Access. 2020;8:63359–67.

    Article  Google Scholar 

  16. Du M, Li X, Luo L. A training-optimization-based method for constructing domain-specific sentiment lexicon. Complexity. 2021;2021.

  17. Ekman P. An argument for basic emotions. Cognit Emot. 1992;6(3–4):169–200.

    Article  Google Scholar 

  18. James W. The emotions. 1922.

  19. Kilgarriff A. Wordnet: an electronic lexical database. JSTOR; 2000.

  20. Mohammad SM, Kiritchenko S. Using hashtags to capture fine emotion categories from tweets. Comput Intell. 2015;31(2):301–26.

    Article  MathSciNet  Google Scholar 

  21. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781. 2013.

  22. Pennington J, Socher R, Manning CD. Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014. p. 1532–43.

  23. Cilibrasi RL, Vitanyi PM. The Google similarity distance. IEEE Trans Knowl Data Eng. 2007;19(3):370–83.

    Article  Google Scholar 

  24. Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. J Am Soc Inf Sci. 1990;41(6):391–407.

    Article  Google Scholar 

  25. Strapparava C, Mihalcea R. Semeval-2007 task 14: affective text. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 2007. p. 70–4.

  26. Wang W, Chen L, Thirunarayan K, Sheth AP. Harnessing twitter “big data” for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. IEEE; 2012. p. 587–92.

  27. Bandhakavi A, Wiratunga N, Massie S, Deepak P. Emotion-corpus guided lexicons for sentiment analysis on twitter. In: International Conference on Innovative Techniques and Applications of Artificial Intelligence. Springer; 2016. p. 71–85.

  28. Aman S, Szpakowicz S. Identifying expressions of emotion in text. In: International Conference on Text, Speech and Dialogue. Springer; 2007. p. 196–205.

  29. Pang B, Lee L. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. arXiv preprint cs/0506075. 2005.

  30. Potts C. On the negativity of negation. In: Semantics and Linguistic Theory, vol. 20. 2010. p. 636–59.

  31. Nakov P, Kozareva Z, Ritter A, Rosenthal S, Stoyanov V, Wilson T. Semeval-2013 task 2: sentiment analysis in twitter. 2013.

  32. Staiano J, Guerini M. Depechemood: a lexicon for emotion analysis from crowd-annotated news. arXiv preprint arXiv:1405.1605. 2014.

  33. Badaro G, Jundi H, Hajj H, El-Hajj W. Emowordnet: automatic expansion of emotion lexicon using English wordnet. In: Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics. 2018. p. 86–93.

  34. Wang L, Xia R. Sentiment lexicon construction with representation learning based on hierarchical sentiment supervision. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017. p. 502–10.

  35. Tang D, Wei F, Qin B, Zhou M, Liu T. Building large-scale twitter-specific sentiment lexicon: a representation learning approach. In: Proceedings of Coling 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014. p. 172–82.

  36. Vo DT, Zhang Y. Don’t count, predict! an automatic approach to learning sentiment lexicons for short text. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2. 2016. p. 219–24.

  37. Mohammad SM, Kiritchenko S, Zhu X. NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242. 2013.

  38. Suttles J, Ide N. Distant supervision for emotion classification with discrete binary values. In: International Conference on Intelligent Text Processing and Computational Linguistics. Springer; 2013. p. 121–36.

  39. Thelwall M, Buckley K, Paltoglou G. Sentiment strength detection for the social web. J Am Soc Inf Sci Technol. 2012;63(1):163–73.

    Article  Google Scholar 

Download references

Funding

This work is supported by the National Natural Science Foundation of China (No. 62066009), and the Key Research and Development Project of Guilin (No. 2020010308).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guimin Huang.

Ethics declarations

Ethics Approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed Consent

Informed consent was not required as no humans or animals were involved.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Huang, G., Li, M. et al. Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis. Cogn Comput 15, 254–271 (2023). https://doi.org/10.1007/s12559-022-10043-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12559-022-10043-1

Keywords

Navigation