Language-Independent Sentiment Analysis with Surrounding Context Extension
Expressing attitudes and opinions towards various entities (i.e. products, companies, people and events) has become pervasive with the recent proliferation of social media. Monitoring of what customers think is a key task for marketing research and opinion surveys, while measuring customers’ preferences or media monitoring have become a fundamental part of corporate activities. Most experiments on automated sentiment analysis focus on major languages (English, but also Chinese); minor or morphologically rich languages are addressed rather sparsely. Moreover, to improve the performance of machine-learning based classifiers, the models are often complemented with language-dependent components (i.e. sentiment lexicons). Such combined approaches provide a high level of accuracy but are limited to a single language or a single thematic domain.
This paper aims to contribute to this field and introduces an experiment utilizing a language– and domain– independent model for sentiment analysis. The model has been previously tested on multiple corpora, providing a trade-off between generality and the classification performance of the model. In this paper, we suggest a further extension of the model utilizing the surrounding context of the classified documents.
KeywordsSentiment analysis Cross-domain Cross-language Document surrounding context
- 3.Balahur, A.: Sentiment analysis in social media texts. In: WASSA 2013, p. 120 (2013)Google Scholar
- 5.Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)Google Scholar
- 6.Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC, pp. 1320-1326 (2010)Google Scholar
- 7.Tsarfaty, R., Seddah, D., Goldberg, Y., Kuebler, S., Candito, M., Foster, J., Versley, Y., Rehbein, I., Tounsi, L.: Statistical parsing of morphologically rich languages (SPMRL): what, how and whither. In: Proceedings of the First Workshop on Statistical Parsing of Morphologically-Rich Languages, NAACL HLT 2010, pp. 1–12. Association for Computational Linguistics (2010)Google Scholar
- 8.Kincl, T., Novák, M., Přibil, J.: Getting inside the minds of the customers: automated sentiment analysis. In: European Conference on Management Leadership and Governance ECMLG 2013, pp. 122–129. Alpen-Adria Universität Klagenfurt, Austria (2013)Google Scholar
- 10.Brychcín, T., Habernal, I.: Unsupervised improving of sentiment analysis using global target context. In: International Conference Recent Advances in Natural Language Processing (RANLP 2013) (2013)Google Scholar
- 13.Maas, A.L., Daly, R.E., Pham, P.T., Huang, D., Ng, A.Y., Potts, C.: Learning word vectors for sentiment analysis. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 142–150. Association for Computational Linguistics (2011)Google Scholar