A Framework for Sentiment Analysis in Turkish: Application to Polarity Detection of Movie Reviews in Turkish
In this work, we present a framework for unsupervised sentiment analysis in Turkish text documents. As part of our framework, we customize the SentiStrength sentiment analysis library by translating its lexicon to Turkish. We apply our framework to the problem of classifying the polarity of movie reviews. For performance evaluation, we use a large corpus of Turkish movie reviews obtained from a popular Turkish social media site. Although our framework is unsupervised, it is demonstrated to achieve a fairly good classification accuracy, approaching the performance of supervised polarity classification techniques.
- 2.Baccianella, A.E.S., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings 7th Conference International, Language Resources and Evaluation (2010)Google Scholar
- 3.Erogul, U.: Sentiment analysis in turkish. Master’s thesis, Middle East Technical University (2009)Google Scholar
- 4.Ghorbel, H., Jacot, D.: Sentiment analysis of french movie reviews. In: Pallotta, V., Soro, A., Vargiu, E. (eds.) Advances in Distributed Agent-Based Retrieval Tools. Studies in Computational Intelligence, vol. 361, pp. 97–108. Springer, Berlin (2011)Google Scholar
- 7.Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of ACL-02 Conference Empirical Methods in Natural Language Processing, pp. 79–86 (2002)Google Scholar
- 9.Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002)Google Scholar
- 10.Yi, J., Nasukawa, T., Bunescu, R., Niblack, W.: Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques. In: Proceedings of 3rd IEEE International Conference Data Mining, pp. 427–434 (2003)Google Scholar