SO-CAL Based Method for Chinese Sentiment Analysis
To study the problem that the polarity and strength of the sentiment words not in lexicon cannot be calculated and classified effectively by lexicon-based classifiers, the EM-SO algorithm was proposed based on expectation maximization (EM) model for constructing and updating sentiment lexicon. Negative and intensifying components were designed upon semantic orientation calculator (SO-CAL) for capturing the combined effects of appraisal words and their modifiers. Experiments showed that the EM-SO algorithm and designed components outperform SO-CAL for the calculation performance of the polarity and strength of sentiment words on review sets.
KeywordsSemantic orientation Sentiment analysis Negation Intensification
- 3.Taboad M, Gillies MA, McFetridge P (2006) Sentiment classification techniques for tracking literary reputation. Proceedings of the LREC Workshop towards computational models of literary analysis 14:36–43Google Scholar
- 4.Bradley P, Fayyad U, Reina C (1998) Scaling clustering algorithms to large databases. In: Proceedings of the fourth international conference on knowledge discovery and data mining vol 123 pp 9–15Google Scholar
- 5.Taboada M, Anthony C (2006) Creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC) vol 1 pp 427–432Google Scholar