Construction of Enhanced Sentiment Sensitive Thesaurus for Cross Domain Sentiment Classification Using Wiktionary

  • P. Sanju
  • T. T. Mirnalinee
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 259)


Sentiment classification is classification of reviews into positive or negative depends on the sentiment words expressed in reviews. Automatic sentiment classification is necessary in various applications such as market analysis, opinion mining, contextual advertisement and opinion summarization. However, sentiments are expressed differently in different domain and annotating label for every domain of interest is expensive and time consuming. In cross domain sentiment classification, a sentiment classifier trained in source domain is applied to classify reviews of target domain, always produce low performance due to the occurrence of features mismatch between source domain and target domain. The proposed method develops solution to feature mismatch problem in cross domain sentiment classification by creating enhanced sentiment sensitive thesaurus using wiktionary. The enhanced sentiment sensitive thesaurus aligns different words in expressing the same sentiment not only from different domains of reviews and from wiktionary to increase the classification performance in target domain. In this paper, the proposed method describes the method of construction of enhanced sentiment sensitive thesaurus which will be useful for cross domain sentiment classification.


Cross domain sentiment classification Enhanced sentiment sensitive thesaurus Domain adaptation Data mining 


  1. 1.
    Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRefGoogle Scholar
  2. 2.
    Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceeding of the 10th ACM SIG KDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. New York, USA (2004)Google Scholar
  3. 3.
    Lu, Y., Zhai, C., Sundaresan, N.: Rated aspect summarization of short comments. In: The Proceedings of the 18th Conference on World Wide Web, pp. 131–140. ACM, New York (2009)Google Scholar
  4. 4.
    Fan, T.-K., Chang, C.H.: Sentiment-oriented contextual advertising. Knowl. Inf. Syst. 23(3), 321–344 (2010)CrossRefGoogle Scholar
  5. 5.
    Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 79–86. Philadelphia, July 2002Google Scholar
  6. 6.
    Turney, P.D.: Humbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: ACL‘02 Proceedings of the 40th Annual Meeting on association for Computational Linguistics, pp. 417–424 (2002)Google Scholar
  7. 7.
    Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: ACL 1997, pp. 174–181 (1997)Google Scholar
  8. 8.
    Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Proceedings 10th European Conference on Machine Learning (ECML), pp. 137–142. Springer Verlag, Berlin (1998)Google Scholar
  9. 9.
    Pan, S.J., Ni, X., Sun, J.T., Yan, Q., Chen, Z.: Cross-domain Sentiment classification via spectral feature alignment. In: WWW 2010, Raleigh, North Carolina, USA.ACM 978-1-60558-799-8/10/04, 26–30 April 2010Google Scholar
  10. 10.
    Bollegala, D., Weir, D., carroll, J.: Cross domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans. Knowl. Data Eng. 25(8) (2013)Google Scholar
  11. 11.
    Zesch, T., Muller, C., Gurevych, I.: Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary (2008)Google Scholar
  12. 12.
    Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In: EMNLP 2003, pp. 129–136 (2003)Google Scholar
  13. 13.
    Pantel, P., Ravichandran, D.: Automatically labeling semantic classes. In: NAACL-HLT‘04, pp. 321–328 (2004)Google Scholar
  14. 14.
    Breck, E., Choi, Y., Cardie, C.: Identifying expressions of opinion in context. In: IJCAI 2007 (2007)Google Scholar
  15. 15.
    Turney, P.: Similarity of semantic relations. Comput. Linguist. 32(3), 379–416 (2006)CrossRefMATHGoogle Scholar
  16. 16.
    Kennedy, A., Inkpen, D.: Sentiment classification of movie reviews using contextual valence shifters. Comput. Intell. 22, 110–125 (2006)CrossRefMathSciNetGoogle Scholar
  17. 17.
    Lin, C., He, Y., Everson, R., Ruger, S.: Weakly supervised joint sentiment- topic detection from text, J. Latex class files, 24 (2011)Google Scholar
  18. 18.
    Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: LREC 2006. pp. 417–422 (2006)Google Scholar
  19. 19.
    Liu, C.L., Hsaio, W.H., Lee, C.H., Lu, G.C., jou, E.: Movie Rating and review summarization in mobile environment. In: IEEE 2012, vol. 42, pp. 397–407 (2012)Google Scholar
  20. 20.
    Khan, A., Baharudin, B.: Sentiment classification by sentence level semantic orientation using sentiwordnet from online reviews and Blogs. Int. J. Comput. Sci. Emerging Tech. 2(4) (2011)Google Scholar
  21. 21.
    Chesley, P., Bruce Vincent, B., Xu, L., Srihari, R.K.: Using verbs and adjectives to automatically classify blog sentiment. In: American Association for Artificial intelligence (2005)Google Scholar
  22. 22.
    Hung, C., Lin, H.: Using objective words in sentiwordnet to improve the sentiment classification word of mouth. In: IEEE Intelligent Systems (2013)Google Scholar
  23. 23.
    Wiebe, J.: Learning subjective adjectives from corpora. In: American Association for AI (2001)Google Scholar
  24. 24.
    Hatzivassiloglou, V., MckeownK.R.: Predicting the Semantic Orientation of Adjectives .pp. 176–182 (1995)Google Scholar
  25. 25.
    Muller, C., Gurevych, I.: Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval (2008)Google Scholar
  26. 26.
    Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: EMNLP 2006 (2006)Google Scholar
  27. 27.
    Yan, L., Zhang, Y.: News sentiment Analysis based on cross domain sentiment word lists and content classifiers. ADMA 2012.LNAI 7713, pp. 577–588. Springer-verlag, Berlin (2012)Google Scholar
  28. 28.
    Briscoe, T., Carroll, J., Watson, R.: The second release of the RASP system. In: COLING/ACL 2006 Interactive Presentation Sessions (2006)Google Scholar
  29. 29.
    Weeds, J., Weir, D.: Co-occurrence retrieval: a flexible framework for lexical distributional similarity. Comput. Linguist. 31(4), 439–475 (2006)CrossRefGoogle Scholar
  30. 30.
    Harris, Z.: Distributional structure. Word 10, 146–162 (1954)Google Scholar
  31. 31.
    Turney, P.: Similarity of semantic relations. Comput. Linguist. 32(3), 379–416 (2006)CrossRefMATHGoogle Scholar
  32. 32.
    Lin, D.: Automatic retrieval and clustering of similar words. In: ACL, pp. 768–774 (1998)Google Scholar
  33. 33.
    Hu, K., Zhang, Y., Hu, X.: A multi-domain adaptation for sentiment classification algorithm based on class distribution. In: IEEE International Conference on Granular Computing (2012)Google Scholar
  34. 34.
    Remus, R.: Domain adaptation using domain similarity- and domain complexity-based instance selection for cross-domain sentiment analysis. In: IEEE 12th International Conference on Data Mining Workshops (2012)Google Scholar

Copyright information

© Springer India 2014

Authors and Affiliations

  1. 1.University College of EngineeringAnna UniversityVillupuramIndia
  2. 2.SSN College of EngineeringAnna UniversityChennaiIndia

Personalised recommendations