Knowledge and Information Systems

, Volume 53, Issue 3, pp 579–636 | Cite as

A valences-totaling model for English sentiment classification

  • Vo Ngoc Phu
  • Vo Thi Ngoc Chau
  • Nguyen Duy Dat
  • Vo Thi Ngoc Tran
  • Tuan A. Nguyen
Survey Paper

Abstract

Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.

Keywords

English document semantic classification Cosine measure Valences-totaling model 

Notes

Compliance with ethical standards

Conflict of interest

The author declares that there is no conflict of interest.

References

  1. 1.
    Large Movie Review Dataset (2016). http://ai.stanford.edu/~amaas/data/sentiment/
  2. 2.
    Efron M (2004) Cultural orientation: classifying subjective documents by cociation sic analysis. In: Proceedings of the AAAI fall symposium on style and meaning in language, art, music, and design, pp 41–48Google Scholar
  3. 3.
    Yuen RWM, Chan TYW, Lai Tom BY, Kwong OY, T’sou Benjamin KY (2004) Morpheme-based derivation of bipolar semantic orientation of Chinese words. In: Proceedings of the 20th international conference on computational linguistics. Stroudsburg, PA, USAGoogle Scholar
  4. 4.
    Chen L-S, Chiu H-J (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists. Hong Kong, MarchGoogle Scholar
  5. 5.
    Wang G, Araki K (2007) Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. In: Proceedings of NAACL HLT 2007, Companion Volume, pp 189–192Google Scholar
  6. 6.
    Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC 2006). Genoa, Italy, pp 427–432Google Scholar
  7. 7.
    Cimiano P, Wenderoth J (2007) Automatic acquisition of ranked qualia structures from the web. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Prague, Czech Republic, pp 888–895Google Scholar
  8. 8.
    Lu G, Huang P, He L, Cu C, Li X (2010) A new semantic similarity measuring method based on web search engines. J WSEAS Trans Comput 9(1):1–10Google Scholar
  9. 9.
    Voll K, Taboada M (2007) Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Proceedings of the 20th Australian joint conference on artificial intelligence. Gold Coast, Australia, pp 337–346Google Scholar
  10. 10.
    Kundi FM, Khan A, Asghar MZ, Ahamd S (2015) Context-aware spelling corrector for sentiment analysis. MAGNT Res Rep 2(6):1–11Google Scholar
  11. 11.
    Mao H, Gao P, Wang Y, Bollen J (2014) Automatic construction of financial semantic orientation lexicon from large-scale Chinese news corpus. 7th financial risks international forumGoogle Scholar
  12. 12.
  13. 13.
    Lin D (1998) Automatic retrieval and clustering of similar words. In: Proceedings of the 17th international conference on computational linguistics, vol 2. Stroudsburg, PA, USA, pp 768–774Google Scholar
  14. 14.
    Turney PD, Littman ML (2002) unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report NRC technical report ERB-1094. Institute for Information Technology, National Research Council CanadaGoogle Scholar
  15. 15.
    Lin WH, Wu YL, Yu LC (2012) Online computation of mutual information and word context entropy. Int J Future Comput Commun 1(2):167CrossRefGoogle Scholar
  16. 16.
    Omar N, Albared M, Al-Shabi AQ, Al-Moslmi T (2013) Ensemble of classification algorithms for subjectivity and sentiment analysis of Arabic customers’ reviews. Int J Adv Comput Technol (IJACT) 5:77Google Scholar
  17. 17.
    English Grammar of British Council (2015). https://learnenglish.britishcouncil.org/en/english-grammar
  18. 18.
    English Grammar of Wikipedia (2015). https://en.wikipedia.org/wiki/English_grammar
  19. 19.
    English Grammar of Cambridge (2015). http://www.cambridge.org/us/cambridgeenglish/
  20. 20.
  21. 21.
    Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst (TOIS) 21(4):315–346CrossRefGoogle Scholar
  22. 22.
    Turney P (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th ACL, pp 417–424Google Scholar
  23. 23.
    Saloot MA, Idris N, Mahmud R, Ja’afar S, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 1–16. doi: 10.1007/s10462-016-9458-x. Print ISSN 0269-2821
  24. 24.
    Ofoghi B, Mann M, Verspoor K (2016) Towards early discovery of salient health threats: a social media emotion classification technique. Pacific symposium on biocomputing, Hawaii, USGoogle Scholar
  25. 25.
    Zarra T, Chiheb R, Faizi R, El Afia A (2016) Using textual similarity and sentiment analysis in discussions forums to enhance learning. Int J Softw Eng Appl 10(1):191–200Google Scholar
  26. 26.
    Korayem M, Aljadda K, Crandall D (2016) Sentiment/subjectivity analysis survey for languages other than English. Soc Netw Anal Min 6:75. doi: 10.1007/s13278-016-0381-6
  27. 27.
    Pappas N, Popescu-Belisa A (2016) Adaptive sentiment-aware one-class collaborative filtering. Expert Syst Appl 43:23–41CrossRefGoogle Scholar
  28. 28.
    Fast E, Chen B, Bernstein M (2016) Empath: understanding topic signals in large-scale text. In: ACM conference on human factors in computing systemsGoogle Scholar
  29. 29.
    Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceeding of the 52th annual meeting of the association for computational linguistics (ACL 2014)Google Scholar
  30. 30.
    Oswin Rahadiyan H, Gloria Virginia, Antonius Rachmat C (2016) Sentiment classification of film reviews using IB1. In: The 7th international conference on intelligent systems, modelling and simulation. doi: 10.1109/ISMS.2016.38
  31. 31.
    Manek AS, Shenoy PD, Mohan MC, Venugopal KR (2016) Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 1–20. doi: 10.1007/s11280-015-0381-x. Print ISSN1386-145X
  32. 32.
    Agarwal B, Mittal N (2016) Machine learning approach for sentiment analysis. Promin Feature Extr Sentim Anal 21–45. doi: 10.1007/978-3-319-25343-5_3. Print ISBN 978-3-319-25341-1
  33. 33.
    Agarwal B, Mittal N (2016) Semantic orientation-based approach for sentiment analysis. Promin Feature Extr Sentim Anal 77–88. doi: 10.1007/978-3-319-25343-5_6. Print ISBN 978-3-319-25341-1
  34. 34.
    Canuto S, Gonçalves MA, Benevenuto F (2016) Exploiting new sentiment-based meta-level features for effective sentiment analysis. In: Proceedings of the ninth ACM international conference on web search and data mining (WSDM ’16). New York, USA, pp 53–62Google Scholar
  35. 35.
    Ahmed S, Danti A (2016) Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Comput Intell Data Mining; 1:171–179. doi: 10.1007/978-81-322-2734-2_18. ISBN 978-81-322-2732-8
  36. 36.
    Phu VN, Tuoi PT (2014) Sentiment classification using enhanced contextual valence shifters. In: International conference on Asian language processing (IALP), pp 224–229Google Scholar
  37. 37.
    Tran VTN, Phu VN, Tuoi PT (2014) Learning more chi square feature selection to improve the fastest and most accurate sentiment classification. In: The third Asian conference on information systems (ACIS 2014)Google Scholar
  38. 38.
    Cambria E, Schuller B, Xia Y, White B (2016) New avenues in knowledge bases for natural language processing. Knowl-Based Syst 108(C):1–4CrossRefGoogle Scholar
  39. 39.
    Erik C (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107Google Scholar

Copyright information

© Springer-Verlag London 2017

Authors and Affiliations

  1. 1.Institute of Research and DevelopmentDuy Tan UniversityDa NangVietnam
  2. 2.Computer Science and Engineering (CSE)Ho Chi Minh City University of Technology (HCMUT), Vietnam National UniversityHo Chi Minh CityVietnam
  3. 3.Faculty of Information TechnologyLy Tu Trong Technical CollegeHo Chi Minh CityVietnam
  4. 4.School of Industrial Management (SIM)Ho Chi Minh City University of Technology (HCMUT), Vietnam National UniversityHo Chi Minh CityVietnam
  5. 5.Faculty of Computer Networks and CommunicationsUniversity of Information Technology (UIT), Vietnam National University of Hochiminh CityHo Chi Minh CityVietnam

Personalised recommendations