A valences-totaling model for English sentiment classification
Abstract
Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.
Keywords
English document semantic classification Cosine measure Valences-totaling modelNotes
Compliance with ethical standards
Conflict of interest
The author declares that there is no conflict of interest.
References
- 1.Large Movie Review Dataset (2016). http://ai.stanford.edu/~amaas/data/sentiment/
- 2.Efron M (2004) Cultural orientation: classifying subjective documents by cociation sic analysis. In: Proceedings of the AAAI fall symposium on style and meaning in language, art, music, and design, pp 41–48Google Scholar
- 3.Yuen RWM, Chan TYW, Lai Tom BY, Kwong OY, T’sou Benjamin KY (2004) Morpheme-based derivation of bipolar semantic orientation of Chinese words. In: Proceedings of the 20th international conference on computational linguistics. Stroudsburg, PA, USAGoogle Scholar
- 4.Chen L-S, Chiu H-J (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists. Hong Kong, MarchGoogle Scholar
- 5.Wang G, Araki K (2007) Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. In: Proceedings of NAACL HLT 2007, Companion Volume, pp 189–192Google Scholar
- 6.Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC 2006). Genoa, Italy, pp 427–432Google Scholar
- 7.Cimiano P, Wenderoth J (2007) Automatic acquisition of ranked qualia structures from the web. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Prague, Czech Republic, pp 888–895Google Scholar
- 8.Lu G, Huang P, He L, Cu C, Li X (2010) A new semantic similarity measuring method based on web search engines. J WSEAS Trans Comput 9(1):1–10Google Scholar
- 9.Voll K, Taboada M (2007) Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Proceedings of the 20th Australian joint conference on artificial intelligence. Gold Coast, Australia, pp 337–346Google Scholar
- 10.Kundi FM, Khan A, Asghar MZ, Ahamd S (2015) Context-aware spelling corrector for sentiment analysis. MAGNT Res Rep 2(6):1–11Google Scholar
- 11.Mao H, Gao P, Wang Y, Bollen J (2014) Automatic construction of financial semantic orientation lexicon from large-scale Chinese news corpus. 7th financial risks international forumGoogle Scholar
- 12.Wikipedia (2016). https://en.wikipedia.org/wiki/
- 13.Lin D (1998) Automatic retrieval and clustering of similar words. In: Proceedings of the 17th international conference on computational linguistics, vol 2. Stroudsburg, PA, USA, pp 768–774Google Scholar
- 14.Turney PD, Littman ML (2002) unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report NRC technical report ERB-1094. Institute for Information Technology, National Research Council CanadaGoogle Scholar
- 15.Lin WH, Wu YL, Yu LC (2012) Online computation of mutual information and word context entropy. Int J Future Comput Commun 1(2):167CrossRefGoogle Scholar
- 16.Omar N, Albared M, Al-Shabi AQ, Al-Moslmi T (2013) Ensemble of classification algorithms for subjectivity and sentiment analysis of Arabic customers’ reviews. Int J Adv Comput Technol (IJACT) 5:77Google Scholar
- 17.English Grammar of British Council (2015). https://learnenglish.britishcouncil.org/en/english-grammar
- 18.English Grammar of Wikipedia (2015). https://en.wikipedia.org/wiki/English_grammar
- 19.English Grammar of Cambridge (2015). http://www.cambridge.org/us/cambridgeenglish/
- 20.English Grammar of Oxford (2015). http://www.oxfordonlineenglish.com/free-english-grammar-lessons
- 21.Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst (TOIS) 21(4):315–346CrossRefGoogle Scholar
- 22.Turney P (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th ACL, pp 417–424Google Scholar
- 23.Saloot MA, Idris N, Mahmud R, Ja’afar S, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 1–16. doi: 10.1007/s10462-016-9458-x. Print ISSN 0269-2821
- 24.Ofoghi B, Mann M, Verspoor K (2016) Towards early discovery of salient health threats: a social media emotion classification technique. Pacific symposium on biocomputing, Hawaii, USGoogle Scholar
- 25.Zarra T, Chiheb R, Faizi R, El Afia A (2016) Using textual similarity and sentiment analysis in discussions forums to enhance learning. Int J Softw Eng Appl 10(1):191–200Google Scholar
- 26.Korayem M, Aljadda K, Crandall D (2016) Sentiment/subjectivity analysis survey for languages other than English. Soc Netw Anal Min 6:75. doi: 10.1007/s13278-016-0381-6
- 27.Pappas N, Popescu-Belisa A (2016) Adaptive sentiment-aware one-class collaborative filtering. Expert Syst Appl 43:23–41CrossRefGoogle Scholar
- 28.Fast E, Chen B, Bernstein M (2016) Empath: understanding topic signals in large-scale text. In: ACM conference on human factors in computing systemsGoogle Scholar
- 29.Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceeding of the 52th annual meeting of the association for computational linguistics (ACL 2014)Google Scholar
- 30.Oswin Rahadiyan H, Gloria Virginia, Antonius Rachmat C (2016) Sentiment classification of film reviews using IB1. In: The 7th international conference on intelligent systems, modelling and simulation. doi: 10.1109/ISMS.2016.38
- 31.Manek AS, Shenoy PD, Mohan MC, Venugopal KR (2016) Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 1–20. doi: 10.1007/s11280-015-0381-x. Print ISSN1386-145X
- 32.Agarwal B, Mittal N (2016) Machine learning approach for sentiment analysis. Promin Feature Extr Sentim Anal 21–45. doi: 10.1007/978-3-319-25343-5_3. Print ISBN 978-3-319-25341-1
- 33.Agarwal B, Mittal N (2016) Semantic orientation-based approach for sentiment analysis. Promin Feature Extr Sentim Anal 77–88. doi: 10.1007/978-3-319-25343-5_6. Print ISBN 978-3-319-25341-1
- 34.Canuto S, Gonçalves MA, Benevenuto F (2016) Exploiting new sentiment-based meta-level features for effective sentiment analysis. In: Proceedings of the ninth ACM international conference on web search and data mining (WSDM ’16). New York, USA, pp 53–62Google Scholar
- 35.Ahmed S, Danti A (2016) Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Comput Intell Data Mining; 1:171–179. doi: 10.1007/978-81-322-2734-2_18. ISBN 978-81-322-2732-8
- 36.Phu VN, Tuoi PT (2014) Sentiment classification using enhanced contextual valence shifters. In: International conference on Asian language processing (IALP), pp 224–229Google Scholar
- 37.Tran VTN, Phu VN, Tuoi PT (2014) Learning more chi square feature selection to improve the fastest and most accurate sentiment classification. In: The third Asian conference on information systems (ACIS 2014)Google Scholar
- 38.Cambria E, Schuller B, Xia Y, White B (2016) New avenues in knowledge bases for natural language processing. Knowl-Based Syst 108(C):1–4CrossRefGoogle Scholar
- 39.Erik C (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107Google Scholar