Sentiment Classification of Short Texts

Kaur, Jaspinder; Dara, Rozita; Matsakis, Pascal

doi:10.1007/978-3-319-92058-0_73

Jaspinder Kaur¹⁷,
Rozita Dara¹⁷ &
Pascal Matsakis¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10868))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

3086 Accesses

Abstract

Over the few years, Sentiment analysis has been the heart of social media research due to the huge volume of opinionated data available on the web and its pervasive real life and commercial applications. Sentiment classification of shorter texts such as movie reviews is challenging due to lack of contextual information which often leads to interesting and unexpected results. Historically, this problem has been addressed using machine learning algorithms that usually learn from rule-based approaches or manually defined sparse features. In the recent years, Deep Neural Networks have gained a lot of attention in sentiment analysis due to their ability to effectively capture subtle semantic information from the input. These methods are capable of building dense continuous feature vectors, which is difficult to model in conventional models such as bag-of-words. In this paper, we conduct experiments and compare several machine learning algorithms Support Vector Machine, Naïve Bayes, Random Forest, and a Deep Learning Algorithm. We selected Convolution Neural Network (CNN) trained on top of various pre-trained word vectors for movie review classification. We validate above models on IMDB movie review dataset, experimental results demonstrate that the task of sentiment analysis can benefit more from the CNN rather than the machine learning techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pang, B., et al.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)
Google Scholar
Turney, P.D.: Thumbs up or thumbs down? semantic orientation applied to unsupervised  classification of reviews. In: Proceedings of the Association for Computational Linguistics (ACL), pp. 417–424 (2002)
Google Scholar
Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic  orientation from association. ACM Trans. Inf. Syst. TOIS 21(4), 315–346 (2003)
Google Scholar
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)
Google Scholar
Mudinas, A., et al.: Combining lexicon and learning based approaches for concept-level sentiment analysis. In: Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining, Article 5, pp. 1–8. ACM, New York (2012)
Google Scholar
Joshi, A., et al.: C-feel-it: a sentiment analyzer for micro blogs. In: Proceedings of ACL: Systems Demonstrations, HLT, vol. 11, pp. 127–132 2011
Google Scholar
Zhai, Z., et al.: Clustering product features for opinion mining. In: WSDM 2011, 9–12 February 2011, Hong Kong, China (2011)
Google Scholar
Yang, F., Wang, H.Z., Mi, H., Cai, W.W.: Using random forest for reliable classification and cost-sensitive learning for medical diagnosis. BMC Bioinform. 10(1), S22 (2009)
Article Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Article Google Scholar
Socher, R., Perelygin, A., Wu, J., Chuang, J., Manning, C.D., Ng, A., Potts, C.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Google Scholar
Large Movie Review Dataset, http://ai.stanford.edu/~amaas/data/sentiment/. Accessed 26 Feb 2018
Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384. ACM, November 2009
Google Scholar
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Article Google Scholar
Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. arXiv preprint arXiv:1412.1058 (2014)
Joachims, T.: Making large-scale SVM learning practical. In: Schölkopf, B., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning, pp. 44–56. MIT Press (1999)
Google Scholar
Support Vector Machine, http://svmlight.joachims.org/. Accessed 26 Feb 2018
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)
Article Google Scholar
Breiman, L.: Random forests. Machine learning 45(1), 5–32 (2001)
Article Google Scholar
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)
Article Google Scholar
Abbasi, A., Chen, H., Salem, A.: Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums. ACM Transactions on Information Systems (TOIS) 26(3), 12 (2008)
Article Google Scholar
Bengio, Y.: Deep learning of representations: looking forward. In: Dediu, A.-H., Martín-Vide, C., Mitkov, R., Truthe, B. (eds.) SLSP 2013. LNCS (LNAI), vol. 7978, pp. 1–37. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39593-2_1
Chapter Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Mu, Y., et al.: Event-related theta and alpha oscillations mediate empathy for pain. Brain Res. 1234, 128–136 (2008)
Article Google Scholar
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. arXiv preprint arXiv:1404.2188 (2014)
Zhang, Y., Wallace, B.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820 (2015)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. Proc. ICML 30(1), 3 (2013)
Google Scholar
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol. 1 Long Papers, Baltimore, Mary, pp. 655–665 (2014)
Google Scholar
Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: A review of classification techniques. Emer. Artif. Intell. Appl. Comput. Eng. 160, 3–24 (2007)
Google Scholar
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. JMLR 5(Oct), 1205–1224 (2004)
Google Scholar
Xu, B., Guo, X., Ye, Y., Cheng, J.: An improved random forest classifier for text categorization. JCP 7(12), 2913–2920 (2012)
Google Scholar
Google. https://code.google.com/archive/p/word2vec/. Accessed 26 Feb 2018
Gokulakrishnan, B., Priyanthan, P., Ragavan, T., Prasath, N., Perera, A.: Opinion mining and sentiment analysis on a twitter data stream. In: 2012 International Conference on Advances in ICT for Emerging Regions (ICTer), pp. 182–188. IEEE, December 2012
Google Scholar
Adankon, M.M., Cheriet, M.: Support vector machine. In: Encyclopedia of Biometrics, pp. 1303–1308. Springer, Boston (2009)
Google Scholar
Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3(22), pp. 41–46. IBM, August 2001
Google Scholar

Download references

Author information

Authors and Affiliations

University of Guelph, Guelph, Canada
Jaspinder Kaur, Rozita Dara & Pascal Matsakis

Authors

Jaspinder Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Rozita Dara
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Matsakis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jaspinder Kaur , Rozita Dara or Pascal Matsakis .

Editor information

Editors and Affiliations

University of Regina, Regina, SK, Canada
Malek Mouhoub
University of Regina, Regina, SK, Canada
Samira Sadaoui
Concordia University, Montreal, QC, Canada
Otmane Ait Mohamed
Texas State University, San Marcos, TX, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaur, J., Dara, R., Matsakis, P. (2018). Sentiment Classification of Short Texts. In: Mouhoub, M., Sadaoui, S., Ait Mohamed, O., Ali, M. (eds) Recent Trends and Future Technology in Applied Intelligence. IEA/AIE 2018. Lecture Notes in Computer Science(), vol 10868. Springer, Cham. https://doi.org/10.1007/978-3-319-92058-0_73

Download citation

DOI: https://doi.org/10.1007/978-3-319-92058-0_73
Published: 30 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92057-3
Online ISBN: 978-3-319-92058-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics