Skip to main content

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 167))

Abstract

This paper proposes use of ensemble voting learning classification approach for sentiment classification of movie review dataset collected from IMDB. Sentiment analysis is a technique used to extract opinions from text including reviews, social messaging, etc. Generally, the opinion is classified into positive and negative polarity. The approach has been used in variety of domains including financial, educational, and other areas. Over the years many researchers have worked on sentiment classification to predict the opinion of text using several machine learning algorithms. The work carried out in this paper proposes the use of ensemble voting learning approach in which Naïve Bayes (NB), K-nearest neighbor (KNN), random forest (RF), and decision tree (DT). The results are compared with individual classification algorithm and voting approach. We have also used normalized and denormalized data to compare the accuracy results. The results show that using voting approach decreases root mean square error and increases precision of the classifier. Maximum accuracy of 80.13% is obtained with voting and normalized data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

AUC:

Area Under the Curve

CNN:

Convolutional-Neural-Network

DL:

Deep-Learning

DT:

Decision-Tree

KNN:

K-Nearest-Neighbor

LSTM:

Long-Short-Term-Memory

NB:

Naïve-Bayes

NLP:

Natural Language Processing

RF:

Random-Forest

SVM:

Support Vector Machine

TF-IDF:

Term Frequency Inverse Document Frequency

References

  1. Giatsoglou M, Vozalis MG, Diamantaras K, Vakali A, Sarigiannidis G, Chatzisavvas KC (2017) Sentiment analysis leveraging emotions and word embedding’s. Expert Syst Appl 69:214–224

    Google Scholar 

  2. Kumar HM, Harish BS, Darshan HK (2019) Sentiment analysis on IMDb movie reviews using hybrid feature extraction method. Int J Interact Multimedia Artif Intell 5(5)

    Google Scholar 

  3. Hussein DMEDM (2018) A survey on sentiment analysis challenges. J King Saud Univ Eng Sci 30(4):330–338

    Google Scholar 

  4. Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C (2011) Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1. Association for Computational Linguistics, pp 142–150

    Google Scholar 

  5. Tripathy A, Agrawal A, Rath SK (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126

    Article  Google Scholar 

  6. Guan Y (2018) IMDB review mining and movie recommendation. In: Conference on statistical learning and data science/nonparametric statistics

    Google Scholar 

  7. Sahu TP, Ahuja S (2016, Jan) Sentiment analysis of movie reviews: a study on feature selection and classification algorithms. In: 2016 International conference on microelectronics, computing and communications (MicroCom). IEEE, pp 1–6

    Google Scholar 

  8. Shaukat Z, Zulfiqar AA, Xiao C, Azeem M, Mahmood T (2020) Sentiment analysis on IMDB using lexicon and neural networks. SN Appl Sci 2(2):1–10

    Article  Google Scholar 

  9. Yenter A, Verma A (2017) Deep CNN-LSTM with combined kernels from multiple branches for IMDB review sentiment analysis. In: IEEE 8th annual ubiquitous computing, electronics and mobile communication conference (UEMCON). IEEE, pp 540–546

    Google Scholar 

  10. Sivakumar S, Rajalakshmi R (2019) Comparative evaluation of various feature weighting methods on movie reviews. In: Computational intelligence in data mining. Springer, Singapore, pp 721–730

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vanita Jain .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jain, A., Jain, V. (2021). Voting Ensemble Classifier for Sentiment Analysis. In: Abraham, A., Castillo, O., Virmani, D. (eds) Proceedings of 3rd International Conference on Computing Informatics and Networks. Lecture Notes in Networks and Systems, vol 167. Springer, Singapore. https://doi.org/10.1007/978-981-15-9712-1_22

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-9712-1_22

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-9711-4

  • Online ISBN: 978-981-15-9712-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics