Abstract
With the recent emergence of Web-based applications and use of social networking sites, number of people are eager in expressing their views and opinions online. The sentimental analysis also referred to as opinion mining aims at processing user reviews (about products, movies, services, books, places, etc.). These reviews are often unstructured and need processing to evolve into the productive knowledge. Majority of the sentiment analysis works on the classification of opinion polarity with the use of simple classifiers. Handling diverse data distribution is one of the major issues that simple classifiers suffer. To cope up with the issue in this paper, we utilized the ensemble learners on the polarity prediction of the movie reviews. The proposed work processes the review data through some elementary steps that are conducted for the feature extraction in sentiment analysis. In addition to the feature extraction, we further perform the feature selection for the sake of dimensionality reduction. However, in contrast to the conventional simple learner, we applied the ensemble learner in the proposed model and evaluated its performance. To compare the ensemble model competence, we conducted the experiment on both individual as well as ensemble learner (random forest, AdaBoost, extra trees) and computed classification measures on both the model. IMDB dataset is used, and the polarity of a review, i.e., whether it is positive or negative, is predicted. With an extensive experimentation, it is found that results of ensemble classifiers are outperforming than individual learner in the classification of sentiment polarity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fernández-gavilanes M, Álvarez-lópez T, Juncal-martínez J, Costa-montenegro E, González-castaño FJ (2016) Unsupervised method for sentiment analysis in online texts, vol 58, pp 57–75
Medhat, W., Hassan, A., Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng 5(4):1093–1113
Parkhe V (2014) Aspect based sentiment analysis of movie reviews
Singh VK, Piryani R, Uddin A (2013) Sentiment analysis of movie reviews, pp 712–717
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up: sentiment classification using machine learning techniques. Proc Conf Empir Methods Nat Lang Process, 79–86
Salvetti F, Lewis S, Reichenbach C (2004) Automatic opinion polarity classification of movie. Color Res Linguist 17(1):2
Mullen T, Collier N (2004) Sentiment analysis using support vector machines with diverse information sources. Conf Empir Methods Nat Lang Process, 412–418
Matsumoto S, Takamura H, Okumura M (2005) Sentiment classification using word sub-sequences and dependency sub-trees. In: Proceedings of 9th Pacific-Asia conference advances in knowledge discovery and data mining, vol 059, pp 301–311
Liu SM, Chen J-H (2015) A multi-label classification based approach for sentiment classification. Expert Syst Appl 42(3):1083–1093
Lin Y, Lei H, Wu J, Li X (2015) An empirical study on sentiment classification of Chinese review using word embedding. In: 29th Pacific Asia conference on language information and computation, pp 258–266
https://docs.oracle.com/database/121/DMCON/feature_extr.htm#DMCON268
Pechenizkiy M, Puuronen S, Tsymbal A (2001) Feature extraction for classification in the data mining process PCA-based feature extraction feature extraction for a classifier and dynamic integration of classifiers. Int J 10:271–278
Tripathy A, Agrawal A, Rath SK (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126
Pedregosa F (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
McCallum A, Nigam K (1998) A comparison of event models for Naive Bayes text classification. AAAI/ICML-98 work learning for text categorization, pp 41–48
Kibriya AM (2004) Multinomial Naive Bayes for text categorization revisited. Adv Artif Intell, 488–499
Mason L, Baxter J, Bartlett P, Frean M (1999) Boosting algorithms as gradient descent. Nips, 512–518
Fradkin D, Muchnik I (2006) Support vector machines for classification. Discret Methods Epidemiol 70:13–20
http://sebastianraschka.com/Articles/2014_naive_bayes_1.html
https://books.google.co.in/books?id=48u5BQAAQBAJ&pg=PA369&lpg=PA369&dq=Stochastic+Gradient+Descent
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Singh, P. (2018). Sentiment Analysis Using Tuned Ensemble Machine Learning Approach. In: Kolhe, M., Trivedi, M., Tiwari, S., Singh, V. (eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems, vol 38. Springer, Singapore. https://doi.org/10.1007/978-981-10-8360-0_27
Download citation
DOI: https://doi.org/10.1007/978-981-10-8360-0_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8359-4
Online ISBN: 978-981-10-8360-0
eBook Packages: EngineeringEngineering (R0)