Abstract
Sentiment analysis is one of the popular techniques gaining attention in recent times. Nowadays, people gain information on reviews of users regarding public transportation, movies, hotel reservation, etc., by utilizing the resources available, as they meet their needs. Hence, sentiment classification is an essential process employed to determine the positive and negative responses. This paper presents an approach for sentiment classification of train reviews using MapReduce model with the proposed Kernel Optimized-Support Vector Machine (KO-SVM) classifier. The MapReduce framework handles big data using a mapper, which performs feature extraction and reducer that classifies the review based on KO-SVM classification. The feature extraction process utilizes features that are classification-specific and SentiWordNet-based. KO-SVM adopts SVM for the classification, where the exponential kernel is replaced by an optimized kernel, finding the weights using a novel optimizer, self-adaptive lion algorithm. In a comparative analysis, the performance of KO-SVM classifier is compared with SentiWordNet, Naive Bayes, neural network, and LSVM, using the evaluation metrics, specificity, sensitivity, and accuracy, with train review and movie review database. The proposed KO-SVM classifier could attain maximum sensitivity of 93.46% and 91.249%, specificity of 74.485% and 70.018%; and accuracy of 84.341% and 79.611%, respectively, for train review and movie review databases.
Similar content being viewed by others
References
Lin Y, Wan H, Jiang R, Wu Z and Jia X 2015 Inferring the travel purposes of passenger groups for better understanding of passengers. IEEE Trans. Intell. Transp. Syst. 16(1): 235–243
Hurk E, Kroon L, Maróti G and Vervest P 2015 Deduction of passengers’ route choices from smart card data. IEEE Trans. Intell. Transp. Syst. 16(1): 430–440
Ali F, Kim EK and Kim Y G 2015 Fuzzy ontology-based opinion mining and information extraction: a proposal to automate the hotel reservation system. Appl. Intell. 42(3): 481–500
Ali D, Kwak P, Khan S M R, Islam K H, Kim and Kwak K S 2017 Fuzzy ontology-based sentiment analysis of transportation and city feature reviews for safe traveling. Transp. Res. Part C: Emerg. Technol. 77: 33–48
Havasi C, Cambria E, Schuller B, Liu B and Wang H 2013 Knowledge- based approaches to concept-level sentiment analysis. IEEE Intell. Syst. 28(2): 0012–14
Manning C D and Schütze H 1999 Foundations of statistical natural language processing, Cambridge, MA: MIT Press
Pang B, Lee L and Vaithyanathan S 2002 Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the EMNLP, pp. 79–86
Tang D, Qin B, Wei F, Dong L, Liu T and Zhou M 2015 A joint segmentation and classification framework for sentence level sentiment classification. IEEE/ACM Trans. Audio, Speech Lang. Process. 23(11): 1750–1761
Quan C and Ren F 2014 Unsupervised product feature extraction for feature-oriented opinion determination. Inf. Sci. 272: 16–28
Catal C and Nangir M 2017 A sentiment classification model based on multiple classifiers. Appl. Soft Comput. 50: 135–141
Xia R, Zong C and Li S 2011 Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 181(6): 1138–1152
Liu B 2012 Sentiment analysis and opinion mining. San Rafael: Morgan & Claypool
Phu V N, Dat N D, Tran V T N and Chau T A, Nguyen 2016 fuzzy C-means for English sentiment classification in a distributed system. Appl. Intell. pp. 1–22
Yang Y and Pedersen J O 1997 A comparative study on feature selection in text categorization. In: Proceedings of the ICML’97, pp. 412–420
Li J, Fong S, Zhuang Y and Khoury R 2016 Hierarchical classification in text mining for sentiment analysis of online news. Soft Comput. 20(9): 3411–3420
Taboada M, Brooke J, Tofiloski M, Voll K and Stede M 2011 Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2): 267–307
Zhao J, Dong L, Wu J and Xu K 2012 Moodlens: An emoticon-based sentiment analysis system for chinese tweets. In: Proceedings of the SIGKDD
Maas A L, Daly R E, Pham P T, Huang D, Ng AY and Potts C 2011 Learning word vectors for sentiment analysis. In: Proceedings of the ACL
Hung C and Lin H 2013 Using objective words in SentiWordNet to improve word-of-mouth sentiment classification. IEEE Intell. Syst. 28(2): 47–54
Salehan M and Kim D J 2016 Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics. Decis. Support Syst. 81; 30–40
Chen T, Xu R, He Y and Wang X 2017 Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst. Appl. 72: 221–230
Saif, Hassan, Fernández, Miriam, He, Yulan and Alani, Harith 2014 On stopwords, filtering and data sparsity for sentiment analysis of Twitter. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation, pp. 810–817
Esuli A and Sebastiani F 2006 SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), 417-422
Ohana B and Tierney B 2009 Sentiment classification of reviews using SentiWordNet. In: 9th IT&T Conference, Dublin Institute of Technology, Dublin, Ireland
Boser B E, Guyon I M and Vladimir N Vapnik 1992 A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on COMPUTATIONAL Learning Theory COLT ‘92, pp. 144–152
Rajakumar B R 2014 Lion algorithm for standard and large scale bilinear system identification: A global optimization based on Lion’s social behavior. In: 2014 IEEE Congress on Evolutionary Computation (CEC), Beijing, pp. 2116–2123
http://www.mouthshut.com/product-reviews/Rajdhani-Express-reviews-925004322-page-3
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Thakur, R.K., Deshpande, M.V. Kernel Optimized-Support Vector Machine and Mapreduce framework for sentiment classification of train reviews. Sādhanā 44, 6 (2019). https://doi.org/10.1007/s12046-018-0980-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12046-018-0980-1