Skip to main content
Log in

Kernel Optimized-Support Vector Machine and Mapreduce framework for sentiment classification of train reviews

  • Published:
Sādhanā Aims and scope Submit manuscript

Abstract

Sentiment analysis is one of the popular techniques gaining attention in recent times. Nowadays, people gain information on reviews of users regarding public transportation, movies, hotel reservation, etc., by utilizing the resources available, as they meet their needs. Hence, sentiment classification is an essential process employed to determine the positive and negative responses. This paper presents an approach for sentiment classification of train reviews using MapReduce model with the proposed Kernel Optimized-Support Vector Machine (KO-SVM) classifier. The MapReduce framework handles big data using a mapper, which performs feature extraction and reducer that classifies the review based on KO-SVM classification. The feature extraction process utilizes features that are classification-specific and SentiWordNet-based. KO-SVM adopts SVM for the classification, where the exponential kernel is replaced by an optimized kernel, finding the weights using a novel optimizer, self-adaptive lion algorithm. In a comparative analysis, the performance of KO-SVM classifier is compared with SentiWordNet, Naive Bayes, neural network, and LSVM, using the evaluation metrics, specificity, sensitivity, and accuracy, with train review and movie review database. The proposed KO-SVM classifier could attain maximum sensitivity of 93.46% and 91.249%, specificity of 74.485% and 70.018%; and accuracy of 84.341% and 79.611%, respectively, for train review and movie review databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8

Similar content being viewed by others

References

  1. Lin Y, Wan H, Jiang R, Wu Z and Jia X 2015 Inferring the travel purposes of passenger groups for better understanding of passengers. IEEE Trans. Intell. Transp. Syst. 16(1): 235–243

    Article  Google Scholar 

  2. Hurk E, Kroon L, Maróti G and Vervest P 2015 Deduction of passengers’ route choices from smart card data. IEEE Trans. Intell. Transp. Syst. 16(1): 430–440

    Article  Google Scholar 

  3. Ali F, Kim EK and Kim Y G 2015 Fuzzy ontology-based opinion mining and information extraction: a proposal to automate the hotel reservation system. Appl. Intell. 42(3): 481–500

    Article  Google Scholar 

  4. Ali D, Kwak P, Khan S M R, Islam K H, Kim and Kwak K S 2017 Fuzzy ontology-based sentiment analysis of transportation and city feature reviews for safe traveling. Transp. Res. Part C: Emerg. Technol. 77: 33–48

    Article  Google Scholar 

  5. Havasi C, Cambria E, Schuller B, Liu B and Wang H 2013 Knowledge- based approaches to concept-level sentiment analysis. IEEE Intell. Syst. 28(2): 0012–14

    Article  Google Scholar 

  6. Manning C D and Schütze H 1999 Foundations of statistical natural language processing, Cambridge, MA: MIT Press

    MATH  Google Scholar 

  7. Pang B, Lee L and Vaithyanathan S 2002 Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the EMNLP, pp. 79–86

  8. Tang D, Qin B, Wei F, Dong L, Liu T and Zhou M 2015 A joint segmentation and classification framework for sentence level sentiment classification. IEEE/ACM Trans. Audio, Speech Lang. Process. 23(11): 1750–1761

    Article  Google Scholar 

  9. Quan C and Ren F 2014 Unsupervised product feature extraction for feature-oriented opinion determination. Inf. Sci. 272: 16–28

    Article  Google Scholar 

  10. Catal C and Nangir M 2017 A sentiment classification model based on multiple classifiers. Appl. Soft Comput. 50: 135–141

    Article  Google Scholar 

  11. Xia R, Zong C and Li S 2011 Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 181(6): 1138–1152

    Article  Google Scholar 

  12. Liu B 2012 Sentiment analysis and opinion mining. San Rafael: Morgan & Claypool

  13. Phu V N, Dat N D, Tran V T N and Chau T A, Nguyen 2016 fuzzy C-means for English sentiment classification in a distributed system. Appl. Intell. pp. 1–22

  14. Yang Y and Pedersen J O 1997 A comparative study on feature selection in text categorization. In: Proceedings of the ICML’97, pp. 412–420

  15. Li J, Fong S, Zhuang Y and Khoury R 2016 Hierarchical classification in text mining for sentiment analysis of online news. Soft Comput. 20(9): 3411–3420

    Article  Google Scholar 

  16. Taboada M, Brooke J, Tofiloski M, Voll K and Stede M 2011 Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2): 267–307

    Article  Google Scholar 

  17. Zhao J, Dong L, Wu J and Xu K 2012 Moodlens: An emoticon-based sentiment analysis system for chinese tweets. In: Proceedings of the SIGKDD

  18. Maas A L, Daly R E, Pham P T, Huang D, Ng AY and Potts C 2011 Learning word vectors for sentiment analysis. In: Proceedings of the ACL

  19. Hung C and Lin H 2013 Using objective words in SentiWordNet to improve word-of-mouth sentiment classification. IEEE Intell. Syst. 28(2): 47–54

    Article  Google Scholar 

  20. Salehan M and Kim D J 2016 Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics. Decis. Support Syst. 81; 30–40

    Article  Google Scholar 

  21. Chen T, Xu R, He Y and Wang X 2017 Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst. Appl. 72: 221–230

    Article  Google Scholar 

  22. Saif, Hassan, Fernández, Miriam, He, Yulan and Alani, Harith 2014 On stopwords, filtering and data sparsity for sentiment analysis of Twitter. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation, pp. 810–817

  23. Esuli A and Sebastiani F 2006 SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), 417-422

  24. Ohana B and Tierney B 2009 Sentiment classification of reviews using SentiWordNet. In: 9th IT&T Conference, Dublin Institute of Technology, Dublin, Ireland

  25. Boser B E, Guyon I M and Vladimir N Vapnik 1992 A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on COMPUTATIONAL Learning Theory COLT ‘92, pp. 144–152

  26. Rajakumar B R 2014 Lion algorithm for standard and large scale bilinear system identification: A global optimization based on Lion’s social behavior. In: 2014 IEEE Congress on Evolutionary Computation (CEC), Beijing, pp. 2116–2123

  27. http://www.mouthshut.com/product-reviews/Rajdhani-Express-reviews-925004322-page-3

  28. http://www.cs.cornell.edu/people/pabo/movie-review-data

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rashmi K Thakur.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Thakur, R.K., Deshpande, M.V. Kernel Optimized-Support Vector Machine and Mapreduce framework for sentiment classification of train reviews. Sādhanā 44, 6 (2019). https://doi.org/10.1007/s12046-018-0980-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12046-018-0980-1

Keywords

Navigation