Skip to main content
Log in

Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

With the rapid development of the World Wide Web, electronic word-of-mouth interaction has made consumers active participants. Nowadays, a large number of reviews posted by the consumers on the Web provide valuable information to other consumers. Such information is highly essential for decision making and hence popular among the internet users. This information is very valuable not only for prospective consumers to make decisions but also for businesses in predicting the success and sustainability. In this paper, a Gini Index based feature selection method with Support Vector Machine (SVM) classifier is proposed for sentiment classification for large movie review data set. The results show that our Gini Index method has better classification performance in terms of reduced error rate and accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1

Similar content being viewed by others

References

  1. Aue, A., Gamon, M.: Customizing Sentiment Classifiers to New Domains: A Case Study. In: Proceedings of Recent Advances in Natural Language Processing (RANLP-2005) (2005)

  2. Basari, A.S.H., Hussin, B., Ananta, I., Zeniarja, J.: Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization. Procedia Engineering, 453–462 (2013)

  3. Blitzer, J., Dredze, M., Pereira, F.: Biographies, Bollywood, Boomboxes and Blenders: Domain Adaptation for Sentiment Classification. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL-2007) (2007)

  4. Chaovalit P., Zhou, L.: Movie Review Mining: A Comparison between Supervised and Unsupervised Classification Approaches. In: Proceedings of the 38th Hawaii International Conference on System Sciences (2005)

  5. Dave, K., Lawrence, S., Pennock, D.: Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. In: Proceedings of International Conference on World Wide Web (WWW-2003) (2003)

  6. He, Y., Lin, C., Alani, H.: Automatically Extracting Polarity-bearing Topics for Cross-domain Sentiment Classification. In: Proceedings 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp 19–24, Portland (2011)

  7. Hu, M., Liu, B.: Mining and Summarizing Customer Reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge Discovery and Data Mining (168–177). ACM (2004)

  8. Jindal, N., Liu, B.: Mining comparative sentences and relations. In AAAI 22, 1331–1336 (2006)

    Google Scholar 

  9. Li, F., Han, C., Huang, M., Zhu, X., Xia, Y.J., Zhang, S., Yu, H.: Structure-aware Review Mining and Summarization. In: Proceedings of the 23rd International Conference on Computational Linguistics (pp. 653-661). Association for Computational Linguistics (2010)

  10. Lin, C., He, Y.: Joint Sentiment/Topic Model for Sentiment Analysis. In: Proceedings of the 18th ACM conference on Information and Knowledge Management (pp. 375–384). ACM (2009)

  11. Liu, C.L., Hsaio, W.H., Lee, C.H., Lu, G.C., Jou, E.: Movie rating and review summarization in mobile environment, Systems, Man and cybernetics, Part C: Applications and reviews. IEEE Trans., 397–407 (2012)

  12. Liu, B.: Sentiment Analysis and Opinion Mining, p 7. Morgan and Claypool Publishers, USA (2012)

    Google Scholar 

  13. Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Human Lang. Technol. 5.1, 1–167 (2012)

    Article  Google Scholar 

  14. Manek, A. S., Pallavi, R.P., Bhat, V.H., Shenoy, P.D., Chandra Mohan M., Venugopal, K.R., Patnaik, L.M.: SentReP: Sentiment Classification of Movie Reviews using Efficient Repetitive Pre-Processing. In: TENCON 2013-2013 IEEE Region 10 Conference (31194), pp 1–5. IEEE (2013)

  15. Movie review data set [online], Available http://www.cs.cornell.edu/people/pabo/movie-review-data/

  16. Paltoglou, G., Thelwall, M.: A Study of Information Retrieval Weighting Schemes for Sentiment Analysis. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL- 2010) (2010)

  17. Pang, B., Lee, L., Vaithyanathan S.: Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Conference on Empirical Methods in Natural Language Processing (EMNLP) (2002)

  18. Pang, B., Lee, L.: Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales. In: Proceedings of Meeting of the Association for Computational Linguistics (ACL-2005) (2005)

  19. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  20. Pan, S., Ni, X., Sun, J., Yang, Q., Chen, Z.: Cross-domain Sentiment Classification via Spectral Feature Alignment. In: Proceedings of International Conference on World Wide Web (WWW-2010) (2010)

  21. Pham, S.B., et al.: Sentiment Classification on Polarity Reviews: An Empirical Study Using Rating-based Features. In: Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, p 128135. Association for Computational Linguistics, Maryland, USA (2014)

    Google Scholar 

  22. Weichselbraun A., et al.: Enriching Semantic Knowledge Bases for Opinion Mining in Big Data Applications, Knowledge Based Systems. doi:10.1016/j.knosys.2014.04.039 (2014)

  23. Yang, H., Si, L., Callan, J.: Knowledge Transfer and Opinion Detection in the TREC2006 Blog Track. In: Proceedings of TREC (2006)

  24. Zhuang, L., Jing, F., Zhu, X.Y.: Movie Review Mining and Summarization. In: Proceedings of the 15th ACM international conference on Information and Knowledge Management (pp. 43–50). ACM (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Asha S Manek.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Manek, A.S., Shenoy, P.D., Mohan, M.C. et al. Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20, 135–154 (2017). https://doi.org/10.1007/s11280-015-0381-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-015-0381-x

Keywords

Navigation