Sentiment Analysis of Movie Reviews Using R
In this chapter, the reader is presented with a step-by-step lexicon-based sentiment analysis using the R open-source software. Using 1,000 movie reviews with sentiment classification labels, the example analysis performs sentiment analysis to assess the predictive accuracy of built-in lexicons in R. Then, a custom stop list is used and accuracy is reevaluated.
KeywordsSentiment analysis Opinion mining Online consumer reviews (OCR) R RStudio Open-source
- Hu, M., & Liu, B. (2004, August). Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 168–177). ACM.Google Scholar
- Nielsen, F. Å. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:1103.2903.Google Scholar
- R Development Core Team. (2008). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. ISBN 3-900051-07-0, URL http://www.R-project.org
- For more about R software, see R Development Core Team (2008) and visit https://www.r-project.org/