Effective Approaches for Classification and Rating of Users Reviews
Abstract
Organizations provide a platform for the users to express their opinion on the product in the form of reviews. Spam reviews are irrelevant reviews that mislead the consumers. In this paper we discuss semantic and machine learning approaches to classify reviews of trending products and provide rating to the reviews. We have used semantic and machine learning algorithms on five different products’ dataset. We have collected datasets comprising of both spam and non-spam reviews for training and testing purposes. We have obtained an average accuracy of 82.2% for the classification and 84.4% for review rating considering all the five products using Semantic approach. Similarly, we have obtained an accuracy of 82.2% using machine learning for the classification. For rating the review, we have obtained accuracy of 89.4% using machine learning. We found that both semantic and machine learning approaches perform well for classification of reviews. However for rating of reviews we found machine learning approach performed marginally better than semantic approach.
Keywords
Machine learning NLP Data miningsReferences
- 1.Akoglu L, Chandy R, Faloutsos C (2013) Opinion fraud detection in online reviews by network effects Google Scholar
- 2.Lim E-P, Nguyen V-A, Jindal N, Liu B, Lauw HW (2010) Detecting product review spammers using rating behaviorsGoogle Scholar
- 3.Prajapati J, Bhatt M, Prajapati DJ (2012) Detection and summarization of genuine review using visual data miningGoogle Scholar
- 4.Jindal N, Liu B (2008) Review spam detectionGoogle Scholar
- 5.Khan K, Baharudin B, Khan A (2013) Identifying product features from customer reviews using hybrid patternsGoogle Scholar
- 6.Anil Kumar KM, Suresha (2011) Analyzing web user’ opinion from phrases and emoticonsGoogle Scholar
- 7.Daiyan M, Tiwari SK, Alam MA (2014) Mining product reviews for spam detection using supervised techniqueGoogle Scholar
- 8.Elmasri R, Shamkanth B, Navathe. Fundamentals of database systems. 3rd ednGoogle Scholar
- 9.Li ZX (2007) Using fuzzy neural network in real estate prices predictionGoogle Scholar
- 10.Stanford NLP, [Online] Available: http://nlp.stanford.edu/
- 11.Jsoup, [Online] Available: http://jsoup.org
- 12.Aylien, [Online] Available: www.aylien.com
- 13.Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an updateGoogle Scholar
- 14.Anil Kumar KM, Anil B, Anand CU, Aniruddha S, Rajath Kumar U (2015) Machine learning approach to predict real estate pricesGoogle Scholar
- 15.Fei G, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Exploiting burstiness in reviews for review spammer detectionGoogle Scholar
- 16.Wang G, Xie S, Liu B, Yu PS (2011) Review graph based online store review spammer detectionGoogle Scholar
- 17.de Albornoz JC, Plaza L, Gervas P, Diaz A (2011) A joint model of feature mining and sentiment analysis for product review ratingGoogle Scholar