Review Spam Detection Using Semi-supervised Technique

Narayan, Rohit; Rout, Jitendra Kumar; Jena, Sanjay Kumar

doi:10.1007/978-981-10-3376-6_31

Rohit Narayan¹⁹,
Jitendra Kumar Rout¹⁹ &
Sanjay Kumar Jena¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 719))

1305 Accesses
17 Citations

Abstract

Today because of the popularity of e-commerce sites, spammers have made their target to these sites for review spam apart from other spams like email spam or web spam. These fake reviews written by fraudsters prevent customers and organizations reaching actual conclusions about the products. Hence, these must be detected and eliminated so as to prevent deceptive potential customers. In this paper, we have used semi-supervised learning technique to detect review spam. The proposed work is based on PU-learning algorithm which learns from a very few positive example and unlabeled data set. Maximum accuracy we have achieved is of 78.12% with F-score 76.67 using only 80 positive example as a training set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Fette, Ian, Norman Sadeh, and Anthony Tomasic.: Learning to detect phishing emails. In: Proceedings of the 16th international conference on World Wide Web. ACM, 2007.
Google Scholar
Li, Wenbin, Ning Zhong, and Chunnian Liu.: Combining multiple email filters based on multivariate statistical analysis. In: Foundations of Intelligent Systems. Springer Berlin Heidelberg, 2006. 729–738.
Google Scholar
Spirin, Nikita, and Jiawei Han.: Survey on web spam detection: principles and algorithms. In: ACM SIGKDD Explorations Newsletter 13.2 (2012): 50–64.
Article Google Scholar
Abernethy, Jacob, Olivier Chapelle, and Carlos Castillo.: Graph regularization methods for web spam detection. Machine Learning 81.2 (2010): 207–225.
Article MathSciNet Google Scholar
Karami, Amir, and Lina Zhou.: Improving static SMS spam detection by using new content-based features. (2014).
Google Scholar
Jindal, Nitin, and Bing Liu.: Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM, 2008.
Google Scholar
Ott, Myle, et al.: Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics, 2011.
Google Scholar
Ott, Myle, Claire Cardie, and Jeffrey T. Hancock.: Negative Deceptive Opinion Spam. In: HLT-NAAC L. 2013.
Google Scholar
Hernndez, D., et al.: Using PU-learning to detect deceptive opinion spam.: Proc. of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis., 2013.
Google Scholar
Liu, Bing, et al.: Partially supervised classification of text documents. ICML. Vol. 2. 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India
Rohit Narayan, Jitendra Kumar Rout & Sanjay Kumar Jena

Authors

Rohit Narayan
View author publications
You can also search for this author in PubMed Google Scholar
Jitendra Kumar Rout
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Kumar Jena
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rohit Narayan .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, National Institute of Technology, Rourkela, Odisha, India
Pankaj Kumar Sa
Department of Computer Science and Engineering, National Institute of Technology, Rourkela, Odisha, India
Manmath Narayan Sahoo
School of Mechatronic Engineering, Universiti Malaysia Perlis (UniMAP), Arau, Perlis, Malaysia
M. Murugappan
The University of Exeter, Exeter, Devon, United Kingdom
Yulei Wu
Department of Computer Science and Engineering, National Institute of Technology, Rourkela, Odisha, India
Banshidhar Majhi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Narayan, R., Rout, J.K., Jena, S.K. (2018). Review Spam Detection Using Semi-supervised Technique. In: Sa, P., Sahoo, M., Murugappan, M., Wu, Y., Majhi, B. (eds) Progress in Intelligent Computing Techniques: Theory, Practice, and Applications. Advances in Intelligent Systems and Computing, vol 719. Springer, Singapore. https://doi.org/10.1007/978-981-10-3376-6_31

Download citation

DOI: https://doi.org/10.1007/978-981-10-3376-6_31
Published: 05 August 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3375-9
Online ISBN: 978-981-10-3376-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics