Skip to main content
Log in

SUH-AIFRD: A self-training-based hybrid approach for individual fake reviewer detection

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Fake reviews and reviewers pose a significant challenge for online review platforms as they can mislead consumers and harm businesses’ reputations. Traditional methods for detecting fake reviews rely on supervised learning techniques, which require a large amount of labelled data and are not scalable for large datasets. However, detecting fake reviews is challenging due to the uncertainty surrounding reviewer behaviour. Fake reviewers may exhibit similar behaviour patterns to legitimate reviewers, making it difficult to distinguish between them. To address this challenge, this research proposed a novel model, SUH-AIFRD, for detecting fake reviewers in online review platforms. The model uses a hybrid approach of S3SVM, SSDT, and USKMW methods to identify content, behavioural, activity, and relationship features indicative of fake reviews. Behavioural features are utilized to address the problem of reviewer behavioural uncertainty. The SUH-AIFRD model achieved high accuracy of 94.99%, with impressive precision, recall, and F1 score at 94.74%, 95.37%, and 95.05%, respectively. The proposed model has significant implications for businesses, consumers, and online review platforms, providing a more effective way to combat the problem of fake reviews.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Fig. 3
Algorithm 2
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Availability of supporting data

The data and analyses generated during this study are fully available in the published articles [29].

References

  1. Mewada A, Dewang RK (2022) A comprehensive survey of various methods in opinion spam detection. Multimedia Tools and Applications 82(9):13199–239

    Article  Google Scholar 

  2. Mewada A, Dewang RK (2022) Research on false review detection methods: A state-of-the-art review. Journal of King Saud University-Computer and Information Sciences 34(9):7530–7546

    Article  Google Scholar 

  3. Jain PK, Pamula R, Srivastava G (2021) A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews. Computer science review 41:100413

    Article  Google Scholar 

  4. Maurya SK, Singh D, Maurya AK (2023) Deceptive opinion spam detection approaches: a literature survey. Appl Intell 53(2):2189–2234

    Article  Google Scholar 

  5. Kaur R, Singh S, Kumar H (2018) Rise of spam and compromised accounts in online social networks: A state-of-the-art review of different combating approaches. J Netw Comput Appl 112:53–88

    Article  Google Scholar 

  6. Jain PK, Pamula R, Ansari S. A supervised machine learning approach for the credibility assessment of user-generated content. Wireless Personal Communications 118:2469–85

  7. Zhang D, Li W, Niu B, Wu C. A deep learning approach for detecting fake reviewers: Exploiting reviewing behavior and textual information. Decision Support Systems 166:113911

  8. Jain PK, Prakash A (2022) Predicting consumer recommendation decisions from online reviews: A rough set approach. Wireless Pers Commun 125(4):3443–60

    Article  Google Scholar 

  9. Walther M, Jakobi T, Watson SJ, Stevens G (2023) A systematic literature review about the consumers’ side of fake review detection-Which cues do consumers use to determine the veracity of online user reviews?. Computers in Human Behavior Reports 100278

  10. Sahut JM, Hajek P (2022) Mining behavioural and sentiment-dependent linguistic patterns from restaurant reviews for fake review detection. Technol Forecast Soc Chang 177:121532

    Article  Google Scholar 

  11. Kumar N, Venugopal D, Qiu L, Kumar S (2018) Detecting review manipulation on online platforms with hierarchical supervised learning. J Manag Inf Syst 35(1):350–80

    Article  Google Scholar 

  12. Hussain N, Mirza HT, Hussain I, Iqbal F, Memon I (2020) Spam review detection using the linguistic and spammer behavioral methods. IEEE Access 8:53801–16

    Article  Google Scholar 

  13. Wu Z, Cao J, Wang Y, Wang Y, Zhang L, Wu J (2018) hPSD: a hybrid PU-learning-based spammer detection model for product reviews. IEEE transactions on cybernetics 50(4):1595–606

    Article  Google Scholar 

  14. Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B (1998) Support vector machines. IEEE Intelligent Systems and their applications 13(4):18–28

    Article  Google Scholar 

  15. Safavian SR, Landgrebe D (1991) A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern 3:660–74

    Article  MathSciNet  Google Scholar 

  16. Likas A, Vlassis N, Verbeek JJ (2003) The global k-means clustering algorithm. Pattern Recogn 36(2):451–61

    Article  Google Scholar 

  17. Lim EP, Nguyen VA, Jindal N, Liu B, Lauw HW (2010) Detecting product review spammers using rating behaviors. In Proceedings of the 19th ACM international conference on Information and knowledge management (pp. 939–948)

  18. Xue H, Li F, Seo H, Pluretti R (2015) Trust-aware review spam detection. In 2015 IEEE Trustcom/BigDataSE/ISPA (Vol. 1, pp. 726–733). IEEE

  19. Fei G, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Exploiting burstiness in reviews for review spammer detection. In Proceedings of the international AAAI conference on web and social media (Vol. 7, No. 1, pp. 175–184)

  20. Kim J, Kang J, Shin S, Myaeng SH (2021) Can you distinguish truthful from fake reviews? User analysis and assistance tool for fake review detection. In Proceedings of the first workshop on bridging human–computer interaction and natural language processing (pp. 53–59)

  21. Mukherjee A, Kumar A, Liu B, Wang J, Hsu M, Castellanos M, Ghosh R (2013) Spotting opinion spammers using behavioral footprints. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 632–640)

  22. Salehan M, Kim DJ (2016) Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics. Decis Support Syst 81:30–40

    Article  Google Scholar 

  23. Batanović V, Cvetanović M, Nikolić B (2020) A versatile framework for resource-limited sentiment articulation, annotation, and analysis of short texts. PLoS ONE 15(11):e0242050

    Article  Google Scholar 

  24. Ott M, Cardie C, Hancock J (2012) Estimating the prevalence of deception in online review communities. In Proceedings of the 21st international conference on World Wide Web 201–210

  25. Kuang L, Zhang H, Shi R, Liao Z, Yang X (2020) A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms. Comput Netw 183:107587

    Article  Google Scholar 

  26. Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th annual meeting of the association for computational linguistics: Human language technologies-volume 1, pages 309–319. Association for Computational Linguistics

  27. Jindal N, Liu B (2007) Analyzing and detecting review spam. In Seventh IEEE international conference on data mining (ICDM 2007) (pp. 547–552). IEEE

  28. Ji SJ, Zhang Q, Li J, Chiu DK, Xu S, Yi L, Gong M (2020) A burst-based unsupervised method for detecting review spammer groups. Inf Sci 536:454–69

    Article  MathSciNet  Google Scholar 

  29. Rayana S, Akoglu L (2015) Collective opinion spam detection: Bridging review networks and metadata. In Proceedings of the 21th acm sigkdd international conference on knowledge discovery and data mining (pp. 985–994)

  30. Rostami M, Oussalah M, Berahmand K, Farrahi V (2023) Community Detection Algorithms in Healthcare Applications: A Systematic Review. IEEE Access

  31. Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50:4281–4295

    Article  Google Scholar 

  32. Zhang D, Zhou L, Kehoe JL, Kilic IY (2016) What online reviewer behaviors really matter? Effects of verbal and nonverbal behaviors on detection of fake online reviews. J Manag Inf Syst 33(2):456–81

    Article  Google Scholar 

  33. Liu P, Xu Z, Ai J, Wang F (2017) Identifying indicators of fake reviews based on spammer’s behavior features. In2017 IEEE international conference on software quality, reliability and security companion (QRS-C) (pp. 396–403). IEEE

  34. Rastogi A, Mehrotra M, Ali SS (2020) Effective opinion spam detection: A study on review metadata versus content. Journal of Data and Information Science 2:76–110

    Article  Google Scholar 

  35. Kumar J (2020) Fake Review Detection Using Behavioral and Contextual Features. arXiv:2003.00807

  36. Kumar A, Gopal RD, Shankar R, Tan KH (2022) Fraudulent review detection model focusing on emotional expressions and explicit aspects: investigating the potential of feature engineering. Decis Support Syst 155:113728

    Article  Google Scholar 

  37. Jain A, Nandakumar K, Ross A (2005) Score normalization in multimodal biometric systems. Pattern Recogn 38(12):2270–85

    Article  Google Scholar 

  38. Rao VM, Sastry VN (2012) Unsupervised feature ranking based on representation entropy. In 2012 1st International Conference on Recent Advances in Information Technology (RAIT) (pp. 421–425). IEEE

  39. Rosenberg C, Hebert M, Schneiderman H (2005) Semi-supervised self-training of object detection models. 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION’05) 36(2)

  40. Sheikhpour R, Berahmand K, Forouzandeh S (2023) Hessian-based semi-supervised feature selection using generalized uncorrelated constraint. Knowl-Based Syst 269:110521

    Article  Google Scholar 

  41. Zhai X, Oliver A, Kolesnikov A, Beyer L (2019) S4l: Self-supervised semi-supervised learning. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 1476–1485)

  42. Dematis I, Karapistoli E, Vakali A (2018) Fake review detection via exploitation of spam indicators and reviewer behavior characteristics. In SOFSEM 2018: Theory and Practice of Computer Science: 44th International Conference on Current Trends in Theory and Practice of Computer Science, Krems, Austria, January 29-February 2, 2018, Proceedings 44, pages 581–595. Springer

  43. Hussain N, Mirza HT, Hussain I et al (2019) Detecting spam review through spammer’s behavior analysis. Advances in Distributed Computing and Artificial Intelligence Journal 8

  44. Khalifa MB, Elouedi Z, Lefevre E (2020) An evidential spammer detection based on the suspicious behaviors’ indicators. In 2020 International Multi-Conference on:“Organization of Knowledge and Advanced Technologies”(OCTA), pages 1–8. IEEE

Download references

Acknowledgements

Not Applicable

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed equally to this work. Each author was involved in the conceptualization, data collection, analysis and interpretation, and writing of the manuscript. All authors have read and approved the final manuscript and have provided their consent for its publication.

Corresponding author

Correspondence to Arvind Mewada.

Ethics declarations

Competing interests:

The authors declare that they have no conflict of interest.

Ethical Approval:

This article does not involve studies with human or animal subjects.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mewada, A., Dewang, R.K. SUH-AIFRD: A self-training-based hybrid approach for individual fake reviewer detection. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18192-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-024-18192-1

Keywords

Navigation