Fraud and Deception Detection: Text-Based Data Analytics

Zhang, Qingquan Tony; Li, Beibei; Xie, Danxia

doi:10.1007/978-3-031-11612-4_10

Fraud and Deception Detection: Text-Based Data Analytics

Qingquan Tony Zhang⁵,
Beibei Li⁶ &
Danxia Xie⁷

Chapter
First Online: 01 November 2022

442 Accesses

Part of the book series: Palgrave Studies in Risk and Insurance ((PSRIIN))

Abstract

With the trend of increasingly complex big data, how to handle and improve the authenticity of data has become an important issue related to the credibility of data. This chapter discusses how to imitate and detect similar applications and how to identify fake reviews by machine learning and various statistical methods using deceptive applications and fake reviews as examples.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Belgiu, Mariana, and Drăguţ, Lucian. (2016). Random Forest in Remote Sensing: A Review of Applications and Future Directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114.
Google Scholar
Chen, Y. F., and Liz, Y. (2014). Research on Product Review AttributeG Based of Emotion Evaluate Review Spam Detection. New Technology of Library and Information Service, (9), 81‒90.
Google Scholar
Derek A. Pisner, and David M. Schnyer, (2020). Chapter 6 - Support Vector Machine, Editor(s): Andrea Mechelli, Sandra Vieira, Machine Learning, Academic Press.
Google Scholar
Esuli, A. A. (2006). Bibliography on Sentiment Classification. Available online: http://liinwww.ira.uka.de/ bibliography/Misc/Sentiment.html (accessed on 27 June 2019).
Feng, S., Banerjee, R., and Choi, Y. (2012). Syntactic Stylometry for Deception Detection. Meeting of the Association for Computational Linguistics: Short Papers, 8–14.
Google Scholar
Fuller, C., Biros, D., and Delen, D. (2011). An Investigation of Data and Text Mining Methods for Real World Deception Detection. Expert Systems with Applications, 38(7).
Google Scholar
Guduru, N. (2006). Text Mining with Support Vector Machines and Non-Negative Matrix Factorization Algorithms. Ph.D. Thesis, University of Rhodes Island, Rhodes Island, Greece.
Google Scholar
Hassani, Hossein, Christina Beneki, Stephan Unger, Maedeh T. Mazinani, and Mohammad R. Yeganegi. 2020. Text Mining in Big Data Analytics. Big Data and Cognitive Computing, 4(1), 1.https://doi.org/10.3390/bdcc4010001.
Jindal, N., and Liu, B. (2007). Analyzing and Detecting Review Spam. IEEE International Conference on Data Mining, pp. 547–552.
Google Scholar
Jindal, N., and Liu, B. (2008). Opinion Spam and Analysis. International Conference on Web Search& Data Mining, pp. 219–230.
Google Scholar
Li, L., Qin, B., Liu, T. (2018). Survey on Fake Review Detection Research. Chinese Journal of Computers, 41(4), 946‒948.
Google Scholar
Luca, M. (2011). Reviews, Reputation, and Revenue: The Case of Yelp. Boston: Harvard Business School.
Google Scholar
Meng. M. R., and Ding, S. C. (2013). Motivation And Behavior Of The Fraud Reviews’ Publishers. Information Science, 31(10), 100‒104.
Google Scholar
Ott, M., and Choiy, Cardiec, et al. (2011). Finding Deceptive Opinion Spam by Any Stretch of the Imagination. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (HLT’11), pp. 309–319.
Google Scholar
Popescu, A. M., Nguyen, B., and Etzioni, O. (2005). Extracting Product Feature Sand Opinions from Reviews. Proceedings of HLT/EMNLP on Interactive Demonstrations, pp. 32–33.
Google Scholar
Praveena, M., and Jaiganesh, V. (2017). A Literature Review on Supervised Machine Learning Algorithms and Boosting Process. International Journal of Computer Applications, 169(8), 975–8887.
Google Scholar
Quan Wang, Beibei Li, and Param Vir Singh. (2018). Copycats vs. Original Mobile Apps: A Machine Learning Copycat-Detection Method and Empirical Analysis. Information Systems Research.
Google Scholar
Ren, Y., Ji, D., and Yin, L. (2014). Deceptive Reviews Detection Base don Semi-supervised Learning Algorithm. Journal of Sichuan University (Engineering Science Edition), 46(3), 62‒69.
Google Scholar
Wang, G., Xie, S., Liu, B., et al. (2011). Review Graph Based Online Store Review Spammer Detection. Proceedings of the 2011 IEEE 11th International Conference on Data Mining, pp. 1242–1247.
Google Scholar
Zhang, H., and Li, D. (2007). Naïve Bayes Text Classifier. 2007 IEEE International Conference on Granular Computing (GRC 2007), pp. 708–708. https://doi.org/10.1109/GrC.2007.40.
Zhao, J., and Wang, H. (2016). Detection of Fake Reviews Based on Emotional Orientation and Logistic Regression. CAAI Transactions on Intelligent Systems, 11 (3) , 336‒342.
Google Scholar
袁禄. (2021). 虚假评论识别研究综述. 计算机科学, (1), 111–118.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinois Urbana-Champaign, Palatine, IL, USA
Qingquan Tony Zhang
Carnegie Mellon University, Pittsburgh, PA, USA
Beibei Li
Tsinghua University, Beijing, China
Danxia Xie

Authors

Qingquan Tony Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Beibei Li
View author publications
You can also search for this author in PubMed Google Scholar
Danxia Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qingquan Tony Zhang .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, Q.T., Li, B., Xie, D. (2022). Fraud and Deception Detection: Text-Based Data Analytics. In: Alternative Data and Artificial Intelligence Techniques. Palgrave Studies in Risk and Insurance. Palgrave Macmillan, Cham. https://doi.org/10.1007/978-3-031-11612-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-11612-4_10
Published: 01 November 2022
Publisher Name: Palgrave Macmillan, Cham
Print ISBN: 978-3-031-11611-7
Online ISBN: 978-3-031-11612-4
eBook Packages: Economics and FinanceEconomics and Finance (R0)

Publish with us

Policies and ethics