Skip to main content

Fraud and Deception Detection: Text-Based Data Analytics

  • Chapter
  • First Online:
  • 442 Accesses

Part of the book series: Palgrave Studies in Risk and Insurance ((PSRIIN))

Abstract

With the trend of increasingly complex big data, how to handle and improve the authenticity of data has become an important issue related to the credibility of data. This chapter discusses how to imitate and detect similar applications and how to identify fake reviews by machine learning and various statistical methods using deceptive applications and fake reviews as examples.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Belgiu, Mariana, and Drăguţ, Lucian. (2016). Random Forest in Remote Sensing: A Review of Applications and Future Directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114.

    Google Scholar 

  • Chen, Y. F., and Liz, Y. (2014). Research on Product Review AttributeG Based of Emotion Evaluate Review Spam Detection. New Technology of Library and Information Service, (9), 81‒90.

    Google Scholar 

  • Derek A. Pisner, and David M. Schnyer, (2020). Chapter 6 - Support Vector Machine, Editor(s): Andrea Mechelli, Sandra Vieira, Machine Learning, Academic Press.

    Google Scholar 

  • Esuli, A. A. (2006). Bibliography on Sentiment Classification. Available online: http://liinwww.ira.uka.de/ bibliography/Misc/Sentiment.html (accessed on 27 June 2019).

  • Feng, S., Banerjee, R., and Choi, Y. (2012). Syntactic Stylometry for Deception Detection. Meeting of the Association for Computational Linguistics: Short Papers, 8–14.

    Google Scholar 

  • Fuller, C., Biros, D., and Delen, D. (2011). An Investigation of Data and Text Mining Methods for Real World Deception Detection. Expert Systems with Applications, 38(7).

    Google Scholar 

  • Guduru, N. (2006). Text Mining with Support Vector Machines and Non-Negative Matrix Factorization Algorithms. Ph.D. Thesis, University of Rhodes Island, Rhodes Island, Greece.

    Google Scholar 

  • Hassani, Hossein, Christina Beneki, Stephan Unger, Maedeh T. Mazinani, and Mohammad R. Yeganegi. 2020. Text Mining in Big Data Analytics. Big Data and Cognitive Computing, 4(1), 1.https://doi.org/10.3390/bdcc4010001.

  • Jindal, N., and Liu, B. (2007). Analyzing and Detecting Review Spam. IEEE International Conference on Data Mining, pp. 547–552.

    Google Scholar 

  • Jindal, N., and Liu, B. (2008). Opinion Spam and Analysis. International Conference on Web Search& Data Mining, pp. 219–230.

    Google Scholar 

  • Li, L., Qin, B., Liu, T. (2018). Survey on Fake Review Detection Research. Chinese Journal of Computers, 41(4), 946‒948.

    Google Scholar 

  • Luca, M. (2011). Reviews, Reputation, and Revenue: The Case of Yelp. Boston: Harvard Business School.

    Google Scholar 

  • Meng. M. R., and Ding, S. C. (2013). Motivation And Behavior Of The Fraud Reviews’ Publishers. Information Science, 31(10), 100‒104.

    Google Scholar 

  • Ott, M., and Choiy, Cardiec, et al. (2011). Finding Deceptive Opinion Spam by Any Stretch of the Imagination. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (HLT’11), pp. 309–319.

    Google Scholar 

  • Popescu, A. M., Nguyen, B., and Etzioni, O. (2005). Extracting Product Feature Sand Opinions from Reviews. Proceedings of HLT/EMNLP on Interactive Demonstrations, pp. 32–33.

    Google Scholar 

  • Praveena, M., and Jaiganesh, V. (2017). A Literature Review on Supervised Machine Learning Algorithms and Boosting Process. International Journal of Computer Applications, 169(8), 975–8887.

    Google Scholar 

  • Quan Wang, Beibei Li, and Param Vir Singh. (2018). Copycats vs. Original Mobile Apps: A Machine Learning Copycat-Detection Method and Empirical Analysis. Information Systems Research.

    Google Scholar 

  • Ren, Y., Ji, D., and Yin, L. (2014). Deceptive Reviews Detection Base don Semi-supervised Learning Algorithm. Journal of Sichuan University (Engineering Science Edition), 46(3), 62‒69.

    Google Scholar 

  • Wang, G., Xie, S., Liu, B., et al. (2011). Review Graph Based Online Store Review Spammer Detection. Proceedings of the 2011 IEEE 11th International Conference on Data Mining, pp. 1242–1247.

    Google Scholar 

  • Zhang, H., and Li, D. (2007). Naïve Bayes Text Classifier. 2007 IEEE International Conference on Granular Computing (GRC 2007), pp. 708–708. https://doi.org/10.1109/GrC.2007.40.

  • Zhao, J., and Wang, H. (2016). Detection of Fake Reviews Based on Emotional Orientation and Logistic Regression. CAAI Transactions on Intelligent Systems, 11 (3) , 336‒342.

    Google Scholar 

  • 袁禄. (2021). 虚假评论识别研究综述. 计算机科学, (1), 111–118.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qingquan Tony Zhang .

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Zhang, Q.T., Li, B., Xie, D. (2022). Fraud and Deception Detection: Text-Based Data Analytics. In: Alternative Data and Artificial Intelligence Techniques. Palgrave Studies in Risk and Insurance. Palgrave Macmillan, Cham. https://doi.org/10.1007/978-3-031-11612-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-11612-4_10

  • Published:

  • Publisher Name: Palgrave Macmillan, Cham

  • Print ISBN: 978-3-031-11611-7

  • Online ISBN: 978-3-031-11612-4

  • eBook Packages: Economics and FinanceEconomics and Finance (R0)

Publish with us

Policies and ethics