Abstract
This paper focuses on Taobao cheater detection. At present the phenomenon of fake trading is widespread in Taobao, which makes it difficult for consumers to distinguish between true and fake product reviews. To solve this problem, we collect a total number of 50,285 historical review data from 100 cheaters and 100 real buyers to create a dataset. By using these data, we extract 8 features from three dimensions that are reviewer, commodity, and review. Then we use the SVM algorithm to construct the classification model and choose the RFB kernel function, which has a better performance to identify the cheater. The precision of the final classification model we built to identify the cheater reaches up to 89%. The experimental result shows that extracting features from the historical review data can recognize the cheaters effectively. It can be applied to the recognition of the cheaters in Taobao.
Chapter PDF
References
China Internet Network Information Center. The 33rd Statistical Report on Internet Development in China, pp. 38–39 (2014)
Jindal, N., Liu, B., Lim, E.-P.: Finding unusual review patterns using unexpected rules. In: CIKM 2010, pp. 1549–1552. ACM, New York (2010)
Wu, G., Greene, D., Smyth, B., Cunningham, P.: Distortion as a validation criterion in the identification of suspicious reviews. In: Proceedings of the First Workshop on Social Media Analytics, SOMA 2010, pp. 10–13. ACM, New York (2010)
Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: KDD, pp. 823–831 (2012)
Wang, G., Xie, S., Liu, B., Yu, P.S.: Identify Online Store Review Spammers via Social Review Graph. Journal of ACM Transactions on Intelligent Systems and Technology (TIST) 3(4) (September 2012)
Mukherjee, A., Liu, B., Wang, J., Glance, N., Jindal, N.: Detecting Group Review Spam. In: Proceedings of the 20th International Conference on World Wide Web. ACM (2011)
Mukherjee, A., Liu, B., Glance, N.: Spotting fake reviewer groups in consumer reviews. In: Proceedings of the 21st International Conference on World Wide Web. ACM (2012)
Zhang, R.: Exploiting shopping and reviewing behavior to re-score online evaluations. In: Proceedings of the 21st International Conference Companion on World Wide Web (2012)
Lim, E.-P., Nguyen, V.-A., Jindal, N., Liu, B., Lauw, H.: Detecting Product Review Spammers using Rating Behaviors. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (2010)
Mukherjee, A., Kumar, A., Liu, B., et al.: Spotting opinion spammers using behavioral footprints. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2013)
Mukherjee, A., Liu, B., Glance, N.: Spotting fake reviewer groups in consumer reviews. In: Proceedings of the 21st International Conference on World Wide Web. ACM (2012)
Mukherjee, A., Venkataraman, V., Liu, B., et al.: What Yelp Fake Review Filter Might Be Doing? In: ICWSM (2013)
Liu, J., Cao, Y., Lin, C.Y., Huang, Y., Zhou, M.: Low-Quality Product Review Detection in Opinion Summarization. In: EMNLP-CoNLL, pp. 334–342 (2007)
Vapnik, V.N.: An overview of statistical learning theory. IEEE Transactions on Neural Networks 10(5), 988–999 (1999)
Yang, Y.: An evaluation of statistical approaches to text categorization. Information Retrieval Journal (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 IFIP International Federation for Information Processing
About this paper
Cite this paper
Dong, B., Liu, Q., Fu, Y., Zhang, L. (2014). A Research of Taobao Cheater Detection. In: Li, H., Mäntymäki, M., Zhang, X. (eds) Digital Services and Information Intelligence. I3E 2014. IFIP Advances in Information and Communication Technology, vol 445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45526-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-662-45526-5_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45525-8
Online ISBN: 978-3-662-45526-5
eBook Packages: Computer ScienceComputer Science (R0)