Skip to main content

Social Spammer Detection Based on PSO-CatBoost

  • Conference paper
  • First Online:
Security, Privacy, and Anonymity in Computation, Communication, and Storage (SpaCCS 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12382))

Abstract

With the rapid development of social networks, more and more organizations or individuals use social media to communicate with each other, passing on information and getting information, etc. However, while bringing convenience to people, social media has also become the main target of malicious attackers who try to take advantage of the system vulnerability and cause harm to other normal users, they obtain benefits mainly through sending false information, advertising links, phishing, etc. In this paper, firstly, we collect the features of spammers from the four views (profile, behavior, relationship, and interaction) for a more comprehensive analysis of spammers, secondly, we creatively combine the features of Particle Swarm Optimization (PSO) and CatBoost algorithm, and finally, we propose a novel PSO-CatBoost model based on the CatBoost model for detecting spammers. In order to validate the effectiveness of our proposed model, some ensemble learning algorithms are compared, and the experimental results show that our model outperforms other models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Inuwa-Dutse, I., Liptrott, M., Korkontzelos, I.: Detection of spam-posting accounts on Twitter. Neurocomputing 315, 496–511 (2018)

    Article  Google Scholar 

  2. Fakhraei, S., Foulds, J., Shashanka, M., Getoor, L.: Collective spammer detection in evolving multi-relational social networks. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1769–1778 (2015)

    Google Scholar 

  3. Zheng, X., Zeng, Z., Chen, Z., Yu, Y., Rong, C.: Detecting spammers on social networks. Neurocomputing 159, 27–34 (2015)

    Article  Google Scholar 

  4. Li, Z., Zhang, X., Shen, H., Liang, W., He, Z.: A Semi-supervised framework for social spammer detection. In: Cao, T., Lim, E.-P., Zhou, Z.-H., Ho, T.-B., Cheung, D., Motoda, H. (eds.) PAKDD 2015. LNCS (LNAI), vol. 9078, pp. 177–188. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18032-8_14

    Chapter  Google Scholar 

  5. NexGate: State of Social Media Spam Research Report. Nexgate (2013)

    Google Scholar 

  6. Liu, D., Mei, B., Chen, J., Lu, Z., Du, X.: Community based spammer detection in social networks. In: Dong, X.L., Yu, X., Li, J., Sun, Y. (eds.) WAIM 2015. LNCS, vol. 9098, pp. 554–558. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21042-1_61

    Chapter  Google Scholar 

  7. Ng, C.K., Jiang, F., Zhang, L.Y., Zhou, W.: Static malware clustering using enhanced deep embedding method. Concurrency Comput. Pract. Exp. 31(19), e5234 (2019)

    Article  Google Scholar 

  8. Wu, F., Shu, J., Huang, Y., Yuan, Z.: Co-detecting social spammers and spam messages in microblogging via exploiting social contexts. Neurocomputing 201, 51–65 (2016)

    Article  Google Scholar 

  9. Jiang, F., Dong, D., Cao, L., Frater, M.R.: Agent-based self-adaptable context-aware network vulnerability assessment. IEEE Trans. Netw. Serv. Manag. 10(3), 255–270 (2013)

    Article  Google Scholar 

  10. Wang, J., Li, H., Zhao, J.: Micro-blog spammer detection based on characteristics of social behaviors. In: 2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS), pp. 358–362 (2017)

    Google Scholar 

  11. Dorogush, A.V., Ershov, V., Gulin, A.: CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 (2018)

  12. Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., Gulin, A.: CatBoost: unbiased boosting with categorical features. In: Advances in Neural Information Processing Systems (NIPS), pp. 6638–6648 (2018)

    Google Scholar 

  13. Massaoudi, M., Refaat, S.S., Abu-Rub, H., Chihi, I., Wesleti, F.S.: A hybrid bayesian ridge regression-CWT-Catboost model for PV power forecasting. In: 2020 IEEE Kansas Power and Energy Conference (KPEC), pp. 1–5 (2020)

    Google Scholar 

  14. Diao, L., Niu, D., Zang, Z., Chen, C.: Short-term weather forecast based on wavelet denoising and Catboost. In: 2019 Chinese Control Conference (CCC), pp. 3760–3764 (2019)

    Google Scholar 

  15. Postnikov, E.B., Esmedljaeva, D.A., Lavrova, A.I.: A CatBoost machine learning for prognosis of pathogen’s drug resistance in pulmonary tuberculosis. In: 2020 IEEE 2nd Global Conference on Life Sciences and Technologies (LifeTech), pp. 86–87 (2020)

    Google Scholar 

  16. Cao, C., Caverlee, J.: Detecting Spam URLs in Social media via behavioral analysis. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 703–714. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16354-3_77

    Chapter  Google Scholar 

  17. Soiraya, M., Thanalerdmongkol, S., Chantrapornchai, C.: Using a data mining approach: spam detection on Facebook. Int. J. Comput. Appl. 58(13) (2012)

    Google Scholar 

  18. McCord, M., Chuah, M.: Spam detection on Twitter using traditional classifiers. In: Calero, J.M.A., Yang, L.T., Mármol, F.G., García Villalba, L.J., Li, A.X., Wang, Y. (eds.) ATC 2011. LNCS, vol. 6906, pp. 175–186. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23496-5_13

    Chapter  Google Scholar 

  19. Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots+ machine learning. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 435–442 (2010)

    Google Scholar 

  20. Chen, C., Wen, S., Zhang, J., Xiang, Y., Oliver, J.: Investigating the deceptive information in Twitter spam. Future Gener. Comput. Syst. 72, 319–326 (2017)

    Article  Google Scholar 

  21. Yang, X., Yang, Q., Wilson, C.: Penny for your thoughts: searching for the 50 cent party on Sina Weibo. In: The 9th International AAAI Conference on Web and Social Media (ICWSM), pp. 694–697 (2015)

    Google Scholar 

  22. Hu, X., Tang, J., Liu, H.: Leveraging knowledge across media for spammer detection in microblogging. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 547–556 (2014)

    Google Scholar 

  23. Yu, D., Chen, N., Jiang, F., Fu, B., Qin, A.: Constrained NMF-based semi-supervised learning for social media spammer detection. Knowl.-Based Syst. 125, 64–73 (2017)

    Article  Google Scholar 

  24. Sohrabi, M.K., Karimi, F.: A feature selection approach to detect spam in the Facebook social network. Arab. J. Sci. Eng. 43(2), 949–958 (2018)

    Article  Google Scholar 

  25. Aslan, Ç.B., Sağlam, R.B., Li, S.: Automatic detection of cyber security related accounts on online social networks: Twitter as an example. In: Proceedings of the 9th International Conference on Social Media and Society (SMSociety 2018), pp. 236–240 (2018)

    Google Scholar 

  26. Agarwal, B., Mittal, N.: Comparative study of feature reduction and machine learning methods for spam detection. In: Babu, B.V., Nagar, A., Deep, K., Pant, M., Bansal, J.C., Ray, K., Gupta, U. (eds.) Proceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 2012. AISC, vol. 236, pp. 761–769. Springer, New Delhi (2014). https://doi.org/10.1007/978-81-322-1602-5_81

    Chapter  Google Scholar 

  27. Mussa, D.J., Jameel, N.G.M.: Relevant SMS spam feature selection using wrapper approach and XGBoost algorithm. Kurdistan J. Appl. Res. 4(2), 110–120 (2019)

    Article  Google Scholar 

  28. Jiang, F., Xia, H., Tran, Q.A., Ha, Q.M., Tran, N.Q., Hu, J.: A new binary hybrid particle swarm optimization with wavelet mutation. Knowl.-Based Syst. 130, 90–101 (2017)

    Article  Google Scholar 

  29. Zhang, Y., Wang, S., Wu, L.: Spam detection via feature selection and decision tree. Adv. Sci. Lett. 5(2), 726–730 (2012)

    Article  Google Scholar 

  30. Al Daoud, E.: Comparison between XGBoost, LightGBM and CatBoost using a home credit dataset. Int. J. Comput. Inf. Eng. 13(1), 6–10 (2019)

    Google Scholar 

  31. Wang, D., Tan, D., Liu, L.: Particle swarm optimization algorithm: an overview. Soft. Comput. 22(2), 387–408 (2017). https://doi.org/10.1007/s00500-016-2474-6

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (61762018), the Guangxi 100 Youth Talent Program (F-KA16016) and the Colleges and Universities Key Laboratory of Intelligent Integrated Automation, Guilin University of Electronic Technology, China (GXZDSY2016-03),the research funding of Guangxi Key Lab of Multi-source Information Mining & Security (18-A-02-02), Natural Science Foundation of Guangxi (2018GXNSFAA281310), the Guangxi Key Research and Development Funding (2019AB35004). This work was supported in part by the Innovation special project of Zhongshan Science and Technology Bureau under Grant 2019AG001.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to F. Jiang or Yunbai Qin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, S., Jiang, F., Qin, Y., Zheng, K. (2021). Social Spammer Detection Based on PSO-CatBoost. In: Wang, G., Chen, B., Li, W., Di Pietro, R., Yan, X., Han, H. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2020. Lecture Notes in Computer Science(), vol 12382. Springer, Cham. https://doi.org/10.1007/978-3-030-68851-6_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-68851-6_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-68850-9

  • Online ISBN: 978-3-030-68851-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics