Abstract
With the rapid development of social networks, more and more organizations or individuals use social media to communicate with each other, passing on information and getting information, etc. However, while bringing convenience to people, social media has also become the main target of malicious attackers who try to take advantage of the system vulnerability and cause harm to other normal users, they obtain benefits mainly through sending false information, advertising links, phishing, etc. In this paper, firstly, we collect the features of spammers from the four views (profile, behavior, relationship, and interaction) for a more comprehensive analysis of spammers, secondly, we creatively combine the features of Particle Swarm Optimization (PSO) and CatBoost algorithm, and finally, we propose a novel PSO-CatBoost model based on the CatBoost model for detecting spammers. In order to validate the effectiveness of our proposed model, some ensemble learning algorithms are compared, and the experimental results show that our model outperforms other models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Inuwa-Dutse, I., Liptrott, M., Korkontzelos, I.: Detection of spam-posting accounts on Twitter. Neurocomputing 315, 496–511 (2018)
Fakhraei, S., Foulds, J., Shashanka, M., Getoor, L.: Collective spammer detection in evolving multi-relational social networks. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1769–1778 (2015)
Zheng, X., Zeng, Z., Chen, Z., Yu, Y., Rong, C.: Detecting spammers on social networks. Neurocomputing 159, 27–34 (2015)
Li, Z., Zhang, X., Shen, H., Liang, W., He, Z.: A Semi-supervised framework for social spammer detection. In: Cao, T., Lim, E.-P., Zhou, Z.-H., Ho, T.-B., Cheung, D., Motoda, H. (eds.) PAKDD 2015. LNCS (LNAI), vol. 9078, pp. 177–188. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18032-8_14
NexGate: State of Social Media Spam Research Report. Nexgate (2013)
Liu, D., Mei, B., Chen, J., Lu, Z., Du, X.: Community based spammer detection in social networks. In: Dong, X.L., Yu, X., Li, J., Sun, Y. (eds.) WAIM 2015. LNCS, vol. 9098, pp. 554–558. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21042-1_61
Ng, C.K., Jiang, F., Zhang, L.Y., Zhou, W.: Static malware clustering using enhanced deep embedding method. Concurrency Comput. Pract. Exp. 31(19), e5234 (2019)
Wu, F., Shu, J., Huang, Y., Yuan, Z.: Co-detecting social spammers and spam messages in microblogging via exploiting social contexts. Neurocomputing 201, 51–65 (2016)
Jiang, F., Dong, D., Cao, L., Frater, M.R.: Agent-based self-adaptable context-aware network vulnerability assessment. IEEE Trans. Netw. Serv. Manag. 10(3), 255–270 (2013)
Wang, J., Li, H., Zhao, J.: Micro-blog spammer detection based on characteristics of social behaviors. In: 2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS), pp. 358–362 (2017)
Dorogush, A.V., Ershov, V., Gulin, A.: CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 (2018)
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., Gulin, A.: CatBoost: unbiased boosting with categorical features. In: Advances in Neural Information Processing Systems (NIPS), pp. 6638–6648 (2018)
Massaoudi, M., Refaat, S.S., Abu-Rub, H., Chihi, I., Wesleti, F.S.: A hybrid bayesian ridge regression-CWT-Catboost model for PV power forecasting. In: 2020 IEEE Kansas Power and Energy Conference (KPEC), pp. 1–5 (2020)
Diao, L., Niu, D., Zang, Z., Chen, C.: Short-term weather forecast based on wavelet denoising and Catboost. In: 2019 Chinese Control Conference (CCC), pp. 3760–3764 (2019)
Postnikov, E.B., Esmedljaeva, D.A., Lavrova, A.I.: A CatBoost machine learning for prognosis of pathogen’s drug resistance in pulmonary tuberculosis. In: 2020 IEEE 2nd Global Conference on Life Sciences and Technologies (LifeTech), pp. 86–87 (2020)
Cao, C., Caverlee, J.: Detecting Spam URLs in Social media via behavioral analysis. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 703–714. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16354-3_77
Soiraya, M., Thanalerdmongkol, S., Chantrapornchai, C.: Using a data mining approach: spam detection on Facebook. Int. J. Comput. Appl. 58(13) (2012)
McCord, M., Chuah, M.: Spam detection on Twitter using traditional classifiers. In: Calero, J.M.A., Yang, L.T., Mármol, F.G., GarcÃa Villalba, L.J., Li, A.X., Wang, Y. (eds.) ATC 2011. LNCS, vol. 6906, pp. 175–186. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23496-5_13
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots+ machine learning. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 435–442 (2010)
Chen, C., Wen, S., Zhang, J., Xiang, Y., Oliver, J.: Investigating the deceptive information in Twitter spam. Future Gener. Comput. Syst. 72, 319–326 (2017)
Yang, X., Yang, Q., Wilson, C.: Penny for your thoughts: searching for the 50 cent party on Sina Weibo. In: The 9th International AAAI Conference on Web and Social Media (ICWSM), pp. 694–697 (2015)
Hu, X., Tang, J., Liu, H.: Leveraging knowledge across media for spammer detection in microblogging. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 547–556 (2014)
Yu, D., Chen, N., Jiang, F., Fu, B., Qin, A.: Constrained NMF-based semi-supervised learning for social media spammer detection. Knowl.-Based Syst. 125, 64–73 (2017)
Sohrabi, M.K., Karimi, F.: A feature selection approach to detect spam in the Facebook social network. Arab. J. Sci. Eng. 43(2), 949–958 (2018)
Aslan, Ç.B., Sağlam, R.B., Li, S.: Automatic detection of cyber security related accounts on online social networks: Twitter as an example. In: Proceedings of the 9th International Conference on Social Media and Society (SMSociety 2018), pp. 236–240 (2018)
Agarwal, B., Mittal, N.: Comparative study of feature reduction and machine learning methods for spam detection. In: Babu, B.V., Nagar, A., Deep, K., Pant, M., Bansal, J.C., Ray, K., Gupta, U. (eds.) Proceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 2012. AISC, vol. 236, pp. 761–769. Springer, New Delhi (2014). https://doi.org/10.1007/978-81-322-1602-5_81
Mussa, D.J., Jameel, N.G.M.: Relevant SMS spam feature selection using wrapper approach and XGBoost algorithm. Kurdistan J. Appl. Res. 4(2), 110–120 (2019)
Jiang, F., Xia, H., Tran, Q.A., Ha, Q.M., Tran, N.Q., Hu, J.: A new binary hybrid particle swarm optimization with wavelet mutation. Knowl.-Based Syst. 130, 90–101 (2017)
Zhang, Y., Wang, S., Wu, L.: Spam detection via feature selection and decision tree. Adv. Sci. Lett. 5(2), 726–730 (2012)
Al Daoud, E.: Comparison between XGBoost, LightGBM and CatBoost using a home credit dataset. Int. J. Comput. Inf. Eng. 13(1), 6–10 (2019)
Wang, D., Tan, D., Liu, L.: Particle swarm optimization algorithm: an overview. Soft. Comput. 22(2), 387–408 (2017). https://doi.org/10.1007/s00500-016-2474-6
Acknowledgments
This work was supported by National Natural Science Foundation of China (61762018), the Guangxi 100 Youth Talent Program (F-KA16016) and the Colleges and Universities Key Laboratory of Intelligent Integrated Automation, Guilin University of Electronic Technology, China (GXZDSY2016-03),the research funding of Guangxi Key Lab of Multi-source Information Mining & Security (18-A-02-02), Natural Science Foundation of Guangxi (2018GXNSFAA281310), the Guangxi Key Research and Development Funding (2019AB35004). This work was supported in part by the Innovation special project of Zhongshan Science and Technology Bureau under Grant 2019AG001.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, S., Jiang, F., Qin, Y., Zheng, K. (2021). Social Spammer Detection Based on PSO-CatBoost. In: Wang, G., Chen, B., Li, W., Di Pietro, R., Yan, X., Han, H. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2020. Lecture Notes in Computer Science(), vol 12382. Springer, Cham. https://doi.org/10.1007/978-3-030-68851-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-68851-6_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68850-9
Online ISBN: 978-3-030-68851-6
eBook Packages: Computer ScienceComputer Science (R0)