Social Spammer Detection Based on PSO-CatBoost

Li, Shupeng; Jiang, F.; Qin, Yunbai; Zheng, Kunkun

doi:10.1007/978-3-030-68851-6_28

Shupeng Li¹⁴,
F. Jiang¹⁴,
Yunbai Qin¹⁴ &
…
Kunkun Zheng¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12382))

Included in the following conference series:

International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage

1 Citations

Abstract

With the rapid development of social networks, more and more organizations or individuals use social media to communicate with each other, passing on information and getting information, etc. However, while bringing convenience to people, social media has also become the main target of malicious attackers who try to take advantage of the system vulnerability and cause harm to other normal users, they obtain benefits mainly through sending false information, advertising links, phishing, etc. In this paper, firstly, we collect the features of spammers from the four views (profile, behavior, relationship, and interaction) for a more comprehensive analysis of spammers, secondly, we creatively combine the features of Particle Swarm Optimization (PSO) and CatBoost algorithm, and finally, we propose a novel PSO-CatBoost model based on the CatBoost model for detecting spammers. In order to validate the effectiveness of our proposed model, some ensemble learning algorithms are compared, and the experimental results show that our model outperforms other models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning

LSSL-SSD: Social Spammer Detection with Laplacian Score and Semi-supervised Learning

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

References

Inuwa-Dutse, I., Liptrott, M., Korkontzelos, I.: Detection of spam-posting accounts on Twitter. Neurocomputing 315, 496–511 (2018)
Article Google Scholar
Fakhraei, S., Foulds, J., Shashanka, M., Getoor, L.: Collective spammer detection in evolving multi-relational social networks. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1769–1778 (2015)
Google Scholar
Zheng, X., Zeng, Z., Chen, Z., Yu, Y., Rong, C.: Detecting spammers on social networks. Neurocomputing 159, 27–34 (2015)
Article Google Scholar
Li, Z., Zhang, X., Shen, H., Liang, W., He, Z.: A Semi-supervised framework for social spammer detection. In: Cao, T., Lim, E.-P., Zhou, Z.-H., Ho, T.-B., Cheung, D., Motoda, H. (eds.) PAKDD 2015. LNCS (LNAI), vol. 9078, pp. 177–188. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18032-8_14
Chapter Google Scholar
NexGate: State of Social Media Spam Research Report. Nexgate (2013)
Google Scholar
Liu, D., Mei, B., Chen, J., Lu, Z., Du, X.: Community based spammer detection in social networks. In: Dong, X.L., Yu, X., Li, J., Sun, Y. (eds.) WAIM 2015. LNCS, vol. 9098, pp. 554–558. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21042-1_61
Chapter Google Scholar
Ng, C.K., Jiang, F., Zhang, L.Y., Zhou, W.: Static malware clustering using enhanced deep embedding method. Concurrency Comput. Pract. Exp. 31(19), e5234 (2019)
Article Google Scholar
Wu, F., Shu, J., Huang, Y., Yuan, Z.: Co-detecting social spammers and spam messages in microblogging via exploiting social contexts. Neurocomputing 201, 51–65 (2016)
Article Google Scholar
Jiang, F., Dong, D., Cao, L., Frater, M.R.: Agent-based self-adaptable context-aware network vulnerability assessment. IEEE Trans. Netw. Serv. Manag. 10(3), 255–270 (2013)
Article Google Scholar
Wang, J., Li, H., Zhao, J.: Micro-blog spammer detection based on characteristics of social behaviors. In: 2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS), pp. 358–362 (2017)
Google Scholar
Dorogush, A.V., Ershov, V., Gulin, A.: CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 (2018)
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., Gulin, A.: CatBoost: unbiased boosting with categorical features. In: Advances in Neural Information Processing Systems (NIPS), pp. 6638–6648 (2018)
Google Scholar
Massaoudi, M., Refaat, S.S., Abu-Rub, H., Chihi, I., Wesleti, F.S.: A hybrid bayesian ridge regression-CWT-Catboost model for PV power forecasting. In: 2020 IEEE Kansas Power and Energy Conference (KPEC), pp. 1–5 (2020)
Google Scholar
Diao, L., Niu, D., Zang, Z., Chen, C.: Short-term weather forecast based on wavelet denoising and Catboost. In: 2019 Chinese Control Conference (CCC), pp. 3760–3764 (2019)
Google Scholar
Postnikov, E.B., Esmedljaeva, D.A., Lavrova, A.I.: A CatBoost machine learning for prognosis of pathogen’s drug resistance in pulmonary tuberculosis. In: 2020 IEEE 2nd Global Conference on Life Sciences and Technologies (LifeTech), pp. 86–87 (2020)
Google Scholar
Cao, C., Caverlee, J.: Detecting Spam URLs in Social media via behavioral analysis. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 703–714. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16354-3_77
Chapter Google Scholar
Soiraya, M., Thanalerdmongkol, S., Chantrapornchai, C.: Using a data mining approach: spam detection on Facebook. Int. J. Comput. Appl. 58(13) (2012)
Google Scholar
McCord, M., Chuah, M.: Spam detection on Twitter using traditional classifiers. In: Calero, J.M.A., Yang, L.T., Mármol, F.G., García Villalba, L.J., Li, A.X., Wang, Y. (eds.) ATC 2011. LNCS, vol. 6906, pp. 175–186. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23496-5_13
Chapter Google Scholar
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots+ machine learning. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 435–442 (2010)
Google Scholar
Chen, C., Wen, S., Zhang, J., Xiang, Y., Oliver, J.: Investigating the deceptive information in Twitter spam. Future Gener. Comput. Syst. 72, 319–326 (2017)
Article Google Scholar
Yang, X., Yang, Q., Wilson, C.: Penny for your thoughts: searching for the 50 cent party on Sina Weibo. In: The 9th International AAAI Conference on Web and Social Media (ICWSM), pp. 694–697 (2015)
Google Scholar
Hu, X., Tang, J., Liu, H.: Leveraging knowledge across media for spammer detection in microblogging. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 547–556 (2014)
Google Scholar
Yu, D., Chen, N., Jiang, F., Fu, B., Qin, A.: Constrained NMF-based semi-supervised learning for social media spammer detection. Knowl.-Based Syst. 125, 64–73 (2017)
Article Google Scholar
Sohrabi, M.K., Karimi, F.: A feature selection approach to detect spam in the Facebook social network. Arab. J. Sci. Eng. 43(2), 949–958 (2018)
Article Google Scholar
Aslan, Ç.B., Sağlam, R.B., Li, S.: Automatic detection of cyber security related accounts on online social networks: Twitter as an example. In: Proceedings of the 9th International Conference on Social Media and Society (SMSociety 2018), pp. 236–240 (2018)
Google Scholar
Agarwal, B., Mittal, N.: Comparative study of feature reduction and machine learning methods for spam detection. In: Babu, B.V., Nagar, A., Deep, K., Pant, M., Bansal, J.C., Ray, K., Gupta, U. (eds.) Proceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 2012. AISC, vol. 236, pp. 761–769. Springer, New Delhi (2014). https://doi.org/10.1007/978-81-322-1602-5_81
Chapter Google Scholar
Mussa, D.J., Jameel, N.G.M.: Relevant SMS spam feature selection using wrapper approach and XGBoost algorithm. Kurdistan J. Appl. Res. 4(2), 110–120 (2019)
Article Google Scholar
Jiang, F., Xia, H., Tran, Q.A., Ha, Q.M., Tran, N.Q., Hu, J.: A new binary hybrid particle swarm optimization with wavelet mutation. Knowl.-Based Syst. 130, 90–101 (2017)
Article Google Scholar
Zhang, Y., Wang, S., Wu, L.: Spam detection via feature selection and decision tree. Adv. Sci. Lett. 5(2), 726–730 (2012)
Article Google Scholar
Al Daoud, E.: Comparison between XGBoost, LightGBM and CatBoost using a home credit dataset. Int. J. Comput. Inf. Eng. 13(1), 6–10 (2019)
Google Scholar
Wang, D., Tan, D., Liu, L.: Particle swarm optimization algorithm: an overview. Soft. Comput. 22(2), 387–408 (2017). https://doi.org/10.1007/s00500-016-2474-6
Article Google Scholar

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (61762018), the Guangxi 100 Youth Talent Program (F-KA16016) and the Colleges and Universities Key Laboratory of Intelligent Integrated Automation, Guilin University of Electronic Technology, China (GXZDSY2016-03),the research funding of Guangxi Key Lab of Multi-source Information Mining & Security (18-A-02-02), Natural Science Foundation of Guangxi (2018GXNSFAA281310), the Guangxi Key Research and Development Funding (2019AB35004). This work was supported in part by the Innovation special project of Zhongshan Science and Technology Bureau under Grant 2019AG001.

Author information

Authors and Affiliations

College of Electronics and Engineering, Guangxi Normal University, Guilin, 541004, China
Shupeng Li, F. Jiang & Yunbai Qin
Guangdong Aohong Technology Co., LTD., Zhongshan, 528437, China
Kunkun Zheng

Authors

Shupeng Li
View author publications
You can also search for this author in PubMed Google Scholar
F. Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yunbai Qin
View author publications
You can also search for this author in PubMed Google Scholar
Kunkun Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to F. Jiang or Yunbai Qin .

Editor information

Editors and Affiliations

Guangzhou University, Guangzhou, China
Guojun Wang
Aeronautics and Astronautics, Nanjing University, Nanjing, China
Bing Chen
Computer Science, Georgia State University, Atlanta, GA, USA
Wei Li
College of Science and Engineering, Qatar Foundation Education City, Doha, Qatar
Roberto Di Pietro
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Xuefeng Yan
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Hao Han

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, S., Jiang, F., Qin, Y., Zheng, K. (2021). Social Spammer Detection Based on PSO-CatBoost. In: Wang, G., Chen, B., Li, W., Di Pietro, R., Yan, X., Han, H. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2020. Lecture Notes in Computer Science(), vol 12382. Springer, Cham. https://doi.org/10.1007/978-3-030-68851-6_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-68851-6_28
Published: 05 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68850-9
Online ISBN: 978-3-030-68851-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Social Spammer Detection Based on PSO-CatBoost

Abstract

Access this chapter

Similar content being viewed by others

PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning

LSSL-SSD: Social Spammer Detection with Laplacian Score and Semi-supervised Learning

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Social Spammer Detection Based on PSO-CatBoost

Abstract

Access this chapter

Similar content being viewed by others

PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning

LSSL-SSD: Social Spammer Detection with Laplacian Score and Semi-supervised Learning

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation