Dynamic Feature Selection for Spam Detection in Twitter

Karakaşlı, M. Salih; Aydin, Muhammed Ali; Yarkan, Serhan; Boyaci, Ali

doi:10.1007/978-981-13-0408-8_20

M. Salih Karakaşlı³⁶,
Muhammed Ali Aydin³⁶,
Serhan Yarkan³⁷ &
…
Ali Boyaci³⁸

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 504))

727 Accesses
15 Citations

Abstract

Social Networks continue to increase their popularity day by day. With the widespread availability of Internet access, interest of people in social networks has also increased significantly. The fact that, popularity of social media makes it tempting to use social media platforms for bad purposes. Malicious people are attempting to gain unfair profits by using fake accounts and various techniques. Among these initiatives, SPAM is one of the most frequently used methods. Today, SPAM attacks on social networks are increasing and many social network users are exposed to this and similar attacks. To identify SPAM users among billions of social network users, the examination of massive amounts of data requires a challenging large-scale data analysis. In this study, we group similar Twitter users and introduce a dynamic feature selection technique that use different features for each user groups instead of use static feature set and apply machine learning algorithms to classify spam users on Twitter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

https://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/
Kandasamy KT, Koroth P (2014) An integrated approach to spam classification on Twitter using URL analysis. In: IEEE students’ conference on electrical, electronics and computer science
Google Scholar
Chaitanya KT, Ponnapalli H, Herts D, Pablo J (2012) Analysis and detection of modern spam techniques on social networking sites. In: Third international conference on services in emerging markets
Google Scholar
Abu-Nimeh S, Chen T, Alzubi O (2011) Malicious and spam posts in online social networks. IEEE Comput Soc 44(9)
Article Google Scholar
Amit AA, Reddy N, Yadav S, Gu G, Yang C (2013) CATS: characterizing automation of Twitter spammers, communication systems and networks (COMSNETS). Fifth Int Conf 2013:1–10
Google Scholar
Chen C, Guan DJ, Su Q (2015) Feature set identification for detecting suspicious URLs using Bayesian classification in social networks. Inf Sci 289:133–147
Article Google Scholar
Twitter, 2015, Twitter Kullanımı ve Şirket Verileri, https://about.twitter.com/tr/company, [Ziyaret Tarihi: 27 Ekim 2015].
Wank K, Wang Y, Li H, Zhang X (2011). A new approach for detecting spam microblogs based on text and user’s social network features. In: Proceedings of the VLDB endowment, vol 4, No 12. Seattle, Washington
Google Scholar
Cao C, Caverlee J (2014) Behavioral detection of spam URL sharing: posting patterns versus click patterns. In: International conference on advances in social networks analysis and mining
Google Scholar
Wang D (2014) Analysis and detection of low quality information in social networks, PhD symposium of 30th IEEE international conference on data engineering (ICDE 2014). Chicago, IL, United States
Google Scholar
Radulescu C, Dinsoreanu M, Potolea R (2014) Identification of spam comments using natural language processing techniques. In: 2014 IEEE 10th international conference on intelligent computer communication and processing
Google Scholar
Shen H (2014) Leveraging social networks for effective spam filtering. IEEE Trans Comput 63(11)
Article MathSciNet Google Scholar
Fabricio B, Magno G, Rodrigues T, Almeida V (2010) Detecting spammers on Twitter, collaboration, electronic messaging, anti- abuse and spam conference (CEAS), vol 6. National Academy Press
Google Scholar
Rashhid C, Nuriddin M, Mahmud GAN, Rashedur M (2013) A data mining based spam detection system for YouTube. In: Eighth international conference on digital information management, pp. 373–378
Google Scholar
Sarita Y, Daniel R, Grant S, Danah B (2010) Detecting spam in a twitter network. Microsoft Res First Monday, 15(1)
Google Scholar
Stafford G, Louis LY (2013) An evaluation of the effect of spam on twitter trending topics. IEEE, New York
Google Scholar
Zhao Y, Zhaoxiang Z, Yungonh W, Liu J (2012) Robust mobile spamming detection via graph patterns. In: 21st international conference on pattern recognition.
Google Scholar
Boykin O, Roychowdhury VP (2005) Leveraging social networks to fight spam. Computer (Impact Factor: 1.44) 38:61–68.
Article Google Scholar
Mohammed B (2011) An unsupervised approach for identifying spammers in social networks. In: 23rd IEEE international conference on tools with artificial intelligence.
Google Scholar
Pelleg D, Moore A (2000) X-means: extending K-means with efficient estimation of the number of clusters. ICML.
Google Scholar

Download references

Acknowledgement

This work is also a part of the M.Sc. thesis titled Big Data Analysis in Social Media at Istanbul University, Department of Computer Engineering.

Author information

Authors and Affiliations

Istanbul University, Istanbul, Turkey
M. Salih Karakaşlı & Muhammed Ali Aydin
Center for Applied Research on Informatics Technologies, Istanbul Commerce University, Istanbul, Turkey
Serhan Yarkan
Istanbul Commerce University, Istanbul, Turkey
Ali Boyaci

Authors

M. Salih Karakaşlı
View author publications
You can also search for this author in PubMed Google Scholar
Muhammed Ali Aydin
View author publications
You can also search for this author in PubMed Google Scholar
Serhan Yarkan
View author publications
You can also search for this author in PubMed Google Scholar
Ali Boyaci
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Serhan Yarkan .

Editor information

Editors and Affiliations

Department of Electrical-Electronics Engineering, Istanbul Commerce University, Istanbul, Turkey
Ali Boyaci
Department of Information Technology, Balıkesir University, ‎Balıkesir, Turkey
Ali Riza Ekti
Computer Engineering Department, Istanbul University, Istanbul, Turkey
Muhammed Ali Aydin
Istanbul Commerce University, Istanbul, Turkey
Serhan Yarkan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karakaşlı, M.S., Aydin, M.A., Yarkan, S., Boyaci, A. (2019). Dynamic Feature Selection for Spam Detection in Twitter. In: Boyaci, A., Ekti, A., Aydin, M., Yarkan, S. (eds) International Telecommunications Conference. Lecture Notes in Electrical Engineering, vol 504. Springer, Singapore. https://doi.org/10.1007/978-981-13-0408-8_20

Download citation

DOI: https://doi.org/10.1007/978-981-13-0408-8_20
Published: 06 July 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0407-1
Online ISBN: 978-981-13-0408-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics