Abstract
As social media platforms like Twitter continue to evolve, the proliferation of spam content has become a pressing issue, undermining the credibility of shared messages. Traditional spam detection methods, such as black-and-white listing and rule-based learning techniques, struggle to efficiently handle large datasets and adapt to dynamic environments. To address these challenges, we propose a novel spam detection model that leverages generative learning techniques, offering improved performance on vast datasets and changing circumstances. Using a substantial Twitter dataset with an 80% training and 20% testing split, our innovative model demonstrates remarkable effectiveness. Experimental results show a G-Loss score of 8.1207, significantly outperforming the D-Loss score of 0.0081, indicating the model’s exceptional accuracy and low error rate. Consequently, our groundbreaking approach emerges as a highly promising solution for real-world spam identification, raising the bar for spam detection research.
Similar content being viewed by others
Data availability
The dataset comes from the NSClab/Resources Twitter Spam. (http://nsclab.org/nsclab/resources/).
References
Elmendili F, Idrissi YEBE (2020) A framework for spam detection in twitter based on recommendation system. Int J Intell Eng Syst. https://doi.org/10.22266/ijies2020.1031.09
Inuwa-Dutse I, Liptrott M, Korkontzelos I (2018) Detection of spam-posting accounts on Twitter. Neurocomputing. https://doi.org/10.1016/j.neucom.2018.07.044
Wu T et al (2017) Detecting spamming activities in twitter based on deep-learning technique. Concurrency Computat. https://doi.org/10.1002/cpe.4209
Fazil M, Abulaish M (2018) A Hybrid Approach for Detecting Automated Spammers in Twitter. IEEE Transact Informat Forensics Sec. https://doi.org/10.1109/TIFS.2018.2825958
Karakaşlı MS, Aydin MA, Yarkan S, Boyaci A (2019) Dynamic feature selection for spam detection in twitter. Lecture Notes Elect Eng. https://doi.org/10.1007/978-981-13-0408-8_20
Li C, Liu S (2018) A comparative study of the class imbalance problem in Twitter spam detection. Concurr Computat. https://doi.org/10.1002/cpe.4281
Çıtlak O, Dörterler M, Doğru İA (2019) A survey on detecting spam accounts on Twitter network. Soc Netw Anal Min 9(1):35. https://doi.org/10.1007/s13278-019-0582-x
Zheng X, Zeng Z, Chen Z, Yu Y, Rong C (2015) Detecting spammers on social networks. Neurocomputing. https://doi.org/10.1016/j.neucom.2015.02.047
Yurtseven I, Bagriyanik S, Ayvaz S (2021) “A Review of Spam Detection in Social Media,” In: Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021. doi: https://doi.org/10.1109/UBMK52708.2021.9558993.
Raza M, Jayasinghe ND, Muslam MMA (2021) A comprehensive review on email spam classification using machine learning algorithms. Int Conf Informat Netw. https://doi.org/10.1109/ICOIN50884.2021.9334020
Bhuvaneshwari P, Rao AN, Robinson YH (2021) Spam review detection using self attention based CNN and bi-directional LSTM. Multimedia Tools Applicat 80(12):18107–18124. https://doi.org/10.1007/s11042-021-10602-y
Kaddoura S, Chandrasekaran G, Popescu DE, Duraisamy JH (2022) A systematic literature review on spam content detection and classification. Peer J Comp Sci. https://doi.org/10.7717/PEERJ-CS.830
Wapet L, Tchana A, Tran GS, Hagimont D (2019) Preventing the propagation of a new kind of illegitimate apps. Future Generat Comp Syst. https://doi.org/10.1016/j.future.2018.11.051
Jain G, Sharma M, Agarwal B (2019) Spam detection in social media using convolutional and long short term memory neural network. Annals Mathemat Artif Intell. https://doi.org/10.1007/s10472-018-9612-z
Faris H et al (2019) An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks. Informat Fusion. https://doi.org/10.1016/j.inffus.2018.08.002
Fu Q, Feng B, Guo D, Li Q (2018) Combating the evolving spammers in online social networks. Comput Secur 72:60–73. https://doi.org/10.1016/j.cose.2017.08.014
Kaur R, Singh S, Kumar H (2018) Rise of spam and compromised accounts in online social networks: a state-of-the-art review of different combating approaches. J Netw Comput Appl 112:53–88. https://doi.org/10.1016/j.jnca.2018.03.015
Barushka A, Hajek P (2020) Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks. Neural Comput Appl 32(9):4239–4257. https://doi.org/10.1007/s00521-019-04331-5
Sharaff A, Jain M, Modugula G (2022) Feature based cluster ranking approach for single document summarization. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-021-00853-1
Shahariar GM, Biswas S, Omar F, Shah FM, Binte Hassan S (2019) “Spam review detection using deep learning”, in IEEE 10th annual information technology. Electron Mobile Commun Conf (IEMCON) 2019:0027–0033. https://doi.org/10.1109/IEMCON.2019.8936148
Gopi AP, Jyothi RNS, Narayana VL, Sandeep KS (2020) Classification of tweets data based on polarity using improved RBF kernel of SVM. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-019-00409-4
Lin G, Sun N, Nepal S, Zhang J, Xiang Y, Hassan H (2017) Statistical twitter spam detection demystified: performance, stability and scalability. IEEE Access. https://doi.org/10.1109/ACCESS.2017.2710540
Cao C, Caverlee J (2015) Detecting spam URLs in social media via behavioral analysis. Lecture Notes Comp Sci (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). https://doi.org/10.1007/978-3-319-16354-3_77
Diqi M, Mulyani SH, Pradila R (2023) DeepCov: effective prediction model of COVID-19 using CNN algorithm. SN Comp Sci 4(4):396. https://doi.org/10.1007/s42979-023-01834-w
Wanda P, Jie HJ (2020) DeepProfile: Finding fake profile in online social network using dynamic CNN. J Informat Sec Appl 52:102465. https://doi.org/10.1016/j.jisa.2020.102465
Jain G, Sharma M, Agarwal B (2019) Optimizing semantic LSTM for spam detection. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-018-0157-5
Ghourabi A, Mahmood MA, Alzubi QM (2020) A hybrid CNN-LSTM model for SMS spam detection in Arabic and English messages. Future Internet. https://doi.org/10.3390/FI12090156
Bang D, Kang S, Shim H (2020) Discriminator feature-based inference by recycling the discriminator of GANs. Int J Comp Vision. https://doi.org/10.1007/s11263-020-01311-4
Diqi M, Hiswati ME, Nur AS (2022) StockGAN: robust stock price prediction using GAN algorithm. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-022-00929-6
Barigye SJ, de la Vega JMG, Perez-Castillo Y (2020) Generative adversarial networks (GANs) based synthetic sampling for predictive modeling. Molecular Inform. https://doi.org/10.1002/minf.202000086
Madisetty S, Desarkar MS (2018) A neural network-based ensemble approach for spam detection in twitter. IEEE Transact Computat Soc Syst. https://doi.org/10.1109/TCSS.2018.2878852
Lee JY, Choi SI (2020) Improvement of learning stability of generative adversarial network using variational learning. Appl Sci (Switzerland). https://doi.org/10.3390/app10134528
Lu PH, Wang PC, Yu CM (2019) Empirical evaluation on synthetic data generation with generative adversarial network. ACM Int Conf Proc Series. https://doi.org/10.1145/3326467.3326474
Xu C, Ren J, Zhang D, Zhang Y, Qin Z, Ren K (2019) GANobfuscator: Mitigating information leakage under GAN via differential privacy. IEEE Transact Informat Forens Sec. https://doi.org/10.1109/TIFS.2019.2897874
Tang X, Qian T, You Z (2020) Generating behavior features for cold-start spam review detection with adversarial learning. Informat Sci. https://doi.org/10.1016/j.ins.2020.03.063
Kumar A, Dabas V, Hooda P (2020) Text classification algorithms for mining unstructured data: a SWOT analysis. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-017-0072-1
Li M, Lin J, Ding Y, Liu Z, Zhu JY, Han S (2022) GAN Compression: Efficient Architectures for Interactive Conditional GANs. IEEE Transact Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2021.3126742
Miller Z, Dickinson B, Deitrick W, Hu W, Wang AH (2014) Twitter spammer detection using data stream clustering. Informat Sci. https://doi.org/10.1016/j.ins.2013.11.016
Fitni QRS, Ramli K (2020) “Implementation of ensemble learning and feature selection for performance improvements in anomaly-based intrusion detection systems,” In: Proceedings - 2020 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2020, doi: https://doi.org/10.1109/IAICT50021.2020.9172014.
Soares E, Garcia C, Poucas R, Camargo H, Leite D (2019) Evolving fuzzy set-based and cloud-based unsupervised classifiers for spam detection. IEEE Lat Am Trans 17(09):1449–1457. https://doi.org/10.1109/TLA.2019.8931138
Rezaeinia SM, Rahmani R, Ghodsi A, Veisi H (2019) Sentiment analysis based on improved pre-trained word embeddings. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2018.08.044
Mohammed MA et al (2019) An anti-spam detection model for emails of multi-natural language. Xinan Jiaotong Daxue Xuebao/J Southwest Jiaotong Univ. https://doi.org/10.35741/issn.0258-2724.54.3.6
Singh AB, Singh KM, Chanu YJ, Thongam K, Singh KJ (2022) An improved image spam classification model based on deep learning techniques. Sec Communicat Net. https://doi.org/10.1155/2022/8905424
Wanda P (2022) RunMax: fake profile classification using novel nonlinear activation in CNN. Soc Netw Anal Min 12(1):158. https://doi.org/10.1007/s13278-022-00983-9
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None declared.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Diqi, M. TwitterGAN: robust spam detection in twitter using novel generative adversarial networks. Int. j. inf. tecnol. 15, 3103–3111 (2023). https://doi.org/10.1007/s41870-023-01352-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01352-1