Skip to main content
Log in

TwitterGAN: robust spam detection in twitter using novel generative adversarial networks

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

As social media platforms like Twitter continue to evolve, the proliferation of spam content has become a pressing issue, undermining the credibility of shared messages. Traditional spam detection methods, such as black-and-white listing and rule-based learning techniques, struggle to efficiently handle large datasets and adapt to dynamic environments. To address these challenges, we propose a novel spam detection model that leverages generative learning techniques, offering improved performance on vast datasets and changing circumstances. Using a substantial Twitter dataset with an 80% training and 20% testing split, our innovative model demonstrates remarkable effectiveness. Experimental results show a G-Loss score of 8.1207, significantly outperforming the D-Loss score of 0.0081, indicating the model’s exceptional accuracy and low error rate. Consequently, our groundbreaking approach emerges as a highly promising solution for real-world spam identification, raising the bar for spam detection research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Data availability

The dataset comes from the NSClab/Resources Twitter Spam. (http://nsclab.org/nsclab/resources/).

References

  1. Elmendili F, Idrissi YEBE (2020) A framework for spam detection in twitter based on recommendation system. Int J Intell Eng Syst. https://doi.org/10.22266/ijies2020.1031.09

    Article  Google Scholar 

  2. Inuwa-Dutse I, Liptrott M, Korkontzelos I (2018) Detection of spam-posting accounts on Twitter. Neurocomputing. https://doi.org/10.1016/j.neucom.2018.07.044

    Article  Google Scholar 

  3. Wu T et al (2017) Detecting spamming activities in twitter based on deep-learning technique. Concurrency Computat. https://doi.org/10.1002/cpe.4209

    Article  Google Scholar 

  4. Fazil M, Abulaish M (2018) A Hybrid Approach for Detecting Automated Spammers in Twitter. IEEE Transact Informat Forensics Sec. https://doi.org/10.1109/TIFS.2018.2825958

    Article  Google Scholar 

  5. Karakaşlı MS, Aydin MA, Yarkan S, Boyaci A (2019) Dynamic feature selection for spam detection in twitter. Lecture Notes Elect Eng. https://doi.org/10.1007/978-981-13-0408-8_20

    Article  Google Scholar 

  6. Li C, Liu S (2018) A comparative study of the class imbalance problem in Twitter spam detection. Concurr Computat. https://doi.org/10.1002/cpe.4281

    Article  Google Scholar 

  7. Çıtlak O, Dörterler M, Doğru İA (2019) A survey on detecting spam accounts on Twitter network. Soc Netw Anal Min 9(1):35. https://doi.org/10.1007/s13278-019-0582-x

    Article  Google Scholar 

  8. Zheng X, Zeng Z, Chen Z, Yu Y, Rong C (2015) Detecting spammers on social networks. Neurocomputing. https://doi.org/10.1016/j.neucom.2015.02.047

    Article  Google Scholar 

  9. Yurtseven I, Bagriyanik S, Ayvaz S (2021) “A Review of Spam Detection in Social Media,” In: Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021. doi: https://doi.org/10.1109/UBMK52708.2021.9558993.

  10. Raza M, Jayasinghe ND, Muslam MMA (2021) A comprehensive review on email spam classification using machine learning algorithms. Int Conf Informat Netw. https://doi.org/10.1109/ICOIN50884.2021.9334020

    Article  Google Scholar 

  11. Bhuvaneshwari P, Rao AN, Robinson YH (2021) Spam review detection using self attention based CNN and bi-directional LSTM. Multimedia Tools Applicat 80(12):18107–18124. https://doi.org/10.1007/s11042-021-10602-y

    Article  Google Scholar 

  12. Kaddoura S, Chandrasekaran G, Popescu DE, Duraisamy JH (2022) A systematic literature review on spam content detection and classification. Peer J Comp Sci. https://doi.org/10.7717/PEERJ-CS.830

    Article  Google Scholar 

  13. Wapet L, Tchana A, Tran GS, Hagimont D (2019) Preventing the propagation of a new kind of illegitimate apps. Future Generat Comp Syst. https://doi.org/10.1016/j.future.2018.11.051

    Article  Google Scholar 

  14. Jain G, Sharma M, Agarwal B (2019) Spam detection in social media using convolutional and long short term memory neural network. Annals Mathemat Artif Intell. https://doi.org/10.1007/s10472-018-9612-z

    Article  Google Scholar 

  15. Faris H et al (2019) An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks. Informat Fusion. https://doi.org/10.1016/j.inffus.2018.08.002

    Article  Google Scholar 

  16. Fu Q, Feng B, Guo D, Li Q (2018) Combating the evolving spammers in online social networks. Comput Secur 72:60–73. https://doi.org/10.1016/j.cose.2017.08.014

    Article  Google Scholar 

  17. Kaur R, Singh S, Kumar H (2018) Rise of spam and compromised accounts in online social networks: a state-of-the-art review of different combating approaches. J Netw Comput Appl 112:53–88. https://doi.org/10.1016/j.jnca.2018.03.015

    Article  Google Scholar 

  18. Barushka A, Hajek P (2020) Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks. Neural Comput Appl 32(9):4239–4257. https://doi.org/10.1007/s00521-019-04331-5

    Article  Google Scholar 

  19. Sharaff A, Jain M, Modugula G (2022) Feature based cluster ranking approach for single document summarization. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-021-00853-1

    Article  Google Scholar 

  20. Shahariar GM, Biswas S, Omar F, Shah FM, Binte Hassan S (2019) “Spam review detection using deep learning”, in IEEE 10th annual information technology. Electron Mobile Commun Conf (IEMCON) 2019:0027–0033. https://doi.org/10.1109/IEMCON.2019.8936148

    Article  Google Scholar 

  21. Gopi AP, Jyothi RNS, Narayana VL, Sandeep KS (2020) Classification of tweets data based on polarity using improved RBF kernel of SVM. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-019-00409-4

    Article  Google Scholar 

  22. Lin G, Sun N, Nepal S, Zhang J, Xiang Y, Hassan H (2017) Statistical twitter spam detection demystified: performance, stability and scalability. IEEE Access. https://doi.org/10.1109/ACCESS.2017.2710540

    Article  Google Scholar 

  23. Cao C, Caverlee J (2015) Detecting spam URLs in social media via behavioral analysis. Lecture Notes Comp Sci (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). https://doi.org/10.1007/978-3-319-16354-3_77

    Article  Google Scholar 

  24. Diqi M, Mulyani SH, Pradila R (2023) DeepCov: effective prediction model of COVID-19 using CNN algorithm. SN Comp Sci 4(4):396. https://doi.org/10.1007/s42979-023-01834-w

    Article  Google Scholar 

  25. Wanda P, Jie HJ (2020) DeepProfile: Finding fake profile in online social network using dynamic CNN. J Informat Sec Appl 52:102465. https://doi.org/10.1016/j.jisa.2020.102465

    Article  Google Scholar 

  26. Jain G, Sharma M, Agarwal B (2019) Optimizing semantic LSTM for spam detection. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-018-0157-5

    Article  Google Scholar 

  27. Ghourabi A, Mahmood MA, Alzubi QM (2020) A hybrid CNN-LSTM model for SMS spam detection in Arabic and English messages. Future Internet. https://doi.org/10.3390/FI12090156

    Article  Google Scholar 

  28. Bang D, Kang S, Shim H (2020) Discriminator feature-based inference by recycling the discriminator of GANs. Int J Comp Vision. https://doi.org/10.1007/s11263-020-01311-4

    Article  MATH  Google Scholar 

  29. Diqi M, Hiswati ME, Nur AS (2022) StockGAN: robust stock price prediction using GAN algorithm. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-022-00929-6

    Article  Google Scholar 

  30. Barigye SJ, de la Vega JMG, Perez-Castillo Y (2020) Generative adversarial networks (GANs) based synthetic sampling for predictive modeling. Molecular Inform. https://doi.org/10.1002/minf.202000086

    Article  Google Scholar 

  31. Madisetty S, Desarkar MS (2018) A neural network-based ensemble approach for spam detection in twitter. IEEE Transact Computat Soc Syst. https://doi.org/10.1109/TCSS.2018.2878852

    Article  Google Scholar 

  32. Lee JY, Choi SI (2020) Improvement of learning stability of generative adversarial network using variational learning. Appl Sci (Switzerland). https://doi.org/10.3390/app10134528

    Article  Google Scholar 

  33. Lu PH, Wang PC, Yu CM (2019) Empirical evaluation on synthetic data generation with generative adversarial network. ACM Int Conf Proc Series. https://doi.org/10.1145/3326467.3326474

    Article  Google Scholar 

  34. Xu C, Ren J, Zhang D, Zhang Y, Qin Z, Ren K (2019) GANobfuscator: Mitigating information leakage under GAN via differential privacy. IEEE Transact Informat Forens Sec. https://doi.org/10.1109/TIFS.2019.2897874

    Article  Google Scholar 

  35. Tang X, Qian T, You Z (2020) Generating behavior features for cold-start spam review detection with adversarial learning. Informat Sci. https://doi.org/10.1016/j.ins.2020.03.063

    Article  Google Scholar 

  36. Kumar A, Dabas V, Hooda P (2020) Text classification algorithms for mining unstructured data: a SWOT analysis. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-017-0072-1

    Article  Google Scholar 

  37. Li M, Lin J, Ding Y, Liu Z, Zhu JY, Han S (2022) GAN Compression: Efficient Architectures for Interactive Conditional GANs. IEEE Transact Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2021.3126742

    Article  Google Scholar 

  38. Miller Z, Dickinson B, Deitrick W, Hu W, Wang AH (2014) Twitter spammer detection using data stream clustering. Informat Sci. https://doi.org/10.1016/j.ins.2013.11.016

    Article  Google Scholar 

  39. Fitni QRS, Ramli K (2020) “Implementation of ensemble learning and feature selection for performance improvements in anomaly-based intrusion detection systems,” In: Proceedings - 2020 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2020, doi: https://doi.org/10.1109/IAICT50021.2020.9172014.

  40. Soares E, Garcia C, Poucas R, Camargo H, Leite D (2019) Evolving fuzzy set-based and cloud-based unsupervised classifiers for spam detection. IEEE Lat Am Trans 17(09):1449–1457. https://doi.org/10.1109/TLA.2019.8931138

    Article  Google Scholar 

  41. Rezaeinia SM, Rahmani R, Ghodsi A, Veisi H (2019) Sentiment analysis based on improved pre-trained word embeddings. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2018.08.044

    Article  Google Scholar 

  42. Mohammed MA et al (2019) An anti-spam detection model for emails of multi-natural language. Xinan Jiaotong Daxue Xuebao/J Southwest Jiaotong Univ. https://doi.org/10.35741/issn.0258-2724.54.3.6

    Article  Google Scholar 

  43. Singh AB, Singh KM, Chanu YJ, Thongam K, Singh KJ (2022) An improved image spam classification model based on deep learning techniques. Sec Communicat Net. https://doi.org/10.1155/2022/8905424

    Article  Google Scholar 

  44. Wanda P (2022) RunMax: fake profile classification using novel nonlinear activation in CNN. Soc Netw Anal Min 12(1):158. https://doi.org/10.1007/s13278-022-00983-9

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Diqi.

Ethics declarations

Conflict of interest

None declared.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Diqi, M. TwitterGAN: robust spam detection in twitter using novel generative adversarial networks. Int. j. inf. tecnol. 15, 3103–3111 (2023). https://doi.org/10.1007/s41870-023-01352-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-023-01352-1

Keywords

Navigation