TwitterGAN: robust spam detection in twitter using novel generative adversarial networks

Diqi, Mohammad

doi:10.1007/s41870-023-01352-1

TwitterGAN: robust spam detection in twitter using novel generative adversarial networks

Original Research
Published: 23 June 2023

Volume 15, pages 3103–3111, (2023)
Cite this article

International Journal of Information Technology Aims and scope Submit manuscript

Mohammad Diqi ORCID: orcid.org/0000-0002-9012-9080¹

158 Accesses
5 Citations
Explore all metrics

Abstract

As social media platforms like Twitter continue to evolve, the proliferation of spam content has become a pressing issue, undermining the credibility of shared messages. Traditional spam detection methods, such as black-and-white listing and rule-based learning techniques, struggle to efficiently handle large datasets and adapt to dynamic environments. To address these challenges, we propose a novel spam detection model that leverages generative learning techniques, offering improved performance on vast datasets and changing circumstances. Using a substantial Twitter dataset with an 80% training and 20% testing split, our innovative model demonstrates remarkable effectiveness. Experimental results show a G-Loss score of 8.1207, significantly outperforming the D-Loss score of 0.0081, indicating the model’s exceptional accuracy and low error rate. Consequently, our groundbreaking approach emerges as a highly promising solution for real-world spam identification, raising the bar for spam detection research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fake news, disinformation and misinformation in social media: a review

Article 09 February 2023

A comprehensive survey of AI-enabled phishing attacks detection techniques

Article 23 October 2020

Fake news detection based on news content and social contexts: a transformer-based approach

Article 30 January 2022

Data availability

The dataset comes from the NSClab/Resources Twitter Spam. (http://nsclab.org/nsclab/resources/).

References

Elmendili F, Idrissi YEBE (2020) A framework for spam detection in twitter based on recommendation system. Int J Intell Eng Syst. https://doi.org/10.22266/ijies2020.1031.09
Article Google Scholar
Inuwa-Dutse I, Liptrott M, Korkontzelos I (2018) Detection of spam-posting accounts on Twitter. Neurocomputing. https://doi.org/10.1016/j.neucom.2018.07.044
Article Google Scholar
Wu T et al (2017) Detecting spamming activities in twitter based on deep-learning technique. Concurrency Computat. https://doi.org/10.1002/cpe.4209
Article Google Scholar
Fazil M, Abulaish M (2018) A Hybrid Approach for Detecting Automated Spammers in Twitter. IEEE Transact Informat Forensics Sec. https://doi.org/10.1109/TIFS.2018.2825958
Article Google Scholar
Karakaşlı MS, Aydin MA, Yarkan S, Boyaci A (2019) Dynamic feature selection for spam detection in twitter. Lecture Notes Elect Eng. https://doi.org/10.1007/978-981-13-0408-8_20
Article Google Scholar
Li C, Liu S (2018) A comparative study of the class imbalance problem in Twitter spam detection. Concurr Computat. https://doi.org/10.1002/cpe.4281
Article Google Scholar
Çıtlak O, Dörterler M, Doğru İA (2019) A survey on detecting spam accounts on Twitter network. Soc Netw Anal Min 9(1):35. https://doi.org/10.1007/s13278-019-0582-x
Article Google Scholar
Zheng X, Zeng Z, Chen Z, Yu Y, Rong C (2015) Detecting spammers on social networks. Neurocomputing. https://doi.org/10.1016/j.neucom.2015.02.047
Article Google Scholar
Yurtseven I, Bagriyanik S, Ayvaz S (2021) “A Review of Spam Detection in Social Media,” In: Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021. doi: https://doi.org/10.1109/UBMK52708.2021.9558993.
Raza M, Jayasinghe ND, Muslam MMA (2021) A comprehensive review on email spam classification using machine learning algorithms. Int Conf Informat Netw. https://doi.org/10.1109/ICOIN50884.2021.9334020
Article Google Scholar
Bhuvaneshwari P, Rao AN, Robinson YH (2021) Spam review detection using self attention based CNN and bi-directional LSTM. Multimedia Tools Applicat 80(12):18107–18124. https://doi.org/10.1007/s11042-021-10602-y
Article Google Scholar
Kaddoura S, Chandrasekaran G, Popescu DE, Duraisamy JH (2022) A systematic literature review on spam content detection and classification. Peer J Comp Sci. https://doi.org/10.7717/PEERJ-CS.830
Article Google Scholar
Wapet L, Tchana A, Tran GS, Hagimont D (2019) Preventing the propagation of a new kind of illegitimate apps. Future Generat Comp Syst. https://doi.org/10.1016/j.future.2018.11.051
Article Google Scholar
Jain G, Sharma M, Agarwal B (2019) Spam detection in social media using convolutional and long short term memory neural network. Annals Mathemat Artif Intell. https://doi.org/10.1007/s10472-018-9612-z
Article Google Scholar
Faris H et al (2019) An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks. Informat Fusion. https://doi.org/10.1016/j.inffus.2018.08.002
Article Google Scholar
Fu Q, Feng B, Guo D, Li Q (2018) Combating the evolving spammers in online social networks. Comput Secur 72:60–73. https://doi.org/10.1016/j.cose.2017.08.014
Article Google Scholar
Kaur R, Singh S, Kumar H (2018) Rise of spam and compromised accounts in online social networks: a state-of-the-art review of different combating approaches. J Netw Comput Appl 112:53–88. https://doi.org/10.1016/j.jnca.2018.03.015
Article Google Scholar
Barushka A, Hajek P (2020) Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks. Neural Comput Appl 32(9):4239–4257. https://doi.org/10.1007/s00521-019-04331-5
Article Google Scholar
Sharaff A, Jain M, Modugula G (2022) Feature based cluster ranking approach for single document summarization. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-021-00853-1
Article Google Scholar
Shahariar GM, Biswas S, Omar F, Shah FM, Binte Hassan S (2019) “Spam review detection using deep learning”, in IEEE 10th annual information technology. Electron Mobile Commun Conf (IEMCON) 2019:0027–0033. https://doi.org/10.1109/IEMCON.2019.8936148
Article Google Scholar
Gopi AP, Jyothi RNS, Narayana VL, Sandeep KS (2020) Classification of tweets data based on polarity using improved RBF kernel of SVM. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-019-00409-4
Article Google Scholar
Lin G, Sun N, Nepal S, Zhang J, Xiang Y, Hassan H (2017) Statistical twitter spam detection demystified: performance, stability and scalability. IEEE Access. https://doi.org/10.1109/ACCESS.2017.2710540
Article Google Scholar
Cao C, Caverlee J (2015) Detecting spam URLs in social media via behavioral analysis. Lecture Notes Comp Sci (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). https://doi.org/10.1007/978-3-319-16354-3_77
Article Google Scholar
Diqi M, Mulyani SH, Pradila R (2023) DeepCov: effective prediction model of COVID-19 using CNN algorithm. SN Comp Sci 4(4):396. https://doi.org/10.1007/s42979-023-01834-w
Article Google Scholar
Wanda P, Jie HJ (2020) DeepProfile: Finding fake profile in online social network using dynamic CNN. J Informat Sec Appl 52:102465. https://doi.org/10.1016/j.jisa.2020.102465
Article Google Scholar
Jain G, Sharma M, Agarwal B (2019) Optimizing semantic LSTM for spam detection. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-018-0157-5
Article Google Scholar
Ghourabi A, Mahmood MA, Alzubi QM (2020) A hybrid CNN-LSTM model for SMS spam detection in Arabic and English messages. Future Internet. https://doi.org/10.3390/FI12090156
Article Google Scholar
Bang D, Kang S, Shim H (2020) Discriminator feature-based inference by recycling the discriminator of GANs. Int J Comp Vision. https://doi.org/10.1007/s11263-020-01311-4
Article MATH Google Scholar
Diqi M, Hiswati ME, Nur AS (2022) StockGAN: robust stock price prediction using GAN algorithm. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-022-00929-6
Article Google Scholar
Barigye SJ, de la Vega JMG, Perez-Castillo Y (2020) Generative adversarial networks (GANs) based synthetic sampling for predictive modeling. Molecular Inform. https://doi.org/10.1002/minf.202000086
Article Google Scholar
Madisetty S, Desarkar MS (2018) A neural network-based ensemble approach for spam detection in twitter. IEEE Transact Computat Soc Syst. https://doi.org/10.1109/TCSS.2018.2878852
Article Google Scholar
Lee JY, Choi SI (2020) Improvement of learning stability of generative adversarial network using variational learning. Appl Sci (Switzerland). https://doi.org/10.3390/app10134528
Article Google Scholar
Lu PH, Wang PC, Yu CM (2019) Empirical evaluation on synthetic data generation with generative adversarial network. ACM Int Conf Proc Series. https://doi.org/10.1145/3326467.3326474
Article Google Scholar
Xu C, Ren J, Zhang D, Zhang Y, Qin Z, Ren K (2019) GANobfuscator: Mitigating information leakage under GAN via differential privacy. IEEE Transact Informat Forens Sec. https://doi.org/10.1109/TIFS.2019.2897874
Article Google Scholar
Tang X, Qian T, You Z (2020) Generating behavior features for cold-start spam review detection with adversarial learning. Informat Sci. https://doi.org/10.1016/j.ins.2020.03.063
Article Google Scholar
Kumar A, Dabas V, Hooda P (2020) Text classification algorithms for mining unstructured data: a SWOT analysis. Int J Informat Technol (Singapore). https://doi.org/10.1007/s41870-017-0072-1
Article Google Scholar
Li M, Lin J, Ding Y, Liu Z, Zhu JY, Han S (2022) GAN Compression: Efficient Architectures for Interactive Conditional GANs. IEEE Transact Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2021.3126742
Article Google Scholar
Miller Z, Dickinson B, Deitrick W, Hu W, Wang AH (2014) Twitter spammer detection using data stream clustering. Informat Sci. https://doi.org/10.1016/j.ins.2013.11.016
Article Google Scholar
Fitni QRS, Ramli K (2020) “Implementation of ensemble learning and feature selection for performance improvements in anomaly-based intrusion detection systems,” In: Proceedings - 2020 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2020, doi: https://doi.org/10.1109/IAICT50021.2020.9172014.
Soares E, Garcia C, Poucas R, Camargo H, Leite D (2019) Evolving fuzzy set-based and cloud-based unsupervised classifiers for spam detection. IEEE Lat Am Trans 17(09):1449–1457. https://doi.org/10.1109/TLA.2019.8931138
Article Google Scholar
Rezaeinia SM, Rahmani R, Ghodsi A, Veisi H (2019) Sentiment analysis based on improved pre-trained word embeddings. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2018.08.044
Article Google Scholar
Mohammed MA et al (2019) An anti-spam detection model for emails of multi-natural language. Xinan Jiaotong Daxue Xuebao/J Southwest Jiaotong Univ. https://doi.org/10.35741/issn.0258-2724.54.3.6
Article Google Scholar
Singh AB, Singh KM, Chanu YJ, Thongam K, Singh KJ (2022) An improved image spam classification model based on deep learning techniques. Sec Communicat Net. https://doi.org/10.1155/2022/8905424
Article Google Scholar
Wanda P (2022) RunMax: fake profile classification using novel nonlinear activation in CNN. Soc Netw Anal Min 12(1):158. https://doi.org/10.1007/s13278-022-00983-9
Article Google Scholar

Download references

Author information

Authors and Affiliations

Universitas Respati Yogyakarta, Yogyakarta, Indonesia
Mohammad Diqi

Authors

Mohammad Diqi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Diqi.

Ethics declarations

Conflict of interest

None declared.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Diqi, M. TwitterGAN: robust spam detection in twitter using novel generative adversarial networks. Int. j. inf. tecnol. 15, 3103–3111 (2023). https://doi.org/10.1007/s41870-023-01352-1

Download citation

Received: 05 November 2022
Accepted: 13 June 2023
Published: 23 June 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s41870-023-01352-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

TwitterGAN: robust spam detection in twitter using novel generative adversarial networks

Abstract

Access this article

Similar content being viewed by others

Fake news, disinformation and misinformation in social media: a review

A comprehensive survey of AI-enabled phishing attacks detection techniques

Fake news detection based on news content and social contexts: a transformer-based approach

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

TwitterGAN: robust spam detection in twitter using novel generative adversarial networks

Abstract

Access this article

Similar content being viewed by others

Fake news, disinformation and misinformation in social media: a review

A comprehensive survey of AI-enabled phishing attacks detection techniques

Fake news detection based on news content and social contexts: a transformer-based approach

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation