Skip to main content
Log in

A novel deep learning model-based optimization algorithm for text message spam detection

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Mobile texting has increased social engineering assaults like phishing. Because spam, or unsolicited text messages, spread phishing attempts that steal personal information. Traditional methods of spam detection, often based on statistical models or human rule-based systems, have difficulties in keeping up with the growing complexity of spamming strategies. Gathering pertinent data from social networks is a challenging task, mostly due to the limits imposed by privacy concerns and time constraints. The inefficiency and time-consuming nature of conventional frequency-based techniques to word encoding are generally recognized. Text classification has shown promising outcomes with the use of word embeddings and deep learning techniques. The proposed approach involves integrating deep learning with the Remora optimization algorithm framework (DL–ROA) to autonomously extract intricate patterns and nuanced information from text messages. The system’s capacity to adapt to new spamming strategies enhances the DL–ROA. The proposed technique improves the accuracy of detection while reducing the inefficiency and time required to create contextual word vectors based on word frequency. Spam detection is achieved by using a hybrid deep model that combines long short-term memory (LSTM) and deep convolutional neural networks (DCNN) architectures. Empirical data demonstrate that the DL–ROA technique surpasses existing deep learning models in terms of accuracy, f1-score, and recall. In addition, the DL–ROA achieved an unprecedented accuracy rate of 98.25%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm - 1
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Kaur R, Singh S, Kumar H (2018) Rise of spam and compromised accounts in online social networks: A state-of-the-art review of different combating approaches. J Netw Comput Appl 112:53–88

    Article  Google Scholar 

  2. Rao S, Verma AK, Bhatia T (2021) A review on social spam detection: Challenges, open issues, and future directions. Expert Syst Appl 186:115742

    Article  Google Scholar 

  3. Alam F, Cresci S, Chakraborty T, Silvestri F, Dimitrov D, Martino GDS, and Nakov P (2021) A survey on multimodal disinformation detection. arXiv preprint arXiv:2103.12541

  4. Barushka A, Hajek P (2020) Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks. Neural Comput Appl 32:4239–4257

    Article  Google Scholar 

  5. Shaaban MA, Hassan YF, Guirguis SK (2022) Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text. Complex Intell Syst 8(6):4897–4909

    Article  Google Scholar 

  6. Akinyelu AA (2021) Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques. J Comput Secur 29(5):473–529

    Google Scholar 

  7. Liu X, Lu H, Nayak A (2021) A spam transformer model for SMS spam detection. IEEE Access 9:80253–80263

    Article  Google Scholar 

  8. Taheri R and Javidan R (2017) Spam filtering in SMS using recurrent neural networks. In 2017 Artificial Intelligence and Signal Processing Conference (AISP). IEEE, pp 331–336

  9. Lee H, Jeong S, Cho S, Choi E (2023) Visualization technology and deep-learning for multilingual spam message detection. Electronics 12(3):582

    Article  Google Scholar 

  10. Amaar A, Aljedaani W, Rustam F, Ullah S, Rupapara V, Ludi S (2022) Detection of fake job postings by utilizing machine learning and natural language processing approaches. Neural Process Lett 54:2219–2247

    Article  Google Scholar 

  11. Jia H, Peng X, Lang C (2021) Remora optimization algorithm. Expert Syst Appl 185:115665

    Article  Google Scholar 

  12. Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J (2016) LSTM: a search space odyssey. IEEE trans neural netw learn syst 28(10):2222–2232

    Article  MathSciNet  Google Scholar 

  13. Wei F, and Nguyen T (2020) A lightweight deep neural model for sms spam detection. In 2020 International Symposium on Networks, Computers and Communications (ISNCC). IEEE, pp 1–6

  14. Mansoor RAZA, Jayasinghe ND, and Muslam MMA (2021) A comprehensive review on email spam classification using machine learning algorithms. In 2021 International Conference on Information Networking (ICOIN).IEEE, pp 327–332

  15. Sun W, Su F, Wang L (2018) Improving deep neural networks with multi-layer maxout networks and a novel initialization method. Neurocomputing 278:34–40

    Article  Google Scholar 

  16. Feng B, Fu Q, Dong M, Guo D, Li Q (2018) Multistage and elastic spam detection in mobile social networks through deep learning. IEEE Netw 32(4):15–21

    Article  Google Scholar 

  17. Gadde S, Lakshmanarao A and Satyanarayana S (2021) SMS spam detection using machine learning and deep learning techniques. In 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS). IEEE, 1 pp 358–362

  18. Al Bataineh A, Kaur D (2021) Immunocomputing-based approach for optimizing the topologies of LSTM networks. IEEE Access 9:78993–79004

    Article  Google Scholar 

  19. Roy PK, Singh JP, Banerjee S (2020) Deep learning to filter SMS Spam. Future Gener Comput Syst 102:524–533

    Article  Google Scholar 

  20. Xia T, Chen X (2021) A weighted feature enhanced hidden Markov model for spam SMS filtering. Neurocomputing 444:48–58

    Article  Google Scholar 

  21. Gaurav D, Tiwari SM, Goyal A, Gandhi N, Abraham A (2020) Machine intelligence-based algorithms for spam filtering on document labeling. Soft Comput 24:9625–9638

    Article  Google Scholar 

  22. Popovac M, Karanovic M, Sladojevic S, Arsenovic M and Anderla A (2018) Convolutional neural network based SMS spam detection. In 2018 26th Telecommunications Forum (TELFOR) IEEE, pp 1–4

  23. Barushka A, Hajek P (2018) Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks. Appl Intell 48:3538–3556

    Article  Google Scholar 

  24. Jain G, Sharma M, Agarwal B (2019) Spam detection in social media using convolutional and long short term memory neural network. Ann Math Artif Intell 85(1):21–44

    Article  Google Scholar 

  25. Kim D, Seo D, Cho S, Kang P (2019) Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec. Inf Sci 477:15–29

    Article  Google Scholar 

  26. Merugu S, Reddy MCS, Goyal E, and Piplani L (2019) Text message classification using supervised machine learning algorithms. In ICCCE 2018: Proceedings of the International Conference on Communications and Cyber Physical Engineering 2018. Springer, Singapore, pp 141–150

  27. Abid MA, Ullah S, Siddique MA, Mushtaq MF, Aljedaani W, Rustam F (2022) Spam SMS filtering based on text features and supervised machine learning techniques. Multimed Tools Appl 81(28):39853–39871

    Article  Google Scholar 

  28. Ahmad SBS, Rafie M, Ghorabie SM (2021) Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions. Multimed Tools Appl 80(8):11583–11605

    Article  Google Scholar 

  29. SenthilMurugan N, Usha Devi G (2018) Detecting streaming of Twitter spam using hybrid method. Wirel Pers Commun 103:1353–1374

    Article  Google Scholar 

  30. Inuwa-Dutse I, Liptrott M, Korkontzelos I (2018) Detection of spam-posting accounts on Twitter. Neurocomputing 315:496–511

    Article  Google Scholar 

  31. Manasa P, Malik A, Alqahtani KN, Alomar MA, Basingab MS et al. (2022) Tweet spam detection using machine learning and swarm optimization techniques. IEEE Transactions on Computational Social Systems

  32. BazzazAbkenar S, Mahdipour E, Jameii SM, HaghiKashani M (2021) A hybrid classification method for Twitter spam detection based on differential evolution and random forest. Concurr Comput: Pract Exp 33(21):e6381

    Article  Google Scholar 

  33. Xia T, Chen X (2020) A discrete hidden Markov model for SMS spam detection. Appl Sci 10(14):5011

    Article  Google Scholar 

  34. Sisodia DS Mahapatra S and Sharma A (2020) Automated sms classification and spam analysis using topic modeling. In 2nd International Conference on Data, Engineering and Applications (IDEA) IEEE, pp 1–6

  35. Ayo FE, Ogundele LA, Olakunle S, Awotunde JB, Kasali FA (2024) A hybrid correlation-based deep learning model for email spam classification using fuzzy inference system. Decis Anal J 10:100390

    Article  Google Scholar 

  36. Kumar S, Gupta S (2024) Legitimate and spam SMS classification employing novel Ensemble feature selection algorithm. Multimed Tools Appl 83(7):19897–19927

    Article  Google Scholar 

  37. Nosrati V, Rahmani M, Jolfaei A, Seifollahi S (2023) A weak-region enhanced Bayesian classification for spam content-based filtering. ACM Trans Asian Low-Res Lang Inf Process 22(3):1–18

    Article  Google Scholar 

  38. Shaik CM, Penumaka NM, Abbireddy SK, Kumar V and Aravinth SS (2023) Bi-LSTM and Conventional Classifiers for Email Spam Filtering. In 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS). IEEE, pp 1350–1355

  39. Alshahrani HJ, Tarmissi K, Yafoz A, Mohamed A, Motwakel A, Yaseen I, Mahzari M (2023) Improved fruitfly optimization with stacked residual deep learning based email classification. Intell Autom Soft Comput 36(3):3139–3155

    Article  Google Scholar 

  40. Borra SR, Yukthika M, Bhargavi M, Samskruthi M, Saisri PV, Akhila Y, Alekhya S (2024) OECNet: optimal feature selection-based email classification network using unsupervised learning with deep CNN model. e-Prime-Adv Electr Eng Electr Energy 7:100415

    Article  Google Scholar 

  41. Mani S, Gunasekaran G, Geetha S (2023) Email spam detection using gated recurrent neural network. Int J Prograss Res Eng Manag Sci (IJPREMS) 3(2023):90–99

    Google Scholar 

  42. Mambina IS, Ndibwile JD, Uwimpuhwe D, Michael KF (2024) Uncovering SMS spam in swahili text using deep learning approaches. IEEE Access 12:25164–25175

    Article  Google Scholar 

  43. Krishnamoorthy P, Sathiyanarayanan M, Proença HP (2024) A novel and secured email classification and emotion detection using hybrid deep neural network. Int J Cogn Comput Eng 5:44–57

    Google Scholar 

  44. Pudasaini S, Shakya A, Pandey SP, Paudel P, Ghimire S, Ale P (2023) SMS spam detection using relevance vector machine. Proced Comput Sci 230:337–346

    Article  Google Scholar 

  45. Pirozmand P, Sadeghilalimi M, Hosseinabadi AAR, Sadeghilalimi F, Mirkamali S, Slowik A (2023) A feature selection approach for spam detection in social networks using gravitational force-based heuristic algorithm. J Ambient Intell Human Comput 14:1633–1646

    Article  Google Scholar 

  46. Ouhame S, Hadi Y, Ullah A (2021) An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model. Neural Comput Appl 33:10043–10055

    Article  Google Scholar 

  47. Zhu Y, Zhang W, Chen Y, Gao H (2019) A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP J Wirel Commun Netw 2019:1–18

    Article  Google Scholar 

  48. Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv eng softw 95:51–67

    Article  Google Scholar 

  49. Almeida TA, Hidalgo J M G and Yamakami A (2011) Contributions to the study of SMS spam filtering: new collection and results. In Proceedings of the 11th ACM symposium on Document engineering pp 259–262

  50. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J artif intell res 16:321–357

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

LD, LA, and AP involved in concept, design, analysis, writing—original draft, and writing—review and editing.

Corresponding author

Correspondence to Lipsa Das.

Ethics declarations

Conflict of interest

All authors have no conflict of interest to report and declare that they have no known competing financial interests in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Das, L., Ahuja, L. & Pandey, A. A novel deep learning model-based optimization algorithm for text message spam detection. J Supercomput (2024). https://doi.org/10.1007/s11227-024-06148-z

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11227-024-06148-z

Keywords

Navigation