Skip to main content
Log in

Cyberbullying detection and machine learning: a systematic literature review

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

The rise in research work focusing on detection of cyberbullying incidents on social media platforms particularly reflect how dire cyberbullying consequences are, regardless of age, gender or location. This paper examines scholarly publications (i.e., 2011–2022) on cyberbullying detection using machine learning through a systematic literature review approach. Specifically, articles were sought from six academic databases (Web of Science, ScienceDirect, IEEE Xplore, Association for Computing Machinery, Scopus, and Google Scholar), resulting in the identification of 4126 articles. A redundancy check followed by eligibility screening and quality assessment resulted in 68 articles included in this review. This review focused on three key aspects, namely, machine learning algorithms used to detect cyberbullying, features, and performance measures, and further supported with classification roles, language of study, data source and type of media. The findings are discussed, and research challenges and future directions are provided for researchers to explore.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. https://trends.google.com/trends/explore?date=2011-01-01%202022-12-31&q=deep%20learning&hl=en.

References

  • Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In European Conference on Information Retrieval (pp. 141–153). Springer, Cham

  • Aizenkot D, Kashy-Rosenbaum G (2018) Cyberbullying in WhatsApp classmates’ groups: evaluation of an intervention program implemented in israeli elementary and middle schools. New Media & Society 20(12):4709–4727

    Article  Google Scholar 

  • Akhter MP, Zheng JB, Naqvi IR, Abdelmajeed M, Sadiq MT (2020) Automatic Detection of Offensive Language for Urdu and Roman Urdu. IEEE Access 8:91213–91226.

  • Aldhyani TH, Al-Adhaileh MH, Alsubari SN (2022) Cyberbullying identification system based deep learning algorithms. Electronics 11(20):3273

    Article  Google Scholar 

  • Al-Garadi MA, Hussain MR, Khan N, Murtaza G, Nweke HF, Ali I, …, Gani A (2019) Predicting cyberbullying on social media in the big data era using machine learning algorithms: review of literature and open challenges. IEEE Access 7:70701–70718

    Article  Google Scholar 

  • Al-garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443

    Article  Google Scholar 

  • Al-Harigy LM, Al-Nuaim HA, Moradpoor N, Tan Z (2022) Building towards Automated Cyberbullying Detection: A Comparative Analysis. Computational Intelligence and Neuroscience, 2022

  • Alom Z, Carminati B, Ferrari E (2020) A deep learning model for Twitter spam detection. Online Social Networks and Media 18:100079

    Article  Google Scholar 

  • Alpaydin E (2010) Introduction to machine learning, 2nd edn. MIT Press

  • Ates EC, Bostanci E, Guzel MS (2021) Comparative performance of machine learning algorithms in cyberbullying detection: using turkish language preprocessing techniques. arXiv preprint arXiv :2101.12718

  • Ayo FE, Folorunso O, Ibharalu FT, Osinuga IA (2020) Machine learning techniques for hate speech classification of twitter data: state-of-the-art, future challenges and research directions. Comput Sci Rev 38:100311

    Article  Google Scholar 

  • Balakrishnan V (2015) Cyberbullying among young adults in Malaysia: the roles of gender, age and internet frequency. Comput Hum Behav 46:149–157

    Article  Google Scholar 

  • Balakrishnan V, Khan S, Arabnia HR (2020a) Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Computers & Security 90:101710

    Article  Google Scholar 

  • Balakrishnan V, Khan S, Arabnia HR (2020b) Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Computers & Security 90:101710

    Article  Google Scholar 

  • Balakrishnan V, Khan S, Fernandez T, Arabnia HR (2019) Cyberbullying detection on twitter using Big Five and Dark Triad features. Pers Individ Differ 141, 252–257.

  • Bretschneider U, Wöhner T, Peters R (2014) Detecting online harassment in social networks.

  • Buan TA, Ramachandra R (2020) Automated Cyberbullying Detection in Social Media Using an SVM Activated Stacked Convolution LSTM Network. In Proceedings of the 2020 the 4th International Conference on Compute and Data Analysis (pp. 170–174)

  • Camerini AL, Marciano L, Carrara A, Schulz PJ (2020) Cyberbullying perpetration and victimization among children and adolescents: a systematic review of longitudinal studies. Telematics Inform 49:101362

    Article  Google Scholar 

  • Chatzakou D, Kourtellis N, Blackburn J, De Cristofaro E, Stringhini G, Vakali A (2017) Mean birds: Detecting aggression and bullying on twitter. In Proceedings of the 2017 ACM on web science conference (pp. 13–22)

  • Chavan VS, Shylaja SS (2015) Machine learning approach for detection of cyber-aggressive comments by peers on social media network. In 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) (pp. 2354–2358). IEEE

  • Cheng L, Guo R, Silva YN, Hall D, Liu H (2021) Modeling temporal patterns of cyberbullying detection with hierarchical attention networks. ACM/IMS Trans Data Sci 2(2):1–23

    Article  Google Scholar 

  • Cheng L, Li J, Silva YN, Hall DL, Liu H (2019) Xbully: Cyberbullying detection within a multi-modal context. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (pp. 339–347)

  • Chen Y, Zhou Y, Zhu S, Xu H (2012) Detecting offensive language in social media to protect adolescent online safety. In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing (pp. 71–80). IEEE

  • Dadvar M, De Jong F (2012) Cyberbullying detection: a step toward a safer internet yard. In Proceedings of the 21st International Conference on World Wide Web (pp. 121–126)

  • Dadvar M, Jong FD, Ordelman R, Trieschnigg D (2012) Improved cyberbullying detection using gender information. In Proceedings of the Twelfth Dutch-Belgian Information Retrieval Workshop (DIR 2012). University of Ghent

  • Dadvar M, Trieschnigg D, Ordelman R, de Jong F (2013) Improving cyberbullying detection with user context. In European Conference on Information Retrieval (pp. 693–696). Springer, Berlin, Heidelberg

  • Dey R, Bag S, Sarkar RR (2021) Identification of stable housekeeping genes for normalization of qPCR data in a pathogenic fungus. J Microbiol Methods 180:106106

    Google Scholar 

  • Dinakar K, Picard R, Lieberman H (2015) Common sense reasoning for detection, prevention, and mitigation of cyberbullying. In IJCAI International Joint Conference on Artificial Intelligence.

  • Dinakar K, Reichart R, Lieberman H (2011) Modeling the detection of textual cyberbullying. In Proceedings of the International Conference on Weblog and Social Media 2011

  • Divyashree VH, Deepashree NS (2016) An effective approach for cyberbullying detection and avoidance. International Journal of Innovative Research in Computer and Communication Engineering, 14

  • Djuraskovic O, Cyberbullying Statistics F (2020) and Trends with Charts: First Site Guide; 2020. Available from: https://firstsiteguide.com/cyberbullying-stats/

  • Elmezain M, Malki A, Gad I, Atlam ES (2022) Hybrid deep learning model–based prediction of images related to Cyberbullying. Int J Appl Math Comput Sci 32(2):323–334

    MATH  Google Scholar 

  • Fahrnberger G, Nayak D, Martha VS, Ramaswamy S (2014) SafeChat: A tool to shield children’s communication from explicit messages. In 2014 14th International Conference on Innovations for Community Services (I4CS) (pp. 80–86). IEEE

  • Fang Y, Yang S, Zhao B, Huang C (2021) Cyberbullying detection in social networks using bi-gru with self-attention mechanism. Information 12(4):171

    Article  Google Scholar 

  • Foong YJ, Oussalah M (2017), September Cyberbullying system detection and analysis. In 2017 European Intelligence and Security Informatics Conference (EISIC) (pp. 40–46). IEEE

  • Galán-García P, Puerta JGDL, Gómez CL, Santos I, Bringas PG (2016) Supervised machine learning for the detection of troll profiles in twitter social network: application to a real case of cyberbullying. Log J IGPL 24(1):42–53

    MathSciNet  Google Scholar 

  • García-Recuero Á (2016) Discouraging abusive behavior in privacy-preserving online social networking applications. In Proceedings of the 25th International Conference Companion on World Wide Web (pp. 305–309)

  • Ge S, Cheng L, Liu H (2021) Improving cyberbullying detection with user interaction. In Proceedings of the Web Conference 2021 (pp. 496–506)

  • Goodboy AK, Martin MM (2015) The personality profile of a cyberbully: examining the Dark Triad. Comput Hum Behav 49:1–4

    Article  Google Scholar 

  • Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press

  • Haidar B, Chamoun M, Serhrouchni A (2017a) Multilingual cyberbullying detection system: Detecting cyberbullying in Arabic content. In 2017 1st Cyber Security in Networking Conference (CSNet) (pp. 1–8). IEEE

  • Haidar B, Chamoun M, Serhrouchni A (2017b) A multilingual system for cyberbullying detection: arabic content detection using machine learning. Adv Sci Technol Eng Syst J 2(6):275–284

    Article  Google Scholar 

  • Hani J, Nashaat M, Ahmed M, Emad Z, Amer E, Mohammed A (2019) Social media cyberbullying detection using machine learning. Int J Adv Comput Sci Appl 10(5):703–707

    Google Scholar 

  • Hinduja S, Patchin JW (2010) Bullying, cyberbullying, and suicide. Archives of suicide research 14(3):206–221

    Article  Google Scholar 

  • Hosmer DW Jr, Lemeshow S, Sturdivant RX (2013) Applied Logistic Regression. Wiley

  • Hosseinmardi H, Mattson SA, Rafiq RI, Han R, Lv Q, Mishra S (2015b) Detection of cyberbullying incidents on the instagram social network. arXiv preprint arXiv:1503.03909

  • Hosseinmardi H, Mattson SA, Rafiq RI, Han R, Lv Q, Mishr S (2015a) Prediction of cyberbullying incidents on the instagram social network. arXiv preprint arXiv:1508.06257

  • Hosseinmardi H, Rafiq RI, Han R, Lv Q, Mishra S (2016) Prediction of cyberbullying incidents in a media-based social network. In 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (pp. 186–192). IEEE

  • Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In Proceedings of the 3rd International Workshop on Socially-Aware Multimedia (pp. 3–6)

  • Hutter F, Kotthoff L, Vanschoren J (2019) Automated machine learning: methods, systems, challenges. Springer

  • Kaity M, Balakrishnan V (2019) An automatic non-english sentiment lexicon builder using unannotated corpus. J Supercomputing 75(4):2243–2268

    Article  Google Scholar 

  • Kelleher JD, Tierney B, Tierney B (2018) Data science: an introduction. CRC Press

  • Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering. Tech Rep EBSE 1:1–57

  • Koutsou A, Tjortjis C (2018) Predicting hospital readmissions using random forests. IEEE J Biomedical Health Inf 22(1):122–130

    Google Scholar 

  • Kumar A, Nayak S, Chandra N (2019) Empirical analysis of supervised machine learning techniques for Cyberbullying detection. In International Conference on Innovative Computing and Communications (pp. 223–230). Springer, Singapore

  • Kumar A, Sachdeva N (2020) Multi-input integrative learning using deep neural networks and transfer learning for cyberbullying detection in real-time code-mix data. Multimedia Systems

  • Kumar A, Sachdeva N (2021) Multimodal cyberbullying detection using capsule network with dynamic routing and deep convolutional neural network. Multimedia Syst, 1–10

  • LeCun Y, Bengio Y, Hinton G (2015) Deep Learn Nat 521(7553):436–444

    Google Scholar 

  • Li W, Li X (2021) Cyberbullying among college students: the roles of individual, familial, and cultural factors. Int J Environ Res Public Health 18(11):1–17

    Google Scholar 

  • López-Vizcaíno MF, Nóvoa FJ, Carneiro V, Cacheda F (2021) Early detection of cyberbullying on social media networks. Future Generation Computer Systems 118:219–229

    Article  Google Scholar 

  • Lu N, Wu G, Zhang Z, Zheng Y, Ren Y, Choo KKR (2020) Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurrency and Computation: Practice and Experience, e5627

  • Maity K, Sen T, Saha S, Bhattacharyya P (2022) MTBullyGNN: a graph neural network-based Multitask Framework for Cyberbullying Detection. IEEE Transactions on Computational Social Systems

  • Malik CI, Radwan RB (2020) Adolescent victims of cyberbullying in Bangladesh- prevalence and relationship with psychiatric disorders. Asian J Psychiatr 48:101893

  • Mangaonkar A, Hayrapetian A, Raje R (2015) Collaborative detection of cyberbullying behavior in Twitter data. In 2015 IEEE international conference on electro/information technology (EIT) (pp. 611–616). IEEE

  • Manning CD, Raghavan P, Schütze H (2008) Introduction to Information Retrieval. Cambridge University Press

  • McEvoy MP, Williams MT (2021) Quality Assessment of systematic reviews and Meta-analyses of physical therapy interventions: a systematic review. Phys Ther 101(4):pzaa226

    Google Scholar 

  • Mercado RNM, Chuctaya HFC, Gutierrez EGC (2018) Automatic cyberbullying detection in spanish-language social networks using sentiment analysis techniques. Int J Adv Comput Sci Appl 9(7):228–235

    Google Scholar 

  • Monteiro RP, Santana MC, Santos RM, Pereira FC (2022) Cyberbullying victimization and mental health in higher education students: the mediating role of perceived social support. J interpers Violence, 1–23

  • Nahar V, Al-Maskari S, Li X, Pang C (2014) Semi-supervised learning for cyberbullying detection in social networks. In Australasian Database Conference (pp. 160–171). Springer, Cham

  • Nahar V, Unankard S, Li X, Pang C (2012) Sentiment analysis for effective detection of cyber bullying. Asia-Pacific Web Conference

  • Nandhini BS, Sheeba JI (2015) Online social network bullying detection using intelligence techniques. Procedia Comput Sci 45:485–492

    Article  Google Scholar 

  • Niu M, Yu L, Tian S, Wang X, Zhang Q (2020) Personal-bullying detection based on Multi-Attention and Cognitive Feature. Autom Control Comput Sci 54(1):52–61

    Article  Google Scholar 

  • Noviantho, Isa SM, Ashianti L (2018) Cyberbullying classification using text mining. In Proceedings - 2017 1st International Conference on Informatics and Computational Sciences, ICICoS 2017

  • Patil C, Salmalge S, Nartam P (2020) Cyberbullying detection on multiple SMPs using modular neural network. Advances in Cybernetics, Cognition, and machine learning for Communication Technologies. Springer, Singapore, pp 181–188

    Chapter  Google Scholar 

  • Pawar R, Raje RR (2019) Multilingual Cyberbullying Detection System. In 2019 IEEE International Conference on Electro Information Technology (EIT) (pp. 040–044). IEEE

  • Pires TM, Nunes IL (2019) Support vector machine for human activity recognition: a comprehensive review. Artif Intell Rev 52(3):1925–1962

    Google Scholar 

  • Pradhan A, Yatam VM, Bera P (2020) Self-Attention for Cyberbullying Detection. In 2020 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA) (pp. 1–6). IEEE

  • Pérez PJC, Valdez CJL, Ortiz MDGC, Barrera JPS, Pérez PF (2012) MISAAC: Instant messaging tool for cyberbullying detection. In Proceedings of the 2012 International Conference on Artificial Intelligence, ICAI 2012 (pp. 1049–1052)

  • Rafiq RI, Hosseinmardi H, Han R, Lv Q, Mishra S (2018) Scalable and timely detection of cyberbullying in online social networks. In Proceedings of the 33rd Annual ACM Symposium on Applied Computing (pp. 1738–1747)

  • Raisi E, Huang B (2018) Weakly supervised cyberbullying detection with participant-vocabulary consistency. Social Netw Anal Min 8(1):38

    Article  Google Scholar 

  • Reynolds K, Kontostathis A, Edwards L (2011) Using machine learning to detect cyberbullying. In 2011 10th International Conference on Machine learning and applications and workshops (Vol. 2, pp. 241–244). IEEE

  • Rosa H, Matos D, Ribeiro R, Coheur L, Carvalho JP (2018) A “deeper” look at detecting cyberbullying in social networks. In 2018 International Joint Conference on Neural Networks (IJCNN) (pp. 1–8). IEEE

  • Rosa H, Pereira N, Ribeiro R, Ferreira PC, Carvalho JP, Oliveira S, Coheur L, Paulino P, Veiga Simão AM, Trancoso I (2019) Automatic cyberbullying detection: A systematic review. Computers in Human Behavior, 93, 333–345

  • Salawu S, He Y, Lumsden J (2017) Approaches to automated detection of cyberbullying: a survey. IEEE Trans Affect Comput.

  • Sanchez H, Kumar S (2011) Twitter bullying detection. ser. NSDI, 12(2011), 15

  • Shah N, Maqbool A, Abbasi AF (2021) Predictive modeling for cyberbullying detection in social media. J Ambient Intell Humaniz Comput 12(6):5579–5594

    Google Scholar 

  • Singh A, Kaur, M (2020) Intelligent content-based cybercrime detection in online social networks using cuckoo search metaheuristic approach [Article]. J Supercomput 76(7):5402–5424

  • Singh VK, Ghosh S, Jose C (2017) Toward multimodal cyberbullying detection. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems (pp. 2090–2099)

  • Soni D, Singh VK (2018) See no evil, hear no evil: Audio-visual-textual cyberbullying detection. Proceedings of the ACM on Human-Computer Interaction, 2(CSCW), 1–26

  • Squicciarini A, Rajtmajer S, Liu Y, Griffin C (2015) Identification and characterization of cyberbullying dynamics in an online social network. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015 (pp. 280–285)

  • Sugandhi R, Pande A, Agrawal A, Bhagat H (2016) Automatic monitoring and prevention of cyberbullying. Int J Comput Appl 8:17–19

    Google Scholar 

  • Tahmasbi N, Rastegari E (2018) A socio-contextual approach in automated detection of public cyberbullying on Twitter. ACM Trans Social Comput 1(4):1–22

    Article  Google Scholar 

  • Tan SH, Zou W, Zhang J, Zhou Y (2020) Evaluation of machine learning algorithms for prediction of ground-level PM2.5 concentration using satellite-derived aerosol optical depth over China. Environ Sci Pollut Res 27(29):36155–36170

    Google Scholar 

  • Tarwani S, Jethanandani M, Kant V (2019) Cyberbullying Detection in Hindi-English Code-Mixed Language Using Sentiment Classification. In International Conference on Advances in Computing and Data Sciences (pp. 543–551). Springer, Singapore

  • Tomkins S, Getoor L, Chen Y, Zhang Y (2018) A socio-linguistic model for cyberbullying detection. In 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (pp. 53–60). IEEE

  • van Geel M, Goemans A, Toprak F, Vedder P (2017) Which personality traits are related to traditional bullying and cyberbullying? A study with the big five, Dark Triad and sadism. Pers Indiv Differ 106:231–235

    Article  Google Scholar 

  • Van Hee C, Jacobs G, Emmery C, Desmet B, Lefever E, Verhoeven B, …, Hoste V (2018) Automatic detection of cyberbullying in social media text. PLoS ONE, 13(10), e0203794

  • Van Hee C, Lefever E, Verhoeven B, Mennes J, Desmet B, De Pauw G, …, Hoste V (2015) Detection and fine-grained classification of cyberbullying events. In International Conference Recent Advances in Natural Language Processing (RANLP) (pp. 672–680)

  • Wang W, Xie X, Wang X, Lei L, Hu Q, Jiang S (2019) Cyberbullying and depression among chinese college students: a moderated mediation model of social anxiety and neuroticism. J Affect Disord 256:54–61

    Article  Google Scholar 

  • Whiting P, Savović J, Higgins JP et al (2016) ROBIS: a new tool to assess risk of bias in systematic reviews was developed. J Clin Epidemiol 69:225–234

    Article  Google Scholar 

  • Witten IH, Frank E, Hall MA (2016) Data Mining: practical machine learning tools and techniques, 4th edn. Morgan Kaufmann Publishers

  • Wright MF (2017) Cyberbullying in cultural context. J Cross-Cult Psychol 48(8):1136–1137

    Article  Google Scholar 

  • Wu J, Wen M, Lu R, Li B, Li J (2020) Toward efficient and effective bullying detection in online social network. Peer-to-Peer Netw Appl, 1–10

  • Wu T, Wen S, Xiang Y, Zhou W (2018) Twitter spam detection: survey of new approaches and comparative study. Computers & Security 76:265–284

    Article  Google Scholar 

  • Yin D, Xue Z, Hong L, Davison BD, Kontostathis A, Edwards L (2009) Detection of harassment on web 2.0. Proceedings of the Content Analysis in the WEB, 2, 1–7

  • Zhang X, Tong J, Vishwamitra N, Whittaker E, Mazer JP, Kowalski R, Hu H, Luo F, Macbeth J, Dillon E (2017) Cyberbullying detection with a pronunciation based convolutional neural network. In Proceedings - 2016 15th IEEE International Conference on Machine Learning and Applications, ICMLA 2016

  • Zhao R, Mao K (2017) Cyberbullying detection based on semantic-enhanced marginalized denoising auto-encoder. IEEE Trans Affect Comput 8(3), 328–339. Article 7412690

  • Zhao R, Zhou A, Mao K (2016) Automatic detection of cyberbullying on social networks based on bullying features. In Proceedings of the 17th international conference on distributed computing and networking (pp. 1–6)

  • Zhong H, Li H, Squicciarini AC, Rajtmajer SM, Griffin C, Miller DJ, Caragea C (2016) Content-Driven Detection of Cyberbullying on the Instagram Social Network. In IJCAI (pp. 3952–3958)

Download references

Author information

Authors and Affiliations

Authors

Contributions

VB wrote the original draft; performed analysis; revised the article; MK performed data collection; performed analysis; revised the article; All authors reviewed the manuscript.

Corresponding author

Correspondence to Vimala Balakrisnan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Balakrisnan, V., Kaity, M. Cyberbullying detection and machine learning: a systematic literature review. Artif Intell Rev 56 (Suppl 1), 1375–1416 (2023). https://doi.org/10.1007/s10462-023-10553-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-023-10553-w

Keywords

Navigation