Abstract
The widespread use of Social Media Platforms (SMP) such as Twitter, Instagram, Facebook, etc. by individuals has recently led to a remarkable increase in Cyberbullying (CB). It is a challenging task to prevent CB in such platforms since bullies use sarcasm or passive-aggressiveness strategies. This article proposes a new CB detection model named FAEO-ECNN for detecting and classifying cyberbullying on social media platforms. The proposed approach integrates Fuzzy Adaptive Equilibrium Optimization (FAEO) clustering-based topic modelling and Extended Convolutional Neural Network (ECNN) to enhance the accuracy of CB detection process. Initially, pre-processing is performed in order to cleanse the dataset. Next, the features are extracted using multiple models. The unsupervised Fuzzy Adaptive Equilibrium Optimization (FAEO) is utilized for discovering the latent topics from the pre-processed input data, which automatically examines the text data and creates clusters of words. Finally, the cyberbullying classification makes use of the ECNN and Rain Optimization (RO) algorithm to detect CB from posts/texts. We evaluated the proposed FAEO-ECNN thoroughly with two short text datasets: Real-world CB Twitter (RW-CB-Twitter) and Cyberbullying Menedely (CB-MNDLY) datasets in comparison to State of The Art (SoTA) models like Long Short Term Memory (LSTM), Bi-directional LSTM (BLSTM), RNN, and CNN-LSTM. The proposed FAEO-ECNN model outperformed the SoTA models in detecting Cyberbullying on SMP. It has obtained 92.91% of accuracy, 92.28% of recall, 92.53% of precision, and 92.40% of F-Measure over CB-MNDLY dataset. Moreover, it has achieved 91.89% of accuracy, 91.32% of recall, 91.81% of precision, and 91.56% of F-Measure on RW-CB-Twitter dataset.
Similar content being viewed by others
Data Availability
No data Availability.
References
Abdulwahab HM, Ajitha S, Saif MAN (2022) Feature selection techniques in the context of big data: taxonomy and analysis. Appl Intell 52:13568–13613. https://doi.org/10.1007/s10489-021-03118-3
Agarwal A, Chivukula AS, Bhuyan MH, Jan T, Narayan B, and Prasad M (2020) Identification and Classification of Cyberbullying Posts: A Recurrent Neural Network Approach Using Under-Sampling and Class Weighting. in Yang H. et al. (eds) Neural Information Processing ICONIP 2020. Communications in Computer and Information Science. Cham: Springer, Cham, vol. 1333, pp. 113–120. https://doi.org/10.1007/978-3-030-63823-8_14.
Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, Cham, vol. 10772 LNCS, pp. 141–153. https://doi.org/10.1007/978-3-319-76941-7_11
Aind AT, Ramnaney A, and Sethia D (2020) Q-Bully: A reinforcement learning based cyberbullying detection framework, in 2020 International Conference for Emerging Technology, INCET 2020. IEEE, pp. 1–6. https://doi.org/10.1109/INCET49848.2020.9154092
Akhter MP, Jiangbin Z, Naqvi IR, AbdelMajeed M, Zia T (2021) Abusive language detection from social media comments using conventional machine learning and deep learning approaches. Multimed Syst, (0123456789). https://doi.org/10.1007/s00530-021-00784-8
Al-Ajlan MA, Ykhlef M (2018) Optimized Twitter Cyberbullying Detection based on Deep Learning. in 2018 21st Saudi Computer Society National Computer Conference (NCC). IEEE, pp. 1–5. https://doi.org/10.1109/NCG.2018.8593146
Alam KS, Bhowmik S and Prosun PRK. (2021) Cyberbullying Detection: An Ensemble Based Machine Learning Approach. In 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), IEEE. IEEE, pp. 710–715. https://doi.org/10.1109/ICICV50876.2021.9388499
Alduailaj AM, Belghith A (2023) Detecting Arabic Cyberbullying Tweets Using Machine Learning. Mach Learn Knowl Extract 5(1):29–42. https://doi.org/10.3390/make5010003
Al-Hassan A, Al-Dossari H (2021) Detection of hate speech in Arabic tweets using deep learning. Multimed Syst 28:1963–1974. https://doi.org/10.1007/s00530-020-00742-w
Balakrishnan V, Khan S, Arabnia HR (2020) Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Comput Secur 90:101710. https://doi.org/10.1016/j.cose.2019.101710
BalakrishnanV Khan S, Fernandez T, Arabnia HR (2019) Cyberbullying detection on twitter using Big Five and Dark Triad features. Person Individ Differ 141(September 2018):252–257. https://doi.org/10.1016/j.paid.2019.01.024
Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of Cyberbullying Using Deep Neural Network. In 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS). IEEE, pp. 604–607. https://doi.org/10.1109/ICACCS.2019.8728378
Bezdek JC, Ehrlich R, Full W (1984) FCM: The fuzzy c-means clustering algorithm’. Comput Geosci 10(2–3):191–203. https://doi.org/10.1109/IGARSS.1988.569600
Chen H, Li C-T (2020) HENIN: Learning Heterogeneous Neural Interaction Networks for Explainable Cyberbullying Detection on Social Media. Available at: http://arxiv.org/abs/2010.04576
Chen Z, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Leveraging multi-domain prior knowledge in topic modelsIJCAI Twenty-Third International Joint Conference on Artificial Intelligence International Joint Conference on. Artif Intell 13:2071–2077
Chen J, Yan S, Wong KC (2020) Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis. Neural Comput Appl 32(15):10809–10818. https://doi.org/10.1007/s00521-018-3442-0
Cheng L, Li J, Silva Y, Hall D, Liu H (2019) PI-Bully: Personalized Cyberbullying Detection with Peer Influence. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence. Calif Int Joint Conf Artif Intell Organ, pp. 5829–5835. https://doi.org/10.24963/ijcai.2019/808
Cheng X, Yan X, Lan Y, Guo J (2014) BTM: Topic Modeling over Short Texts. IEEE Trans Knowl Data Eng 26(12):2928–2941. https://doi.org/10.1109/TKDE.2014.2313872
Chia ZL, Ptaszynski M, Masui F, Leliwa G, Wroczynski M (2021) Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection. Inf Process Manag 58(4):102600. https://doi.org/10.1016/j.ipm.2021.102600
Cortis K, Handschuh S (2015) Analysis of cyberbullying tweets in trending world events. In Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business. New York, NY, USA: ACM, pp. 1–8. https://doi.org/10.1145/2809563.2809605
Dadvar M, Eckert K (2018) Cyberbullying Detection in Social Networks Using Deep Learning Based Models: A Reproducibility Study. In arXiv preprint arXiv:1812.08046. arXiv:1812.08046
Dadvar M, Trieschnigg D, Ordelman R, Jong FD (2013) Improving Cyberbullying Detection with User Context. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7814 LNCS, pp. 693–696. https://doi.org/10.1007/978-3-642-36973-5_62
Dalvi RR, Baliram Chavan S, Halbe A (2020) Detecting A Twitter Cyberbullying Using Machine Learning. In: Proceedings of the International Conference on Intelligent Computing and Control Systems, ICICCS 2020. ICICCS, pp 297–301. https://doi.org/10.1109/ICICCS48265.2020.9120893
Edo-Osagie O, Lake I, Edeghere O, Iglesia BDL (2019) Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring. In 15th International Work-Conference on Artificial Neural Networks. Springer, Cham., pp. 895–911. https://doi.org/10.1007/978-3-030-20521-8_73
Elsafoury F (2020) Cyberbullying datasets. Mendeley Data V1. https://doi.org/10.17632/jf4pzyvnpj.1
Eronen J, Ptaszynski M, Masui F, Pohl A, Leliwa G, Wroczynski M (2021) Improving classifier training efficiency for automatic cyberbullying detection with Feature Density. Inf Process Manage 58(5):02616. https://doi.org/10.1016/j.ipm.2021.102616
Fang Y, Yang S, Zhao B, Huang C (2021) Cyberbullying Detection in Social Networks Using Bi-GRU with Self-Attention Mechanism. Inf 12(4):171. https://doi.org/10.3390/info12040171
Galán-García P, Puerta JGDL, Gómez CL, Santos I, Bringas PG (2014) Supervised machine learning for the detection of troll profiles in twitter social network: Application to a real case of cyberbullying. Logic J IGPL 24(1):42–53. https://doi.org/10.1093/jigpal/jzv048
Gambäck B, Sikdar UK (2017) Using Convolutional Neural Networks to Classify Hate-Speech. In Proceedings of the first workshop on abusive language online. ACL, pp. 85–90
Gao Z, Zhao J, LI S-R, HU R-R (2020) The Improved Equilibrium Optimization Algorithm with Tent Map. In 2020 5th International Conference on Computer and Communication Systems (ICCCS). IEEE, pp. 343–346. https://doi.org/10.1109/ICCCS49078.2020.9118477
Hee C et al (2018) Automatic detection of cyberbullying in social media text’. Plos One 13(10):1–22. https://doi.org/10.1371/journal.pone.0203794
Huang Q, Inkpen D, Zhang J, Van Bruwaene D (2018) Cyberbullying Intervention Based on Convolutional Neural Networks. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018). Association for Computational Linguistics, Santa Fe, New Mexico, USA, pp 42–51. Available at: https://www.bbc.co.uk/news/10302550. Accessed 7 Nov 2014
Huang Q, Singh VK, Atrey PK. (2014) Cyber Bullying Detection Using Social and Textual Analysis. In Proceedings of the 3rd International Workshop on Socially-Aware Multimedia - SAM ’14. ACM Press, New York, USA, pp 3–6. https://doi.org/10.1145/2661126.2661133
Iwendi C, Srivastava G, Khan S, and Maddikunta PKR (2020) Cyberbullying detection solutions based on deep learning architectures. Multimed Syst 1–14. https://doi.org/10.1007/s00530-020-00701-5
Kaur S, Kumar P, Kumaraguru P (2020) Automating fake news detection system using multi-level voting model’. Soft Comput 24(12):9049–9069. https://doi.org/10.1007/s00500-019-04436-y
Khodabakhsh M, Kahani M, Bagheri E (2020) Predicting future personal life events on twitter via recurrent neural networks. J Intell Inf Syst 54(1):101–127. https://doi.org/10.1007/s10844-018-0519-2
Kumar A, Sachdeva N (2021b) Multimodal cyberbullying detection using capsule network with dynamic routing and deep convolutional neural network. Multimed Syst (0123456789). https://doi.org/10.1007/s00530-020-00747-5
Kumar A, Sachdeva N (2021) A Bi-GRU with attention and CapsNet hybrid model for cyberbullying detection on social media. World Wide Web. https://doi.org/10.1007/s11280-021-00920-4
Kumari K, Singh JP (2021) Identification of cyberbullying on multi-modal social media posts using genetic algorithm. Trans Emerg Telecommun Technol 32(2):1–13. https://doi.org/10.1002/ett.3907
Kumari K, Singh JP, Dwivedi YK, Rana NP (2020) Towards Cyberbullying-free social media in smart cities: a unified multi-modal approach. Soft Comput 24(15):11059–11070. https://doi.org/10.1007/s00500-019-04550-x
Kumari K, Singh JP, Dwivedi YK, Rana NP (2021) Bilingual Cyber-aggression detection on social media using LSTM autoencoder. Soft Comput 25(14):8999–9012. https://doi.org/10.1007/s00500-021-05817-y
l-garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: The experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443. https://doi.org/10.1016/j.chb.2016.05.051
Liang W, Feng R, Liu X, Li Y, Zhang X (2018) GLTM: A Global and Local Word Embedding-Based Topic Model for Short Texts. IEEE Access 6:43612–43621. https://doi.org/10.1109/ACCESS.2018.2863260
Liu Z, Qin T, Chen K-J, Li Y (2020) Collaboratively Modeling and Embedding of Latent Topics for Short Texts. IEEE Access 8:99141–99153. https://doi.org/10.1109/ACCESS.2020.2997973
Lu N, Wu G, Zhang Z, Zheng Y, Ren Y, Choo KR (2020) Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurr Comput Practice Exp 32(23):1–11. https://doi.org/10.1002/cpe.5627
Mishna F, Khoury-Kassabri M, Gadalla T, Daciuk J (2012) ‘Risk factors for involvement in cyber bullying: Victims, bullies and bully-victims. Child Youth Serv Rev 34(1):63–70. https://doi.org/10.1016/j.childyouth.2011.08.032
Moazzeni AR, Khamehchi E (2020) Rain optimization algorithm (ROA): A new metaheuristic method for drilling optimization solutions’. J Pet Sci Eng 195:107512. https://doi.org/10.1016/j.petrol.2020.107512
Muneer A, Fati SM (2020) A comparative analysis of machine learning techniques for cyberbullying detection on twitter. Future Int 12(11):1–21. https://doi.org/10.3390/fi12110187
Murshed BAH, Abawajy J, Mallappa S, Saif MAN, Al-ariki HDE (2022) DEA-RNN: A Hybrid Deep Learning Approach for Cyberbullying Detection in Twitter Social Media Platform. IEEE Access 10:25857–25871. https://doi.org/10.1109/ACCESS.2022.3153675
Murshed BAH, Abawajy J, Mallappa S, Saif MAN, Al-Ghuribi SM, Ghanem FA (2020) Enhancing Big Social Media Data Quality for Use in Short-Text Topic Modeling. IEEE Access 10:105328–105351. https://doi.org/10.1109/ACCESS.2022.3211396
Murshed BAH, Al-ariki HDE, Mallappa S (2020) Semantic Analysis Techniques using Twitter Datasets on Big Data: Comparative Analysis Study. Comput Syst Sci Eng 35(6):495–512. https://doi.org/10.32604/csse.2020.35.495
Murshed BAH, Mallappa S, Abawajy J, Mallappa S, Saif MAN, Al-ariki HDE, Abdulwahab HM (2022) Short text topic modelling approaches in the context of big data: taxonomy, survey, and analysis. Artif Intell Rev. https://doi.org/10.1007/s10462-022-10254-w
Murshed BAH, Mallappa S, Ghaleb OAM, Al-ariki HDE,. (2021) Efficient Twitter Data Cleansing Model for Data Analysis of the Pandemic Tweets. In Studies in Systems, Decision and Control. Springer, Cham, 348, pp. 93–114https://doi.org/10.1007/978-3-030-67716-9_7
Nand P, Perera R, Kasture A (2016) “ How Bullying is this Message ?”: A Psychometric Thermometer for Bullying. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. The COLING 2016 Organizing Committee, pp. 695–706. Available at: https://aclanthology.org/C16-1067
Paul S, Saha S (2020) CyberBERT: BERT for cyberbullying identification. Multimed Syst (0123456789). https://doi.org/10.1007/s00530-020-00710-4
Paul S, Saha S, Hasanuzzaman M (2020) ‘Identification of cyberbullying: A deep learning based multimodal approach. Multimed Tools Appl 81(19):26989–27008. https://doi.org/10.1007/s11042-020-09631-w
Pennington J, Socher R, Manning CD (2014) GloVe: Global Vectors for Word Representation Jeffrey. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543
Pericherla S, Ilavarasan E (2021) Transformer network-based word embeddings approach for autonomous cyberbullying detection. Int J Intell Unmanned Syst. https://doi.org/10.1108/IJIUS-02-2021-0011
Pitsilis GK, Ramampiaro H, Langseth H (2018) Effective hate-speech detection in Twitter data using recurrent neural networks. Appl Intell 48(12):4730–4742. https://doi.org/10.1007/s10489-018-1242-y
Purnamasari NMGD, Fauzi MA, Indriati Dewi LS (2020) Cyberbullying identification in twitter using support vector machine and information gain based feature selection. Indones J Electric Eng Comput Sci 18(3):1494–1500. https://doi.org/10.11591/ijeecs.v18.i3.pp1494-1500
rochier R, Guille A, Velcin J (2019) Global Vectors for Node Representations. In The World Wide Web Conference on - WWW ’19. New York, New York, USA: ACM Press, pp. 2587–2593. https://doi.org/10.1145/3308558.3313595
Rosa H, Matos D, Ribeiro R, Coheur L, Carvalho JP (2018) A 'Deeper' Look at Detecting Cyberbullying in Social Networks. In 2018 International Joint Conference on Neural Networks (IJCNN). IEEE, pp. 1–8. https://doi.org/10.1109/IJCNN.2018.8489211
Roy PK, Mali FU (2022) Cyberbullying detection using deep transfer learning. Complex Intell Syst 8(6):5449–5467. https://doi.org/10.1007/s40747-022-00772-z
Singh J, Singh AK (2020) NSLPCD: Topic based tweets clustering using Node significance based label propagation community detection algorithm. Ann Math Artif Intell 1–37. https://doi.org/10.1007/s10472-020-09709-z.
Squicciarini A, Rajtmajer S, Liu Y, and Griffin C (2015) Identification and characterization of cyberbullying dynamics in an online social network. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015. New York, NY, USA: ACM, pp. 280–285. https://doi.org/10.1145/2808797.2809398
Srinath AS, Johnson H, Dagher GG, Long M (2021) BullyNet: Unmasking Cyberbullies on Social Networks. IEEE Trans Comput Soc Syst 8(2):332–344. https://doi.org/10.1109/TCSS.2021.3049232
Talpur BA, O’Sullivan D (2020) Multi-Class Imbalance in Text Classification: A Feature Engineering Approach to Detect Cyberbullying in Twitter. Inf 7(4):52. https://doi.org/10.3390/informatics7040052
Tripathy JK, Chakkaravarthy SS, Satapathy SC, Sahoo M, and Vaidehi V (2020) ALBERT-based fine-tuning model for cyberbullying analysis. Multimed Syst 28:1941–1949. https://doi.org/10.1007/s00530-020-00690-5
Vivolo-Kantor AM, Martell BN, Holland KM, Westby R (2014) A systematic review and content analysis of bullying and cyber-bullying measurement strategies. Aggress Violent Beh 19(4):423–434. https://doi.org/10.1016/j.avb.2014.06.008
Wang XD, Chen RC, Yan F, Zeng ZQ, Hong CQ (2019) Fast Adaptive K-Means Subspace Clustering for High-Dimensional Data. IEEE Access 7:42639–42651. https://doi.org/10.1109/ACCESS.2019.2907043
Wang K, Xiong Q, Wu C, Gao M, Yu Y (2020) Multi-modal cyberbullying detection on social networks. Proceed Int Joint Conf Neural Netw. https://doi.org/10.1109/IJCNN48605.2020.9206663
Yan F, X-dong W, Z-qiang Z, C-qun H (2020) Adaptive multi-view subspace clustering for high-dimensional data’. Pattern Recogn Lett 130:299–305. https://doi.org/10.1016/j.patrec.2019.01.016
Yuvaraj N et al (2021) Automatic detection of cyberbullying using multi-feature based artificial intelligence with deep decision tree classification. Comput Electric Eng 92:107186. https://doi.org/10.1016/j.compeleceng.2021.107186
Yuvaraj N et al (2021) Nature-Inspired-Based Approach for Automated Cyberbullying Classification on Multimedia Social Networking. Math Probl Eng 2021:1–12. https://doi.org/10.1155/2021/6644652
Zhang X et al (2016) Cyberbullying Detection with a Pronunciation Based Convolutional Neural Network. In 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE, pp. 740–745. https://doi.org/10.1109/ICMLA.2016.0132
Zhang Y, Ramesh A (2018) Fine-Grained Analysis of Cyberbullying Using Weakly-Supervised Topic Models. In 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, pp. 504–513. https://doi.org/10.1109/DSAA.2018.00065
Zhang Z, Robinson D, Tepper J (2018) Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network. In GangemiAnna, A. et al. (eds) The European semantic web conference. ESWC 2018. Lecture Notes in Computer Science. Springer, Cham, pp. 745–760. https://doi.org/10.1007/978-3-319-93417-4_48
Zhao Y, Karypis G (2001) Criterion functions for document clustering: Experiments and analysis
Zhao R, Mao K (2016) Cyberbullying Detection based on Semantic-Enhanced Marginalized Denoising Auto-Encoder. IEEE Trans Affect Comput 8(3):328–339. https://doi.org/10.1109/TAFFC.2016.2531682
Zhou C, Sun C, Liu Z, Lau FCM (2015) A C-LSTM Neural Network for Text Classification. arXiv preprint arXiv:1511.08630. arXiv:1511.08630
Zhou K, Yang S (2020) Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering. Pattern Anal Appl 23(1):455–466. https://doi.org/10.1007/s10044-019-00783-6
Zuo Y, Wu J, Zhang H, Lin H, Xu K, Xiong H (2016) Topic Modeling of Short Texts: A Pseudo-Document View. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD 2016), pp. 2105–2114. https://doi.org/10.1145/2939672.2939880
Funding
No funding is provided for the preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
Belal Abdullah Hezam Murshed: Methodology, Conceptualization, Validation, Data curation, Formal analysis, Investigation, Software, Visualization, Writing—Original Draft, Writing—Review & Editing. Suresha: Supervision, Methodology, Resources, Data curation. Jemal Abawajy: Methodology, Supervision, Formal analysis, Data curation, Writing—Review & Editing, Supervision. Mufeed Ahmed Naji: Investigation, Formal analysis, Data curation, Writing—Review & Editing. Hudhaifa Mohammed Abdulwahab: Resources, Writing—Review & Editing. Fahd A Ghanem: Resources, Writing—Review & Editing.
Corresponding author
Ethics declarations
Conflict of Interest
Authors declare that they have no conflict of interest.
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Consent to Publish.
Reviewer and Editors can publish this work.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Murshed, B.A.H., Suresha, Abawajy, J. et al. FAEO-ECNN: cyberbullying detection in social media platforms using topic modelling and deep learning. Multimed Tools Appl 82, 46611–46650 (2023). https://doi.org/10.1007/s11042-023-15372-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15372-3