BERT-based ensemble learning for multi-aspect hate speech detection

Mazari, Ahmed Cherif; Boudoukhani, Nesrine; Djeffal, Abdelhamid

doi:10.1007/s10586-022-03956-x

BERT-based ensemble learning for multi-aspect hate speech detection

Published: 03 January 2023

Volume 27, pages 325–339, (2024)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Ahmed Cherif Mazari¹,
Nesrine Boudoukhani^1,2 &
Abdelhamid Djeffal³

1658 Accesses
13 Citations
Explore all metrics

Abstract

The social media world nowadays is overwhelmed with unfiltered content ranging from cyberbullying and cyberstalking to hate speech. Therefore, identifying and cleaning up such toxic language presents a big challenge and an active area of research. This study is dedicated to multi-aspect hate speech detection based on classifying text in multi-labels including ‘identity hate’, ‘threat’, ‘insult’, ‘obscene’, ‘toxic’ and ‘severe toxic’. The proposed approach is based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model combined with Deep Learning (DL) models to compose several ensemble learning architectures. The DL models used are built by stacking Bidirectional Long-Short Term Memory (Bi-LSTM) and/or Bidirectional Gated Recurrent Unit (Bi-GRU) on GloVe and FastText word embeddings. Whereby, these models and BERT are trained individually on multi-label hateful dataset and used in combination for hate speech detection tasks on social media. Thus, we demonstrate that encoding texts by using recent word embedding techniques as FastText and GloVe alongside Bi-LSTM and Bi-GRU can create models that, when combined with BERT, can enhance the ROC-AUC score to 98.63%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fake news, disinformation and misinformation in social media: a review

Article 09 February 2023

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

Article 19 November 2021

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Article 04 June 2022

Data availability

The data that support the findings of this study are publicly available in kaggle Datasets (www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/data).

Notes

References

Kaplan, A.M., Haenlein, M.: Users of the world, unite! The challenges and opportunities of Social Media. Bus. Horiz. 53, 59–68 (2010). https://doi.org/10.1016/j.bushor.2009.09.003
Article Google Scholar
Smith, P.K., Mahdavi, J., Carvalho, M., Fisher, S., Russell, S., Tippett, N.: Cyberbullying: its nature and impact in secondary school pupils. J. Child Psychol. Psychiatry. 49, 376–385 (2008). https://doi.org/10.1111/j.1469-7610.2007.01846.x
Article PubMed Google Scholar
Grigg, D.W.: Cyber-aggression: definition and concept of cyberbullying. J. Psychol. Couns. Sch. 20, 143–156 (2010). https://doi.org/10.1375/ajgc.20.2.143
Article Google Scholar
Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated Hate Speech Detection and the Problem of Offensive Language. In: Proceedings of the 11th International Conference on Web and Social Media, ICWSM 2017. 512–515 (2017)
Fortuna, P., Nunes, S.: A survey on automatic detection of hate speech in text. ACM Comput. Surv. 51, 1–30 (2018)
Article Google Scholar
Basile, V., Bosco, C., Fersini, E., Debora, N., Patti, V., Pardo, F.M.R., Rosso, P., Sanguinetti, M.: Semeval-2019 task 5: Multilingual detection of hate speech against immigrants and women in twitter. In: 13th International Workshop on Semantic Evaluation. pp. 54–63. Association for Computational Linguistics (2019)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in neural information processing systems. pp. 5998–6008 (2017)
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv Prepr. http://arxiv.org/1810.04805. (2018)
Badjatiya, P., Gupta, S., Gupta, M., Varma, V.: Deep Learning for Hate Speech Detection in Tweets. In: Proceedings of the 26th International Conference on World Wide Web Companion - WWW ’17 Companion. pp. 759–760. ACM Press, New York, New York, USA (2017). https://doi.org/10.1145/3041021.3054223
Waseem, Z., Hovy, D.: Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In: Proceedings of the NAACL Student Research Workshop. pp. 88–93. Association for Computational Linguistics, Stroudsburg, PA, USA, PA, USA (2016). https://doi.org/10.18653/v1/N16-2013
Srivastava, S., Khurana, P., Tewari, V.: Identifying aggression and toxicity in comments using capsule network. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018). pp. 98–105 (2018)
Aroyehun, S.T., Gelbukh, A.: Aggression detection in social media Using deep neural networks, data augmentation, and pseudo labeling. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018). pp. 90–97 (2018)
Agrawal, S., Awekar, A.: Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms. In: European Conference on Information Retrieval. pp. 303–315 (2018). https://doi.org/10.1007/978-3-319-76941-7
Founta, A.M., Chatzakou, D., Kourtellis, N., Blackburn, J., Vakali, A., Leontiadis, I.: A Unified Deep Learning Architecture for Abuse Detection. In: Proceedings of the 10th ACM Conference on Web Science - WebSci ’19. pp. 105–114. ACM Press, New York, New York, USA (2019). https://doi.org/10.1145/3292522.3326028
Mossie, Z., Wang, J.H.: Vulnerable community identification using hate speech detection on social media. Inf. Process. Manag. 57, 102087 (2020)
Article Google Scholar
Kapil, P., Ekbal, A.: A deep neural network based multi-task learning approach to hate speech detection. Knowledge-Based Syst. 210, 106458 (2020). https://doi.org/10.1016/j.knosys.2020.106458
Article Google Scholar
Mangaonkar, A., Pawar, R., Chowdhury, N.S., Raje, R.R.: Enhancing collaborative detection of cyberbullying behavior in Twitter data. Cluster Comput. 25, 1263–1277 (2022). https://doi.org/10.1007/s10586-021-03483-1
Article Google Scholar
Kammakomati, M., Tarun Kumar, P. V, Radhika, K.: Comparison of Machine Learning Algorithms for Hate and Offensive Speech Detection. In: Evolutionary Computing and Mobile Sustainable Networks. pp. 873–881. Springer (2022)
Liu, P., Li, W., Zou, L.: NULI at SemEval-2019 Task 6: Transfer Learning for Offensive Language Detection using Bidirectional Transformers. In: Proceedings of the 13th International Workshop on Semantic Evaluation. pp. 87–91. Association for Computational Linguistics, Stroudsburg, PA, USA (2019). https://doi.org/10.18653/v1/S19-2011
Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval). In: Proceedings of the 13th International Workshop on Semantic Evaluation. 652–656 (2019). https://doi.org/10.18653/v1/S19-2116
Mozafari, M., Farahbakhsh, R., Crespi, N.: A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media. In: International Conference on Complex Networks and Their Applications. pp. 928–940 (2020). https://doi.org/10.1007/978-3-030-36687-2_77
Waseem, Z.: Are you a racist or am i seeing things annotator influence on hate speech detection on twitter. In: Proceedings of the first workshop on NLP and computational social science. pp. 138–142 (2016)
Modha, S., Majumder, P., Mandl, T., Mandalia, C.: Detecting and visualizing hate speech in social media: a cyber Watchdog for surveillance. Expert Syst. Appl. 161, 113725 (2020). https://doi.org/10.1016/j.eswa.2020.113725
Article Google Scholar
Pamungkas, E.W., Basile, V., Patti, V.: Misogyny detection in twitter: a multilingual and cross-domain study. Inf. Process. Manag. 57, 102360 (2020). https://doi.org/10.1016/j.ipm.2020.102360
Article Google Scholar
Wei, B., Li, J., Gupta, A., Umair, H., Vovor, A., Durzynski, N.: Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning. arXiv Prepr. http://arxiv.org/2108.03305. (2021)
Plaza-del-Arco, F.M., Molina-González, M.D., Urena-López, L.A., Martín-Valdivia, M.T.: Comparing pre-trained language models for Spanish hate speech detection. Expert Syst. Appl. 166, 114120 (2021)
Article Google Scholar
Rosa, H., Matos, D., Ribeiro, R., Coheur, L., Carvalho, J.P.: A “Deeper” Look at Detecting Cyberbullying in Social Networks. In: 2018 International Joint Conference on Neural Networks (IJCNN). pp. 1–8. IEEE (2018). https://doi.org/10.1109/IJCNN.2018.8489211
Kim, Y.: Convolutional Neural Networks for Sentence Classification. arXiv Prepr. http://arxiv.org/1408.5882. (2014)
Zhou, C., Sun, C., Liu, Z., Lau, F.C.M.: A C-LSTM Neural Network for Text Classification. arXiv Prepr. http://arxiv.org/1511.08630. (2015)
Ghosh, A., Veale, T.: Fracking sarcasm using neural network. In: Proceedings of the 7th workshop on computational approaches to subjectivity, sentiment and social media analysis. pp. 161–169 (2016)
Pitsilis, G.K., Ramampiaro, H., Langseth, H.: Effective hate-speech detection in Twitter data using recurrent neural networks. Appl. Intell. 48, 4730–4742 (2018). https://doi.org/10.1007/s10489-018-1242-y
Article Google Scholar
Mahata, D., Zhang, H., Uppal, K., Kumar, Y., Shah, R., Shahid, S., Mehnaz, L., Anand, S.: MIDAS at SemEval-2019 task 6 Identifying offensive posts and targeted offense from twitter. In: Proceedings of the 13th International Workshop on Semantic Evaluation. pp. 683–690 (2019)
Sun, X., Zhang, C., Li, L.: Dynamic emotion modelling and anomaly detection in conversation based on emotional transition tensor. Inf. Fusion. 46, 11–22 (2019)
Article Google Scholar
Sadiq, S., Mehmood, A., Ullah, S., Ahmad, M., Choi, G.S., On, B.-W.: Aggression detection through deep neural model on Twitter. Futur. Gener. Comput. Syst. 114, 120–129 (2020)
Article Google Scholar
Nascimento, F.R.S., Cavalcanti, G.D.C., Da Costa-Abreu, M.: Unintended bias evaluation: an analysis of hate speech detection and gender bias mitigation on social media using ensemble learning. Expert Syst. Appl. 201, 117032 (2022)
Article Google Scholar
Lin, S.Y., Kung, Y.C., Leu, F.Y.: Predictive intelligence in harmful news identification by BERT-based ensemble learning model with text sentiment analysis. Inf. Process. Manag. 59, 102872 (2022)
Article Google Scholar
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017)
Article Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). pp. 1532–1543 (2014)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. 1st International Conference Learn. Represent. ICLR 2013 - Work. Track Proc. 1–12 (2013)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Article CAS PubMed Google Scholar
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv Prepr. http://arxiv.org/1406.1078. (2014)
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323, 533–536 (1986)
Article ADS Google Scholar
Hochreiter, S., Bengio, Y., Frasconi, P., Schmidhuber, J.: Gradient flow in recurrent nets the difficulty of learning long-term dependencies. IEEE Press, Piscataway (2001)
Google Scholar
Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673–2681 (1997)
Article ADS Google Scholar
Lu, N., Wu, G., Zhang, Z., Zheng, Y., Ren, Y., Choo, K.K.R.: Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurr. Comput. Pract. Exp. (2020). https://doi.org/10.1002/cpe.5627
Article Google Scholar
Ratadiya, P., Mishra, D.: An Attention Ensemble Based Approach for Multilabel Profanity Detection. In: 2019 International Conference on Data Mining Workshops (ICDMW). pp. 544–550. IEEE (2019). https://doi.org/10.1109/ICDMW.2019.00083
Saeed, H.H., Shahzad, K., Kamiran, F.: Overlapping Toxic Sentiment Classification Using Deep Neural Architectures. In: 2018 IEEE International Conference on Data Mining Workshops (ICDMW). pp. 1361–1366. IEEE (2018). https://doi.org/10.1109/ICDMW.2018.00193

Download references

Funding

No funding sources are applicable for this research.

Author information

Authors and Affiliations

LSEA Laboratory, Mathematics and Computer Science Department, University of Médéa, Médéa, Algeria
Ahmed Cherif Mazari & Nesrine Boudoukhani
EEDIS Laboratory, Computer Science Department, University of Sidi Bel Abbès, Sidi Bel Abbès, Algeria
Nesrine Boudoukhani
LESIA Laboratory, Computer Science Department, University of Biskra, Biskra, Algeria
Abdelhamid Djeffal

Authors

Ahmed Cherif Mazari
View author publications
You can also search for this author in PubMed Google Scholar
Nesrine Boudoukhani
View author publications
You can also search for this author in PubMed Google Scholar
Abdelhamid Djeffal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ACM, NB: Conceptualization. NB: Testing. ACM, NB: Analysis and investigation, Writing original draft. ACM, NB & AD: Review and editing.

Corresponding author

Correspondence to Ahmed Cherif Mazari.

Ethics declarations

Conflict of interest

Not applicable.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mazari, A.C., Boudoukhani, N. & Djeffal, A. BERT-based ensemble learning for multi-aspect hate speech detection. Cluster Comput 27, 325–339 (2024). https://doi.org/10.1007/s10586-022-03956-x

Download citation

Received: 13 July 2022
Revised: 19 September 2022
Accepted: 21 December 2022
Published: 03 January 2023
Issue Date: February 2024
DOI: https://doi.org/10.1007/s10586-022-03956-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

BERT-based ensemble learning for multi-aspect hate speech detection

Abstract

Access this article

Similar content being viewed by others

Fake news, disinformation and misinformation in social media: a review

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

BERT-based ensemble learning for multi-aspect hate speech detection

Abstract

Access this article

Similar content being viewed by others

Fake news, disinformation and misinformation in social media: a review

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation