Hate Speech Detection in Social Media Using Ensemble Method in Classifiers

Sathishkumar, R.; Govindarajan, M.; Deepankumar, R.

doi:10.1007/978-981-97-0700-3_16

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 915))

Included in the following conference series:

International Conference on Mobile Radio Communications & 5G Networks

15 Accesses

Abstract

Artificial intelligence has reached a stage where machines possess the capability to engage in tasks that traditionally demanded human intelligence. The integral component of this advancement lies in “machine learning,” where algorithms are trained to generate predictions or make decisions by analyzing data. In hate speech identification using machine learning, a number of methods are used to automatically find text that uses vocabulary that is considered to be derogatory, discriminatory, or motivated by hatred. Supervised learning techniques like neural networks, decision trees, and SVMs need a labelled dataset comprising samples of hate speech and non-hate speech. Unsupervised techniques, such k-means clustering, use word frequency and other variables to group comparable text data. CNNs and RNNs are examples of deep learning systems that learn complex word associations and spot trends indicating hate speech. Text data is subjected to the utilization of N-grams, word embeddings, and sentiment analysis in order to derive distinctive attributes. Accurate detection is improved by ensemble methods that combine predictions from various models. Identifying hate speech, avoiding bias in data and algorithms, and the necessity for large and diverse datasets are among the difficulties. Ultimately, machine learning-based hate speech identification is an essential tool for preventing hate speech online and fostering an inclusive and secure online environment for all users. So we did research on detecting hate speech using various algorithms. The TF-IDF representation prioritises textual terms, whereas the ensemble method uses classifier diversity to capture distinctive patterns. Results from experiments show the strategy's effectiveness, with a 90% average accuracy rate for detecting hate speech. By successfully utilizing AI's capacity to fight hate speech, this research helps the development of a diverse and secure online environment. The suggested approach works well for automatically identifying hate speech, making the internet a safer and more welcoming place for all users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Davidson T, Warmsley D, Macy M, Weber I (2018) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web social media, vol 11, no. 1, pp 512–515
Google Scholar
Watanabe H, Bouazizi M, Ohtsuki T (2018) Hate speech ontwitter: a pragmatic approach to collect hateful and offensive expressions and perform hate speech detection. IEEE Access 6:13825–13835
Google Scholar
Roy PK, Tripathy AK, Das TK, Gao X-Z (2020) A framework for hate speech detection using deep convolutional neural network. IEEE Access 8:204951–204962
Google Scholar
Oriola O, Kotzé E (2020) Evaluating machine learning techniques for detecting offensive and hate speech in South African Tweets. IEEE Access 8:21496–21509
Google Scholar
Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) ETHOS: An online hate speech detection dataset. arXiv:2006.08328
Zhou Y, Yang Y, Liu H, Liu X, Savage N (2020) Deep learning-based fusion approach for hate speech detection. IEEE Access 8:128923–128929
Article Google Scholar
Baydogan C, Alatas B (2021) Metaheuristic ant lion and moth flame optimization-based novel approach for automatic detection hate speech in online social networks. IEEE Access 9:110047–110062
Google Scholar
Ali MZ, Ehsan-Ul-Haq, Rauf S, Javed K, Hussain S (2021) Improving hate speech detection of Urdu tweets using sentiment analysis. IEEE Access 9:84296–84305
Google Scholar
Plaza-Del-Arco FM, Molina-González MD, UreñaLópez LA, Martín-Valdivia MT (2021) A multi-task learning approach to hate speech detection leveraging sentiment analysis. IEEE Access 9:112478–112489
Google Scholar
Alatawi HS, Alhothali AM, Moria KM (2021) Detecting white supremacist hate speech using domain-specific word embedding with deep learning and BERT. IEEE Access 9:106363–106374
Google Scholar
Rodriguez A, Chen Y-L, Argueta C (2022) FADOHS: framework for detection and integration of unstructured data of hate speech on Facebook using sentiment and emotion analysis. IEEE Access 10:22400–22419
Google Scholar
Ilie V-I, Truică C-O, Apostol E-S, Paschke A (2021) Context-aware misinformation detection: a benchmark of deep learning architectures using word embeddings. IEEE Access 9:162122–162146
Google Scholar
Lee E, Rustam F, Washington PB, Barakaz FE, Aljedaani W, Ashraf I (2022) Racism detection by analyzing differential opinions through sentiment analysis of tweets using stacked ensemble GCR-NN model. IEEE Access 10:9717–9728
Google Scholar
Briliani A, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using K-nearest neighbor classification method. In: Proceedings of the 2019 IEEE international conference on internet of things intelligence system. IoTaIS, pp 98–104
Google Scholar
Fauzi MA, Yuniarti A (2018) Ensemble method for Indonesian twitter hate speech detection. IJEECS 294–299
Google Scholar
Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on World Wide Web companion (WWW Companion), pp 759–760
Google Scholar
Di Capua M, Di Nardo E, Petrosino A (2016) Unsupervised cyber bullying detection in social networks. In: Proceedings of the 23rd international conference on pattern recognition, pp 432–437
Google Scholar
Sathishkumar R, Kalaiarasan K, Prabhakaran A, Aravind M (2019) Detection of lung cancer using SVM classifier and KNN algorithm. In: 2019 IEEE international conference on system, computation, automation and networking (ICSCAN). IEEE, pp 1–7
Google Scholar
Baydogan C (2022) Deep-Cov19-hate: a textual-based novel approach for automatic detection of hate speech in online social networks throughout COVID-19 with shallow and deep learning models. Tehnički Vjesnik 29(1):149–156
Google Scholar
Jaki S, De Smedt T (2019) Right-wing German hate speech on Twitter: analysis and automatic detection. arXiv 1910-07518
Google Scholar
Abro S, Shaikh S, Hussain Z, Ali Z, Khan S, Mujtaba G (2020) Automatic hate speech detection using machine learning: a comparative study. Int J Adv Comput Sci Appl 11(8):1–8
Google Scholar
Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: Proceedings of the European Conference on Information Retrieval. Springer, pp 141–153
Google Scholar

Download references

Author information

Authors and Affiliations

Manakula Vinayagar Institute of Technology, Puducherry, India
R. Sathishkumar & R. Deepankumar
Annamalai University, Chidambaram, Tamilnadu, India
M. Govindarajan

Authors

R. Sathishkumar
View author publications
You can also search for this author in PubMed Google Scholar
M. Govindarajan
View author publications
You can also search for this author in PubMed Google Scholar
R. Deepankumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Sathishkumar .

Editor information

Editors and Affiliations

University Institute of Engineering and Technology, Kurukshetra University, Kurukshetra, Haryana, India
Nikhil Kumar Marriwala
University Institute of Engineering and Technology, Kurukshetra University, Kurukshetra, India
Sunil Dhingra
Electronics and Communication Engineering, Jaypee University of Information Technology, Solan, Himachal Pradesh, India
Shruti Jain
Electrical and Computer System Engineering, RMIT University, Melbourne, VIC, Australia
Dinesh Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sathishkumar, R., Govindarajan, M., Deepankumar, R. (2024). Hate Speech Detection in Social Media Using Ensemble Method in Classifiers. In: Marriwala, N.K., Dhingra, S., Jain, S., Kumar, D. (eds) Mobile Radio Communications and 5G Networks. MRCN 2023. Lecture Notes in Networks and Systems, vol 915. Springer, Singapore. https://doi.org/10.1007/978-981-97-0700-3_16

Download citation

DOI: https://doi.org/10.1007/978-981-97-0700-3_16
Published: 30 April 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0699-0
Online ISBN: 978-981-97-0700-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics