Abstract
Artificial intelligence has reached a stage where machines possess the capability to engage in tasks that traditionally demanded human intelligence. The integral component of this advancement lies in “machine learning,” where algorithms are trained to generate predictions or make decisions by analyzing data. In hate speech identification using machine learning, a number of methods are used to automatically find text that uses vocabulary that is considered to be derogatory, discriminatory, or motivated by hatred. Supervised learning techniques like neural networks, decision trees, and SVMs need a labelled dataset comprising samples of hate speech and non-hate speech. Unsupervised techniques, such k-means clustering, use word frequency and other variables to group comparable text data. CNNs and RNNs are examples of deep learning systems that learn complex word associations and spot trends indicating hate speech. Text data is subjected to the utilization of N-grams, word embeddings, and sentiment analysis in order to derive distinctive attributes. Accurate detection is improved by ensemble methods that combine predictions from various models. Identifying hate speech, avoiding bias in data and algorithms, and the necessity for large and diverse datasets are among the difficulties. Ultimately, machine learning-based hate speech identification is an essential tool for preventing hate speech online and fostering an inclusive and secure online environment for all users. So we did research on detecting hate speech using various algorithms. The TF-IDF representation prioritises textual terms, whereas the ensemble method uses classifier diversity to capture distinctive patterns. Results from experiments show the strategy's effectiveness, with a 90% average accuracy rate for detecting hate speech. By successfully utilizing AI's capacity to fight hate speech, this research helps the development of a diverse and secure online environment. The suggested approach works well for automatically identifying hate speech, making the internet a safer and more welcoming place for all users.
References
Davidson T, Warmsley D, Macy M, Weber I (2018) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web social media, vol 11, no. 1, pp 512–515
Watanabe H, Bouazizi M, Ohtsuki T (2018) Hate speech ontwitter: a pragmatic approach to collect hateful and offensive expressions and perform hate speech detection. IEEE Access 6:13825–13835
Roy PK, Tripathy AK, Das TK, Gao X-Z (2020) A framework for hate speech detection using deep convolutional neural network. IEEE Access 8:204951–204962
Oriola O, Kotzé E (2020) Evaluating machine learning techniques for detecting offensive and hate speech in South African Tweets. IEEE Access 8:21496–21509
Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) ETHOS: An online hate speech detection dataset. arXiv:2006.08328
Zhou Y, Yang Y, Liu H, Liu X, Savage N (2020) Deep learning-based fusion approach for hate speech detection. IEEE Access 8:128923–128929
Baydogan C, Alatas B (2021) Metaheuristic ant lion and moth flame optimization-based novel approach for automatic detection hate speech in online social networks. IEEE Access 9:110047–110062
Ali MZ, Ehsan-Ul-Haq, Rauf S, Javed K, Hussain S (2021) Improving hate speech detection of Urdu tweets using sentiment analysis. IEEE Access 9:84296–84305
Plaza-Del-Arco FM, Molina-González MD, UreñaLópez LA, Martín-Valdivia MT (2021) A multi-task learning approach to hate speech detection leveraging sentiment analysis. IEEE Access 9:112478–112489
Alatawi HS, Alhothali AM, Moria KM (2021) Detecting white supremacist hate speech using domain-specific word embedding with deep learning and BERT. IEEE Access 9:106363–106374
Rodriguez A, Chen Y-L, Argueta C (2022) FADOHS: framework for detection and integration of unstructured data of hate speech on Facebook using sentiment and emotion analysis. IEEE Access 10:22400–22419
Ilie V-I, Truică C-O, Apostol E-S, Paschke A (2021) Context-aware misinformation detection: a benchmark of deep learning architectures using word embeddings. IEEE Access 9:162122–162146
Lee E, Rustam F, Washington PB, Barakaz FE, Aljedaani W, Ashraf I (2022) Racism detection by analyzing differential opinions through sentiment analysis of tweets using stacked ensemble GCR-NN model. IEEE Access 10:9717–9728
Briliani A, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using K-nearest neighbor classification method. In: Proceedings of the 2019 IEEE international conference on internet of things intelligence system. IoTaIS, pp 98–104
Fauzi MA, Yuniarti A (2018) Ensemble method for Indonesian twitter hate speech detection. IJEECS 294–299
Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on World Wide Web companion (WWW Companion), pp 759–760
Di Capua M, Di Nardo E, Petrosino A (2016) Unsupervised cyber bullying detection in social networks. In: Proceedings of the 23rd international conference on pattern recognition, pp 432–437
Sathishkumar R, Kalaiarasan K, Prabhakaran A, Aravind M (2019) Detection of lung cancer using SVM classifier and KNN algorithm. In: 2019 IEEE international conference on system, computation, automation and networking (ICSCAN). IEEE, pp 1–7
Baydogan C (2022) Deep-Cov19-hate: a textual-based novel approach for automatic detection of hate speech in online social networks throughout COVID-19 with shallow and deep learning models. Tehnički Vjesnik 29(1):149–156
Jaki S, De Smedt T (2019) Right-wing German hate speech on Twitter: analysis and automatic detection. arXiv 1910-07518
Abro S, Shaikh S, Hussain Z, Ali Z, Khan S, Mujtaba G (2020) Automatic hate speech detection using machine learning: a comparative study. Int J Adv Comput Sci Appl 11(8):1–8
Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: Proceedings of the European Conference on Information Retrieval. Springer, pp 141–153
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sathishkumar, R., Govindarajan, M., Deepankumar, R. (2024). Hate Speech Detection in Social Media Using Ensemble Method in Classifiers. In: Marriwala, N.K., Dhingra, S., Jain, S., Kumar, D. (eds) Mobile Radio Communications and 5G Networks. MRCN 2023. Lecture Notes in Networks and Systems, vol 915. Springer, Singapore. https://doi.org/10.1007/978-981-97-0700-3_16
Download citation
DOI: https://doi.org/10.1007/978-981-97-0700-3_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0699-0
Online ISBN: 978-981-97-0700-3
eBook Packages: EngineeringEngineering (R0)