Skip to main content

Hate Speech Detection in Social Media Using Ensemble Method in Classifiers

  • Conference paper
  • First Online:
Mobile Radio Communications and 5G Networks (MRCN 2023)

Abstract

Artificial intelligence has reached a stage where machines possess the capability to engage in tasks that traditionally demanded human intelligence. The integral component of this advancement lies in “machine learning,” where algorithms are trained to generate predictions or make decisions by analyzing data. In hate speech identification using machine learning, a number of methods are used to automatically find text that uses vocabulary that is considered to be derogatory, discriminatory, or motivated by hatred. Supervised learning techniques like neural networks, decision trees, and SVMs need a labelled dataset comprising samples of hate speech and non-hate speech. Unsupervised techniques, such k-means clustering, use word frequency and other variables to group comparable text data. CNNs and RNNs are examples of deep learning systems that learn complex word associations and spot trends indicating hate speech. Text data is subjected to the utilization of N-grams, word embeddings, and sentiment analysis in order to derive distinctive attributes. Accurate detection is improved by ensemble methods that combine predictions from various models. Identifying hate speech, avoiding bias in data and algorithms, and the necessity for large and diverse datasets are among the difficulties. Ultimately, machine learning-based hate speech identification is an essential tool for preventing hate speech online and fostering an inclusive and secure online environment for all users. So we did research on detecting hate speech using various algorithms. The TF-IDF representation prioritises textual terms, whereas the ensemble method uses classifier diversity to capture distinctive patterns. Results from experiments show the strategy's effectiveness, with a 90% average accuracy rate for detecting hate speech. By successfully utilizing AI's capacity to fight hate speech, this research helps the development of a diverse and secure online environment. The suggested approach works well for automatically identifying hate speech, making the internet a safer and more welcoming place for all users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Davidson T, Warmsley D, Macy M, Weber I (2018) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web social media, vol 11, no. 1, pp 512–515

    Google Scholar 

  2. Watanabe H, Bouazizi M, Ohtsuki T (2018) Hate speech ontwitter: a pragmatic approach to collect hateful and offensive expressions and perform hate speech detection. IEEE Access 6:13825–13835

    Google Scholar 

  3. Roy PK, Tripathy AK, Das TK, Gao X-Z (2020) A framework for hate speech detection using deep convolutional neural network. IEEE Access 8:204951–204962

    Google Scholar 

  4. Oriola O, Kotzé E (2020) Evaluating machine learning techniques for detecting offensive and hate speech in South African Tweets. IEEE Access 8:21496–21509

    Google Scholar 

  5. Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) ETHOS: An online hate speech detection dataset. arXiv:2006.08328

  6. Zhou Y, Yang Y, Liu H, Liu X, Savage N (2020) Deep learning-based fusion approach for hate speech detection. IEEE Access 8:128923–128929

    Article  Google Scholar 

  7. Baydogan C, Alatas B (2021) Metaheuristic ant lion and moth flame optimization-based novel approach for automatic detection hate speech in online social networks. IEEE Access 9:110047–110062

    Google Scholar 

  8. Ali MZ, Ehsan-Ul-Haq, Rauf S, Javed K, Hussain S (2021) Improving hate speech detection of Urdu tweets using sentiment analysis. IEEE Access 9:84296–84305

    Google Scholar 

  9. Plaza-Del-Arco FM, Molina-González MD, UreñaLópez LA, Martín-Valdivia MT (2021) A multi-task learning approach to hate speech detection leveraging sentiment analysis. IEEE Access 9:112478–112489

    Google Scholar 

  10. Alatawi HS, Alhothali AM, Moria KM (2021) Detecting white supremacist hate speech using domain-specific word embedding with deep learning and BERT. IEEE Access 9:106363–106374

    Google Scholar 

  11. Rodriguez A, Chen Y-L, Argueta C (2022) FADOHS: framework for detection and integration of unstructured data of hate speech on Facebook using sentiment and emotion analysis. IEEE Access 10:22400–22419

    Google Scholar 

  12. Ilie V-I, Truică C-O, Apostol E-S, Paschke A (2021) Context-aware misinformation detection: a benchmark of deep learning architectures using word embeddings. IEEE Access 9:162122–162146

    Google Scholar 

  13. Lee E, Rustam F, Washington PB, Barakaz FE, Aljedaani W, Ashraf I (2022) Racism detection by analyzing differential opinions through sentiment analysis of tweets using stacked ensemble GCR-NN model. IEEE Access 10:9717–9728

    Google Scholar 

  14. Briliani A, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using K-nearest neighbor classification method. In: Proceedings of the 2019 IEEE international conference on internet of things intelligence system. IoTaIS, pp 98–104

    Google Scholar 

  15. Fauzi MA, Yuniarti A (2018) Ensemble method for Indonesian twitter hate speech detection. IJEECS 294–299

    Google Scholar 

  16. Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on World Wide Web companion (WWW Companion), pp 759–760

    Google Scholar 

  17. Di Capua M, Di Nardo E, Petrosino A (2016) Unsupervised cyber bullying detection in social networks. In: Proceedings of the 23rd international conference on pattern recognition, pp 432–437

    Google Scholar 

  18. Sathishkumar R, Kalaiarasan K, Prabhakaran A, Aravind M (2019) Detection of lung cancer using SVM classifier and KNN algorithm. In: 2019 IEEE international conference on system, computation, automation and networking (ICSCAN). IEEE, pp 1–7

    Google Scholar 

  19. Baydogan C (2022) Deep-Cov19-hate: a textual-based novel approach for automatic detection of hate speech in online social networks throughout COVID-19 with shallow and deep learning models. Tehnički Vjesnik 29(1):149–156

    Google Scholar 

  20. Jaki S, De Smedt T (2019) Right-wing German hate speech on Twitter: analysis and automatic detection. arXiv 1910-07518

    Google Scholar 

  21. Abro S, Shaikh S, Hussain Z, Ali Z, Khan S, Mujtaba G (2020) Automatic hate speech detection using machine learning: a comparative study. Int J Adv Comput Sci Appl 11(8):1–8

    Google Scholar 

  22. Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: Proceedings of the European Conference on Information Retrieval. Springer, pp 141–153

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Sathishkumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sathishkumar, R., Govindarajan, M., Deepankumar, R. (2024). Hate Speech Detection in Social Media Using Ensemble Method in Classifiers. In: Marriwala, N.K., Dhingra, S., Jain, S., Kumar, D. (eds) Mobile Radio Communications and 5G Networks. MRCN 2023. Lecture Notes in Networks and Systems, vol 915. Springer, Singapore. https://doi.org/10.1007/978-981-97-0700-3_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0700-3_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0699-0

  • Online ISBN: 978-981-97-0700-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics