Skip to main content
Log in

Automatic hate speech detection using aspect based feature extraction and Bi-LSTM model

  • Original Article
  • Published:
International Journal of System Assurance Engineering and Management Aims and scope Submit manuscript

Abstract

The various social media platforms are used for easy access of information from the distinctive field that might constitute an offensive discussion. Therefore, existing studies are examined to reduce offensive harassment cases online. The spread of hate speech is growing with the ubiquity and anonymity through the means of social media for many years. Thus, the increase in demand showed an automated model for the detection of hate speech. The existing models utilized deep learning models failed to analyze the syntax and grammar or even modify original data’s meaning due to its complex patterns. Therefore, the present research work utilizes an Activation Function known as Soft-plus in Bidirectional Long Short Term Memory (Bi-LSTM) models for hate speech detection which helps the network to learn complex patterns in the data. The proposed Soft-plus Bi-LSTM learned the c complex patterns present in the network data and the activation function made an end decision that should be fired out into the next neuron. The classification results showed that the proposed Soft-plus Bi-LSTM classified the reviews as abusive or non-abusive speech. The results obtained better precision values of 60.09% when compared to the existing models Auto-Encoder and Multi-task learning model showed a precision of 53.9% and 55.7% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data Availability

The datasets generated during and/or analyzed during the current study are available in the [OLID, SOLID] repository,

[https://sites.google.com/site/offensevalsharedtask/olid.

https://sites.google.com/site/offensevalsharedtask/solid].

References

  • Akhter MP, Jiangbin Z, Naqvi IR, Abdelmajeed M, Sadiq MT (2020) Automatic Detection of Offensive Language for Urdu and Roman Urdu. IEEE Access 8:91213–91226. DOI: https://doi.org/10.1109/ACCESS.2020.2994950

    Article  Google Scholar 

  • Beddiar DR, Jahan MS, Oussalah M (2021) Data expansion using back translation and paraphrasing for hate speech detection. Online Social Networks and Media 24:100153. https://doi.org/10.1016/j.osnem.2021.100153

    Article  Google Scholar 

  • El-Alami FZ, El Alaoui SO, Nahnahi NE (2021) A multilingual offensive language detection method based on transfer learning from transformer fine-tuning model.Journal of King Saud University-Computer and Information Sciences

  • Ghanghor N, Krishnamurthy P, Thavareesan S, Priyadharshini R, Chakravarthi BR (2021) IIITK@ DravidianLangTech-EACL2021: Offensive Language Identification and Meme Classification in Tamil, Malayalam and Kannada. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv, 222–229

  • Hande A, Priyadharshini R, Chakravarthi BR (2020) KanCMD: Kannada CodeMixed Dataset for Sentiment Analysis and Offensive Language Detection. In Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, Barcelona, Spain (Online), Association for Computational Linguistics [publisher], 54–63

  • Isaksen V, Gambäck B (2020) Using Transfer-based Language Models to Detect Hateful and Offensive Language Online. In [Online] Proceedings of the Fourth Workshop on Online Abuse and Harms, Association for Computational Linguistics, 16–27. DOI: https://doi.org/10.18653/v1/2020.alw-1.3

  • Kapil P, Ekbal A (2020) A deep neural network based multi-task learning approach to hate speech detection. Knowl Based Syst 210:106458. https://doi.org/10.1016/j.knosys.2020.106458

    Article  Google Scholar 

  • Liu J, Yang Y, Fan X, Ren G, Yang L, Ning Q (2022) Offensive-Language Detection on Multi-Semantic Fusion Based on Data Augmentation. Applied System Innovation, 5(1), p.9

  • Mandl T, Modha S, Kumar MA, Chakravarthi BR (2020) Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German. In Forum for Information Retrieval Evaluation, Virtual event hosted by IDRBT, 16th – 20th December, Hyderabad, 29–32. https://doi.org/10.1145/3441501.3441517

  • Mossie Z, Wang JH (2020) Vulnerable community identification using hate speech detection on social media. Inf Process Manag 57(3):102087. https://doi.org/10.1016/j.ipm.2019.102087

    Article  Google Scholar 

  • Pathak AR, Pandey M, Rautaray S (2021) Topic-level sentiment analysis of social media data using deep learning. Appl Soft Comput 108:107440. https://doi.org/10.1016/j.asoc.2021.107440

    Article  Google Scholar 

  • Ranasinghe T, Zampieri M (2021) An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India. Information 12(8):306. https://doi.org/10.3390/info12080306

    Article  Google Scholar 

  • Ranasinghe T, Zampieri M, Hettiarachchi H (2019) BRUMS at HASOC 2019: Deep Learning Models for Multilingual Hate Speech and Offensive Language Identification. In FIRE 2019 - Forum for Information Retrieval Evaluation (Working Notes), 12–15 December 2019, Kolkata, India, 199–207

  • Rawat C, Sarkar A, Singh S, Alvarado R, Rasberry L (2019) Automatic Detection of Online Abuse and Analysis of Problematic Users in Wikipedia. In 2019 Systems and Information Engineering Design Symposium (SIEDS), IEEE, Charlottesville, VA, USA, 1–6. DOI: https://doi.org/10.1109/SIEDS.2019.8735592

  • Sharma M, Kandasamy I, Kandasamy V (2021) Deep Learning for predicting neutralities in Offensive Language Identification Dataset. Expert Syst Appl 185:115458. https://doi.org/10.1016/j.eswa.2021.115458

    Article  Google Scholar 

  • Vashishtha S, Susan S (2019) Fuzzy rule based unsupervised sentiment analysis from social media posts. Expert Syst Appl 138:112834. https://doi.org/10.1016/j.eswa.2019.112834

    Article  Google Scholar 

  • Zhang K, Zhu Y, Zhang W, Zhang W, Zhu Y (2020) Transfer Correlation Between Textual Content to Images for Sentiment Analysis. IEEE Access 8:35276–35289. DOI: https://doi.org/10.1109/ACCESS.2020.2975036

    Article  Google Scholar 

Download references

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Srinivasulu Kothuru.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kothuru, S., Santhanavijayan, A. Automatic hate speech detection using aspect based feature extraction and Bi-LSTM model. Int J Syst Assur Eng Manag 13, 2934–2943 (2022). https://doi.org/10.1007/s13198-022-01763-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13198-022-01763-6

Keywords

Navigation