Multilabel Toxic Comment Classification Using Supervised Machine Learning Algorithms

Shah, Darshin Kalpesh; Sanghvi, Meet Ashok; Mehta, Raj Paresh; Shah, Prasham Sanjay; Singh, Artika

doi:10.1007/978-981-15-7106-0_3

Darshin Kalpesh Shah¹²,
Meet Ashok Sanghvi¹²,
Raj Paresh Mehta¹²,
Prasham Sanjay Shah¹² &
…
Artika Singh¹²

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 141))

1417 Accesses
2 Citations

Abstract

Web-based life has turned into an integral part of the regular day-to-day existence of a large number of individuals around the globe. Online commenting spaces generate a plethora of expressive content in the public domain, which contributes to a healthy environment for humans. However, it also has threats and dangers of cyberbullying, personal attacks, and the use of abusive language. This motivates industry researchers to model an automated process to curb this phenomenon. The aim of this paper is to perform multi-label text categorization, where each comment could belong to multiple toxic labels at the same time. We tested two models: RNN and LSTM. Their performance is significantly better than that of Logistic Regression and ExtraTrees, which are baseline models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Roman Urdu toxic comment classification

Article 29 January 2021

Machine Learning-Based Tool to Classify Online Toxic Comments

Vulgarity Classification in Comments Using SVM and LSTM

References

Online harassment pew research center (Online). Available http://www.pewinternet.org/2017/07/11/online-harassment-2017/
Social Network Ranking (Online). Available https://www.statista.com/statistics/272014/global-social-networksranked-by-number-of-users/
F. Mohammad (2018) Is preprocessing of text really worth your time for online comment classification? arXiv preprint arXiv:1806.02908
A. Lenhart, M. Ybarra, K. Zickuhr, M. Price-Feeney, Online harassment, digital abuse, and cyberstalking in America. Data and Society Research Institute (2016)
Google Scholar
Teen Internet Safety Survey 2014 (Online). Available https://www.cox.com/content/dam/cox/aboutus/documents/tweeninternet-safety-survey.pdf
E. Wulczyn, N. Thain, L. Dixon, Ex machina: personal attacks seen at scale, in Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee (2017), pp. 1391–1399
Google Scholar
Georgakopoulos, S.V., Tasoulis, S.K., Vrahatis, A.G., Plagianakos, V.P. Convolutional neural networks for toxic comment classification, in Proceedings of the 10th Hellenic Conference on Artificial Intelligence (ACM, 2018), p. 35
Google Scholar
S. Vijayarani, M.J. Ilamathi, M. Nithya, Preprocessing techniques for text mining-an overview. Int. J. Comput. Sci. Commun. Netw. 5(1), 7–16 (2015)
Google Scholar
J.H. Park, P. Fung, One-step and two-step classification for abusive language detection on twitter. Hong Kong University of Science and Technology
Google Scholar
P. Fortuna, J. Ferreira, L. Pires, G. Routar, S. Nunes, Merging datasets for aggressive text identification, in Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018) (2018), pp. 128–139
Google Scholar
C. Guibin, Y. Deheng, Z.X.C. Jieshan, C. Erik, Ensemble application of convolutional and recurrent neural networks for multilabel text categorization, in International Joint Conference on Neural Networks (IJCNN) (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Mukesh Patel School of Technology Management & Engineering, NMIMS, Mumbai, India
Darshin Kalpesh Shah, Meet Ashok Sanghvi, Raj Paresh Mehta, Prasham Sanjay Shah & Artika Singh

Authors

Darshin Kalpesh Shah
View author publications
You can also search for this author in PubMed Google Scholar
Meet Ashok Sanghvi
View author publications
You can also search for this author in PubMed Google Scholar
Raj Paresh Mehta
View author publications
You can also search for this author in PubMed Google Scholar
Prasham Sanjay Shah
View author publications
You can also search for this author in PubMed Google Scholar
Artika Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Darshin Kalpesh Shah .

Editor information

Editors and Affiliations

Global Knowledge Research Foundation, Gujarat, India
Amit Joshi
University of Osaka, Suita, Japan
Mahdi Khosravy
School of Engineering and Computer Science, Oakland University, Rochester Hills, MI, USA
Neeraj Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shah, D.K., Sanghvi, M.A., Mehta, R.P., Shah, P.S., Singh, A. (2021). Multilabel Toxic Comment Classification Using Supervised Machine Learning Algorithms. In: Joshi, A., Khosravy, M., Gupta, N. (eds) Machine Learning for Predictive Analysis. Lecture Notes in Networks and Systems, vol 141. Springer, Singapore. https://doi.org/10.1007/978-981-15-7106-0_3

Download citation

DOI: https://doi.org/10.1007/978-981-15-7106-0_3
Published: 23 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7105-3
Online ISBN: 978-981-15-7106-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Multilabel Toxic Comment Classification Using Supervised Machine Learning Algorithms

Abstract

Access this chapter

Similar content being viewed by others

Roman Urdu toxic comment classification

Machine Learning-Based Tool to Classify Online Toxic Comments

Vulgarity Classification in Comments Using SVM and LSTM

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Multilabel Toxic Comment Classification Using Supervised Machine Learning Algorithms

Abstract

Access this chapter

Similar content being viewed by others

Roman Urdu toxic comment classification

Machine Learning-Based Tool to Classify Online Toxic Comments

Vulgarity Classification in Comments Using SVM and LSTM

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation