Automated Tool for Toxic Comments Identification on Live Streaming YouTube

Tarafder, Tuhin; Vashisth, Harsh Kumar; Arora, Mamta

doi:10.1007/978-981-99-8129-8_5

Tuhin Tarafder¹³,
Harsh Kumar Vashisth¹³ &
Mamta Arora¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 832))

Included in the following conference series:

International Conference on MAchine inTelligence for Research & Innovations

45 Accesses

Abstract

The necessity for content moderation on social media websites is increasing every day. The reason behind this is the anonymity of an individual which is provided by the internet and that they can exercise on any streaming website like YouTube. This project aids the creators/moderators in maintaining and stabilizing the toxicity of comments on their channel/page. A moderator has the authority to delete/hide any inappropriate comment posted by any user. During the pandemic of Covid-19, a significant, large user base immigrated to social media websites particularly streaming websites like YouTube. This surge in users resulted in an increased need for moderators. Any user can post any hateful/toxic or obscene comment that moderators can miss due to the huge volume of comments. Using NLP (Natural Language Processing), a model can be implemented directly onto any live-streaming chat session which can identify any toxic/obscene or threat comment and can flag them under the respective category. This model is real time and formulated using NLP techniques that are used in order to achieve the task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Tang T, Tang X, Yuan T (2020) Fine-tuning BERT for Multi-label sentiment analysis in unbalanced code-switching text. IEEE Access 8(2020):193248–193256. https://doi.org/10.1109/ACCESS.2020.3030468
Article Google Scholar
Chakravarthi BR (2020) HopeEDI: a multilingual hope speech detection dataset for equality, diversity, and inclusion. In Proceedings of the third workshop on computational modeling of people’s opinions, personality, and emotion’s in social media, Association for Computational Linguistics, Barcelona, Spain (Online), 41–53. Retrieved February 3, 2023. https://aclanthology.org/2020.peoples-1.5
Rupapara V, Rustam F, Shahzad HF, Mehmood A, Ashraf I, Choi GS (2021) Impact of SMOTE on imbalanced text features for toxic comments classification using RVVC model. IEEE Access 9(2021):78621–78634. https://doi.org/10.1109/ACCESS.2021.3083638
Article Google Scholar
Asif M, Ishtiaq A, Ahmad H, Aljuaid H, Shah J (2020) Sentiment analysis of extremism in social media from textual information. Telemat Inform 48:101345. https://doi.org/10.1016/j.tele.2020.101345
Kanfoud MR, Bouramoul A (2022) SentiCode: a new paradigm for one-time training and global prediction in multilingual sentiment analysis. J Intell Inf Syst 59(2):501–522. https://doi.org/10.1007/s10844-022-00714-8
Salminen J, Hopf M, Chowdhury SA, Jung S, Almerekhi H, Jansen BJ (2020) Developing an online hate classifier for multiple social media platforms. Hum Centric Comput Inf Sci 10, 1:1. https://doi.org/10.1186/s13673-019-0205-6
Kobs K, Zehe A, Bernstetter A, Chibane J, Pfister J, Tritscher J, Hotho A (2020) Emote-controlled: obtaining implicit viewer feedback through emote-based sentiment analysis on comments of popular twitch.tv channels. ACM Trans Soc Comput 3, 2:7:1–7:34. https://doi.org/10.1145/3365523
Nandakumar R, Pallavi MS, Harithas PP, Hegde V (2022) Sentimental analysis on student feedback using NLP & POS tagging. In 2022 International conference on edge computing and applications (ICECAA), 309–313. https://doi.org/10.1109/ICECAA55415.2022.9936569
Xu G, Meng Y, Qiu X, Yu Z, Wu X (2019) Sentiment analysis of comment texts based on BiLSTM. IEEE Access 7:51522–51532.https://doi.org/10.1109/ACCESS.2019.2909919
Alhujaili RF, Yafooz WMS (2022) Sentiment analysis for youtube educational videos using machine and deep learning approaches. In 2022 IEEE 2nd international conference on electronic technology, communication and information (ICETCI), 238–244. https://doi.org/10.1109/ICETCI55101.2022.9832284
Hasan MR, Maliha M, Arifuzzaman M (2019) Sentiment analysis with NLP on twitter data. In 2019 International conference on computer, communication, chemical, materials and electronic engineering (IC4ME2), 1–4. https://doi.org/10.1109/IC4ME247184.2019.9036670
Javed Mehedi Shamrat FM, Chakraborty S, Imran MM, Naeem Muna J, Billah M, Das P, Rahman O (2021) Sentiment analysis on twitter tweets about COVID-19 vaccines usi ng NLP and supervised KNN classification algorithm. Indones J Electr Eng Comput Sci 23, 1:463. https://doi.org/10.11591/ijeecs.v23.i1.pp463-470
Basiri ME, Nemati S, Abdar M, Cambria E, Rajendra Acharya U (2021) ABCDM: an attention-based bidirectional CNN-RNN deep model for sentiment analysis. Future Gener Comput Syst 115:279–294. https://doi.org/10.1016/j.future.2020.08.005
Pavel MI, Razzak R, Sengupta K, Niloy DK, Muqith MB, Tan SY (2021) Toxic comment classification implementing CNN combining word embedding technique. In Inventive computation and information technologies (Lecture Notes in Networks and Systems). Springer, Singapore, 897–909. https://doi.org/10.1007/978-981-33-4305-4_65
Toxic Comment Classification Challenge. https://kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge. Accessed 9 May 2023
Mahara T, Josephine VLH, Srinivasan R, Prakash P, Algarni AD, Verma OP (2023) Deep vs. shallow: a comparative study of machine learning and deep learning approaches for fake health news detection. IEEE Access 11:79330–79340. https://doi.org/10.1109/ACCESS.2023.3298441
Article Google Scholar

Download references

Author information

Authors and Affiliations

Manav Rachna University, Faridabad, Haryana, India
Tuhin Tarafder, Harsh Kumar Vashisth & Mamta Arora

Authors

Tuhin Tarafder
View author publications
You can also search for this author in PubMed Google Scholar
Harsh Kumar Vashisth
View author publications
You can also search for this author in PubMed Google Scholar
Mamta Arora
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mamta Arora .

Editor information

Editors and Affiliations

Dept of Instrumentation and Control Engg, Dr. B. R. Ambedkar National Institute of, Jalandhar, Punjab, India
Om Prakash Verma
School of Electrical and Electronics Engineering, Nanyang Technological University, Singapore, Singapore
Lipo Wang
Department of Electrical Engineering, Malaviya National Institute of Technolog, Jaipur, Rajasthan, India
Rajesh Kumar
Department of Mathematics, DR BR Ambedkar NIT Jalandhar, Jalandhar, Punjab, India
Anupam Yadav

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tarafder, T., Vashisth, H.K., Arora, M. (2024). Automated Tool for Toxic Comments Identification on Live Streaming YouTube. In: Verma, O.P., Wang, L., Kumar, R., Yadav, A. (eds) Machine Intelligence for Research and Innovations. MAiTRI 2023. Lecture Notes in Networks and Systems, vol 832. Springer, Singapore. https://doi.org/10.1007/978-981-99-8129-8_5

Download citation

DOI: https://doi.org/10.1007/978-981-99-8129-8_5
Published: 03 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8128-1
Online ISBN: 978-981-99-8129-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics