Skip to main content

Automated Tool for Toxic Comments Identification on Live Streaming YouTube

  • Conference paper
  • First Online:
Machine Intelligence for Research and Innovations (MAiTRI 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 832))

  • 45 Accesses

Abstract

The necessity for content moderation on social media websites is increasing every day. The reason behind this is the anonymity of an individual which is provided by the internet and that they can exercise on any streaming website like YouTube. This project aids the creators/moderators in maintaining and stabilizing the toxicity of comments on their channel/page. A moderator has the authority to delete/hide any inappropriate comment posted by any user. During the pandemic of Covid-19, a significant, large user base immigrated to social media websites particularly streaming websites like YouTube. This surge in users resulted in an increased need for moderators. Any user can post any hateful/toxic or obscene comment that moderators can miss due to the huge volume of comments. Using NLP (Natural Language Processing), a model can be implemented directly onto any live-streaming chat session which can identify any toxic/obscene or threat comment and can flag them under the respective category. This model is real time and formulated using NLP techniques that are used in order to achieve the task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tang T, Tang X, Yuan T (2020) Fine-tuning BERT for Multi-label sentiment analysis in unbalanced code-switching text. IEEE Access 8(2020):193248–193256. https://doi.org/10.1109/ACCESS.2020.3030468

    Article  Google Scholar 

  2. Chakravarthi BR (2020) HopeEDI: a multilingual hope speech detection dataset for equality, diversity, and inclusion. In Proceedings of the third workshop on computational modeling of people’s opinions, personality, and emotion’s in social media, Association for Computational Linguistics, Barcelona, Spain (Online), 41–53. Retrieved February 3, 2023. https://aclanthology.org/2020.peoples-1.5

  3. Rupapara V, Rustam F, Shahzad HF, Mehmood A, Ashraf I, Choi GS (2021) Impact of SMOTE on imbalanced text features for toxic comments classification using RVVC model. IEEE Access 9(2021):78621–78634. https://doi.org/10.1109/ACCESS.2021.3083638

    Article  Google Scholar 

  4. Asif M, Ishtiaq A, Ahmad H, Aljuaid H, Shah J (2020) Sentiment analysis of extremism in social media from textual information. Telemat Inform 48:101345. https://doi.org/10.1016/j.tele.2020.101345

  5. Kanfoud MR, Bouramoul A (2022) SentiCode: a new paradigm for one-time training and global prediction in multilingual sentiment analysis. J Intell Inf Syst 59(2):501–522. https://doi.org/10.1007/s10844-022-00714-8

  6. Salminen J, Hopf M, Chowdhury SA, Jung S, Almerekhi H, Jansen BJ (2020) Developing an online hate classifier for multiple social media platforms. Hum Centric Comput Inf Sci 10, 1:1. https://doi.org/10.1186/s13673-019-0205-6

  7. Kobs K, Zehe A, Bernstetter A, Chibane J, Pfister J, Tritscher J, Hotho A (2020) Emote-controlled: obtaining implicit viewer feedback through emote-based sentiment analysis on comments of popular twitch.tv channels. ACM Trans Soc Comput 3, 2:7:1–7:34. https://doi.org/10.1145/3365523

  8. Nandakumar R, Pallavi MS, Harithas PP, Hegde V (2022) Sentimental analysis on student feedback using NLP & POS tagging. In 2022 International conference on edge computing and applications (ICECAA), 309–313. https://doi.org/10.1109/ICECAA55415.2022.9936569

  9. Xu G, Meng Y, Qiu X, Yu Z, Wu X (2019) Sentiment analysis of comment texts based on BiLSTM. IEEE Access 7:51522–51532.https://doi.org/10.1109/ACCESS.2019.2909919

  10. Alhujaili RF, Yafooz WMS (2022) Sentiment analysis for youtube educational videos using machine and deep learning approaches. In 2022 IEEE 2nd international conference on electronic technology, communication and information (ICETCI), 238–244. https://doi.org/10.1109/ICETCI55101.2022.9832284

  11. Hasan MR, Maliha M, Arifuzzaman M (2019) Sentiment analysis with NLP on twitter data. In 2019 International conference on computer, communication, chemical, materials and electronic engineering (IC4ME2), 1–4. https://doi.org/10.1109/IC4ME247184.2019.9036670

  12. Javed Mehedi Shamrat FM, Chakraborty S, Imran MM, Naeem Muna J, Billah M, Das P, Rahman O (2021) Sentiment analysis on twitter tweets about COVID-19 vaccines usi ng NLP and supervised KNN classification algorithm. Indones J Electr Eng Comput Sci 23, 1:463. https://doi.org/10.11591/ijeecs.v23.i1.pp463-470

  13. Basiri ME, Nemati S, Abdar M, Cambria E, Rajendra Acharya U (2021) ABCDM: an attention-based bidirectional CNN-RNN deep model for sentiment analysis. Future Gener Comput Syst 115:279–294. https://doi.org/10.1016/j.future.2020.08.005

  14. Pavel MI, Razzak R, Sengupta K, Niloy DK, Muqith MB, Tan SY (2021) Toxic comment classification implementing CNN combining word embedding technique. In Inventive computation and information technologies (Lecture Notes in Networks and Systems). Springer, Singapore, 897–909. https://doi.org/10.1007/978-981-33-4305-4_65

  15. Toxic Comment Classification Challenge. https://kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge. Accessed 9 May 2023

  16. Mahara T, Josephine VLH, Srinivasan R, Prakash P, Algarni AD, Verma OP (2023) Deep vs. shallow: a comparative study of machine learning and deep learning approaches for fake health news detection. IEEE Access 11:79330–79340. https://doi.org/10.1109/ACCESS.2023.3298441

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mamta Arora .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tarafder, T., Vashisth, H.K., Arora, M. (2024). Automated Tool for Toxic Comments Identification on Live Streaming YouTube. In: Verma, O.P., Wang, L., Kumar, R., Yadav, A. (eds) Machine Intelligence for Research and Innovations. MAiTRI 2023. Lecture Notes in Networks and Systems, vol 832. Springer, Singapore. https://doi.org/10.1007/978-981-99-8129-8_5

Download citation

Publish with us

Policies and ethics