Irrelevant Racist Tweets Identification Using Data Mining Techniques

Kodali, Jyothirlatha; Kandikatla, Vyshnavi; Nagati, Princy; Nerendla, Veena; Sreedevi, M.

doi:10.1007/978-981-16-3728-5_15

Jyothirlatha Kodali⁶,
Vyshnavi Kandikatla⁶,
Princy Nagati⁶,
Veena Nerendla⁶ &
…
M. Sreedevi⁶

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 75))

1026 Accesses

Abstract

In recent times, Twitter is one of the major sources to access information. Its feature of the hashtag is something that grabs more attention from the users. One can write one’s mind and heart out on Twitter at any given minute. Due to which there is a rapid increase in the generation of irrelevant content on Twitter. Lately, a new hashtag called “#whitelivesmatter” was used as a counter for another hashtag “#blacklivesmatter”. A lot of anti-government protests and various other violent activities were conducted, recorded, and posted on Twitter with this hashtag. A lot of Kpop fans had taken over this hashtag and flooded Twitter with extremely irrelevant content. Due to which the main and important content of the protests was drowned in these irrelevant tweets, which made it extremely hard for the officials to find and reinforce the law and order. This paper aims at building a model that helps in finding the relevance of text content in the tweet and its hashtag #whitelivesmatter in specific. In this paper, supervised data analysis techniques like text classification are used to get the required output.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Rumour Detection in the Political Domain from Twitter Using Machine Learning Techniques

Hong Kong Protests: Using Natural Language Processing for Fake News Detection on Twitter

Mining Text Patterns over Fake and Real Tweets

References

Cooper, Jr., G.P., Yeager, V., Burkle, Jr., F.M., Subbarao, I.: Twitter as a potential disaster risk reduction tool. Part I: introduction, terminology, research and operational applications. PLoS Curr. 7 (2015)
Google Scholar
Halawi, B., Mourad, A., Otrok, H., Damiani, E.: Few are as good as many: an ontology-based tweet spam detection approach. IEEE Access 6, 63890–63904 (2018)
Article Google Scholar
Vosoughi, S.: Automatic detection and verification of rumors on Twitter. Mit.edu. Url: https://www.media.mit.edu/cogmac/publications/Soroush_Vosoughi_PHD_thesis.pdf
Kolos, S.: Hashtag as a Way of Archiving and Distributing Information on the Internet
Google Scholar
Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using Twitter hashtags and smileys. In: Coling 2010: Posters, Aug 2010, pp. 241–249
Google Scholar
Sedhai, S., Sun, A.: An analysis of 14 million tweets on hashtag-oriented spamming. J. Assoc. Inf. Sci. Technol. 68(7), 1638–1651 (2017)
Article Google Scholar
Pervin, N., Phan, T.Q., Datta, A., Takeda, H., Toriumi, F.: Hashtag popularity on Twitter: analyzing co-occurrence of multiple hashtags. In: International Conference on Social Computing and Social Media, Aug 2015, pp. 169–182. Springer, Cham (2015)
Google Scholar
Vijayakumar, T., Vinothkanna, M.R.: Capsule network on font style classification. J. Artif. Intell. 2(02), 64–76 (2020)
Google Scholar
Dann, S.: Twitter data acquisition and analysis: methodology and best practice. In: Maximizing Commerce and Marketing Strategies Through Micro-Blogging, pp. 280–296. IGI Global. Twitter, 2020, Terms of Service. Url: https://twitter.com/en/tos
Herzallah, W., Faris, H., Adwan, O.: Feature engineering for detecting spammers on Twitter: modelling and analysis. J. Inf. Sci. 44(2), 230–247 (2018)
Article Google Scholar
Inuwa-Dutse, I., Liptrott, M., Korkontzelos, I.: Detection of spam-posting accounts on Twitter. Neurocomputing 315, 496–511 (2018)
Article Google Scholar
Narasamma, V.L., Sreedevi, M.: A Comparative Approach for Classification and Combined Cluster Based Classification Method for Tweets Data Analysis. Url: https://link.springer.com/chapter/10.1007%2F978-981-32-9690-9_33
Kumar, S., Morstatter, F., Liu, H.: Twitter Data Analytics, pp. 1041–4347. Springer New York, New York, NY (2014)
Google Scholar
Sungheetha, A., Sharma, R.: Transcapsule model for sentiment classification. J. Artif. Intell. 2(03), 163–169 (2020)
Google Scholar
Twitter, 2020, Terms of Service. Url: https://twitter.com/en/tos
Wolny, W.: Knowledge gained from Twitter data. In: 2016 Federated Conference on Computer Science and Information Systems (FedCSIS), Gdansk, pp. 1133–1136 (2016). https://doi.org/10.15439/2016F149
Uysal, A.K., Gunal, S.: The impact of preprocessing on text classification. Inf. Process. Manage. 50(1), 104–112 (2014)
Article Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008). https://doi.org/10.1017/CBO9780511809071

Download references

Author information

Authors and Affiliations

Koneru Lakshmaiah Education Foundation, Green Fields, Vaddeswaram, Guntur, Andhra Pradesh, India
Jyothirlatha Kodali, Vyshnavi Kandikatla, Princy Nagati, Veena Nerendla & M. Sreedevi

Authors

Jyothirlatha Kodali
View author publications
You can also search for this author in PubMed Google Scholar
Vyshnavi Kandikatla
View author publications
You can also search for this author in PubMed Google Scholar
Princy Nagati
View author publications
You can also search for this author in PubMed Google Scholar
Veena Nerendla
View author publications
You can also search for this author in PubMed Google Scholar
M. Sreedevi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Sreedevi .

Editor information

Editors and Affiliations

Department of Information Technology, RVS Technical Campus, Coimbatore, Tamil Nadu, India
S. Smys
Department of Telecommunication Engineering, Czech Technical University in Prague, Praha, Czech Republic
Robert Bestak
Gerald Schwartz School of Business, St. Francis Xavier University, Antigonish, NS, Canada
Ram Palanisamy
Faculty of Informatics and Information Technology, Slovak University Technology, Bratislava, Slovakia
Ivan Kotuliak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kodali, J., Kandikatla, V., Nagati, P., Nerendla, V., Sreedevi, M. (2022). Irrelevant Racist Tweets Identification Using Data Mining Techniques. In: Smys, S., Bestak, R., Palanisamy, R., Kotuliak, I. (eds) Computer Networks and Inventive Communication Technologies . Lecture Notes on Data Engineering and Communications Technologies, vol 75. Springer, Singapore. https://doi.org/10.1007/978-981-16-3728-5_15

Download citation

DOI: https://doi.org/10.1007/978-981-16-3728-5_15
Published: 14 September 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3727-8
Online ISBN: 978-981-16-3728-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Irrelevant Racist Tweets Identification Using Data Mining Techniques

Abstract

Access this chapter

Similar content being viewed by others

Rumour Detection in the Political Domain from Twitter Using Machine Learning Techniques

Hong Kong Protests: Using Natural Language Processing for Fake News Detection on Twitter

Mining Text Patterns over Fake and Real Tweets

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Irrelevant Racist Tweets Identification Using Data Mining Techniques

Abstract

Access this chapter

Similar content being viewed by others

Rumour Detection in the Political Domain from Twitter Using Machine Learning Techniques

Hong Kong Protests: Using Natural Language Processing for Fake News Detection on Twitter

Mining Text Patterns over Fake and Real Tweets

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation