Information Credibility on Twitter Using Machine Learning Techniques

Ahmad, Faraz; Rizvi, Syed Afzal Murtaza

doi:10.1007/978-981-15-4451-4_29

Faraz Ahmad¹³ &
Syed Afzal Murtaza Rizvi

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1206))

Included in the following conference series:

International Conference on Futuristic Trends in Networks and Computing Technologies

670 Accesses
3 Citations
1 Altmetric

Abstract

In today’s world, people are highly inclined towards social networking sites like Twitter, which provides a platform where users can have easy access to the high impact occasions/events rising worldwide. Users can share views and retweet the contents posted by other users with respect to such high impact event. However, users have diverse interests and hold strong opinions pertaining to any political party, caste, culture, religion, etc. So, while sharing any information, it is very essential for the social media users that they do not post any abusive or absurd content which might hurt the emotions of others and can end up into dreadful situations. So, there is a dire need of some filtering techniques to build a credibility analysis model which can filter out all such uncredible and questionable contents from social media. In this paper, a machine learning model has been developed to detect the credibility of tweets over four distinct credibility classes. Approximately 5k tweets were crawled from Twitter and preprocessed, which helped in developing a model. Initially, the K-Means clustering algorithm was applied on the dataset to find which tweet falls in which cluster, based on its similarity measures of feature set. The total variance in the dataset explained by K Mean Clustering Algorithm is found to be 86.6%. Furthermore, the Support Vector Machine algorithm is applied to build a classification model and classify the tweets into their respective credibility classes. It provides 96.53% accuracy with 99.51% area under the curve.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Xia, X., Yang, X., Wu, C., Li, S., Bao, L.: Information credibility on Twitter in emergency situation. In: Chau, M., Wang, G.A., Yue, W.T., Chen, H. (eds.) PAISI 2012. LNCS, vol. 7299, pp. 45–59. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30428-6_4
Chapter Google Scholar
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16
Chapter Google Scholar
Zhang, Q., Zhang, S., Dong, J., Xiong, J., Cheng, X.: Automatic detection of rumor on social network. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 113–122. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_10
Chapter Google Scholar
Westerman, D., Spence, P.R., Van Der Heide, B.: A social network as information: the effect of system generated reports of connectedness on credibility on Twitter. Comput. Hum. Behav. 28(1), 199–206 (2012). https://doi.org/10.1016/j.chb.2011.09.001
Article Google Scholar
Morris, M.R., Counts, S., Roseway, A., Hoff, A., Schwarz, J.: Tweeting is believing?: understanding microblog credibility perceptions. In: Proceedings of the ACM Conference on Computer Supported Cooperative Work, pp. 441–450. ACM (2012). https://doi.org/10.1145/2145204.2145274
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in Twitter: the million follower fallacy. In: ICWSM, 10(10–17), 30. (PASSAT), 2012 International Conference on and 2012 International Conference on Social Computing (SocialCom), pp. 91–100. IEEE (2010)
Google Scholar
O’Donovan, J., Kang, B., Meyer, G., Hollerer, T., Adalii, S.: Credibility in context: an analysis of feature distributions in Twitter. In: Proceedings of the 12th ASE/IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT) and ASE/IEEE International Conference on Social Computing (SocialCom), pp. 293–301. IEEE (2012). https://doi.org/10.1109/socialcom-passat.2012.128
Lorek, K., Suehiro-Wiciński, J., Jankowski-Lorek, M., Gupta, A.: Automated credibility assessment on Twitter. Comput. Sci. 16(2), 157–168 (2015). https://doi.org/10.7494/csci.2015.16.2.157
Article Google Scholar
Resnick, P., Carton, S., Park, S., Shen, Y., Zeffer, N.: RumorLens: a system for analyzing the impact of rumors and corrections in social media. In: Proceedings of the Computational Journalism Conference, p. 10121-0701 (2014)
Google Scholar
Thakur, H.K., Gupta, A., Bhardwaj, A., Verma, D.: Rumor detection on Twitter using a supervised machine learning framework. Int. J. Inf. Retrieval Res. (IJIRR) 8(3), 1–13 (2018). https://doi.org/10.4018/IJIRR.2018070101
Article Google Scholar
Sicilia, R., Giudice, S.L., Pei, Y., Pechenizkiy, M., Soda, P.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018). https://doi.org/10.1016/j.eswa.2018.05.019
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Jamia Millia Islamia University, New Delhi, India
Faraz Ahmad

Authors

Faraz Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Syed Afzal Murtaza Rizvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Faraz Ahmad .

Editor information

Editors and Affiliations

Jaypee University of Information Technology, Waknaghat, Himachal Pradesh, India
Pradeep Kumar Singh
CDAC, Mohali, India
Sanjay Sood
Jaypee University of Information Technology, Solan, Himachal Pradesh, India
Yugal Kumar
Polish Academy of Sciences, Warsaw, Poland
Marcin Paprzycki
Southern Federal University, Rostov-on-Don, Russia
Anton Pljonkin
Jiangsu Normal University, Xuzhou, China
Wei-Chiang Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmad, F., Rizvi, S.A.M. (2020). Information Credibility on Twitter Using Machine Learning Techniques. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_29

Download citation

DOI: https://doi.org/10.1007/978-981-15-4451-4_29
Published: 22 April 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4450-7
Online ISBN: 978-981-15-4451-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics