Skip to main content

Information Credibility on Twitter Using Machine Learning Techniques

  • Conference paper
  • First Online:
Futuristic Trends in Networks and Computing Technologies (FTNCT 2019)

Abstract

In today’s world, people are highly inclined towards social networking sites like Twitter, which provides a platform where users can have easy access to the high impact occasions/events rising worldwide. Users can share views and retweet the contents posted by other users with respect to such high impact event. However, users have diverse interests and hold strong opinions pertaining to any political party, caste, culture, religion, etc. So, while sharing any information, it is very essential for the social media users that they do not post any abusive or absurd content which might hurt the emotions of others and can end up into dreadful situations. So, there is a dire need of some filtering techniques to build a credibility analysis model which can filter out all such uncredible and questionable contents from social media. In this paper, a machine learning model has been developed to detect the credibility of tweets over four distinct credibility classes. Approximately 5k tweets were crawled from Twitter and preprocessed, which helped in developing a model. Initially, the K-Means clustering algorithm was applied on the dataset to find which tweet falls in which cluster, based on its similarity measures of feature set. The total variance in the dataset explained by K Mean Clustering Algorithm is found to be 86.6%. Furthermore, the Support Vector Machine algorithm is applied to build a classification model and classify the tweets into their respective credibility classes. It provides 96.53% accuracy with 99.51% area under the curve.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Xia, X., Yang, X., Wu, C., Li, S., Bao, L.: Information credibility on Twitter in emergency situation. In: Chau, M., Wang, G.A., Yue, W.T., Chen, H. (eds.) PAISI 2012. LNCS, vol. 7299, pp. 45–59. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30428-6_4

    Chapter  Google Scholar 

  2. Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16

    Chapter  Google Scholar 

  3. Zhang, Q., Zhang, S., Dong, J., Xiong, J., Cheng, X.: Automatic detection of rumor on social network. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 113–122. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_10

    Chapter  Google Scholar 

  4. Westerman, D., Spence, P.R., Van Der Heide, B.: A social network as information: the effect of system generated reports of connectedness on credibility on Twitter. Comput. Hum. Behav. 28(1), 199–206 (2012). https://doi.org/10.1016/j.chb.2011.09.001

    Article  Google Scholar 

  5. Morris, M.R., Counts, S., Roseway, A., Hoff, A., Schwarz, J.: Tweeting is believing?: understanding microblog credibility perceptions. In: Proceedings of the ACM Conference on Computer Supported Cooperative Work, pp. 441–450. ACM (2012). https://doi.org/10.1145/2145204.2145274

  6. Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in Twitter: the million follower fallacy. In: ICWSM, 10(10–17), 30. (PASSAT), 2012 International Conference on and 2012 International Conference on Social Computing (SocialCom), pp. 91–100. IEEE (2010)

    Google Scholar 

  7. O’Donovan, J., Kang, B., Meyer, G., Hollerer, T., Adalii, S.: Credibility in context: an analysis of feature distributions in Twitter. In: Proceedings of the 12th ASE/IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT) and ASE/IEEE International Conference on Social Computing (SocialCom), pp. 293–301. IEEE (2012). https://doi.org/10.1109/socialcom-passat.2012.128

  8. Lorek, K., Suehiro-Wiciński, J., Jankowski-Lorek, M., Gupta, A.: Automated credibility assessment on Twitter. Comput. Sci. 16(2), 157–168 (2015). https://doi.org/10.7494/csci.2015.16.2.157

    Article  Google Scholar 

  9. Resnick, P., Carton, S., Park, S., Shen, Y., Zeffer, N.: RumorLens: a system for analyzing the impact of rumors and corrections in social media. In: Proceedings of the Computational Journalism Conference, p. 10121-0701 (2014)

    Google Scholar 

  10. Thakur, H.K., Gupta, A., Bhardwaj, A., Verma, D.: Rumor detection on Twitter using a supervised machine learning framework. Int. J. Inf. Retrieval Res. (IJIRR) 8(3), 1–13 (2018). https://doi.org/10.4018/IJIRR.2018070101

    Article  Google Scholar 

  11. Sicilia, R., Giudice, S.L., Pei, Y., Pechenizkiy, M., Soda, P.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018). https://doi.org/10.1016/j.eswa.2018.05.019

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Faraz Ahmad .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ahmad, F., Rizvi, S.A.M. (2020). Information Credibility on Twitter Using Machine Learning Techniques. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4451-4_29

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4450-7

  • Online ISBN: 978-981-15-4451-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics