Abstract
In today’s world, people are highly inclined towards social networking sites like Twitter, which provides a platform where users can have easy access to the high impact occasions/events rising worldwide. Users can share views and retweet the contents posted by other users with respect to such high impact event. However, users have diverse interests and hold strong opinions pertaining to any political party, caste, culture, religion, etc. So, while sharing any information, it is very essential for the social media users that they do not post any abusive or absurd content which might hurt the emotions of others and can end up into dreadful situations. So, there is a dire need of some filtering techniques to build a credibility analysis model which can filter out all such uncredible and questionable contents from social media. In this paper, a machine learning model has been developed to detect the credibility of tweets over four distinct credibility classes. Approximately 5k tweets were crawled from Twitter and preprocessed, which helped in developing a model. Initially, the K-Means clustering algorithm was applied on the dataset to find which tweet falls in which cluster, based on its similarity measures of feature set. The total variance in the dataset explained by K Mean Clustering Algorithm is found to be 86.6%. Furthermore, the Support Vector Machine algorithm is applied to build a classification model and classify the tweets into their respective credibility classes. It provides 96.53% accuracy with 99.51% area under the curve.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Xia, X., Yang, X., Wu, C., Li, S., Bao, L.: Information credibility on Twitter in emergency situation. In: Chau, M., Wang, G.A., Yue, W.T., Chen, H. (eds.) PAISI 2012. LNCS, vol. 7299, pp. 45–59. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30428-6_4
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16
Zhang, Q., Zhang, S., Dong, J., Xiong, J., Cheng, X.: Automatic detection of rumor on social network. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 113–122. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_10
Westerman, D., Spence, P.R., Van Der Heide, B.: A social network as information: the effect of system generated reports of connectedness on credibility on Twitter. Comput. Hum. Behav. 28(1), 199–206 (2012). https://doi.org/10.1016/j.chb.2011.09.001
Morris, M.R., Counts, S., Roseway, A., Hoff, A., Schwarz, J.: Tweeting is believing?: understanding microblog credibility perceptions. In: Proceedings of the ACM Conference on Computer Supported Cooperative Work, pp. 441–450. ACM (2012). https://doi.org/10.1145/2145204.2145274
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in Twitter: the million follower fallacy. In: ICWSM, 10(10–17), 30. (PASSAT), 2012 International Conference on and 2012 International Conference on Social Computing (SocialCom), pp. 91–100. IEEE (2010)
O’Donovan, J., Kang, B., Meyer, G., Hollerer, T., Adalii, S.: Credibility in context: an analysis of feature distributions in Twitter. In: Proceedings of the 12th ASE/IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT) and ASE/IEEE International Conference on Social Computing (SocialCom), pp. 293–301. IEEE (2012). https://doi.org/10.1109/socialcom-passat.2012.128
Lorek, K., Suehiro-Wiciński, J., Jankowski-Lorek, M., Gupta, A.: Automated credibility assessment on Twitter. Comput. Sci. 16(2), 157–168 (2015). https://doi.org/10.7494/csci.2015.16.2.157
Resnick, P., Carton, S., Park, S., Shen, Y., Zeffer, N.: RumorLens: a system for analyzing the impact of rumors and corrections in social media. In: Proceedings of the Computational Journalism Conference, p. 10121-0701 (2014)
Thakur, H.K., Gupta, A., Bhardwaj, A., Verma, D.: Rumor detection on Twitter using a supervised machine learning framework. Int. J. Inf. Retrieval Res. (IJIRR) 8(3), 1–13 (2018). https://doi.org/10.4018/IJIRR.2018070101
Sicilia, R., Giudice, S.L., Pei, Y., Pechenizkiy, M., Soda, P.: Twitter rumour detection in the health domain. Expert Syst. Appl. 110, 33–40 (2018). https://doi.org/10.1016/j.eswa.2018.05.019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ahmad, F., Rizvi, S.A.M. (2020). Information Credibility on Twitter Using Machine Learning Techniques. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_29
Download citation
DOI: https://doi.org/10.1007/978-981-15-4451-4_29
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4450-7
Online ISBN: 978-981-15-4451-4
eBook Packages: Computer ScienceComputer Science (R0)