CbI: Improving Credibility of User-Generated Content on Facebook

Gupta, Sonu; Sachdeva, Shelly; Dewan, Prateek; Kumaraguru, Ponnurangam

doi:10.1007/978-3-030-04780-1_12

CbI: Improving Credibility of User-Generated Content on Facebook

Sonu Gupta¹⁸,
Shelly Sachdeva¹⁹,
Prateek Dewan²⁰ &
…
Ponnurangam Kumaraguru²⁰

Conference paper
First Online: 22 November 2018

1586 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11297))

Abstract

Online Social Networks (OSNs) have become a popular platform to share information with each other. Fake news often spread rapidly in OSNs especially during news-making events, e.g. Earthquake in Chile (2010) and Hurricane Sandy in the USA (2012). A potential solution is to use machine learning techniques to assess the credibility of a post automatically, i.e. whether a person would consider the post believable or trustworthy. In this paper, we provide a fine-grained definition of credibility. We call a post to be credible if it is accurate, clear, and timely. Hence, we propose a system which calculates the Accuracy, Clarity, and Timeliness (A-C-T) of a Facebook post which in turn are used to rank the post for its credibility. We experiment with 1,056 posts created by 107 pages that claim to belong to news-category. We use a set of 152 features to train classification models each for A-C-T using supervised algorithms. We use the best performing features and models to develop a RESTful API and a Chrome browser extension to rank posts for its credibility in real-time. The random forest algorithm performed the best and achieved ROC AUC of 0.916, 0.875, and 0.851 for A-C-T respectively.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://en.wikipedia.org/wiki/Facebook.
2.
https://www.cnbc.com/2016/12/30/read-all-about-it-the-biggest-fake-news-stories-of-2016.html.
3.
https://developers.facebook.com/docs/graph-api.
4.
https://dev.twitter.com/overview/api.
5.
https://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html.
6.
Both the tools are in the development stage; hence, they are not available online.
7.
http://flask.pocoo.org.
8.
https://aws.amazon.com/ec2/.
9.
http://scikit-learn.org.

References

Alrubaian, M., Al-Qurishi, M., Hassan, M., Alamri, A.: A credibility analysis system for assessing information on Twitter. IEEE Trans. Dependable Secur. Comput. 15(4), 661–674 (2016)
Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. ACM (2011)
Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Article Google Scholar
Dewan, P., Bagroy, S., Kumaraguru, P.: Hiding in plain sight: characterizing and detecting malicious Facebook pages. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 193–196. IEEE (2016)
Google Scholar
Dewan, P., Bagroy, S., Kumaraguru, P.: Hiding in plain sight: the anatomy of malicious Pages on Facebook. In: Kaya, M., Kawash, J., Khoury, S., Day, M.-Y. (eds.) Social Network Based Big Data Analysis and Applications. LNSN, pp. 21–54. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-78196-9_2
Chapter Google Scholar
Dewan, P., Kumaraguru, P.: Towards automatic real time identification of malicious posts on Facebook. In: 2015 13th Annual Conference on Privacy, Security and Trust (PST), pp. 85–92. IEEE (2015)
Google Scholar
Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, p. 2. ACM (2012)
Google Scholar
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16
Chapter Google Scholar
Gupta, A., Lamba, H., Kumaraguru, P., Joshi, A.: Faking sandy: characterizing and identifying fake images on Twitter during hurricane sandy. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 729–736. ACM (2013)
Google Scholar
Haralabopoulos, G., Anagnostopoulos, I., Zeadally, S.: The challenge of improving credibility of user-generated content in online social networks. J. Data Inf. Qual. (JDIQ) 7(3), 13 (2016)
Google Scholar
Li, H., Sakamoto, Y.: Computing the veracity of information through crowds: a method for reducing the spread of false messages on social media. In: 2015 48th Hawaii International Conference on System Sciences (HICSS), pp. 2003–2012. IEEE (2015)
Google Scholar
Mendoza, M., Poblete, B., Castillo, C.: Twitter under crisis: can we trust what we RT? In: Proceedings of the First Workshop on Social Media Analytics, pp. 71–79. ACM (2010)
Google Scholar
Ratkiewicz, J., et al.: Truthy: mapping the spread of astroturf in microblog streams. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 249–252. ACM (2011)
Google Scholar
Saikaew, K.R., Noyunsan, C.: Features for measuring credibility on Facebook information. Int. Sch. Sci. Res. Innov. 9(1), 174–177 (2015)
Google Scholar
Tanaka, Y., Sakamoto, Y., Matsuka, T.: Toward a social-technological system that inactivates false rumors through the critical thinking of crowds. In: 2013 46th Hawaii International Conference on System Sciences (HICSS), pp. 649–658. IEEE (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Jaypee Institute of Information Technology, Noida, India
Sonu Gupta
National Institute of Technology, Delhi, New Delhi, India
Shelly Sachdeva
Indraprastha Institute of Information Technology, Delhi, New Delhi, India
Prateek Dewan & Ponnurangam Kumaraguru

Authors

Sonu Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Shelly Sachdeva
View author publications
You can also search for this author in PubMed Google Scholar
Prateek Dewan
View author publications
You can also search for this author in PubMed Google Scholar
Ponnurangam Kumaraguru
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sonu Gupta .

Editor information

Editors and Affiliations

Ashoka University, Sonepat, India
Anirban Mondal
IBM Research - India, New Delhi, India
Himanshu Gupta
University of Minnesota, Minneapolis, MN, USA
Jaideep Srivastava
IIIT, Hyderabad, India
P. Krishna Reddy
National Institute of Technology, Warangal, India
D.V.L.N. Somayajulu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gupta, S., Sachdeva, S., Dewan, P., Kumaraguru, P. (2018). CbI: Improving Credibility of User-Generated Content on Facebook. In: Mondal, A., Gupta, H., Srivastava, J., Reddy, P., Somayajulu, D. (eds) Big Data Analytics. BDA 2018. Lecture Notes in Computer Science(), vol 11297. Springer, Cham. https://doi.org/10.1007/978-3-030-04780-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-04780-1_12
Published: 22 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04779-5
Online ISBN: 978-3-030-04780-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics