Detecting False Information in Medical and Healthcare Domains: A Text Mining Approach
In recent years, a lot of false information in medical and healthcare domains has emerged and spread over the Internet. Such false information has become a big risk to public health and safety. This study investigates this problem by analyzing data collected from two fact-checking websites, 416 medical claims from Snopes.com and 1,692 healthcare-related statements from PolitiFact.com. Topic analysis reveals frequent words and common topics occurring in these claims spread online. Furthermore, using text-mining and machine-learning techniques, this study builds prediction models for detecting false information and shows promising performance. Several textual and source features are identified as good indicators for true or false information in medical and healthcare domains.
KeywordsFalse information Medical Healthcare Text mining
- 1.Kumar, S., Shah, N.: False Information on Web and Social Media: A Survey, April 2018Google Scholar
- 2.Sloan, L., Quan-Haase, A., Rubin, V.L.: Deception detection and rumor debunking for social media. In: The SAGE Handbook of Social Media Research Methods, pp. 342–363 (2017)Google Scholar
- 4.Shu, K., Wang, S., Liu, H.: Beyond news contents: the role of social context for fake news detection. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pp. 312–320 (2019)Google Scholar
- 5.Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 309–319 (2011)Google Scholar
- 6.Li, J., Ott, M., Cardie, C., Hovy, E.: Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 1566–1576 (2014)Google Scholar