Abstract
Social networking services (SNS), such as Twitter, disseminate not only useful information, but also false information. Identifying this false information is crucial in order to keep the information on a SNS reliable. The aim of this paper is to develop a method of extracting false information from among a large collection of tweets. We do so by using a set of linguistic patterns formulated to correct false information. More specifically, the proposed method extracts text passages that match specified correction patterns, clusters the passages into topics of false information, and selects a passage that represents each topic of false information. In the experiment we conduct, we build an evaluation set manually, and demonstrate the effectiveness of the proposed method.
This paper was partly supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI Grants No. 23240018 and 23700159 and by the Precursory Research for Embryonic Science and Technology (PRESTO), Japan Science and Technology Agency (JST). We are grateful to Twitter Japan for its provision of invaluable data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684 (2011)
Ennals, R., Trushkowsky, B., Agosta, J.M.: Highlighting disputed claims on the web. In: Proceedings of the 19th International Conference on World Wide Web, pp. 341–350 (2010)
Giampiccolo, D., Magnini, B., Dagan, I., Dolan, B.: The third pascal recognizing textual entailment challenge. In: Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, pp. 1–9 (2007)
Harabagiu, S., Hickl, A., Lacatusu, F.: Negation, contrast and contradiction in text processing. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 755–762 (2006)
Hsu, C.F., Khabiri, E., Caverlee, J.: Ranking comments on the social web. In: International Conference on Computational Science and Engineering, CSE 2009, vol. 4, pp. 90–97 (2009)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM) 46(5), 604–632 (1999)
Lex, E., Voelske, M., Errecalde, M., Ferretti, E., Cagnina, L., Horn, C., Stein, B., Granitzer, M.: Measuring the quality of web content using factual information. In: Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality, pp. 7–10 (2012)
MacCartney, B., Galley, M., Manning, C.D.: A phrase-based alignment model for natural language inference. In: Proceedings of 2008 Conference on Empirical Methods in Natural Language Processing, pp. 802–811 (2008)
de Marneffe, M.C., Rafferty, A.N., Manning, C.D.: Finding contradictions in text. In: Proceedings of ACL 2008: HLT, pp. 1039–1047 (2008)
de Marneffe, M.C., Rafferty, A.R., Manning, C.D.: Identifying Conflicting Information in Texts. In: Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation (2011)
Pasternack, J., Roth, D.: Making better informed trust decisions with generalized fact-finding. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, pp. 2324–2329 (2011)
Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: Identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599 (2011)
Ratkiewicz, J., Conover, M., Meiss, M., Gonçalves, B., Patil, S., Flammini, A., Menczer, F.: Truthy: Mapping the spread of astroturf in microblog streams. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 249–252 (2011)
Ritter, A., Soderland, S., Downey, D., Etzioni, O.: It’s a contradiction – no, it’s not: A case study using functional relations. In: Proceedings of 2008 Conference on Empirical Methods in Natural Language Processing, pp. 11–20 (2008)
Watanabe, Y., Miyao, Y., Mizuno, J., Shibata, T., Kanayama, H., Lee, C.W., Lin, C.J., Shi, S., Mitamura, T., Kando, N., Shima, H., Takeda, K.: Overview of the recognizing inference in text (RITE-2) at NTCIR-10. In: Proceedings of the NTCIR-10 Conference, pp. 385–404 (2013)
Watanabe, Y., Mizuno, J., Inui, K.: THK’s natural logic-based compositional textual entailment model at NTCIR-10 RITE-2. In: Proceedings of the NTCIR-10 Conference, pp. 531–536 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Nabeshima, K., Mizuno, J., Okazaki, N., Inui, K. (2014). Mining False Information on Twitter for a Major Disaster Situation. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, YS. (eds) Active Media Technology. AMT 2014. Lecture Notes in Computer Science, vol 8610. Springer, Cham. https://doi.org/10.1007/978-3-319-09912-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-09912-5_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09911-8
Online ISBN: 978-3-319-09912-5
eBook Packages: Computer ScienceComputer Science (R0)