Abstract
Disaster management urgently requires mechanisms for achieving situation awareness (SA) in a timely manner, allowing authorities to react in an appropriate way to reduce the impact on affected people and infrastructure. In such situations, no matter if they are human-induced like shootings or natural ones like earthquakes or floods, social media such as Twitter are frequently used communication channels, making them a highly valuable additional data source for enhancing SA. The challenge is, however, to identify out of the tremendous mass of irrelevant and non informative social media data those messages being really “informative”, i.e., contributing to SA in a certain disaster situation. Existing approaches on machine-learning driven informativeness classification most often focus on specific disaster types, such as shootings or floods, thus lacking general applicability and falling short in classification of new disaster events. Therefore, this article puts forward the following three contributions: First, in order to better understand the underlying social media data source, an in-depth analysis of existing Twitter data on 26 different disaster events is provided along temporal, spatial, linguistic, and source dimensions. Second, based thereupon, a cross-domain informativeness classifier is proposed being not focused on specific disaster types but rather allowing for classifications across different types. Third, the applicability of this cross-domain classifier is demonstrated, showing its accuracy compared to other disaster type specific approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
It has to be noted that a considerably shorter pre-version of this article has already been published in Proceeding of the 10th International Conference on Management of Digital EcoSystems. ACM, Tokyo, Japan, Sept. 2018.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
References
Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisc. Rev. Comput. Stat. 2(4), 433–459 (2010)
Acerbo, F., Rossi, C.: Filtering informative tweets during emergencies: a machine learning approach. In: Proceedings of the 1st CoNEXT Workshop on ICT Tools for Emergency Networks and Disaster Relief, I-TENDER 2017, pp. 1–6. ACM, New York (2017)
Avvenuti, M., Cimino, M.G.C.A., Cresci, S., Marchetti, A., Tesconi, M.: A framework for detecting unfolding emergencies using humans as sensors. SpringerPlus 5(1), 1–23 (2016). https://doi.org/10.1186/s40064-016-1674-y
Cameron, M., Power, R., Robinson, B., Yin, J.: Emergency situation awareness from twitter for crisis management. In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 695–698. ACM, New York (2012)
Cresci, S., Tesconi, M., Cimino, A., Dell’Orletta, F.: A linguistically-driven approach to cross-event damage assessment of natural disasters from social media messages. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pp. 1195–1200. ACM (2015)
Dai, W., Xue, G., Yang, Q., Yu, Y.: Transferring Naive Bayes classifiers for text classification. In: Proceedings of the 22nd International Conference on Association for the Advancement of Artificial Intelligence, AAAI 2007, vol. 7, pp. 540–545 (2007)
Derczynski, L., Meesters, K., Bontcheva, K., Maynard, D.: Helping crisis responders find the informative needle in the tweet haystack. arXiv preprint arXiv:1801.09633 (2018)
Girtelschmid, S., Salfinger, A., Pröll, B., Retschitzegger, W., Schwinger, W.: Near real-time detection of crisis situations. In: Proceedings of 39th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2016, pp. 247–252. IEEE (2016)
Horn, C., Zhila, A., Gelbukh, A., Kern, R., Lex, E.: Using factual density to measure informativeness of web documents. In: Proceedings of the 19th Nordic Conference of Computational Linguistics, NODALIDA 2013, pp. 227–238 (2013)
Imran, M., Castillo, C., Diaz, F., Vieweg, S.: Processing social media messages in mass emergency: a survey. ACM Comput. Surv. (CSUR) 47, 1–38 (2015)
Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., Meier, P.: Extracting information nuggets from disaster-related messages in social media. In: Proceedings of the 10th Conference for Information Systems for Crisis Response and Management, ISCRAM 2013 (2013)
Imran, M., Mitra, P., Srivastava, J.: Cross-language domain adaptation for classifying crisis-related short messages. arXiv preprint arXiv:1602.05388 (2016)
Khare, P., Burel, G., Alani, H.: Classifying crises-information relevancy with semantics. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 367–383. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_24
Khare, P., Fernandez, M., Alani, H.: Statistical semantic classification of crisis information. In: 1st workshop of Hybrid Statistical Semantic Understanding and Emerging Semantics (HSSUES), 16th International Semantic Web Conference (2017)
Khare, P., Burel, G., Maynard, D., Alani, H.: Cross-lingual classification of crisis data. In: Vrandečić, D.D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 617–633. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_36
Kireyev, K.: Semantic-based estimation of term informativeness. In: Proceedings of the 2009 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2009, pp. 530–538. Association for Computational Linguistics (2009)
Li, H., et al.: Twitter mining for disaster response: a domain adaptation approach. In: Proceedings of the 12th Conference for Information Systems for Crisis Response and Management, ISCRAM 2015 (2015)
Lloret, E., Palomar, M.: Analysing and evaluating the task of automatic tweet generation: knowledge to business. Comput. Ind. 78, 3–15 (2016)
Longhini, J., Rossi, C., Casetti, C., Angaramo, F.: A language-agnostic approach to exact informative tweets during emergency situations. In: International Conference on Big Data, Big Data 2017, pp. 3475–3739. IEEE (2017)
Mohammad, S., Kiritchenko, S., Zhu, X.: NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242 (2013)
Ning, X., Yao, L., Wang, X., Benatallah, B.: Calling for response: automatically distinguishing situation-aware tweets during crises. In: Cong, G., Peng, W.-C., Zhang, W.E., Li, C., Sun, A. (eds.) ADMA 2017. LNCS (LNAI), vol. 10604, pp. 195–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69179-4_14
Olteanu, A., Vieweg, S., Castillo, C.: What to expect when the unexpected happens: social media communications across crises. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW 2015, pp. 994–1009. ACM (2015)
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Pekar, V., Binner, J., Najafi, H., Hale, C.: Selecting classification features for detection of mass emergency events on social media. In: Proceedings of the 15th International Conference on Security and Management, SAM 2016, The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp), p. 192 (2016)
Pekar, V., Binner, J., Najafi, H., Hale, C., Schmidt, V.: Early detection of heterogeneous disaster events using social media. J. Assoc. Inf. Sci. Technol. 71, 43–54 (2020)
Ren, X., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, WWW 2017, International World Wide Web Conference on Steering Committee, pp. 1015–1024 (2017)
Rossi, C., et al.: Early detection and information extraction for weather-induced floods using social media streams. Int. J. Disaster Risk Reduct. 30, 145–157 (2018)
Salfinger, A.: Staying aware in an evolving world. Ph.D. thesis, Johannes Kepler University of Linz (2016)
Salfinger, A., Salfinger, C., Pröll, B., Retschitzegger, W., Schwinger, W.: Pinpointing the eye of the hurricane-creating a gold-standard corpus for situative geo-coding of crisis tweets based on linked open data. In: LDL 2016 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources, p. 27 (2016)
Salfinger, A., Schwinger, W., Retschitzegger, W., Pröll, B.: Mining the disaster hotspots-situation-adaptive crowd knowledge extraction for crisis management. In: Proceedings of the 2016 Multi-Disciplinary International Conference on Cognitive Methods in Situation Awareness and Decision Support, CogSIMA 2016, pp. 212–218. IEEE (2016)
Stowe, K., Paul, M., Palmer, M., Palen, L., Anderson, K.: Identifying and categorizing disaster-related tweets. In: Proceedings of The 4th International Workshop on Natural Language Processing for Social Media, pp. 1–6 (2016)
Tsuchida, T., Kato, D., Endo, M., Hirota, M., Araki, T., Ishikawa, H.: Analyzing Relationship of words using biased LexRank from geotagged tweets. In: Proceedings of the 9th International Conference on Management of Digital Ecosystems, MEDES 2017, pp. 42–49. ACM, New York (2017)
Verma, S., et al.: Natural language processing to the rescue? Extracting “situational awareness” tweets during mass emergency. In: Proceedings of the 5th Conference on Weblogs and Social Media, ICWSM 2011 (2011)
Vieweg, S.: Situational awareness in mass emergency: a behavioral and linguistic analysis of microblogged communications. Ph.D. thesis, University of Colorado at Boulder (2012)
Vieweg, S., Hughes, A.L., Starbird, K., Palen, L.: Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the Conference on Human Factors in Computing Systems, CHI 2010, pp. 1079–1088. ACM (2010)
Wong, B., Kit, C.: Comparative evaluation of term informativeness measures for machine translation evaluation metrics. In: Proceedings of the 13th Conference of Machine Translation Summit, vol. 2011, pp. 537–544 (2011)
Wu, Z., Giles, C.: Measuring term informativeness in context. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013, pp. 259–269 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer-Verlag GmbH Germany, part of Springer Nature
About this chapter
Cite this chapter
Graf, D., Retschitzegger, W., Schwinger, W., Pröll, B., Kapsammer, E. (2020). Exploiting Twitter for Informativeness Classification in Disaster Situations. In: Hameurlain, A., et al. Transactions on Large-Scale Data- and Knowledge-Centered Systems XLV. Lecture Notes in Computer Science(), vol 12390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-62308-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-62308-4_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-62307-7
Online ISBN: 978-3-662-62308-4
eBook Packages: Computer ScienceComputer Science (R0)