Abstract
Social Network systems, such as Twitter, can serve as important data sources to provide collective intelligence and awareness of health problems in real time. The challenges of utilizing social media data include that the volume of data is large but distributed and of a highly unstructured form. Appropriate data gathering, scrubbing and aggregating efforts for these data are required to transform them for meaningful use. In this paper, we discuss such a social media data ETL (Extract-Transform-Load) method, to provide a user-friendly, dynamic method for visualizing outbreaks and the spread of developing epidemics in space and time. We have developed the Epidemics Outbreak and Spread Detection System (EOSDS) as a prototype that makes use of the rich information retrievable in real time from Twitter. EOSDS provides three different visualization methods of spreading epidemics, static map, distribution map, and filter map, to investigate public health threats in the space and time dimensions. The results of these visualizations in our experiments correlate well with relevant CDC official reports, a gold standard used by health informatics scientists. In our experiments, the EOSDS also detected an unusual situation not shown in the CDC reports, but confirmed by online news media.
Keywords
- Social Network
- Epidemics Detection
- Health Information Visualization
- Epidemics Distribution
- Epidemics Spread
This is a preview of subscription content, access via your institution.
Buying options
Preview
Unable to display preview. Download preview PDF.
References
Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., Smolinski, M.S., Brilliant, L.: Detecting influenza epidemics using search engine query data. Nature 457, 1012–1014 (2009)
Sipping from the fire hose: Making sense of a torrent of tweets. The Economist, p. 68 (2011)
Twitter developers documentation, https://dev.twitter.com/docs
Aramaki, E., Maskawa, S., Morita, M.: Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter. In: Conference on Empirical Methods in Natural Language Processing, EMNLP 2011 (2011)
140dev libraries, http://140dev.com/ (accessed on February 8, 2012)
Carley, K.M., Columbus, D., Bigrigg, M., Kunkel, F.: Automap user guide (2011)
Google Map API, http://code.google.com/apis/maps/documentation/geocoding/
Aurousseau, M.: On Lists of Words and Lists of Names. The Geographical Journal 105, 61–67 (1945)
National Places Gazetteer, http://www.census.gov/geo/www/gazetteer/files/Gaz_places_national.txt (accessed on February 8, 2012)
Mazzocchi, S., Garland, S., Lee, R.: SIMILE: practical metadata for the semantic web. O’Reilly (2005)
CDC Listeria report on September 30, http://www.cdc.gov/listeria/outbreaks/cantaloupes-jensen-farms/093011/index.html
Wyoming news report, http://www.health.wyo.gov/news.aspx?NewsID=498 (accessed on February 8, 2012)
CDC Listeria report on October 7, http://www.cdc.gov/listeria/outbreaks/cantaloupes-jensen-farms/100711/index.html (accessed on February 8, 2012)
Brownstein, J.S., Freifeld, C.C., Reis, B.Y., Mandl, K.D.: Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project. PLoS Med. 5(7), e151 (2008), doi:10.1371/journal.pmed.0050151
Cheng, Z., Caverlee, J., Lee, K.: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM 2010), Toronto, Canada, October 26-30 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ji, X., Chun, S.A., Geller, J. (2012). Epidemic Outbreak and Spread Detection System Based on Twitter Data. In: He, J., Liu, X., Krupinski, E.A., Xu, G. (eds) Health Information Science. HIS 2012. Lecture Notes in Computer Science, vol 7231. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29361-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-29361-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29360-3
Online ISBN: 978-3-642-29361-0
eBook Packages: Computer ScienceComputer Science (R0)