Predicting social response to infectious disease outbreaks from internet-based news streams
Infectious disease outbreaks often have consequences beyond human health, including concern among the population, economic instability, and sometimes violence. A warning system capable of anticipating social disruptions resulting from disease outbreaks is urgently needed to help decision makers prepare appropriately. We designed a system that operates in near real-time to identify and predict social response. Over 150,000 Internet-based news articles related to outbreaks of 16 diseases in 72 countries and territories were provided by HealthMap. These articles were automatically tagged with indicators of the disease activity and population reaction. An anomaly detection algorithm was implemented on the population reaction indicators to identify periods of unusually severe social response. Then a model was developed to predict the probability of these periods of unusually severe social response occurring in the coming week, 2 and 3 weeks. This model exhibited remarkably strong performance for diseases with substantial media coverage. For country-disease pairs with a median of 20 or more articles per year, the onset of social response in the next week was correctly predicted over 60% of the time, and 87% of weeks were correctly predicted. Performance was weaker for diseases with little media coverage, and, for these diseases, the main utility of our system is in identifying social response when it occurs, rather than predicting when it will happen in the future. Overall, the developed near real-time prediction approach is a promising step toward developing predictive models to inform responders of the likely social consequences of disease spread.
KeywordsBiosurveillance Social response Epidemics Anomaly detection Near real-time prediction
- Beck, N., Epstein, D., Jackman, S., & O’Halloran, S. (2001). Alternative models of dynamics in binary time-series-cross-section models: The example of state failure. http://hdl.handle.net/10022/AC:P:9718.
- Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16(1), 321–357.Google Scholar
- Cheng, C. (2004). To be paranoid is the standard? Panic responses to SARS outbreak in the Hong Kong Special Administrative Region. Asian Perspective, 28(1), 67–98.Google Scholar
- Fast, S. M., González, M. C., Wilson, J. M., & Markuzon, N. (2015). Modelling the propagation of social response during a disease outbreak. Journal of The Royal Society Interface, 12(104), 20141105. doi: 10.1098/rsif.2014.1105.
- International Federation of Red Cross and Red Crescent Societies (2015) Red Cross Red Crescent denounces countinued violence against volunteers working to stop the spread of Ebola. http://www.ifrc.org/en/news-and-media/press-releases/africa/guinea/red-cross-denounces-continued-violence-against-volunteers-working-to-stop-the-spread-of-ebola
- Jackman, S. (2000). In and out of war and peace: Transitional models of international conflict. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.200.5895&rank=1.
- Lau, J. T. F., Griffiths, S., Choi, K. C., & Tsui, H. Y. (2010). Avoidance behaviors and negative psychological responses in the general population in the initial stage of the H1N1 pandemic in Hong Kong. BMC Infectious Diseases, 10(1), 139. doi: 10.1186/1471-2334-10-139.
- Lozano, R., Naghavi, M., Foreman, K., Lim, S., Shibuya, K., Aboyans, V., et al. (2012). Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: A systematic analysis for the global burden of disease study 2010. Lancet, 380(9859), 2095–2128. doi: 10.1016/S0140-6736(12)61728-0.CrossRefGoogle Scholar
- Montgomery, D. C. (2009). Introduction to Statistical Quality Control (6th ed.). New Jersey: Wiley.Google Scholar
- Mykhalovskiy, E., & Weir, L. (2006). The global public health intelligence network and early warning outbreak detection: A Canadian contribution to global public health. Canadian Journal of Public Health/Revue Canadienne de SantéPublique, 97(1), 42–44.Google Scholar
- Racette, M. P., Smith, C. T., Cunningham, M. P., Heekin, T. A., Lemley, J. P., & Mathieu, R. S. (2014). Improving situational awareness for humanitarian logistics through predictive modeling. Systems and Information Engineering Design Symposium (SIEDS), 2014, 334–339.Google Scholar
- Truvé, S. (2013). Big data for the future: Unlocking the predictive power of the web. http://www.slideshare.net/RecordedFuture/big-data-for-the-future-unlocking-the-predictive-power-of-the-web
- Vaisman, E., Fast, S. M., Cunha, M. G., Postlethwaite, T., Wilson, J. M., & Mekaru, S. R. (2014). Predicting negative social response to disease outbreaks using biosurveillance and news data. In: 2014 INFORMS Workshop on Data Mining and Analytics.Google Scholar
- Wong, W.K., Moore, A., Cooper, G., & Wagner, M. (2003). Bayesian network anomaly pattern detection for disease outbreaks. In Proceedings of the Twentieth International Conference on Machine Learning (pp. 808–815).Google Scholar