Flu Detector - Tracking Epidemics on Twitter
We present an automated tool with a web interface for tracking the prevalence of Influenza-like Illness (ILI) in several regions of the United Kingdom using the contents of Twitter’s microblogging service. Our data is comprised by a daily average of approximately 200,000 geolocated tweets collected by targeting 49 urban centres in the UK for a time period of 40 weeks. Official ILI rates from the Health Protection Agency (HPA) form our ground truth. Bolasso, the bootstrapped version of LASSO, is applied in order to extract a consistent set of features, which are then used for learning a regression model.
KeywordsGround Truth Candidate Feature Health Protection Agency Perform Feature Selection Really Simple Syndication
- 1.Fleming, D., Elliot, A.: Lessons from 40 years surveillance of influenza in England and Wales. Epidemiology and Infection 136(7), 866–875 (2007)Google Scholar
- 2.Neuzil, K.M., Hohlbein, C., Zhu, Y.: Illness among schoolchildren during influenza season: effect on school absenteeism, parental absenteeism from work, and secondary illness in families. Arch. Pediatr. Adolesc. Med. 156(10), 986–991 (2002)Google Scholar
- 5.Lampos, V., Cristianini, N.: Tracking the flu pandemic by monitoring the Social Web. In: 2nd IAPR Workshop on Cognitive Information Processing, pp. 411–416 (2010)Google Scholar
- 8.Asur, S., Huberman, B.A.: Predicting the Future with Social Media. Arxiv preprint arXiv:1003.5699 (2010)Google Scholar
- 9.Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)Google Scholar