Flu Detector - Tracking Epidemics on Twitter

  • Vasileios Lampos
  • Tijl De Bie
  • Nello Cristianini
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6323)


We present an automated tool with a web interface for tracking the prevalence of Influenza-like Illness (ILI) in several regions of the United Kingdom using the contents of Twitter’s microblogging service. Our data is comprised by a daily average of approximately 200,000 geolocated tweets collected by targeting 49 urban centres in the UK for a time period of 40 weeks. Official ILI rates from the Health Protection Agency (HPA) form our ground truth. Bolasso, the bootstrapped version of LASSO, is applied in order to extract a consistent set of features, which are then used for learning a regression model.


Ground Truth Candidate Feature Health Protection Agency Perform Feature Selection Really Simple Syndication 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Fleming, D., Elliot, A.: Lessons from 40 years surveillance of influenza in England and Wales. Epidemiology and Infection 136(7), 866–875 (2007)Google Scholar
  2. 2.
    Neuzil, K.M., Hohlbein, C., Zhu, Y.: Illness among schoolchildren during influenza season: effect on school absenteeism, parental absenteeism from work, and secondary illness in families. Arch. Pediatr. Adolesc. Med. 156(10), 986–991 (2002)Google Scholar
  3. 3.
    Ginsberg, J., Mohebbi, M.H., et al.: Detecting influenza epidemics using search engine query data. Nature 457(7232), 1012–1014 (2008)CrossRefGoogle Scholar
  4. 4.
    Polgreen, P.M., Chen, Y., et al.: Using internet searches for influenza surveillance. Clinical Infectious Diseases 47, 1443–1448 (2008)CrossRefGoogle Scholar
  5. 5.
    Lampos, V., Cristianini, N.: Tracking the flu pandemic by monitoring the Social Web. In: 2nd IAPR Workshop on Cognitive Information Processing, pp. 411–416 (2010)Google Scholar
  6. 6.
    Bach, F.R.: Bolasso: model consistent Lasso estimation through the bootstrap. ICML 25, 33–40 (2008)CrossRefGoogle Scholar
  7. 7.
    Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society 58B, 267–288 (1996)MathSciNetGoogle Scholar
  8. 8.
    Asur, S., Huberman, B.A.: Predicting the Future with Social Media. Arxiv preprint arXiv:1003.5699 (2010)Google Scholar
  9. 9.
    Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Vasileios Lampos
    • 1
  • Tijl De Bie
    • 1
  • Nello Cristianini
    • 1
  1. 1.Intelligent Systems LaboratoryUniversity of BristolUK

Personalised recommendations