Temporal and Spatial Clustering of Bacterial Genotypes

  • Blanca Gallego


Genotypic characterization of bacterial isolates provides valuable information for epidemiological surveillance and microbial population biology. In particular, the ability to discern clonal relatedness among isolates can be used to identify links and sites of transmission, some of which are not easily traced using conventional contact investigation. The spatial and temporal clustering of isolates that share the same or closely related genotypes can add further value to the use of molecular fingerprinting in the detection and management of infectious disease outbreaks. This chapter reviews and discusses the use of both spatio-temporal clustering and bacterial genotypes in public health biosurveillance and includes examples of temporal and spatial clustering of bacterial genotypes that allow for the integration of bacterial genotyping into public health decision making.


Spatial Cluster Exponentially Weighted Move Average Statistical Process Control Public Health Action Outbreak Detection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



The author acknowledges substantial contributions from Qinning Wang, Gwendolyn L Gilbert, Vitali Sintchenko and Peter Howard of the Centre for Infectious Diseases and Microbiology, Institute of Clinical Pathology and Medical Research, Sydney West Area Health Service and The University of Sydney. This work was supported by the Australian Research Council.


  1. Brookmeyer R, Stroup D (2004) Monitoring the health of populations: statistical methods for public health surveillance. Oxford University Press, OxfordGoogle Scholar
  2. Buckeridge D (2007) Outbreak detection through automated surveillance: A review of the determinants of detection. J Biomed Inform 40:370–379CrossRefPubMedGoogle Scholar
  3. Buckeridge D, Burkom H et al. (2005) Algorithms for rapid outbreak detection: a research synthesis. J Biomed Inform 38:99–113CrossRefPubMedGoogle Scholar
  4. Burkom H, Murphy S (2007) Data classification for selection of temporal alerting methods for biosurveillance. In: Zeng D et al. (eds) Intelligence and Security Information: Biosurveillance. Lecture Notes in Computer Science 4506. Springer, pp. 59–70Google Scholar
  5. Chan M-S, Maiden M et al. (2001) Database-driven multi locus sequence typing (MLST) of bacterial pathogens. Bioinformatics 17:1077–1083CrossRefPubMedGoogle Scholar
  6. Clayton D, Kaldor J (1987) Empirical Bayes estimates of age-standardised relative risks for use in disease mapping. Biometrics 43:671–681CrossRefPubMedGoogle Scholar
  7. Crawford J, Braden C et al. (2002) National Tuberculosis Genotyping and Surveillance Network: design and methods. Emerg Infect Dis 8:1192–1196PubMedGoogle Scholar
  8. Cuzick J, Edwards R (1990) Spatial clustering for inhomogeneous populations. J R Stat Soc Series B 52:73–104Google Scholar
  9. Edgerton J, Burkom H et al. (2007) Modifications to spatial scan statistics for estimated probabilities at fine-resolution in highly skewed spatial distributions. Adv Dis Surv 4:89Google Scholar
  10. Elliot P, Cuzick J, et al. (2000) Geographical and environmental epidemiology: methods for Small area studies. Oxford University Press, OxfordGoogle Scholar
  11. Frisen M (2003) Statistical surveillance: optimality and methods. Int Stat Rev 71:403–434CrossRefGoogle Scholar
  12. Frisen M, Demare J (1991) Optimal surveillance. Biometrika 78:271–290Google Scholar
  13. Gallego B, Sintchenko V et al. (2009) Biosurveillance of emerging biothreats using scalable genotype clustering. J Biomed Inform 42:66–73CrossRefPubMedGoogle Scholar
  14. Harvey A (1993) Time series models. The MIT Press, BostonGoogle Scholar
  15. Hawkins D, Olwell D (1998) Cumulative sum charts and charting for quality improvement. Springer, New YorkGoogle Scholar
  16. Howard S, Burkom YE 2A, Feldman 1J, Lin1 A (2004) Role of data aggregation in biosurveillance detection strategies with applications from ESSENCE. MMWR Morb Mortal Wkly Rep 53(Suppl):67–73Google Scholar
  17. Kelsall J, Diggle P (1998) Spatial variation in risk of disease: a nonparametric binary regression approach. J R Stat Soc Series C 47:559–573CrossRefGoogle Scholar
  18. Kulldorff M (1997) A spatial scan statistic. Commun Stat Theory Methods 26:1481–1496CrossRefGoogle Scholar
  19. Kulldorff M (2001) Prospective time-periodic geographical disease surveillance using a scan statistic. J R Stat Soc Series A 164:61–72CrossRefGoogle Scholar
  20. Kulldorff M, Heffernan R et al. (2005) A space-time permutation scan statistic for disease outbreak detection. PLoS Med 2:e59CrossRefPubMedGoogle Scholar
  21. Kulldorff M, Mostashari F et al. (2007) Multivariate scan statistics for disease surveillance. Stat Med 26:1824–1833CrossRefPubMedGoogle Scholar
  22. Lawson A (2001) Statistical methods in spatial epidemiology. Wiley, LondonGoogle Scholar
  23. Lawson A, Denison D (2002) Spatial cluster modelling: an overview. Spatial Cluster Modelling. CRC Press, New YorkCrossRefGoogle Scholar
  24. LeSage J, Banerjee S et al. (2009) Spatial statistics: methods, models & computation. Comput Stat Data Anal 53:2781–2785CrossRefGoogle Scholar
  25. Lindstedt B, Vardund T et al. (2004) Multiple-locus variable-number tandem-repeats analysis of Salmonella enterica subsp. enterica serovar Typhimurium using PCR multiplexing and multicolor capillary electrophoresis. J Microbiol Methods 59:163–172CrossRefPubMedGoogle Scholar
  26. McNabb S, Kammerer J et al. (2004) Added epidemiologic value to tuberculosis prevention and control of the investigation of clustered genotypes of Mycobacterium tuberculosis isolates. Am J Epidemiol 160:589–597CrossRefPubMedGoogle Scholar
  27. Naus J (1965) The distribution of the size of the maximum cluster of points on a line. J Am Stat Assoc 60:532–538CrossRefGoogle Scholar
  28. Neill D, Moore A (2004) Rapid detection of significant spatial clusters. In: Proceedings of the 10th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp 256–265Google Scholar
  29. Overhage J, Suico J, et al. (2001) Electronic laboratory reporting: barriers, solutions and findings. J Public Health Manag Pract 7:60–66PubMedGoogle Scholar
  30. Panackal A, Mikanatha N, et al. (2002) Automatic electronic laboratory-based reporting of notifiable infectious diseases at a large health system. Emerg Infect Dis 8:685–691PubMedGoogle Scholar
  31. Potthoff R, Whittinghill M (1966) Testing for homogeneity. II. The Poisson distribution. Biometrika 53:183–190PubMedGoogle Scholar
  32. Roberts SW (2000) Control chart tests based on geometric moving averages. Technometrics 42(1):97–101Google Scholar
  33. Shmueli G (2005) Wavelet-based monitoring in modern biosurveillance. Robert H Smith School Research Paper No RHS-06–002 Available at SSRN: http://ssrncom/abstract=902878
  34. Shmueli G, Fienberg S (2005) Current and potential statistical methods for monitoring multiple data streams for biosurveillance. In: Alyson G, Wilson D, Olwell A (eds) Statistical methods in counterterrorism: game theory, modeling, syndromic surveillance, and biometric authentication. Springer, New YorkGoogle Scholar
  35. Sintchenko V, Iredell JR, et al. (2007) Genomic profiling of pathogens for disease management and surveillance. Nat Microbiol Rev 5:464–470CrossRefGoogle Scholar
  36. Sonesson C (2007) A CUSUM framework for detection of space-time disease clusters using scan statistics. Stat Med 26:4770–4789CrossRefPubMedGoogle Scholar
  37. Sonesson C, Bock D (2003) A review and discussion of prospective statistical surveillance in public health. J R Stat Soc 166:5–21CrossRefGoogle Scholar
  38. Song C, Kulldorff M (2003) Power evaluation of disease clustering tests. Int J Health Geogr 2:1–8CrossRefGoogle Scholar
  39. Tango T (1995) A class of tests for detecting “general” and “focused” clustering of rare diseases. Stat Med 14:2323–2334CrossRefPubMedGoogle Scholar
  40. Tauxe RV (2006) Molecular subtyping and the transformation of public health. Foodborne Pathog Dis 3:4–8CrossRefPubMedGoogle Scholar
  41. Torpdahl M, Sorensen G et al. (2007) Tandem repeat analysis for surveillance of human Salmonella typhimurium Infections. Emerg Infect Dis 13:388–395CrossRefPubMedGoogle Scholar
  42. Wagner M, Moore A et al. (2006) Handbook of biosurveillance. Elsevier Academic PressGoogle Scholar
  43. Williamson GD, Weatherby HG (1999) A monitoring system for detecting aberrations in public health surveillance reports. Stat Med 18:3283–3298CrossRefPubMedGoogle Scholar
  44. Wong W, Moore A et al. (2002) Rule-based anomaly pattern detection for detecting disease outbreaks. AAAI-02. Edmonton, Alberta, pp 217–223Google Scholar
  45. Wong W, Moore A et al. (2003) What’s strange about recent events? J Urban Health 80:66–75Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  • Blanca Gallego
    • 1
  1. 1.Centre for Health InformaticsUniversity of New South WalesSydneyAustralia

Personalised recommendations