Boundaries, links and clusters: a new paradigm in spatial analysis?

Jacquez, Geoff M.; Kaufmann, Andy; Goovaerts, Pierre

doi:10.1007/s10651-007-0066-4

Boundaries, links and clusters: a new paradigm in spatial analysis?

Published: 04 December 2007

Volume 15, pages 403–419, (2008)
Cite this article

Environmental and Ecological Statistics Aims and scope Submit manuscript

Geoff M. Jacquez¹,
Andy Kaufmann¹ &
Pierre Goovaerts¹

212 Accesses
20 Citations
Explore all metrics

Abstract

This paper develops and applies new techniques for the simultaneous detection of boundaries and clusters within a probabilistic framework. The new statistic “little b” (written b _ij) evaluates boundaries between adjacent areas with different values, as well as links between adjacent areas with similar values. Clusters of high values (hotspots) and low values (coldspots) are then constructed by joining areas abutting locations that are significantly high (e.g., an unusually high disease rate) and that are connected through a “link” such that the values in the adjoining areas are not significantly different. Two techniques are proposed and evaluated for accomplishing cluster construction: “big B” and the “ladder” approach. We compare the statistical power and empirical Type I and Type II error of these approaches to those of wombling and the local Moran test. Significance may be evaluated using distribution theory based on the product of two continuous (e.g., non-discrete) variables. We also provide a “distribution free” algorithm based on resampling of the observed values. The methods are applied to simulated data for which the locations of boundaries and clusters is known, and compared and contrasted with clusters found using the local Moran statistic and with polygon Womble boundaries. The little b approach to boundary detection is comparable to polygon wombling in terms of Type I error, Type II error and empirical statistical power. For cluster detection, both the big B and ladder approaches have lower Type I and Type II error and are more powerful than the local Moran statistic. The new methods are not constrained to find clusters of a pre-specified shape, such as circles, ellipses and donuts, and yield a more accurate description of geographic variation than alternative cluster tests that presuppose a specific cluster shape. We recommend these techniques over existing cluster and boundary detection methods that do not provide such a comprehensive description of spatial pattern.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

Aroian LA (1947). The probability function of a product of two normally distributed variables. Ann Math Stat 18: 265–271
Article Google Scholar
Besag J and Newell J (1991). The detection of clusters in rare diseases. J Roy Stat Soc Ser A 154: 143–155
Article Google Scholar
Craig CC (1936). On the frequency function of xy. Ann Math Stat 7: 1–15
Article Google Scholar
Csillag C, Boots B, Fortin M-J, Lowell K and Potvin F (2001). Multiscale charaterization of boundaries and landscape ecological patterns. Geomatica 55: 291–307
Google Scholar
Glen AG, Leemis LM and Drew JH (2004). Computing the distribution of the product of two continuous random variables. Comput Stat Data Anal 44: 451–464
Article Google Scholar
Goovaerts P and Jacquez GM (2004). Accounting for regional background and population size in the detection of spatial clusters and outliers using geostatistical filtering and spatial neutral models: the case of lung cancer in Long Island, New York. Int J Health Geogr 3: 14
Article PubMed Google Scholar
Greiling DA, Jacquez GM, Kaufmann AM and Rommel RG (2005). Space time visualization and analysis in the Cancer Atlas Viewer. J Geogr Syst 7: 67–84
Article PubMed Google Scholar
Jacquez GM (2004). Current practices in the spatial analysis of cancer: flies in the ointment. Int J Health Geogr 3: 22
Article PubMed Google Scholar
Jacquez GM and Greiling DA (2003a). Geographic boundaries in breast, lung and colorectal cancers in relation to exposure to air toxics in Long Island, New York. Int J Health Geogr 2: 4
Article PubMed Google Scholar
Jacquez GM and Greiling DA (2003b). Local clustering in breast, lung and colorectal cancer in Long Island, New York. Int J Health Geogr 2: 3
Article PubMed Google Scholar
Jacquez GM, Waller LA, Grimson R and Wartenberg D (1996). The analysis of disease clusters, Part I: state of the art. Infect Control Hosp Epidemiol 17: 319–327
Article PubMed CAS Google Scholar
Jacquez GM, Maruca SL and Fortin MJ (2000). From fields to objects: a review of geographic boundary analysis. J Geogr Syst 2: 221–241
Article Google Scholar
Kulldorff M, Heffernan R, Hartman J, Assuncao R and Mostashari F (2005). A space-time permutation scan statistic for disease outbreak detection. PLoS Med 2: e59
Article PubMed Google Scholar
Kulldorff M, Huang L, Pickle L and Duczmal L (2006). An elliptic spatial scan statistic. Stat Med 25(22): 3929–3943
Article PubMed Google Scholar
Lu H and Carlin BP (2005). Bayesian areal wombling for geographical boundary analysis. Geogr Anal 37(3): 265–285
Article Google Scholar
Maruca SL and Jacquez GM (2002). Area-based tests for association between spatial patterns. J Geogr Syst 4: 69–84
Article Google Scholar
Ord J and Getis A (1995). Local spatial autocorrelation statistics: distributional issues and an application. Geogr Anal 27: 286–306
Article Google Scholar
Patil GP, Modarres R, Myers WL and Patankar AP (2006). Spatially constrained clustering and upper level set scan hotspot detection in surveillance geoinformatics. Environ Ecol Stat 13(4): 365–377
Article Google Scholar
Rohatgi VK (1976). An introduction to probability theory and mathematical statistics. John Wiley & Sons, New York
Google Scholar
Tango T. (2007). A class of multiplicity-adjusted tests for spatial clustering based on case-control point data. Biometrics. 63: 119–127
Article PubMed Google Scholar
Tango T and Takahashi K (2005). A flexibly shaped spatial scan statistic for detecting clusters. Int J Health Geogr 4: 11
Article PubMed Google Scholar
Ware B, Ladd F (2003) Approximating the distribution for sums of products of normal variables. University of Canterbury Mathematics and Statistics Department
Womble WH (1951). Differential systematics. Science 114: 315–322
Article PubMed CAS Google Scholar

Download references

Author information

Authors and Affiliations

BioMedware, 516 North State Street, Ann Arbor, MI, 48104-1236, USA
Geoff M. Jacquez, Andy Kaufmann & Pierre Goovaerts

Authors

Geoff M. Jacquez
View author publications
You can also search for this author in PubMed Google Scholar
Andy Kaufmann
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Goovaerts
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Geoff M. Jacquez.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jacquez, G.M., Kaufmann, A. & Goovaerts, P. Boundaries, links and clusters: a new paradigm in spatial analysis?. Environ Ecol Stat 15, 403–419 (2008). https://doi.org/10.1007/s10651-007-0066-4

Download citation

Received: 01 January 2006
Revised: 01 June 2006
Published: 04 December 2007
Issue Date: December 2008
DOI: https://doi.org/10.1007/s10651-007-0066-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Boundaries, links and clusters: a new paradigm in spatial analysis?

Abstract

Access this article

Similar content being viewed by others

Adjusted Inference for the Spatial Scan Statistic

Testing for clustering at many ranges inflates family-wise error rate (FWE)

Scale, Aggregation, and the Modifiable Areal Unit Problem

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Boundaries, links and clusters: a new paradigm in spatial analysis?

Abstract

Access this article

Similar content being viewed by others

Adjusted Inference for the Spatial Scan Statistic

Testing for clustering at many ranges inflates family-wise error rate (FWE)

Scale, Aggregation, and the Modifiable Areal Unit Problem

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation