Normality Tests for Spatially Correlated Data

Pardo-Igúzquiza, Eulogio; Dowd, Peter A.

doi:10.1023/B:MATG.0000039540.43774.2b

Normality Tests for Spatially Correlated Data

Published: August 2004

Volume 36, pages 659–681, (2004)
Cite this article

Mathematical Geology Aims and scope Submit manuscript

Eulogio Pardo-Igúzquiza¹ &
Peter A. Dowd²

423 Accesses
22 Citations
Explore all metrics

Abstract

In studies that involve a finite sample size of spatial data it is often of interest to test (statistically) the assumption that the marginal (or univariate) distribution of the data is Gaussian (normal). This may be important per se because, for example, a data transformation may be desired if the normality hypothesis is rejected, or it may provide a way of testing other hypotheses, such as lognormality, by testing the normality of the logarithms of the observations. The most commonly used tests, such as the Kolmogorov–Smirnov (K–S), chi-square (χ²), and Shapiro–Wilks (S–W) tests, are designed on the assumption that the observations are independent and identically distributed (iid). In geostatistical applications, however, this is not usually the case unless the spatial covariance (semivariogram) function is a pure nugget variance. If the covariance structure has a (practical) range greater than the minimum distance between observations, the data are correlated and the standard tests cannot be applied to the probability density function (pdf) or cumulative probability function (cdf) estimated directly from the data. The problem with correlated data arises not from the correlation per se but from cases in which correlated data are clustered rather than being located on a regular grid. In these cases inferences requiring iid assumptions may be seriously biased because of the spatial correlation among the observations. If unbiased (i.e., de-clustered) estimates of the pdf or cdf are obtained, then normality tests, such as K-S, χ², or S–W, can be applied using the unbiased estimates and an effective number of samples equivalent to the iid case. There are three questions to be addressed in these cases:

• Is the distribution ergodic?

• How are unbiased estimates of the pdf and cdf obtained from clustered samples?

• What is the effective number of samples equivalent to the iid case?

Working within the framework of the universal model (generalized linear model) in which a spatial process, Z(x), is composed of a deterministic drift m(x) and an (auto-) correlated residual e(x), Z(x) = m(x) + e(x), the assumption of distribution ergodicity (an assumption that can be checked from the experimental data) implies that the normality test should be applied to the variable, Z(x), if the drift is constant (m(x) = m), and to the residual variable if the drift is variable. We show that an efficient method for obtaining unbiased estimates of the pdf or cdf is by weighting the observations (i.e., de-clustering) using block kriging. Block kriging requires an estimate of the semivariogram and we present a new method of semivariogram estimation that is robust with respect to data clustering. In addition, we discuss a way of determining the effective number of samples required for the application of a normality test and for constructing confidence intervals for statistics such as the mean and variance. The method is illustrated using a published data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

REFERENCES

Berry, D. A., and Lindgren, B. W., 1990, Statistics: Theory and methods: Brook/Cole, Pacific Grove, CA, 763 p.
Google Scholar
Chilès, J.-P., and Delfiner, P., 1999, Geostatistics: Modelling spatial uncertainty: Wiley Interscience, New York, 695 p.
Google Scholar
Deutsch, C., 1989, DECLUS: A Fortran-77 program for determining optimum spatial declustering weights: Comput. Geosci., v. 15, no. 3, p. 325–332.
Article Google Scholar
Deutsch, C., and Journel, A. G., 1992, GSLIB. Geostatistical software library and user's guide: Oxford University Press, New York, 340 p.
Google Scholar
Journel, A. G., 1982, Non-parametric estimation of spatial distributions: Math. Geol., v. 15, no. 3, p. 445–468.
Google Scholar
Lilliefors, H. W., 1967, On the Kolmogorov-Smirnov test for normality with mean and variance unknown: J. Am. Stat. Assoc., v. 64, p. 399–402.
Google Scholar
Omre, H., 1984, The variogram and its estimation, inGeostatistics for natural resources characterization: D. Reidel, Dordrecht, p. 107–125.
Google Scholar
Papoulis, A.,1984, Probability, random variables and stochastic processes: McGraw-Hill International Editions, Singapore, 576 p.
Google Scholar
Pardo-Igúzquiza, E., 1998, Comparison of geostatistical methods for estimating the areal average climatological rainfall mean using data on precipitation and topography: Int. J. Climatol., v. 18, p. 1031–1047.
Article Google Scholar
Schofield, N., 1993, Using the entropy statistic to infer population parameters from spatially clustered sampling, inSoaves, A., ed., Geostatistics Troia'92: Kluwer Academic, Dordrecht, Vol. 1, p. 109–119.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mining and Mineral Engineering, University of Leeds, Leeds, LS2 9JT, United Kingdom
Eulogio Pardo-Igúzquiza
Faculty of Engineering, Computer and Mathematical Sciences, University of Adelaide, Australia
Peter A. Dowd

Authors

Eulogio Pardo-Igúzquiza
View author publications
You can also search for this author in PubMed Google Scholar
Peter A. Dowd
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pardo-Igúzquiza, E., Dowd, P.A. Normality Tests for Spatially Correlated Data. Mathematical Geology 36, 659–681 (2004). https://doi.org/10.1023/B:MATG.0000039540.43774.2b

Download citation

Issue Date: August 2004
DOI: https://doi.org/10.1023/B:MATG.0000039540.43774.2b

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Normality Tests for Spatially Correlated Data

Abstract

Access this article

Similar content being viewed by others

A non-homogeneous skew-Gaussian Bayesian spatial model

Monte Carlo Permutation Tests for Assessing Spatial Dependence at Different Scales

Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics

REFERENCES

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Normality Tests for Spatially Correlated Data

Abstract

Access this article

Similar content being viewed by others

A non-homogeneous skew-Gaussian Bayesian spatial model

Monte Carlo Permutation Tests for Assessing Spatial Dependence at Different Scales

Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics

REFERENCES

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation