Abstract
A Gaussian conditional autoregressive (CAR) formulation is presented that permits the modelling of the spatial dependence and the dependence between multivariate random variables at irregularly spaced sites so capturing some of the modelling advantages of the geostatistical approach. The model benefits not only from the explicit availability of the full conditionals but also from the computational simplicity of the precision matrix determinant calculation using a closed form expression involving the eigenvalues of a precision matrix submatrix. The introduction of covariates into the model adds little computational complexity to the analysis and thus the method can be straightforwardly extended to regression models. The model, because of its computational simplicity, is well suited to application involving the fully Bayesian analysis of large data sets involving multivariate measurements with a spatial ordering. An extension to spatio-temporal data is also considered. Here, we demonstrate use of the model in the analysis of bivariate binary data where the observed data is modelled as the sign of the hidden CAR process. A case study involving over 450 irregularly spaced sites and the presence or absence of each of two species of rain forest trees at each site is presented; Markov chain Monte Carlo (MCMC) methods are implemented to obtain posterior distributions of all unknowns. The MCMC method works well with simulated data and the tree biodiversity data set.
Similar content being viewed by others
References
Adler R.J. 1981. The Geometry of Random Fields. Wiley, Chichester.
Belyaev Yu.K. 1961. Continuity and Hölder continous conditions for sample functions of stationary Gaussian processes. In: Proc. Fourth Berkeley Symp. Math. Statist. Prob. University of California Press, Berkely, Vol. 2, pp. 23-33.
Besag J.E. 1974. Spatial interaction and the statistical analysis of lattice systems (with discussion). J. Roy. Statist. Soc. B 36: 192-236.
Besag J. and Higdon D. 1999. Bayesian analysis of agricultural field experiments (with discussion). J. Roy. Statist. Soc. B 61: 691-746.
Besag J.E. and Kooperberg C. 1995. On conditional and intrinsic autoregressions. Biometrika 82: 733-746.
Best N.G., Cowles M.K., and Vines S.K. 1995. CODA Manual version 0.30. MRC Biostatistics Unit, Cambridge, UK.
Chib S. and Greenberg E. 1998. Analysis of multivariate probit models. Biometrika 85: 347-361.
Cressie N.A.C. 1993. Statistics for Spatial Data, Rev. Edition. Wiley, New York.
Cressie N.A.C. and Huang H.-C. 1999. Classes of nonseparable, spatiotemporal stationary covarince functions. J. Am. Statist. Assoc. 94: 1330-1340.
Dempster A.P. 1972. Covariance selection. Biometrics 28: 157-175.
Diggle P.J., Tawn J.A., and Moyeed R.A. 1998. Model-based geostatistics (with discussion). Appl. Statist. 47: 299-350.
Ecker M.D. and Gelfand A.E. 1997. Bayesian variogram modelling for an isotropic spatial process. J. Agric. Biol. Environ. Statist. 2: 347-369.
Gilks W.R., Richardson S., and Spiegelhalter D.J. 1996. Markov Chain Monte Carlo in Practice. Chapman & Hall, London.
Graybill F.A. 1983. Matrices with Applications in Statistics, 2nd Edition. Wadsworth, California.
He Z. and Sun D. 2000. Hierarchical bayes estimation of hunting success rates with spatial correlations. Biometrics 56: 360-367.
Heidelberger P. and Welch P. 1983. Simulation run length control in the presence of an initial transient. Operations Research 7: 493-497.
Mardia K.V. 1988. Multi-dimensional multivariate Gaussian Markov random fields with application to image processing. J. Multivariate Analysis 24(2): 265-284.
McCormack B. 1995. Timber inventory manual for the native forests of Queensland. Technical Report, Queensland Dept. Primary Industries-Forest Service, Brisbane, Queensland.
Thompson J., Bean A., Dillewaard H., Sparshott K., Grimshaw P., Dowling R., Stephens K., Price R., and Stanley T. 1996. Methodolgy for vegetation survey and mapping for eastern Queensland. Technical Report, Queensland Dept. Environment and Heritage, Brisbane, Queensland.
Weir I.S. and Pettitt A.N. 1999. Spatial modelling for binary data using a hidden conditional autoregressive Gaussian process: A multivariate extension of the probit model. Statistics and Computing 9: 77-86.
Weir I.S. and Pettitt A.N. 2000. Binary probability maps using a hidden conditional autoregressive Gaussian process with an application to Finnish common toad data. Appl. Statist. 49: 473-484.
Wikle C.R. and Cressie N. 1999. A dimension-reduced approach to space-time Kalman filtering. Biometrika 86: 815-829.
Rights and permissions
About this article
Cite this article
Pettitt, A.N., Weir, I.S. & Hart, A.G. A Conditional Autoregressive Gaussian Process for Irregularly Spaced Multivariate Data with Application to Modelling Large Sets of Binary Data. Statistics and Computing 12, 353–367 (2002). https://doi.org/10.1023/A:1020792130229
Issue Date:
DOI: https://doi.org/10.1023/A:1020792130229