Skip to main content
Log in

A Conditional Autoregressive Gaussian Process for Irregularly Spaced Multivariate Data with Application to Modelling Large Sets of Binary Data

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

A Gaussian conditional autoregressive (CAR) formulation is presented that permits the modelling of the spatial dependence and the dependence between multivariate random variables at irregularly spaced sites so capturing some of the modelling advantages of the geostatistical approach. The model benefits not only from the explicit availability of the full conditionals but also from the computational simplicity of the precision matrix determinant calculation using a closed form expression involving the eigenvalues of a precision matrix submatrix. The introduction of covariates into the model adds little computational complexity to the analysis and thus the method can be straightforwardly extended to regression models. The model, because of its computational simplicity, is well suited to application involving the fully Bayesian analysis of large data sets involving multivariate measurements with a spatial ordering. An extension to spatio-temporal data is also considered. Here, we demonstrate use of the model in the analysis of bivariate binary data where the observed data is modelled as the sign of the hidden CAR process. A case study involving over 450 irregularly spaced sites and the presence or absence of each of two species of rain forest trees at each site is presented; Markov chain Monte Carlo (MCMC) methods are implemented to obtain posterior distributions of all unknowns. The MCMC method works well with simulated data and the tree biodiversity data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Adler R.J. 1981. The Geometry of Random Fields. Wiley, Chichester.

    Google Scholar 

  • Belyaev Yu.K. 1961. Continuity and Hölder continous conditions for sample functions of stationary Gaussian processes. In: Proc. Fourth Berkeley Symp. Math. Statist. Prob. University of California Press, Berkely, Vol. 2, pp. 23-33.

    Google Scholar 

  • Besag J.E. 1974. Spatial interaction and the statistical analysis of lattice systems (with discussion). J. Roy. Statist. Soc. B 36: 192-236.

    Google Scholar 

  • Besag J. and Higdon D. 1999. Bayesian analysis of agricultural field experiments (with discussion). J. Roy. Statist. Soc. B 61: 691-746.

    Google Scholar 

  • Besag J.E. and Kooperberg C. 1995. On conditional and intrinsic autoregressions. Biometrika 82: 733-746.

    Google Scholar 

  • Best N.G., Cowles M.K., and Vines S.K. 1995. CODA Manual version 0.30. MRC Biostatistics Unit, Cambridge, UK.

    Google Scholar 

  • Chib S. and Greenberg E. 1998. Analysis of multivariate probit models. Biometrika 85: 347-361.

    Google Scholar 

  • Cressie N.A.C. 1993. Statistics for Spatial Data, Rev. Edition. Wiley, New York.

    Google Scholar 

  • Cressie N.A.C. and Huang H.-C. 1999. Classes of nonseparable, spatiotemporal stationary covarince functions. J. Am. Statist. Assoc. 94: 1330-1340.

    Google Scholar 

  • Dempster A.P. 1972. Covariance selection. Biometrics 28: 157-175.

    Google Scholar 

  • Diggle P.J., Tawn J.A., and Moyeed R.A. 1998. Model-based geostatistics (with discussion). Appl. Statist. 47: 299-350.

    Google Scholar 

  • Ecker M.D. and Gelfand A.E. 1997. Bayesian variogram modelling for an isotropic spatial process. J. Agric. Biol. Environ. Statist. 2: 347-369.

    Google Scholar 

  • Gilks W.R., Richardson S., and Spiegelhalter D.J. 1996. Markov Chain Monte Carlo in Practice. Chapman & Hall, London.

    Google Scholar 

  • Graybill F.A. 1983. Matrices with Applications in Statistics, 2nd Edition. Wadsworth, California.

    Google Scholar 

  • He Z. and Sun D. 2000. Hierarchical bayes estimation of hunting success rates with spatial correlations. Biometrics 56: 360-367.

    Google Scholar 

  • Heidelberger P. and Welch P. 1983. Simulation run length control in the presence of an initial transient. Operations Research 7: 493-497.

    Google Scholar 

  • Mardia K.V. 1988. Multi-dimensional multivariate Gaussian Markov random fields with application to image processing. J. Multivariate Analysis 24(2): 265-284.

    Google Scholar 

  • McCormack B. 1995. Timber inventory manual for the native forests of Queensland. Technical Report, Queensland Dept. Primary Industries-Forest Service, Brisbane, Queensland.

    Google Scholar 

  • Thompson J., Bean A., Dillewaard H., Sparshott K., Grimshaw P., Dowling R., Stephens K., Price R., and Stanley T. 1996. Methodolgy for vegetation survey and mapping for eastern Queensland. Technical Report, Queensland Dept. Environment and Heritage, Brisbane, Queensland.

    Google Scholar 

  • Weir I.S. and Pettitt A.N. 1999. Spatial modelling for binary data using a hidden conditional autoregressive Gaussian process: A multivariate extension of the probit model. Statistics and Computing 9: 77-86.

    Google Scholar 

  • Weir I.S. and Pettitt A.N. 2000. Binary probability maps using a hidden conditional autoregressive Gaussian process with an application to Finnish common toad data. Appl. Statist. 49: 473-484.

    Google Scholar 

  • Wikle C.R. and Cressie N. 1999. A dimension-reduced approach to space-time Kalman filtering. Biometrika 86: 815-829.

    Google Scholar 

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pettitt, A.N., Weir, I.S. & Hart, A.G. A Conditional Autoregressive Gaussian Process for Irregularly Spaced Multivariate Data with Application to Modelling Large Sets of Binary Data. Statistics and Computing 12, 353–367 (2002). https://doi.org/10.1023/A:1020792130229

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1020792130229

Navigation