Abstract
We propose a covariance specification for modeling spatially continuous multivariate data. This model is based on a reformulation of Kronecker’s product of covariance matrices for Gaussian random fields. The structure holds for different choices of covariance functions with parameters varying in their usual domains. In comparison with classical models from the literature, we used the Matérn correlation function to specify the marginal covariances. We also assess the reparametrized generalized Wendland model as an option for efficient calculation of the Cholesky decomposition, improving the model’s ability to deal with large data sets. The reduced computational time and flexible generalization for increasing number of variables, make it an attractive alternative for modelling spatially continuous data. The proposed model is fitted to a soil chemistry properties dataset, and adequacy measures, forecast errors and estimation times are compared with the ones obtained based on classical models. In addition, the model is fitted to a North African temperature dataset to illustrate the model’s flexibility in dealing with large data. A simulation study is performed considering different parametric scenarios to evaluate the properties of the maximum likelihood estimators. The simple structure and reduced estimation time make the proposed model a candidate approach for multivariate analysis of spatial data.
Similar content being viewed by others
References
Alegría A, Porcu E, Furrer R (2018) Asymmetric matrix-valued covariances for multivariate random fields on spheres. J Stat Comput Simul 88(10):1850–1862. https://doi.org/10.1080/00949655.2017.1406488
Alegría A, Porcu E, Furrer R et al (2019) Covariance functions for multivariate gaussian fields evolving temporally over planet earth. Stoch Env Res Risk Assess 33(8):1593–1608. https://doi.org/10.1007/s00477-019-01707-w
Banerjee S, Roy A (2014) Linear algebra and matrix analysis for statistics. CRC Press, Boca Raton. https://doi.org/10.1201/b17040
Banerjee S, Carlin BP, Gelfand AE (2003) Hierarchical modeling and analysis for spatial data. Chapman and Hall/CRC, Boca Raton. https://doi.org/10.1201/9780203487808
Bates D, Maechler M (2021) Matrix: Sparse and Dense Matrix Classes and Methods. https://CRAN.R-project.org/package=Matrix, R package version 1.3-4
Bevilacqua M, Morales-Oñate V (2018) GeoModels: A Package for Geostatistical Gaussian and non Gaussian Data Analysis. https://vmoprojs.github.io/GeoModels-page/, R package version 1.0.3-4
Bevilacqua M, Vallejos R, Velandia D (2015) Assessing the significance of the correlation between the components of a bivariate gaussian random field. Environmetrics 26(8):545–556. https://doi.org/10.1002/env.2367
Bevilacqua M, Alegría A, Velandia D et al (2016) Composite likelihood inference for multivariate gaussian random fields. J Agric Biol Environ Stat 21(3):448–469. https://doi.org/10.1007/s13253-016-0256-3
Bevilacqua M, Fassò A, Gaetan C et al (2016) Covariance tapering for multivariate gaussian random fields estimation. Stat Methods & Appl 25(1):21–37. https://doi.org/10.1007/s10260-015-0338-3
Bevilacqua M, Faouzi T, Furrer R et al (2019) Estimation and prediction using generalized wendland covariance functions under fixed domain asymptotics. Ann Stat 47(2):828–856
Bevilacqua M, Diggle P, Porcu E (2020) Families of covariance functions for bivariate random fields on spheres. Spatial Stat 40(100448):1–29. https://doi.org/10.1016/j.spasta.2020.100448
Bevilacqua M, Caamaño-Carrillo C, Porcu E (2022) Unifying compactly supported and matern covariance functions in spatial statistics. J Multivariate Anal 104949
Bivand RS, Pebesma EJ, Gómez-Rubio V et al (2008) Applied spatial data analysis with R. Springer, New York. https://doi.org/10.1007/978-0-387-78171-6
Bonat WH, Jørgensen B (2016) Multivariate covariance generalized linear models. J Roy Stat Soc: Ser C (Appl Stat) 65(5):649–675. https://doi.org/10.1111/rssc.12145
Bonat WH, Petterle RR, Balbinot P et al (2020) Modelling multiple outcomes in repeated measures studies: comparing aesthetic eyelid surgery techniques. Stat Model 21:564–582. https://doi.org/10.1177/1471082X20943312
Bourgault G, Marcotte D (1991) Multivariable variogram and its application to the linear model of coregionalization. Math Geol 23(7):899–928. https://doi.org/10.1007/BF02066732
Cecconi L, Grisotto L, Catelan D et al (2016) Preferential sampling and bayesian geostatistics: Statistical modeling and examples. Stat Methods Med Res 25(4):1224–1243. https://doi.org/10.1177/0962280216660409
Chilès JP, Delfiner P (2012) Geostatistics: Modeling Spatial Uncertainty. Wiley, New York. https://doi.org/10.1007/s11004-012-9429-y
Cressie N (1993) Statistics for spatial data. Wiley, New York. https://doi.org/10.1002/9781119115151
Diggle P, Ribeiro PJ Jr (2007) Model-based Geostatistics. Springer, New York. https://doi.org/10.1007/978-0-387-48536-2
Emery X, Porcu E (2019) Simulating isotropic vector-valued gaussian random fields on the sphere through finite harmonics approximations. Stoch Env Res Risk Assess 33(8):1659–1667. https://doi.org/10.1007/s00477-019-01717-8
Emery X, Porcu E, Bissiri PG (2019) A semiparametric class of axially symmetric random fields on the sphere. Stoch Env Res Risk Assess 33(10):1863–1874. https://doi.org/10.1007/s00477-019-01725-8
Fasiolo M (2016) An introduction to mvnfast. https://CRAN.R-project.org/package=mvnfast, R package version 0.1.6
Fick SE, Hijmans RJ (2017) Worldclim 2: new 1-km spatial resolution climate surfaces for global land areas. Int J Climatol 37(12):4302–4315
Finley AO, Banerjee S, Gelfand AE (2015) spBayes for large univariate and multivariate point-referenced spatio-temporal data models. J Stat Softw, Art 63(13):1–28. https://doi.org/10.18637/jss.v063.i13
Furrer R, Sain SR (2010) spam: a sparse matrix r package with emphasis on mcmc methods for gaussian markov random fields. J Stat Softw 36(10):1–25
Gelfand AE, Schmidt AM, Banerjee S et al (2004) Nonstationary multivariate process modeling through spatially varying coregionalization. TEST 13(2):263–312. https://doi.org/10.1007/BF02595775
Gentle JE (2017) Matrix algebra: theory, computations, and applications in statistics. Springer, New York. https://doi.org/10.1007/978-3-319-64867-5
Genton MG, Kleiber W (2015) Cross-covariance functions for multivariate geostatistics. Stat Sci 30(2):147–163. https://doi.org/10.1214/14-STS487
Gneiting T (1999) Correlation functions for atmospheric data analysis. Q J R Meteorol Soc 125(559):2449–2464. https://doi.org/10.1002/qj.49712555906
Gneiting T, Kleiber W, Schlather M (2010) Matérn cross-covariance functions for multivariate random fields. J Am Stat Assoc 105(491):1167–1177. https://doi.org/10.1198/jasa.2010.tm09420
Goovaerts P et al (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York
Goulard M, Voltz M (1992) Linear coregionalization model: tools for estimation and choice of cross-variogram matrix. Math Geol 24(3):269–286. https://doi.org/10.1007/BF00893750
Guttorp P, Gneiting T (2006) Studies in the history of probability and statistics XLIX on the matérn correlation family. Biometrika 93(4):989–995
Hardy Y, Steeb WH (2019) Matrix Calculus, Kronecker Product and Tensor Product: A Practical Approach to Linear Algebra, Multilinear Algebra and Tensor Calculus with Software Implementations. World Scientific, Singapore, https://doi.org/10.1142/11338
MacNab YC (2016) Linear models of coregionalization for multivariate lattice data: Order-dependent and order-free cMCARs. Stat Methods Med Res 25(4):1118–1144. https://doi.org/10.1177/0962280216660419
MacNab YC (2018) Some recent work on multivariate gaussian markov random fields. TEST 27(3):497–541. https://doi.org/10.1007/s11749-018-0605-3
Martinez-Beneito MA (2013) A general modelling framework for multivariate disease mapping. Biometrika 100(3):539–553. https://doi.org/10.1093/biomet/ast023
Martinez-Beneito MA (2020) Some links between conditional and coregionalized multivariate gaussian markov random fields. Spatial Stat 40(100383):1–17. https://doi.org/10.1016/j.spasta.2019.100383
Martins ABT, Bonat WH, Ribeiro PJ Jr (2016) Likelihood analysis for a class of spatial geostatistical compositional models. Spatial Stat 17:121–130. https://doi.org/10.1016/j.spasta.2016.06.008
Matérn B (1986) Spatial variation. Springer, Berlin. https://doi.org/10.1002/bimj.4710300514
Pebesma E, Bivand RS (2005) Classes and methods for spatial data: the sp package. R News 5(2):9–13
Pebesma EJ (2004) Multivariable geostatistics in S: the gstat package. Comp & Geosci 30:683–691. https://doi.org/10.1016/j.cageo.2004.03.012
Petersen KB, Pedersen MS, et al. (2008) The matrix cookbook. Technical University of Denmark
Petterle RR, Bonat WH, Scarpin CT (2019) Quasi-beta longitudinal regression model applied to water quality index data. J Agric Biol Environ Stat 24(2):346–368. https://doi.org/10.1007/s13253-019-00360-8
Porcu E, Daley DJ, Buhmann M et al (2013) Radial basis functions with compact support for multivariate geostatistics. Stoch Env Res Risk Assess 27(4):909–922. https://doi.org/10.1007/s00477-012-0656-z
Qadir GA, Euán C, Sun Y (2021) Flexible modeling of variable asymmetries in cross-covariance functions for multivariate random fields. J Agric Biol Environ Stat 26(1):1–22. https://doi.org/10.1007/s13253-020-00414-2
R Core Team (2021) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/
Ribeiro AMT, Ribeiro PJ Jr, Bonat WH (2021) Comparison of exponential covariance functions for bivariate geostatistical data. Revista Brasileira de Biometria 39(1):89–102. https://doi.org/10.28951/rbb.v39i1.558
Ribeiro Jr PJ, Diggle PJ, Schlather M, et al. (2020) geoR: Analysis of Geostatistical Data. https://CRAN.R-project.org/package=geoR, R package version 1.8-1
Rue H, Held L (2005) Gaussian Markov random fields: theory and applications. CRC Press, Boca Raton. https://doi.org/10.1201/9780203492024
Salvaña MLO, Genton MG (2020) Nonstationary cross-covariance functions for multivariate spatio-temporal random fields. Spatial Stat 37(100411):1–24. https://doi.org/10.1016/j.spasta.2020.100411
Särkkä S (2013) Bayesian filtering and smoothing. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9781139344203
Schmidt AM, Gelfand AE (2003) A bayesian coregionalization approach for multivariate pollutant data. J Geophys Res 108(D24):1–9. https://doi.org/10.1029/2002JD002905
Teichmann J, Menzel P, Heinig T et al (2021) Modeling and fitting of three-dimensional mineral microstructures by multinary random fields. Math Geosci 53(5):877–904. https://doi.org/10.1007/s11004-020-09871-4
Vallejos R, Osorio F, Bevilacqua M (2020) Spatial relationships between two georeferenced variables: With applications in R. Springer, New York. https://doi.org/10.1007/978-3-030-56681-4
Ver Hoef JM, Cressie N (1993) Multivariable spatial prediction. Math Geol 25(2):219–240. https://doi.org/10.1007/BF00893273
Wackernagel H (2003) Multivariate geostatistics: an introduction with applications. Springer, Berlin. https://doi.org/10.1007/978-3-662-05294-5
Wand M (2002) Vector differential calculus in statistics. Am Stat 56(1):55–62. https://doi.org/10.1198/000313002753631376
Zastavnyi VP, Trigub RM (2002) Positive-definite splines of special form. Sbornik: Math 193(12):1771
Funding
The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.
Author information
Authors and Affiliations
Contributions
All authors contributed equally to the design and preparation of the material, with comments and suggestions for improvements. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix: Derivatives of the covariance matrix
Based on matrix properties (Wand 2002; Bonat et al. 2020), the score functions with respect to the \(\pmb {\beta }\) and \(\pmb {\lambda }\) parameters, are given, respectively, by:
with \({\mathbf {r}}(\pmb {\beta }) = ({\mathbf {y}}-\pmb {\mu }(\pmb {\beta }))\), \({\mathbf {D}}=\frac{\partial \pmb {\mu }(\pmb {\beta })}{\partial \pmb {\beta }} = \text {Bdiag}({\mathbf {X}}_1, \dots , {\mathbf {X}}_p)\) and the inverse calculation was described earlier.
We achieve the maximum likelihood estimator of \(\pmb {\beta }\) by solving \({\mathcal {U}}_{\pmb {\beta }}\), which results in:
Making similar calculations, we find the Fisher information matrix which, for \(\pmb {\beta }\), is given by:
For \(\pmb {\lambda }\), the \((i,j)^{th}\) entry of the Fisher information matrix, is given by:
where \({\mathbf {W}}_{\lambda _i}=-\partial \varvec{\Sigma }(\pmb {\lambda })^{-1}/\partial \lambda _i\).
Considering \(\hat{\pmb {\theta }} = (\hat{\pmb {\beta }}^\top , \hat{\pmb {\lambda }}^\top )^\top\) the maximum likelihood estimator of \(\pmb {\theta }\) parameter, the asymptotic distribution of \(\hat{\pmb {\theta }}\) is \(\hat{\pmb {\theta }} \sim N(\pmb {\theta }, {\mathcal {F}}_\theta ^{-1})\), where \({\mathcal {F}}_{\pmb {\theta }} = \left( \begin{array}{cc} {\mathcal {F}}_{\pmb {\beta }} &{} \pmb {0} \\ \pmb {0} &{} {\mathcal {F}}_{\pmb {\lambda }} \end{array}\right)\) denotes the Fisher information matrix of \(\pmb {\theta }\). This result is compatible with the increasing domain regime (Cressie 1993).
The maximum likelihood estimates of \(\pmb {\lambda }\) can be found through Newton’s scoring iterative algorithm (Bonat et al. 2020):
where \(\tilde{\pmb {\theta }}=(\hat{\pmb {\beta }}^\top , \pmb {\lambda }^\top )^\top\) and \(\alpha\) controls the step length.
Now, let \(\rho _r\), for \(r = 1,\dots , p(p-1)/2\), denoting the correlation parameters of \(\varvec{\Sigma }_{b}\), \(\sigma _{i}^2\), \(\phi _i\) and \(\nu _i\), denoting the variance, scale and smoothness parameters of the marginal-covariance matrix, \(\Sigma _{ii}\), for \(i=1,\dots ,p\).
The partial derivative of the matrix-valued covariance function \(\varvec{\Sigma }\), with respect to each correlation parameter, \(\rho _r\), is given by:
To obtain the partial derivative with respect to variance parameter, \(\sigma _{i}^2\), we will use matrix properties, that is,
An analogous procedure to the Eq. (12) can be used to obtain the derivatives with respect to \(\phi _i\) and \(\nu _i\). Thus, to obtain the derivatives of \(\varvec{\Sigma }\) with respect to each parameter, we must calculate the parcial derivatives in (12). Using the result of partial derivatives of Cholesky’s factorization (Särkkä 2013; Bonat and Jørgensen 2016), follows:
where \(\Phi (.)\) is the strictly lower triangular part of the argument and half of its diagonal.
Appendix: Computational results
Rights and permissions
About this article
Cite this article
Ribeiro, A.M.T., Ribeiro Junior, P.J. & Bonat, W.H. A Kronecker-based covariance specification for spatially continuous multivariate data. Stoch Environ Res Risk Assess 36, 4087–4102 (2022). https://doi.org/10.1007/s00477-022-02252-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-022-02252-9