Composite likelihood approach to the regression analysis of spatial multivariate ordinal data and spatial compositional data with exact zero values
In many environmental and ecological studies, it is of interest to model compositional data. One approach is to consider positive random vectors that are subject to a unit-sum constraint. In landscape ecological studies, it is common that compositional data are also sampled in space with some elements of the composition absent at certain sampling sites. In this paper, we first propose a practical spatial multivariate ordered probit model for multivariate ordinal data, where the response variables can be viewed as the discretized non-negative compositions without the unit-sum constraint. We then propose a novel two-stage spatial mixture Dirichlet regression model. The first stage models the spatial dependence and the presence of exact zero values, and the second stage models all the non-zero compositional data. A maximum composite likelihood approach is developed for parameter estimation and inference in both the spatial multivariate ordered probit model and the two-stage spatial mixture Dirichlet regression model. The standard errors of the parameter estimates are computed by an estimate of the Godambe information matrix. A simulation study is conducted to evaluate the performance of the proposed models and methods. A land cover data example in landscape ecology further illustrates that accounting for spatial dependence can improve the accuracy in the prediction of presence/absence of different land covers as well as the magnitude of land cover compositions.
KeywordsDirichlet regression model Gaussian latent variable Godambe information Mixture model Multivariate ordered probit model Spatial prediction
Funding has been provided for this research from a USDA Cooperative State Research, Education and Extension Service (CSREES) McIntire-Stennis project and the National Science Foundation PalEON MacroSystems Biology under grant no. DEB1241868. The authors thank Dr. Mark D.O. Adams for database development assistance. We also thank the co-editor, an associate editor, and three anonymous referees for constructive comments that improved the content and presentation of this paper.
- Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F (eds) Second international symposium on information theory. Akademia Kiado, Budapest, pp 267–281Google Scholar
- Aitchison J, Kay JW (2003) Possible solutions in some essential zero problems in compositional data analysis. Working paper, presented at CoDaWorks03Google Scholar
- Bhat CR, Varin C, Ferdous N (2010) A comparison of the maximum simulated likelihood and composite marginal likelihood estimation approaches in the context of the multivariate ordered-response model. In: Greene W, Hill RC (eds) Advances in econometrics: maximum simulated likelihood methods and applications. Emerald Group Publishing Limited, Bingley, pp 65–106CrossRefGoogle Scholar
- Eskelson BN, Madsen L, Hagar JC, Temesgen H (2011) Estimating riparian understory vegetation cover with beta regression and copula models. Forest Sci 57:212–221Google Scholar
- Feng X (2015) Composite likelihood estimation and inference for spatial data models. Ph.D. thesis, University of Wisconsin, MadisonGoogle Scholar
- Hijazi RH, Jernigan RW (2009) Modeling compositional data using Dirichlet regression models. J Appl Prob Stat 4:77–91Google Scholar
- LaMondia J, Bhat CR (2009) A conceptual and methodological framework of leisure activity loyalty accommodating the travel context: application of a copula-based bivariate ordered-response choice model. Technical Paper, Department of Civil, Architectural and Environmental Engineering, The University of Texas at AustinGoogle Scholar
- Lindsay B (1988) Composite likelihood methods. Contemp Math 80:221–239Google Scholar
- R Core Team (2013) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0. http://www.R-project.org/
- Tsagris M (2014) Zero adjusted Dirichlet regression for compositional data with zero values present. arXiv:1410.5011
- Varin C, Reid N, Firth D (2011) An overview of composite likelihood methods. Stat Sinica 21:5–42Google Scholar