A Dirichlet Regression Model for Compositional Data with Zeros
- 140 Downloads
Compositional data are met in many different fields, such as economics, archaeometry, ecology, geology and political sciences. Regression where the dependent variable is a composition is usually carried out via a log-ratio transformation of the composition or via the Dirichlet distribution. However, when there are zero values in the data these two ways are not readily applicable. Suggestions for this problem exist, but most of them rely on substituting the zero values. In this paper we adjust the Dirichlet distribution when covariates are present, in order to allow for zero values to be present in the data, without modifying any values. To do so, we modify the log-likelihood of the Dirichlet distribution to account for zero values. Examples and simulation studies exhibit the performance of the zero adjusted Dirichlet regression.
KeywordsCompositional data regression Dirichlet distribution zero values
Unable to display preview. Download preview PDF.
- 3.I. J. Bear and D. Billheimer, “A logistic normal mixture model allowing essential zeros,” in Proceedings of the 6th Compositional Data Analysis Workshop, Girona, Spain, 2015.Google Scholar
- 5.G. Campbell and J. E. Mosimann, “Multivariate analysis of size and shape: modelling with the Dirichlet distribution,” in ASA Proceedings of Section on Statistical Graphics (San Francisco, USA, 1987), pp. 93–101.Google Scholar
- 10.R. H. Hijazi, “An EM-algorithm based method to deal with rounded zeros in compositional data under Dirichlet models,” in Proceedings of the 1st Compositional Data Analysis Workshop, Girona, Spain, 2011.Google Scholar
- 14.J. M. Maier, DirichletReg: Dirichlet Regression in R (2014). http://dirichletreg.r-forge.r-project.org/.Google Scholar
- 27.M. Templ, K. Hron, and P. Filzmoser, robCompositions: Robust Estimation for Compositional Data, R PackageVersion 0.8-4.Google Scholar
- 28.H. Theil, Economics and Information Theory (North-Holland, Amsterdam, 1967).Google Scholar
- 29.T. W. Yee, VGAM: Vector Generalized Linear and Additive Models. R Package Version 0.8-4 (2011). http://CRAN. R-project.org/package=VGAM.Google Scholar
- 31.M. Tsagris and G. Athineou, Compositional: Compositional Data Analysis. R package version 2.8 (2017). https://CRAN.R-project.org/package=Compositional.Google Scholar