Abstract
This chapter provides the concepts from compositional data analysis required to prepare compositional data for geostatistical treatment. Specifically we define the term closure, its rationale and caveats, and the various ways of escaping from its curse, i.e. the various forms of log-ratio transformation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
There is an incompatible function called select in package “MASS”. To ensure using the right function in such cases, the syntax package::command is useful.
- 2.
Note the use of the pipe %>% from package “magrittr”, where X %>% fun is equivalent to fun(X). Piping is a comfortable way of nesting functions in R, easy to write and to read: from the data set australia, select some variables, do absolute values, compute the row sums and return their summaries.
- 3.
Gangue: part of a mineral deposit considered as not valuable.
References
Aitchison, J. (1986). The statistical analysis of compositional data (416 pp.). Monographs on Statistics and Applied Probability. London, UK: Chapman & Hall Ltd. (Reprinted in 2003 with additional material by The Blackburn Press).
Aitchison, J. (1997). The one-hour course in compositional data analysis or compositional data analysis is simple. In V. Pawlowsky-Glahn (Ed.), Proceedings of IAMG’97 - The III Annual Conference of the Int. Association for Mathematical Geology, Volume I, II and addendum, Barcelona (E) (pp. 3–35, 1100 pp.). International Center for Numerical Methods in Engineering (CIMNE), Barcelona (E).
Barceló-Vidal, C. (2000). Fundamentación matemática del análisis de datos composicionales (77 pp.). Number IMA 00-02-RR.
Barceló-Vidal, C., Martín-Fernández, J. A., & Pawlowsky-Glahn, V. (2001). Mathematical foundations of compositional data analysis. In G. Ross (Ed.), Proceedings of IAMG’01 – The VII Annual Conference of the Int. Association for Mathematical Geology, Cancun (Mex), 20 p.
Billheimer, D., Guttorp, P., & Fagan, W. (2001). Statistical interpretation of species composition. Journal of the American Statistical Association, 96(456), 1205–1214.
Boogaart, K. G. v. d., & Tolosana-Delgado, R. (2008). “compositions”: a unified R package to analyze compositional data. Computers and Geosciences, 34(4), 320–338.
Boogaart, K. G. v. d., & Tolosana-Delgado, R. (2013). Analysing compositional data with R (280 pp.). Heidelberg: Springer.
Boogaart, K. G. v. d., Tolosana-Delgado, R., & Bren, M. (2011). The compositional meaning of a detection limit. In J. J. Egozcue, R. Tolosana-Delgado, M. I. Ortego (Eds.), Proceedings of the 4th International Workshop on Compositional Data Analysis (2011). CIMNE, Barcelona, Spain. ISBN:978-84-87867-76-7.
Buccianti, A., Mateu-Figueras, G., & Pawlowsky-Glahn, V. E. (2006). Compositional data analysis in the geosciences: From theory to practice. Special publications (Vol. 264, 212 pp.). London: Geological Society.
Butler, J. C. (1978). Visual bias in R-mode dendrograms due to the effect of closure. Mathematical Geology, 10(2), 243–252.
Butler, J. C. (1979). The effects of closure on the moments of a distribution. Mathematical Geology, 11(1), 75–84.
Chayes, F. (1960). On correlation between variables of constant sum. Journal of Geophysical Research, 65(12), 4185–4193.
Chayes, F., & Trochimczyk, J. (1978). An effect of closure on the structure of principal components. Mathematical Geology, 10(4), 323–333.
Egozcue, J. J., Pawlowsky-Glahn, V., Mateu-Figueras, G., & Barceló-Vidal, C. (2003). Isometric logratio transformations for compositional data analysis. Mathematical Geology, 35(3), 279–300.
Martín-Fernández, J. A., Palarea-Albadalejo, J., & Gómez-García, J. (2003). Markov chain Montecarlo method applied to rounding zeros of compositional data: first approach. In S. Thió-Henestrosa, J. A. Martín-Fernández (Eds.), Proceedings of CoDaWork’03, The 1st Compositional Data Analysis Workshop, Girona (E). Universitat de Girona. ISBN:84-8458-111-X. http://ima.udg.edu/Activitats/CoDaWork2003/.
Palarea-Albaladejo, J., & Martín-Fernández, J. A. (2008). A modified EM alr-algorithm for replacing rounded zeros in compositional data sets. Computers and Geosciences, 34(8), 2233–2251.
Pawlowsky, V. (1984). On spurious spatial covariance between variables of constant sum. Science de la Terre, Sér. Informatique, 21, 107–113.
Pawlowsky, V. (1989). Cokriging of regionalised compositions. Mathematical Geology, 21(5), 513–521.
Pawlowsky-Glahn, V., & Egozcue, J. J. (2001). Geometric approach to statistical analysis on the simplex. Stochastic Environmental Research and Risk Assessment (SERRA), 15(5), 384–398.
Pawlowsky-Glahn, V., Egozcue, J. J., & Lovell, D. (2015). Tools for compositional data with a total. Statistical Modelling, 15(2), 175–190.
Pawlowsky-Glahn, V., Egozcue, J. J., & Tolosana-Delgado, R. (2015). Modeling and analysis of compositional data (272 pp.). Chichester, UK: John Wiley & Sons.
Pearson, K. (1897). Mathematical contributions to the theory of evolution. On a form of spurious correlation which may arise when indices are used in the measurement of organs. Proceedings of the Royal Society of London, LX, 489–502.
Shurtz, R. F. (2003). Compositional geometry and mass conservation. Mathematical Geology, 35(8), 927–937.
Tjelmeland, H., & Lund, K. V. (2003). Bayesian modelling of spatial compositional data. Journal of Applied Statistics, 30(1), 87–100.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Tolosana-Delgado, R., Mueller, U. (2021). A Review of Compositional Data Analysis. In: Geostatistics for Compositional Data with R. Use R!. Springer, Cham. https://doi.org/10.1007/978-3-030-82568-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-82568-3_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-82567-6
Online ISBN: 978-3-030-82568-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)