Abstract
Compositional data were defined traditionally as constrained data, like proportions or percentages, with a fixed constant sum constraint (1 or 100, respectively). Nevertheless, from a practical perspective it is much more intuitive to consider them as observations carrying relative information, where proportions stand just for one possible representation. Equivalently, all relevant information in compositional data is contained in ratios between components (parts). According to this broader definition, the decision whether the data at hand are compositional or not depends primarily on the purpose of the analysis, i.e. if the relative structure of the compositional parts is of interest or not. As a consequence, the use of standard statistical methods for the analysis of compositional data that obey specific geometrical properties leads inevitably to biased results. A reasonable way out is to set up an algebraic-geometrical structure that follows the principles of compositional data analysis (scale invariance, permutation invariance, and subcompositional coherence). Nowadays, this is called the Aitchison geometry and it enables to express compositional data in interpretable real coordinates, where standard statistical procedures can directly be applied. These coordinates are formed by logratios of pairs of compositional parts and their aggregations: the logratio methodology was born.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J. Aitchison, The Statistical Analysis of Compositional Data (Chapman & Hall, London, 1986). Reprinted in 2003 with additional material by The Blackburn Press
D. Billheimer, P. Guttorp, W.F. Fagan, Statistical interpretation of species composition. J. Am. Stat. Assoc. 96(456), 1205–1214 (2001)
A. Buccianti, G. Mateu-Figueras, V. Pawlowsky-Glahn (eds.), Compositional Data Analysis: Theory and Applications (Wiley, Chichester, 2011)
F. Chayes, On correlation between variables of constant sum. J. Geophys. Res. 65(12), 4185–4193 (1960)
J.A. Cortés, On the Harker variation diagrams; a comment on “The statistical analysis of compositional data. Where are we and where should we be heading?” by Aitchison and Egozcue (2005). Math. Geosci. 41(7), 817–828 (2009)
M.L. Eaton, Multivariate Statistics. A Vector Space Approach (Wiley, New York, 1983)
J.J. Egozcue, Reply to “On the Harker variation diagrams; …” by J.A. Cortés. Math. Geosci. 41(7), 829–834 (2009)
V. Pawlowsky-Glahn, J.J. Egozcue, Geometric approach to statistical analysis on the simplex. Stoch. Env. Res. Risk A. 15(5), 384–398 (2001)
V. Pawlowsky-Glahn, J.J. Egozcue, R. Tolosana-Delgado, Modeling and Analysis of Compositional Data (Wiley, Chichester, 2015)
K. Pearson, Mathematical contributions to the theory of evolution. On a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc. R. Soc. Lond. LX, 489–502 (1897)
R Development Core Team, R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, 2017). http://www.R-project.org/, ISBN 3-900051-07-0
C. Reimann, P. Filzmoser, K. Fabian, K. Hron, M. Birke, A. Demetriades, E. Dinelli, A. Ladenberger, The GEMAS Project Team, The concept of compositional data analysis in practice-Total major element concentrations in agricultural and grazing land soils of Europe. Sci. Total Environ. 426, 196–210 (2012)
J.L. Scealy, A.H. Welsch, Colours and coctails: compositional data analysis 2013 Lancaster Lecture. Aust. N. Z. J. Stat. 56(2), 145–169 (2014)
K.G. van den Boogaart, R. Tolosana-Delgado, Analyzing Compositional Data with R (Springer, Heidelberg, 2013)
K. Varmuza, I. Steiner, H. Glinsner, H. Klein, Chemometric evaluation of concentration profiles from compounds relevant in beer ageing. Eur. Food Res. Technol. 215(3), 235–239 (2002)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Filzmoser, P., Hron, K., Templ, M. (2018). Compositional Data as a Methodological Concept. In: Applied Compositional Data Analysis. Springer Series in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-96422-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-96422-5_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96420-1
Online ISBN: 978-3-319-96422-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)