Skip to main content

Compositional Data as a Methodological Concept

  • Chapter
  • First Online:
Applied Compositional Data Analysis

Part of the book series: Springer Series in Statistics ((SSS))

Abstract

Compositional data were defined traditionally as constrained data, like proportions or percentages, with a fixed constant sum constraint (1 or 100, respectively). Nevertheless, from a practical perspective it is much more intuitive to consider them as observations carrying relative information, where proportions stand just for one possible representation. Equivalently, all relevant information in compositional data is contained in ratios between components (parts). According to this broader definition, the decision whether the data at hand are compositional or not depends primarily on the purpose of the analysis, i.e. if the relative structure of the compositional parts is of interest or not. As a consequence, the use of standard statistical methods for the analysis of compositional data that obey specific geometrical properties leads inevitably to biased results. A reasonable way out is to set up an algebraic-geometrical structure that follows the principles of compositional data analysis (scale invariance, permutation invariance, and subcompositional coherence). Nowadays, this is called the Aitchison geometry and it enables to express compositional data in interpretable real coordinates, where standard statistical procedures can directly be applied. These coordinates are formed by logratios of pairs of compositional parts and their aggregations: the logratio methodology was born.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • J. Aitchison, The Statistical Analysis of Compositional Data (Chapman & Hall, London, 1986). Reprinted in 2003 with additional material by The Blackburn Press

    Google Scholar 

  • D. Billheimer, P. Guttorp, W.F. Fagan, Statistical interpretation of species composition. J. Am. Stat. Assoc. 96(456), 1205–1214 (2001)

    Article  MathSciNet  Google Scholar 

  • A. Buccianti, G. Mateu-Figueras, V. Pawlowsky-Glahn (eds.), Compositional Data Analysis: Theory and Applications (Wiley, Chichester, 2011)

    Google Scholar 

  • F. Chayes, On correlation between variables of constant sum. J. Geophys. Res. 65(12), 4185–4193 (1960)

    Article  Google Scholar 

  • J.A. Cortés, On the Harker variation diagrams; a comment on “The statistical analysis of compositional data. Where are we and where should we be heading?” by Aitchison and Egozcue (2005). Math. Geosci. 41(7), 817–828 (2009)

    Google Scholar 

  • M.L. Eaton, Multivariate Statistics. A Vector Space Approach (Wiley, New York, 1983)

    Google Scholar 

  • J.J. Egozcue, Reply to “On the Harker variation diagrams; …” by J.A. Cortés. Math. Geosci. 41(7), 829–834 (2009)

    Google Scholar 

  • V. Pawlowsky-Glahn, J.J. Egozcue, Geometric approach to statistical analysis on the simplex. Stoch. Env. Res. Risk A. 15(5), 384–398 (2001)

    Article  Google Scholar 

  • V. Pawlowsky-Glahn, J.J. Egozcue, R. Tolosana-Delgado, Modeling and Analysis of Compositional Data (Wiley, Chichester, 2015)

    Google Scholar 

  • K. Pearson, Mathematical contributions to the theory of evolution. On a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc. R. Soc. Lond. LX, 489–502 (1897)

    Google Scholar 

  • R Development Core Team, R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, 2017). http://www.R-project.org/, ISBN 3-900051-07-0

  • C. Reimann, P. Filzmoser, K. Fabian, K. Hron, M. Birke, A. Demetriades, E. Dinelli, A. Ladenberger, The GEMAS Project Team, The concept of compositional data analysis in practice-Total major element concentrations in agricultural and grazing land soils of Europe. Sci. Total Environ. 426, 196–210 (2012)

    Google Scholar 

  • J.L. Scealy, A.H. Welsch, Colours and coctails: compositional data analysis 2013 Lancaster Lecture. Aust. N. Z. J. Stat. 56(2), 145–169 (2014)

    Article  MathSciNet  Google Scholar 

  • K.G. van den Boogaart, R. Tolosana-Delgado, Analyzing Compositional Data with R (Springer, Heidelberg, 2013)

    Book  Google Scholar 

  • K. Varmuza, I. Steiner, H. Glinsner, H. Klein, Chemometric evaluation of concentration profiles from compounds relevant in beer ageing. Eur. Food Res. Technol. 215(3), 235–239 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Filzmoser, P., Hron, K., Templ, M. (2018). Compositional Data as a Methodological Concept. In: Applied Compositional Data Analysis. Springer Series in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-96422-5_1

Download citation

Publish with us

Policies and ethics