Skip to main content
Log in

Dealing with Compositional Data: The Freeware CoDaPack

  • Published:
Mathematical Geology Aims and scope Submit manuscript

Abstract

The statistical analysis of compositional data based on logratios of parts is commonly used in geological studies. In particular, descriptive methods of reduction of dimensionality as the biplot or Principal Component Analysis are very useful tools in a multivariate context. These methods are difficult to use correctly with compositional data in standard packages. In this paper we describe a freeware package, named CoDaPack, which implements most of the basic statistical methods suitable for compositional data. An example using real data is presented to illustrate the use of the package.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Aitchison, J., 1997, The one-hour course in compositional data analysis is easy, in Pawlowsky-Glahn, ed., Proceedings of IAMG'97, Vol. 1: The Third Annual Conference of the International Association for Mathematical Geology, CIMNE Barcelona, Spain, p. 3–35.

  • Aitchison, J., 2001, Simplicial inference, in Viana, M. A. G., and Richards, D. S. P., eds., Contemporary mathematics series: Algebraic methods in statistics and probability, v. 287: American Mathematical Society, Providence, RI, p. 1–22.

  • Aitchison, J., 2003, The statistical analysis of compositional data: (Reprint) Blackburn, Caldwell, NJ, 416 p.

  • Aitchison, J., Barceló-Vidal, C., Egozcue, J. J., and Pawlowsky-Glahn, V., 2002, A concise guide for the algebraic–geometric structure of the simplex, the sample space for compositional data analysis, in Burger, H., and Skala, W., eds., Proceedings of IAMG'02: The 8th Annual Meeting of the International Association for Mathematical Geology, Terra Nostro, no. 3, p. 387–392.

  • Aitchison, J., and Greenacre, M., 2002, Biplots of compositional data: Appl. Stat., v. 51, p. 375–392.

    Google Scholar 

  • Aitchison, J., and Thomas, C. W., 1998, Differential perturbation processes: A tool for the study of compositional processes, in Buccianti A., Nardi, G., and Potenza, R., eds., Proceedings of IAMG'98: The Fourth Annual Conference of the International Association for Mathematical Geology, De Frede, Napoli, Italy, p. 499–504.

  • Barceló-Vidal, C., Martín-Fernández, J. A., and Pawlowsky-Glahn, V., 2001, Mathematical foundations for compositional data analysis, in Ross, G., ed., Proceedings of IAMG'01: The Annual Conference of the International Association for Mathematical Geology, Cancún, Mexico, 20 p. (CD, electronic publication).

  • Cox, N. J., 2001, Triangular plots, in United Kingdom Stata Users' Group Meetings 2001, no. 7. Available at http://ideas.repec.org/p/boc/usug01/7.html. (on April 26, 2004).

  • Furrer, M., 1997, Trixcel pour créer des diagrammes ternaires dans excel. Available at http://www-sst.unil.ch/perso_pages/jfurrer/Applications/Trixcel/Trixcel.htm. (on April 26, 2004).

  • John, C. M., 2004, New Freeware makes plotting and analyzing data trends in ternary diagrams easy. Available at http://www.agu.org/eos_elec/000562e.shtml (on April 26, 2004).

  • Marshall, D., 1996, Ternplot: An Excel Spreadsheet for ternary diagrams: Comput. Geosci., v. 22, no. 6, p. 697–699. Available at http://www.sfu.ca/∼marshall/ternplot.htm. (on April 26, 2004).

  • Martín-Fernández, J. A., Barceló-Vidal, C., and Pawlowsky-Glahn, V., 2003, Dealing with zeros and missing values in compositional data sets: Math. Geol., v. 35, no. 3, p. 253–278.

    Google Scholar 

  • Martín-Fernández, J. A., Bren, M., Barceló-Vidal, C., and Pawlowsky-Glahn, V., 1999, A measure of difference for compositional data based on measures of divergence, in Lippard, S. J., Naess, A., and Sinding-Larsen, R., eds., Proceedings of IAMG'99, Vol. 1: The Fifth Annual Conference of the International Association for Mathematical Geology, Tapir, Trondheim, Norway, p. 211–216.

  • Mateu-Figueras, G., Barceló-Vidal, C., and Pawlowsky-Glahn, V., 1998, Modeling compositional data with multivariate skew-normal distributions, in Buccianti, A., Nardi, G., and Potenza, R., eds., Proceedings of IAMG'98, v. 1: The Fourth Annual Conference of the International Association for Mathematical Geology, De Frede, Napoli, Italy, p. 532–537.

  • Pawlowsky-Glahn, V., and Buccianti, A., 2002, The statistical analysis of compositional data: From theory to practice, in Buccianti, A., Marini, L., Ottonello, G., and Vaselli, O., eds., Proceedings of the Arezzo Seminar on Fluid Geochemistry: Genova, Italy, p. 61–73.

  • Pawlowsky-Glahn, V., and Egozcue, J. J., 2002, About BLU estimators and compositional data: Math. Geol., v. 34, no. 3, p. 259–274.

    Article  Google Scholar 

  • Reyment, R. A., and Savazzi, E., 1999, Aspects of multivariate statistical analysis in geology: Elsevier, Amsterdam, 285 p.

    Google Scholar 

  • Reynolds, J. H., and Billheimer, D., 2002, Basic compositional data analysis functions for S+/R. Available at http://www.biostat.wustl.edu/archives/html/s-news/2003-12/msg00139.html. (on April 26, 2004).

  • Sidder, G. B., 1994, Petro.Calc.Plot, Microsoft Excel macros to aid petrologic interpretation. Comput. Geosci., v. 20, no. 6, p. 1041–1061. Available at http://www.ndsu.nodak.edu/instruct/sainieid/software/petro-calc-plot/. (on April 26, 2004).

  • Thió-Henestrosa, S., Barceló-Vidal, C., Martín-Fernández, J. A., and Pawlowsky-Glahn, V., 2003, CoDaPack. An Excel and Visual Basic based sotfware of compositional data analysis. Current version and discussion for upcoming versions, in Thió-Henestrosa, S., and Martín-Fernández, J. A., eds., Proceedings of CODAWORK'03: The First Compositional Data Analysis Workshop, Girona, Spain, 8 p. (CD, electronic publication).

  • Thomson, T. A., Triplot Version 4.02, 2004. Available at http://home.earthlink.net/∼baedke/triplot/. (on April 26, 2004).

  • von Eynatten, H., Barceló-Vidal, C., and Pawlowsky-Glahn, V., 2003, Modelling compositional change: The example of chemical weathering of granitoid rocks: Math. Geol., v. 35, no. 3, p. 231–251.

    Article  Google Scholar 

  • von Eynatten, H., Pawlowsky-Glahn, V., and Egozcue, J. J., 2002, Understanding perturbation on the simplex: A simple method to better visualize and interpret compositional data in ternary diagrams: Math. Geol., v. 34, no. 3, p. 249–257.

    Article  Google Scholar 

  • Zippi, P. A., 2001, Ternary Plot 4.0 for Macintosh OS. Available at http://www.pazsoftware.com/Ternary4.html. (on April 26, 2004).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Thió-Henestrosa.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thió-Henestrosa, S., Martín-Fernández, J.A. Dealing with Compositional Data: The Freeware CoDaPack. Math Geol 37, 773–793 (2005). https://doi.org/10.1007/s11004-005-7379-3

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11004-005-7379-3

Keywords

Navigation