Skip to main content
Log in

Statistical properties of the total variation estimator for compositional data

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

The sample space of compositional data, a simplex, induces a different kind of geometry, known as Aitchison geometry, with the Euclidean space property. For this reason, the standard statistical analysis is not meaningful here, and this is also true for measures of location and covariance. The measure of location, called centre, is the best linear unbiased estimator of the central tendency of the distribution of a random composition with respect to the geometry on the simplex (Pawlowsky-Glahn and Egozcue in Stoch Envir Res Risk Ass, 15:384–398, 2001; Math Geol, 34:259–274, 2002). Its covariance structure is described through a variation matrix, which induces the so called total variation as a measure of dispersion. The aim of the paper is to show that its sample counterpart has theoretical properties, corresponding to the standard multivariate case, like unbiasedness and convergence in probability. Moreover, its distribution in the case of normality on the simplex is developed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aitchison J (1986) The statistical analysis of compositional data. Monographs on statistics and applied probability. Chapman and Hall, London

    Google Scholar 

  • Aitchison J, Shen SM (1980) Logistic-normal distibutions. Some properties and uses. Biometrika 67: 261–272

    Article  MathSciNet  MATH  Google Scholar 

  • Anděl J (1978) Mathematical statistics (in Czech). SNTL/Alfa, Prague

    Google Scholar 

  • Buccianti A, Pawlowsky-Glahn V (2005) New perspectives on water chemistry and compositional data analysis. Math Geol 37: 703–727

    Article  MATH  Google Scholar 

  • Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) (2006) Compositional data analysis in the geosciences: from theory to practice. Geological Society, London (Special Publications), p 264

  • Daunis-i-Estadella J, Barceló-Vidal C, Buccianti A (2006) Exploratory compositional data analysis. In: Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) Compositional data analysis in the geosciences: from theory to practice, vol 264. Geological Society, London (Special Publications), pp 161–174

  • Egozcue JJ, Pawlowsky-Glahn V, Mateu-Figueras G, Barceló-Vidal C (2003) Isometric logratio transformations for compositional data analysis. Math Geol 35: 279–300

    Article  MathSciNet  Google Scholar 

  • Egozcue JJ, Pawlowsky-Glahn V (2005) Groups of parts and their balances in compositional data analysis. Math Geol 37: 795–828

    Article  MathSciNet  MATH  Google Scholar 

  • Egozcue JJ, Pawlowsky-Glahn V (2006) Simplicial geometry for compositional data. In: Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) Compositional data analysis in the geosciences: from theory to practice, vol 264. Geological Society, London (Special Publications), pp 145–160

  • Filzmoser P, Hron K (2008) Outlier detection for compositional data using robust methods. Math Geosci 40: 233–248

    Article  MATH  Google Scholar 

  • Filzmoser P, Hron K, Reimann C (2009) Principal component analysis for compositional data with outliers. Environmetrics 20: 621–632

    Article  Google Scholar 

  • Glueck DH, Muller KE (1998) On the trace of a Wishart. Commun Stat Theory Methods 21: 2137–2141

    MathSciNet  Google Scholar 

  • Halmos PR (1974) Measure theory. Springer, New York

    MATH  Google Scholar 

  • Kshirsagar AM (1972) Multivariate analysis. Dekker, New York

    MATH  Google Scholar 

  • Mateu-Figueras G, Pawlowsky-Glahn V (2008) A critical approach to probability laws in geochemistry. Math Geosci 40: 489–502

    Article  MATH  Google Scholar 

  • Mathai AM (1980) Moments of the trace of a noncentral Wishart matrix. Commun Stat Theory Methods 9: 795–801

    Article  MathSciNet  Google Scholar 

  • Mathai AM, Pillai KCS (1982) Further results on the trace of a noncentral Wishart matrix. Commun Stat Theory Methods 11: 1077–1086

    Article  MathSciNet  MATH  Google Scholar 

  • Pawlowsky-Glahn V, Egozcue JJ (2001) Geometric approach to statistical analysis on the simplex. Stoch Envir Res Risk Ass 15: 384–398

    Article  MATH  Google Scholar 

  • Pawlowsky-Glahn V, Egozcue JJ (2002) BLU estimators and compositional data. Math Geol 34: 259–274

    Article  MathSciNet  MATH  Google Scholar 

  • Pawlowsky-Glahn V, Egozcue JJ, Tolosana-Delgado J (2008) Lecture notes on compositional data analysis. http://hdl.handle.net/10256/297. Accessed 17 Feb 2009

  • Reimann C, Filzmoser P, Garrett RG, Dutter R (2008) Statistical data analysis explained. Applied environmental statistics with R. Wiley, Chichester

    Book  Google Scholar 

  • Tolosana-Delgado R, Otero N, Pawlowsky-Glahn V, Soler A (2005) Latent compositional factors in the Llobragat river basin (Spain) hydrochemistry. Math Geol 37: 681–702

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karel Hron.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hron, K., Kubáček, L. Statistical properties of the total variation estimator for compositional data. Metrika 74, 221–230 (2011). https://doi.org/10.1007/s00184-010-0299-3

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00184-010-0299-3

Keywords

Navigation