Abstract
The sample space of compositional data, a simplex, induces a different kind of geometry, known as Aitchison geometry, with the Euclidean space property. For this reason, the standard statistical analysis is not meaningful here, and this is also true for measures of location and covariance. The measure of location, called centre, is the best linear unbiased estimator of the central tendency of the distribution of a random composition with respect to the geometry on the simplex (Pawlowsky-Glahn and Egozcue in Stoch Envir Res Risk Ass, 15:384–398, 2001; Math Geol, 34:259–274, 2002). Its covariance structure is described through a variation matrix, which induces the so called total variation as a measure of dispersion. The aim of the paper is to show that its sample counterpart has theoretical properties, corresponding to the standard multivariate case, like unbiasedness and convergence in probability. Moreover, its distribution in the case of normality on the simplex is developed.
Similar content being viewed by others
References
Aitchison J (1986) The statistical analysis of compositional data. Monographs on statistics and applied probability. Chapman and Hall, London
Aitchison J, Shen SM (1980) Logistic-normal distibutions. Some properties and uses. Biometrika 67: 261–272
Anděl J (1978) Mathematical statistics (in Czech). SNTL/Alfa, Prague
Buccianti A, Pawlowsky-Glahn V (2005) New perspectives on water chemistry and compositional data analysis. Math Geol 37: 703–727
Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) (2006) Compositional data analysis in the geosciences: from theory to practice. Geological Society, London (Special Publications), p 264
Daunis-i-Estadella J, Barceló-Vidal C, Buccianti A (2006) Exploratory compositional data analysis. In: Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) Compositional data analysis in the geosciences: from theory to practice, vol 264. Geological Society, London (Special Publications), pp 161–174
Egozcue JJ, Pawlowsky-Glahn V, Mateu-Figueras G, Barceló-Vidal C (2003) Isometric logratio transformations for compositional data analysis. Math Geol 35: 279–300
Egozcue JJ, Pawlowsky-Glahn V (2005) Groups of parts and their balances in compositional data analysis. Math Geol 37: 795–828
Egozcue JJ, Pawlowsky-Glahn V (2006) Simplicial geometry for compositional data. In: Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V (eds) Compositional data analysis in the geosciences: from theory to practice, vol 264. Geological Society, London (Special Publications), pp 145–160
Filzmoser P, Hron K (2008) Outlier detection for compositional data using robust methods. Math Geosci 40: 233–248
Filzmoser P, Hron K, Reimann C (2009) Principal component analysis for compositional data with outliers. Environmetrics 20: 621–632
Glueck DH, Muller KE (1998) On the trace of a Wishart. Commun Stat Theory Methods 21: 2137–2141
Halmos PR (1974) Measure theory. Springer, New York
Kshirsagar AM (1972) Multivariate analysis. Dekker, New York
Mateu-Figueras G, Pawlowsky-Glahn V (2008) A critical approach to probability laws in geochemistry. Math Geosci 40: 489–502
Mathai AM (1980) Moments of the trace of a noncentral Wishart matrix. Commun Stat Theory Methods 9: 795–801
Mathai AM, Pillai KCS (1982) Further results on the trace of a noncentral Wishart matrix. Commun Stat Theory Methods 11: 1077–1086
Pawlowsky-Glahn V, Egozcue JJ (2001) Geometric approach to statistical analysis on the simplex. Stoch Envir Res Risk Ass 15: 384–398
Pawlowsky-Glahn V, Egozcue JJ (2002) BLU estimators and compositional data. Math Geol 34: 259–274
Pawlowsky-Glahn V, Egozcue JJ, Tolosana-Delgado J (2008) Lecture notes on compositional data analysis. http://hdl.handle.net/10256/297. Accessed 17 Feb 2009
Reimann C, Filzmoser P, Garrett RG, Dutter R (2008) Statistical data analysis explained. Applied environmental statistics with R. Wiley, Chichester
Tolosana-Delgado R, Otero N, Pawlowsky-Glahn V, Soler A (2005) Latent compositional factors in the Llobragat river basin (Spain) hydrochemistry. Math Geol 37: 681–702
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hron, K., Kubáček, L. Statistical properties of the total variation estimator for compositional data. Metrika 74, 221–230 (2011). https://doi.org/10.1007/s00184-010-0299-3
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-010-0299-3