Abstract
The sample space of compositional data is the open simplex. Therefore, zeros in a compositional data set are identified either with below detection limit values, or lead to a division of the data set into different subpopulations with the corresponding lower dimensional sample space. Most multivariate data analysis techniques require complete data matrices, thus calling for a strategy of imputation of zeros in the first case. Existing replacement methods of rounded zeros are reviewed, and a new method is proposed, who’s properties are analyzed and illustrated. The method is applied in a hierarchical cluster analysis of compositional data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
AITCHISON, J. (1986): The Statistical Analysis of Compositional Data. Chapman and Hall, New York (USA), 416
AITCHISON, J., BARCELÓ-VIDAL, C., MARTÍN-FERNÁNDEZ, J.A., and PAWLOWSKY-GLAHN, V. (2000): Logratio analysis and compositional distance. Mathematical Geology, (in press).
KRZANOWSKI, W.J. (1988): Principles of Multivariate Analysis. A User’s Perspective. Clarendon Press, Oxford (GB), 563 p. (reprinted 1996).
LITTLE, R.J.A and RUBIN, D.B. (1987): Statistical Analysis with Missing Data. John Wiley & Sons, New York(USA), 278p.
MARTÍN-FERNÁNDEZ, J.A., BARCELÓ-VIDAL, C., and PAWLOWSKY-GLAHN, V. (1998): A Critical Approach to Non-parametric Classification of Compositional Data. In: A. Rizzi, M. Vichi, and H.-H. Bock (Eds.): Advances in Data Science and Classification. Springer, Heidelberg, pp. 49–56.
SANDFORD, R.F, PIERSON, C.T., and CROVELLI, R.A. (1993): An Objective Replacement Method for Censored Geochemical Data. Mathematical Geology, Vol. 25:1, pp 59–80.
TAUBER, F. (1999): Spurious clusters in Granulometric Data Caused by LogratioTransformation. Mathematical Geology, Vol. 31:5, pp. 491–504
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Martín-Fernández, J.A., Barceló-Vidal, C., Pawlowsky-Glahn, V. (2000). Zero Replacement in Compositional Data Sets. In: Kiers, H.A.L., Rasson, JP., Groenen, P.J.F., Schader, M. (eds) Data Analysis, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59789-3_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-59789-3_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67521-1
Online ISBN: 978-3-642-59789-3
eBook Packages: Springer Book Archive