Skip to main content

Zero Replacement in Compositional Data Sets

  • Conference paper
Data Analysis, Classification, and Related Methods

Abstract

The sample space of compositional data is the open simplex. Therefore, zeros in a compositional data set are identified either with below detection limit values, or lead to a division of the data set into different subpopulations with the corresponding lower dimensional sample space. Most multivariate data analysis techniques require complete data matrices, thus calling for a strategy of imputation of zeros in the first case. Existing replacement methods of rounded zeros are reviewed, and a new method is proposed, who’s properties are analyzed and illustrated. The method is applied in a hierarchical cluster analysis of compositional data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • AITCHISON, J. (1986): The Statistical Analysis of Compositional Data. Chapman and Hall, New York (USA), 416

    Google Scholar 

  • AITCHISON, J., BARCELÓ-VIDAL, C., MARTÍN-FERNÁNDEZ, J.A., and PAWLOWSKY-GLAHN, V. (2000): Logratio analysis and compositional distance. Mathematical Geology, (in press).

    Google Scholar 

  • KRZANOWSKI, W.J. (1988): Principles of Multivariate Analysis. A User’s Perspective. Clarendon Press, Oxford (GB), 563 p. (reprinted 1996).

    Google Scholar 

  • LITTLE, R.J.A and RUBIN, D.B. (1987): Statistical Analysis with Missing Data. John Wiley & Sons, New York(USA), 278p.

    Google Scholar 

  • MARTÍN-FERNÁNDEZ, J.A., BARCELÓ-VIDAL, C., and PAWLOWSKY-GLAHN, V. (1998): A Critical Approach to Non-parametric Classification of Compositional Data. In: A. Rizzi, M. Vichi, and H.-H. Bock (Eds.): Advances in Data Science and Classification. Springer, Heidelberg, pp. 49–56.

    Google Scholar 

  • SANDFORD, R.F, PIERSON, C.T., and CROVELLI, R.A. (1993): An Objective Replacement Method for Censored Geochemical Data. Mathematical Geology, Vol. 25:1, pp 59–80.

    Article  Google Scholar 

  • TAUBER, F. (1999): Spurious clusters in Granulometric Data Caused by LogratioTransformation. Mathematical Geology, Vol. 31:5, pp. 491–504

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Martín-Fernández, J.A., Barceló-Vidal, C., Pawlowsky-Glahn, V. (2000). Zero Replacement in Compositional Data Sets. In: Kiers, H.A.L., Rasson, JP., Groenen, P.J.F., Schader, M. (eds) Data Analysis, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59789-3_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-59789-3_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67521-1

  • Online ISBN: 978-3-642-59789-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics