Skip to main content
Log in

Some aspects of transformations of compositional data and the identification of outliers

  • Published:
Mathematical Geology Aims and scope Submit manuscript

Abstract

The statistical analysis of compositional data is based on determining an appropriate transformation from the simplex to real space. Possible transfonnations and outliers strongly interact: parameters of transformations may be influenced particularly by outliers, and the result of goodness-of-fit tests will reflect their presence. Thus, the identification of outliers in compositional datasets and the selection of an appropriate transformation of the same data, are problems that cannot be separated. A robust method for outlier detection together with the likelihood of transformed data is presented as a first approach to solve those problems when the additive-logratio and multivariate Box-Cox transformations are used. Three examples illustrate the proposed methodology.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aitchison, J., 1986. The statistical analysis ol compositional data: Chapman and Hall, London. 416 p.

    Google Scholar 

  • Andrews, D. F., Gnanadesikan, R., and Warner, J. L., 1971. Transformations of multivariate data: Biometrics, v. 27. p. 825–840. Quoted in Ganandesikan, R., 1977. Methods for statistical data analysis ol multivariate observations: John Wiley & Sons, New York, 311 p.

    Google Scholar 

  • Bamett, V., and Lewis, T., 1994. Outliers in statistical data (3rd ed I: John Wiles & Sons, Chichester. 584 p.

    Google Scholar 

  • Coakley, J. P., and Rust, B. R., 1968, Sedimentation in an Arctic lake: Jour. Sed. Pet., v. 38. no. 4. p. 1290–1300: Quoted in Aitchison, J., 1986. The statistical analysis of compositional data: Chapman and Hall, London, 416 p.

    Google Scholar 

  • Gnanadesikan, R., 1977. Methods for statistical data analysis of multivariale ohservations: John Wiley & Sons, New York, 311 p.

    Google Scholar 

  • Hadi, A. S., 1992, Identifying multiple outliers in multivariate data: Jour. Roy. Statist. Soc. B. v. 54, no. 3, p. 761–771.

    Google Scholar 

  • Hadi, A. S., 1994, A modification of a method for the detection of outliers in multivariate samples: Jour. Roy. Statist. Soc. B. v. 56. no. 2. p. 393–346.

    Google Scholar 

  • McLachlan, G. J., and Basford, K. E., 1988, Mixture models, inference and applications to clustering: Marcel Dekker, New York, 253 p.

    Google Scholar 

  • Rayens, W. S., and Srinivasan, C., 1991, Box-Cox transformations in the analysis of compositional data: Jour. Chemometrics, v. 5, no. 3, p. 227–239.

    Google Scholar 

  • Rousseeuw, P. J., and Leroy, A. M., 1987, Robust regression and outlier detection: John Wiley & Sons, New York, 329 p.

    Google Scholar 

  • Rousseeuw, P. J., and Zomeren, B. C., 1990, Unmasking multivariate outliers and leverage points (with comments): Jour. Am. Statist. Assoc., v. 85, no. 411, p. 633–651.

    Google Scholar 

  • Seber, G. A. F., 1984, Multivariate observations: John Wiley & Sons, New York, 686 p.

    Google Scholar 

  • Thompson, R. N., Esson, J., and Duncan, A. C., 1972, Major element chemical variation in the Eocene lavas of the Isle of Skye, Scotland: Jour. Petrology, v. 13. no. 2. p. 219–253; Quoted in Aitchison, J., 1986. The statistical analysis of compositional data: Chapman and Hall, London, 416 p.

    Google Scholar 

  • Wilks, S. S., 1938. The large-sample distribution of the likelihood ratio for testing composite hypotheses: Ann. Math. Statis., v. 9, no. 1, p. 60–62.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Barceló, C., Pawlowsky, V. & Grunsky, E. Some aspects of transformations of compositional data and the identification of outliers. Math Geol 28, 501–518 (1996). https://doi.org/10.1007/BF02083658

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02083658

Key words

Navigation