Spline-based nonlinear biplots

  • Patrick J. F. Groenen
  • Niël J. Le Roux
  • Sugnet Gardner-Lubbe
Regular Article

Abstract

Biplots are helpful tools to establish the relations between samples and variables in a single plot. Most biplots use a projection interpretation of sample points onto linear lines representing variables. These lines can have marker points to make it easy to find the reconstructed value of the sample point on that variable. For classical multivariate techniques such as principal components analysis, such linear biplots are well established. Other visualization techniques for dimension reduction, such as multidimensional scaling, focus on an often nonlinear mapping in a low dimensional space with emphasis on the representation of the samples. In such cases, the linear biplot can be too restrictive to properly describe the relations between the samples and the variables. In this paper, we propose a simple nonlinear biplot that represents the marker points of a variable on a curved line that is governed by splines. Its main attraction is its simplicity of interpretation: the reconstructed value of a sample point on a variable is the value of the closest marker point on the smooth curved line representing the variable. The proposed spline-based biplot can never lead to a worse overall sample fit of the variable as it contains the linear biplot as a special case.

Keywords

Biplot Multidimensional scaling Principal components analysis Splines 

Mathematics Subject Classification

62H25 

References

  1. Borg I, Groenen PJF (2005) Modern multidimensional scaling. Springer Science + Business Media Inc, New YorkMATHGoogle Scholar
  2. De Boor C (1978) A practical guide to splines. Springer-Verlag, New YorkCrossRefMATHGoogle Scholar
  3. Gifi A (1990) Nonlinear multivariate analysis. Wiley, ChichesterMATHGoogle Scholar
  4. Gower JC (1966) Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika 53:588–589MathSciNetGoogle Scholar
  5. Gower JC (1982) Euclidean distance geometry. Math Sci 7:1–14MATHMathSciNetGoogle Scholar
  6. Gower JC, Hand DJ (1996) Biplots. Chapman and Hall, LondonMATHGoogle Scholar
  7. Gower JC, Legendre P (1986) Metric and Euclidean properties of dissimilarity coefficients. J Classif 3:5–48CrossRefMATHMathSciNetGoogle Scholar
  8. Gower JC, Ngouenet RF (2005) Nonlinearity effects in multidimensional scaling. J Multivar Anal 94: 344–365Google Scholar
  9. Gower JC, Meulman JJ, Arnold GM (1999) Nonmetric linear biplots. J Classif 16:181–196CrossRefMATHGoogle Scholar
  10. Gower JC, Lubbe S, Le Roux NJ (2011) Understanding biplots. Wiley, ChichesterCrossRefGoogle Scholar
  11. Hand DJ, Daly F, Lunn AD, McConway KJ, Ostrowski E (1994) A handbook of small data sets. Chapman & Hall, LondonCrossRefGoogle Scholar
  12. Hastie T, Stuetzle W (1989) Principal curves. J Am Stat Assoc 84:502–516CrossRefMATHMathSciNetGoogle Scholar
  13. Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, New YorkCrossRefMATHGoogle Scholar
  14. Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer, New YorkMATHGoogle Scholar
  15. Kruskal JB, Wish MW (1978) Multidimensional scaling. Sage Publications, Beverley HillsGoogle Scholar
  16. Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J 7:308–313CrossRefMATHGoogle Scholar
  17. Vapnik V (1996) The nature of statistical learning theory. Springer, New YorkGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Patrick J. F. Groenen
    • 1
  • Niël J. Le Roux
    • 2
  • Sugnet Gardner-Lubbe
    • 3
  1. 1.Econometric InstituteErasmus UniversityRotterdamThe Netherlands
  2. 2.Department of Statistics and Actuarial ScienceStellenbosch UniversityStellenboschSouth Africa
  3. 3.Department of Statistical SciencesUniversity of Cape TownCape TownSouth Africa

Personalised recommendations