Skip to main content

A Collinearity Based Hierarchical Method to Identify Clusters of Variables

  • Conference paper
New Developments in Classification and Data Analysis

Abstract

The most frequently used hierarchical methods for clustering of quantitative variables are based on bivariate or multivariate correlation measures. These solutions can be unsuitable in presence of uncorrelated but collinear variables. In this paper we propose a hierarchical agglomerative algorithm based on a similarity measure which takes into account the collinearity between two groups of variables. Its main theoretical features are described and its performance is evaluated both on simulated and real data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • ANDERBERG, M.R. (1973): Cluster Analysis for Applications. Academic Press, New York.

    Google Scholar 

  • BELSLEY, D.A. (1991): Conditioning Diagnostics. John Wiley and Sons, New York.

    Google Scholar 

  • GORDON, A.D. (1999): Classification. Chapman and Hall/CRC, London.

    Google Scholar 

  • HARTIGAN, J.A. (1975): Clustering Algorithms. John Wiley, New York.

    Google Scholar 

  • NICOLAU, F.C. and BACELAR-NICOLAU, H. (1998): Some trends in the classification of variables. In: C. Hayashi, N. Ohsumi, K. Yajima, Y. Tanaka, H. Bock and Y. Baba (Eds.): Data Science, Classification and Related Methods. Springer, Berlin, 89–98.

    Google Scholar 

  • MULLER, K.E. (1982): Understanding canonical correlation through the general linear model and principal components. The American Statistician, 36, 342–354

    Article  MATH  Google Scholar 

  • SOFFRITTI, G. (1999): Hierarchical clustering of variables: a comparison among strategies of analysis. Communications in Statistics-Simulation and Computation, 28, 977–999.

    MATH  Google Scholar 

  • SOFFRITTI, G. (2003): Identifying multiple cluster structures in a data matrix. Communications in Statistics-Simulation and Computation, 32, 1151–1177.

    Article  MATH  MathSciNet  Google Scholar 

  • VIGNEAU, E. and QANNARI, E.M. (2003): Clustering of variables around latent components. Communications in Statistics-Simulation and Computation, 32, 1131–1150

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Laghi, A., Soffritti, G. (2005). A Collinearity Based Hierarchical Method to Identify Clusters of Variables. In: Bock, HH., et al. New Developments in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27373-5_7

Download citation

Publish with us

Policies and ethics