Abstract
The most frequently used hierarchical methods for clustering of quantitative variables are based on bivariate or multivariate correlation measures. These solutions can be unsuitable in presence of uncorrelated but collinear variables. In this paper we propose a hierarchical agglomerative algorithm based on a similarity measure which takes into account the collinearity between two groups of variables. Its main theoretical features are described and its performance is evaluated both on simulated and real data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ANDERBERG, M.R. (1973): Cluster Analysis for Applications. Academic Press, New York.
BELSLEY, D.A. (1991): Conditioning Diagnostics. John Wiley and Sons, New York.
GORDON, A.D. (1999): Classification. Chapman and Hall/CRC, London.
HARTIGAN, J.A. (1975): Clustering Algorithms. John Wiley, New York.
NICOLAU, F.C. and BACELAR-NICOLAU, H. (1998): Some trends in the classification of variables. In: C. Hayashi, N. Ohsumi, K. Yajima, Y. Tanaka, H. Bock and Y. Baba (Eds.): Data Science, Classification and Related Methods. Springer, Berlin, 89–98.
MULLER, K.E. (1982): Understanding canonical correlation through the general linear model and principal components. The American Statistician, 36, 342–354
SOFFRITTI, G. (1999): Hierarchical clustering of variables: a comparison among strategies of analysis. Communications in Statistics-Simulation and Computation, 28, 977–999.
SOFFRITTI, G. (2003): Identifying multiple cluster structures in a data matrix. Communications in Statistics-Simulation and Computation, 32, 1151–1177.
VIGNEAU, E. and QANNARI, E.M. (2003): Clustering of variables around latent components. Communications in Statistics-Simulation and Computation, 32, 1131–1150
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Laghi, A., Soffritti, G. (2005). A Collinearity Based Hierarchical Method to Identify Clusters of Variables. In: Bock, HH., et al. New Developments in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27373-5_7
Download citation
DOI: https://doi.org/10.1007/3-540-27373-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23809-6
Online ISBN: 978-3-540-27373-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)