A Collinearity Based Hierarchical Method to Identify Clusters of Variables

Laghi, Annalisa; Soffritti, Gabriele

doi:10.1007/3-540-27373-5_7

Annalisa Laghi²¹ &
Gabriele Soffritti²²

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

Abstract

The most frequently used hierarchical methods for clustering of quantitative variables are based on bivariate or multivariate correlation measures. These solutions can be unsuitable in presence of uncorrelated but collinear variables. In this paper we propose a hierarchical agglomerative algorithm based on a similarity measure which takes into account the collinearity between two groups of variables. Its main theoretical features are described and its performance is evaluated both on simulated and real data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

ANDERBERG, M.R. (1973): Cluster Analysis for Applications. Academic Press, New York.
Google Scholar
BELSLEY, D.A. (1991): Conditioning Diagnostics. John Wiley and Sons, New York.
Google Scholar
GORDON, A.D. (1999): Classification. Chapman and Hall/CRC, London.
Google Scholar
HARTIGAN, J.A. (1975): Clustering Algorithms. John Wiley, New York.
Google Scholar
NICOLAU, F.C. and BACELAR-NICOLAU, H. (1998): Some trends in the classification of variables. In: C. Hayashi, N. Ohsumi, K. Yajima, Y. Tanaka, H. Bock and Y. Baba (Eds.): Data Science, Classification and Related Methods. Springer, Berlin, 89–98.
Google Scholar
MULLER, K.E. (1982): Understanding canonical correlation through the general linear model and principal components. The American Statistician, 36, 342–354
Article MATH Google Scholar
SOFFRITTI, G. (1999): Hierarchical clustering of variables: a comparison among strategies of analysis. Communications in Statistics-Simulation and Computation, 28, 977–999.
MATH Google Scholar
SOFFRITTI, G. (2003): Identifying multiple cluster structures in a data matrix. Communications in Statistics-Simulation and Computation, 32, 1151–1177.
Article MATH MathSciNet Google Scholar
VIGNEAU, E. and QANNARI, E.M. (2003): Clustering of variables around latent components. Communications in Statistics-Simulation and Computation, 32, 1131–1150
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Servizio Controllo di Gestione e Sistemi Statistici, Regione Emilia-Romagna, Bologna, Italy
Annalisa Laghi
Dipartimento di Scienze Statistiche, Università di Bologna, Italy
Gabriele Soffritti

Authors

Annalisa Laghi
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Soffritti
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Aachen
H.-H. Bock
Karlsruhe
W. Gaul
Rome
M. Vichi
Newark
Ph. Arabie
Cottbus
D. Baier
Milton Keynes
F. Critchley
Bielefeld
R. Decker
Paris
E. Diday
Barcelona
M. Greenacre
Naples
C. Lauro
Leiden
J. Meulman
Bologna
P. Monari
Toronto
S. Nishisato
Tokyo
N. Ohsumi
Augsburg
O. Opitz
Passau
G. Ritter
Mannheim
M. Schader
Dortmund
C. Weihs
Department of Statistics, Probability and Applied Statistics, University of Rome “La Sapienza”, Piazzale Aldo Moro 5, 00185, Rome, Italy
Maurizio Vichi
Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy
Paola Monari , Stefania Mignani & Angela Montanari , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Laghi, A., Soffritti, G. (2005). A Collinearity Based Hierarchical Method to Identify Clusters of Variables. In: Bock, HH., et al. New Developments in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27373-5_7

Download citation

DOI: https://doi.org/10.1007/3-540-27373-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23809-6
Online ISBN: 978-3-540-27373-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics