Abstract
The evaluation of the effectiveness of organisations can be aided by the use of cluster analysis, suggesting and clarifying differences in structure between successful and failing organisations. Unfortunately, traditional methods of cluster analysis are highly sensitive to the presence of atypical observations and departures from normality. We describe a form of robust clustering using the forward search that allows the data to determine the number of clusters and so allows for outliers. An example is given of the successful clustering of customers of a bank into groups that are decidedly non-normal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Atkinson, A. C., & Riani, M. (2007). Exploratory tools for clustering multivariate data. Computational Statistics and Data Analysis, 52, 272–285 doi:10.1016/j.csda.2006.12.034
Atkinson, A. C., Riani, M., & Cerioli, A. (2004). Exploring multivariate data with the forward search. New York: Springer
Atkinson, A. C., Riani, M., & Cerioli, A. (2006). Random start forward searches with envelopes for detecting clusters in multivariate data. In S. Zani, A. Cerioli, M. Riani, & M. Vichi (eds.), Data analysis, classification and the forward search (pp. 163–171). Berlin: Springer
Atkinson, A. C., Riani, M., & Laurini, F. (2007). Approximate envelopes for finding an unknown number of multivariate outliers in large data sets. In S. Aivazian, P. Filzmoser, & Y. Kharin (eds.), Proceedings of the Eighth International Conference on Computer Data Analysis and Modeling (pp. 11–18). Russian Federation : Artia, Minsk
Bini, M., Riani, M., Atkinson, A., & Cerioli, A. (2004). Analisi di efficienza e di efficacia del sistema universitario italiano attraverso nuove metodologie statistiche multivariate robuste. Research report 03, Comitato Nazionale per la Valutazione del Sistema Universitario (CNVSU), MIUR, Ministero dell’Istruzione dell’Universit e della Ricerca. RDR document produced on behalf of CNVSU. http://www.cnvsu.it/{ _}library/downloadfile.asp?id=11265
Cerioli, A., Riani, M., & Atkinson, A. C. (2006). Robust classification with categorical variables. In A. Rizzi & M. Vichi (eds.), COMPSTAT 2006: Proceedings in Computational Statistics (pp. 507–519). Heidelberg: Physica
Fraley, C., & Raftery, A. E. (2006). MCLUST version 3: an R package for normal mixture modeling and model-based clustering. Tech. Rep. 504, University of Washington, Department of Statistics, Seattle, WA
Peck, L. (2005). Using cluster analysis in program evaluation. Evaluation Review, 29, 178–196
Riani, M., & Atkinson, A. C. (2007). Fast calibrations of the forward search for testing multiple outliers in regression. Advances in data analysis and classification, 1, 123–141
Riani, M., Atkinson, A. C., & Cerioli, A. (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society, Series B, 71, 447–466
Riani, M., Cerioli, A., Atkinson, A., Perrotta, D., & Torti, F. (2008). Fitting mixtures of regression lines with the forward search. In F. Fogelman-Soulié, D. Perrotta, J. Piskorski, & R. Steinberger (eds.), Mining massive data sets for security (pp. 271–286). Amsterdam: IOS
Acknowledgements
This work was supported by the grants “Metodi statistici multivariati per la valutazione integrata della qualità dei servizi di pubblica utilità: efficacia-efficienza, rischio del fornitore, soddisfazione degli utenti” and “Metodologie statistiche per lanalisi di impatto e la valutazione della regolamentazione” of Ministero dell’Università e della Ricerca PRIN 2006.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Atkinson, A.C., Riani, M., Cerioli, A. (2010). Robust Clustering for Performance Evaluation. In: Palumbo, F., Lauro, C., Greenacre, M. (eds) Data Analysis and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03739-9_43
Download citation
DOI: https://doi.org/10.1007/978-3-642-03739-9_43
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03738-2
Online ISBN: 978-3-642-03739-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)