Skip to main content

Robust Clustering for Performance Evaluation

  • Conference paper
  • First Online:
Data Analysis and Classification

Abstract

The evaluation of the effectiveness of organisations can be aided by the use of cluster analysis, suggesting and clarifying differences in structure between successful and failing organisations. Unfortunately, traditional methods of cluster analysis are highly sensitive to the presence of atypical observations and departures from normality. We describe a form of robust clustering using the forward search that allows the data to determine the number of clusters and so allows for outliers. An example is given of the successful clustering of customers of a bank into groups that are decidedly non-normal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Atkinson, A. C., & Riani, M. (2007). Exploratory tools for clustering multivariate data. Computational Statistics and Data Analysis, 52, 272–285 doi:10.1016/j.csda.2006.12.034

    Article  MATH  MathSciNet  Google Scholar 

  • Atkinson, A. C., Riani, M., & Cerioli, A. (2004). Exploring multivariate data with the forward search. New York: Springer

    MATH  Google Scholar 

  • Atkinson, A. C., Riani, M., & Cerioli, A. (2006). Random start forward searches with envelopes for detecting clusters in multivariate data. In S. Zani, A. Cerioli, M. Riani, & M. Vichi (eds.), Data analysis, classification and the forward search (pp. 163–171). Berlin: Springer

    Chapter  Google Scholar 

  • Atkinson, A. C., Riani, M., & Laurini, F. (2007). Approximate envelopes for finding an unknown number of multivariate outliers in large data sets. In S. Aivazian, P. Filzmoser, & Y. Kharin (eds.), Proceedings of the Eighth International Conference on Computer Data Analysis and Modeling (pp. 11–18). Russian Federation : Artia, Minsk

    Google Scholar 

  • Bini, M., Riani, M., Atkinson, A., & Cerioli, A. (2004). Analisi di efficienza e di efficacia del sistema universitario italiano attraverso nuove metodologie statistiche multivariate robuste. Research report 03, Comitato Nazionale per la Valutazione del Sistema Universitario (CNVSU), MIUR, Ministero dell’Istruzione dell’Universit e della Ricerca. RDR document produced on behalf of CNVSU. http://www.cnvsu.it/{ _}library/downloadfile.asp?id=11265

  • Cerioli, A., Riani, M., & Atkinson, A. C. (2006). Robust classification with categorical variables. In A. Rizzi & M. Vichi (eds.), COMPSTAT 2006: Proceedings in Computational Statistics (pp. 507–519). Heidelberg: Physica

    Chapter  Google Scholar 

  • Fraley, C., & Raftery, A. E. (2006). MCLUST version 3: an R package for normal mixture modeling and model-based clustering. Tech. Rep. 504, University of Washington, Department of Statistics, Seattle, WA

    Google Scholar 

  • Peck, L. (2005). Using cluster analysis in program evaluation. Evaluation Review, 29, 178–196

    Article  Google Scholar 

  • Riani, M., & Atkinson, A. C. (2007). Fast calibrations of the forward search for testing multiple outliers in regression. Advances in data analysis and classification, 1, 123–141

    Article  MathSciNet  Google Scholar 

  • Riani, M., Atkinson, A. C., & Cerioli, A. (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society, Series B, 71, 447–466

    Article  Google Scholar 

  • Riani, M., Cerioli, A., Atkinson, A., Perrotta, D., & Torti, F. (2008). Fitting mixtures of regression lines with the forward search. In F. Fogelman-Soulié, D. Perrotta, J. Piskorski, & R. Steinberger (eds.), Mining massive data sets for security (pp. 271–286). Amsterdam: IOS

    Google Scholar 

Download references

Acknowledgements

This work was supported by the grants “Metodi statistici multivariati per la valutazione integrata della qualità dei servizi di pubblica utilità: efficacia-efficienza, rischio del fornitore, soddisfazione degli utenti” and “Metodologie statistiche per lanalisi di impatto e la valutazione della regolamentazione” of Ministero dell’Università e della Ricerca PRIN 2006.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anthony C. Atkinson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Atkinson, A.C., Riani, M., Cerioli, A. (2010). Robust Clustering for Performance Evaluation. In: Palumbo, F., Lauro, C., Greenacre, M. (eds) Data Analysis and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03739-9_43

Download citation

Publish with us

Policies and ethics