A Robust Methodology for Comparing Performances of Clustering Validity Criteria
Many different clustering validity measures exist that are very useful in practice as quantitative criteria for evaluating the quality of data partitions. However, it is a hard task for the user to choose a specific measure when he or she faces such a variety of possibilities. The present paper introduces an alternative, robust methodology for comparing clustering validity measures that has been especially designed to get around some conceptual flaws of the comparison paradigm traditionally adopted in the literature. An illustrative example involving the comparison of the performances of four well-known validity measures over a collection of 7776 data partitions of 324 different data sets is presented.
Unable to display preview. Download preview PDF.
- 2.Everitt, B.S., Landau, S., Leese, M.: Cluster Analysis, 4th edn. Arnold (2001)Google Scholar
- 11.Casella, G., Berger, R.L.: Statistical Inference, 2nd edn. Duxbury Press (2001)Google Scholar
- 15.Triola, M.F.: Elementary Statistics. Addison Wesley Longman (1999)Google Scholar