Advertisement

On External Measures for Validation of Fuzzy Partitions

  • Alessandro G. Di Nuovo
  • Vincenzo Catania
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4529)

Abstract

The procedure of evaluating the results of a clustering algorithm is know under the term cluster validity. In general terms, cluster validity criteria can be classified in three categories: internal, external and relative. In this work we focus on the external criteria, which evaluate the results of a clustering algorithm based on a pre-specified structure S, that pertains to the data but which is independent of it. Usually S is a crisp partition (i.e. the data labels), and the most common approach for external validation of fuzzy partitions is to apply measures defined for crisp partitions to fuzzy partitions, using crisp partitions derived (hardened) from them. In this paper we discuss fuzzy generalizations of two well known crisp external measures, which are able to assess the quality of a partition U without the hardening of U. We also define a new external validity measure, that we call DNC index, useful for comparing a fuzzy U to a crisp S. Numerical examples based on four real world data sets are given, demonstrating the higher reliability of the DNC index.

Keywords

Data mining Fuzzy Clustering Fuzzy validity index External Validity Criteria Fuzzy Rand Index Partition Assessment DNC index 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Baraldi, A., Blonda, P.: A survey of fuzzy clustering algorithms for pattern recognition. part I-II. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics) 29, 778–801 (1999)CrossRefGoogle Scholar
  2. 2.
    Bezdek, J.C., Keller, J., Krishnapuram, R., Pal, N.R.: Fuzzy Models and algorithms for pattern recognition and image processing. The handbooks of fuzzy sets series. Springer, Heidelberg (1999)zbMATHGoogle Scholar
  3. 3.
    Mitra, S., Pal, S.K.: Fuzzy sets in pattern recognition and machine intelligence. Fuzzy Sets and Systems 156, 381–386 (2005)CrossRefMathSciNetGoogle Scholar
  4. 4.
    Höppner, F., Klawonn, F., Kruse, R., Runkler, T.: Fuzzy Cluster Analysis: Methods for Classification Data Analysis and Image Recognition. Wiley, New York (1999)zbMATHGoogle Scholar
  5. 5.
    Akay, M.: Nonlinear Biomedical Signal Processing, Fuzzy Logic, Neural Networks, and New Algorithms. Series on Biomedical Engineering. IEEE Press, Los Alamitos (2000)Google Scholar
  6. 6.
    Jain, L.C., Sato-Ilic, M.: Innovations in Fuzzy Clustering: Theory and Applications Theory And Applications. Studies In Fuzziness And Soft Computing. Springer, Heidelberg (2006)zbMATHGoogle Scholar
  7. 7.
    de Oliveira, J.V., Pedrycz, W.: Advances in Fuzzy Clustering and its Applications. Wiley, New York (2007)Google Scholar
  8. 8.
    Theodoridis, S., Koutroubas, K.: Pattern recognition. Academic Press, London (1999)Google Scholar
  9. 9.
    Milligan, G., Cooper, M.: A study of comparability of external criteria for hierarchical cluster analysis. Multivariate Behavioral Research 21, 441–458 (1986)CrossRefGoogle Scholar
  10. 10.
    Hubert, L., Arabie, P.: Comparing partitions. Journal of Classification, 193–218 (1985)Google Scholar
  11. 11.
    Fowlkes, E., Mallows, C.: A method of comparing two hieararchical clusterings. Journal of American Statistical Society 78, 553–569 (1983)zbMATHGoogle Scholar
  12. 12.
    Rand, W.: Objective criteria for evaluation of clustering methods. Journal of the American Statistical Association 66, 846–850 (1971)CrossRefGoogle Scholar
  13. 13.
    Back, C., Hussain, M.: Validity measures for fuzzy partitions. In: Bock, H., Polasek, W. (eds.) Data analysis and information systems, pp. 114–125. Springer, Heidelberg (1995)Google Scholar
  14. 14.
    University of california machine learning repository, http://www.ics.uci.edu/~mlearn/MLRepository.html
  15. 15.
    Pal, N.R., Bezdek, J.C.: On cluster validity for the fuzzy c-means model. Transactions on Fuzzy Systems 3, 370–379 (1995)CrossRefGoogle Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Alessandro G. Di Nuovo
    • 1
  • Vincenzo Catania
    • 1
  1. 1.Università degli Studi di Catania, Dipartimento di Ingegneria Informatica e delle Telecomunicazioni, Viale Andrea Doria 6, 95125 CataniaItaly

Personalised recommendations