Skip to main content
Log in

Attribute weighted mercer kernel based fuzzy clustering algorithm for general non-spherical datasets

  • Original Paper
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Clustering analysis is an important topic in artificial intelligence, data mining and pattern recognition research. Conventional clustering algorithms, for instance, the famous Fuzzy C-means clustering algorithm (FCM), assume that all the attributes are equally relevant to all the clusters. However in most domains, especially for high-dimensional dataset, some attributes are irrelevant, and some relevant ones are less important than others with respect to a specific class. In this paper, such imbalances between the attributes are considered and a new weighted fuzzy kernel-clustering algorithm (WFKCA) is presented. WFKCA performs clustering in a kernel feature space mapped by mercer kernels. Compared with the conventional hard kernel-clustering algorithm, WFKCA can yield the meaningful prototypes (cluster centers) of the clusters. Numerical convergence properties of WFKCA are also discussed. For in-depth studies, WFKCA is extended to WFKCA2, which has been demonstrated as a useful tool for clustering incomplete data. Numerical examples demonstrate the effectiveness of the new WFKCA algorithm

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York

    MATH  Google Scholar 

  2. Bezdek JC (1980) A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans Pattern Anal Mach Intell 2:1–8

    Article  MATH  Google Scholar 

  3. Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser 39:1–38

    MATH  MathSciNet  Google Scholar 

  4. Duda RO, Hart PE (1973) Pattern classiacation and scene analysis. Wiley, New York

    Google Scholar 

  5. Elaine YC, Wai KC (2004) An optimization algorithm for clustering using weighted dissimilarity measures. Pattern Recognit 37:943–952

    Article  MATH  Google Scholar 

  6. Frigui H, Nasraoui O (2004) Unsupervised learning of prototypes and attribute weights. Pattern Recognit 37:567–581

    Article  Google Scholar 

  7. Gary K, Honaker J, Joseph A, Scheve K (2000) Listwise deletion is evil: what to do about missing data in political science. http://Gking.Harvard.edu

  8. Girolami M (2002) Mercer kernel-based clustering in feature space. IEEE Trans Neural Netw 13:780–784

    Article  Google Scholar 

  9. Gordon AD, Henderson JT (1977) An algorithm for Euclidean sum-of-squares classification. Biometrics 33:355–362

    Article  MATH  Google Scholar 

  10. Han JW, Kamber M (2000) Data mining: concept and techniques. Morgan Kanfmann, San Mateo

    Google Scholar 

  11. Hathaway RJ, Bezdek JC (2001) Fuzzy c-means clustering of incomplete data. IEEE T Sys Man Cybern B: Cybern 31:735–744

    Article  Google Scholar 

  12. Hopper F (1999) Fuzzy cluster analysis. John Wiley, Chichester

    Google Scholar 

  13. Leski J (2003) Towards a robust fuzzy clustering. Fuzzy Sets Syst 137:215–233

    Article  MATH  MathSciNet  Google Scholar 

  14. Little RJ, Rubin DB (1987) Statistical analysis with missing data. Wiley, New York

    MATH  Google Scholar 

  15. Myrtveit I, Stensrud E, Olsson UH (2001) Analyzing data sets with missing data: an empirical evaluation of imputation methods and likelihood-based methods. IEEE Trans Softw Eng 27:999–1013

    Article  Google Scholar 

  16. Pedrycz W (2001) Fuzzy equalization in the construction of fuzzy sets. Fuzzy Sets Syst 119:329–335

    Article  MATH  MathSciNet  Google Scholar 

  17. Portes M (2004) Image thresholding using Tsallis entropy. Pattern Recognit Lett 25:1059–1065

    Article  Google Scholar 

  18. Roth V, Steinhage V (1999) Nonlinear discriminant analysis using kernel functions. In: Solla SA, Leen TK, Muller K-R (eds) Advances in neural information processing systems. MIT Press, Cambridge, pp 568–574

    Google Scholar 

  19. Shen HB, Wang ST (2004) Fuzzy kernel clustering with outliers. J Softw 15:1021–1029 (in Chinese with English abstract)

    MATH  Google Scholar 

  20. Vapnik VN (1998) Statistical learning theory. Wiley, New York

    MATH  Google Scholar 

  21. Wang Q, Chi ZR (2002) Image thresholding by maximizing the index of nonfuzziness of the 2-D grayscale histogram. Comput Vis Image Underst 85:100–116

    Article  MATH  Google Scholar 

  22. Wang K, Lu WC, Chen NY (2004) Investigation of Ru-porcelain of song dynasty by trace element-SVM method. Comput Appl Chem 21:191–194 (in Chinese with English abstract)

    Google Scholar 

  23. Wang ST, Chung KF, Shen HB (2004) Note on the relationship between probabilistic and fuzzy clustering. Soft Comput 8:366–369

    Google Scholar 

  24. Wu KL, Yang MS (2002) Alternative c-means clustering algorithms. Pattern Recognit 35:2267–2278

    Article  MATH  Google Scholar 

  25. Yeung DS, Wang XZ (2002) Improving performance of similarity-based clustering by feature weight learning. IEEE Trans PAMI 24:556–561

    Google Scholar 

  26. Yu J, Cheng QS, Huang HK (2002) Analysis of the weighting exponent in the FCM. IEEE Trans Syst Man Cybern B 34:634–639

    Article  Google Scholar 

  27. Zhang L, Da ZW, Jiao LC (2002) Kernel clustering algorithm. Chinese J Comput 25:587–590

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongbin Shen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shen, H., Yang, J., Wang, S. et al. Attribute weighted mercer kernel based fuzzy clustering algorithm for general non-spherical datasets. Soft Comput 10, 1061–1073 (2006). https://doi.org/10.1007/s00500-005-0043-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-005-0043-5

Keywords

Navigation