Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm

Zhang, Dao-Qiang; Chen, Song-Can

doi:10.1023/B:NEPL.0000011135.19145.1b

Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm

Published: December 2003

Volume 18, pages 155–162, (2003)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Dao-Qiang Zhang¹ &
Song-Can Chen¹

1230 Accesses
245 Citations
Explore all metrics

Abstract

There is a trend in recent machine learning community to construct a nonlinear version of a linear algorithm using the 'kernel method', e.g. Support Vector Machines (SVMs), kernel principal component analysis, kernel fisher discriminant analysis and the recent kernel clustering algorithms. In unsupervised clustering algorithms using kernel method, typically, a nonlinear mapping is used first to map the data into a potentially much higher feature space, where clustering is then performed. A drawback of these kernel clustering algorithms is thatthe clustering prototypes lie in high dimensional feature space and hence lack clear and intuitive descriptions unless using additional projection approximation from the feature to the data space as done in the existing literatures. In this paper, a novel clustering algorithm using the 'kernel method' based on the classical fuzzy clustering algorithm (FCM) is proposed and called as kernel fuzzy c-means algorithm (KFCM). KFCM adopts a new kernel-induced metric in the data space to replace the original Euclidean norm metric in FCM and the clustered prototypes still lie in the data space so that the clustering results can be reformulated and interpreted in the original space. Our analysis shows that KFCM is robust to noise and outliers and also tolerates unequal sized clusters. And finally this property is utilized to cluster incomplete data. Experiments on two artificial and one real datasets show that KFCM hasbetter clustering performance and more robust than several modifications of FCM for incomplete data clustering.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bezdek, J. C.: Pattern Recognition with Fuzzy Objective Function Algorithms, Plenum Press, New York, 1981.
Google Scholar
Wu, K. L. and Yang, M. S.: Alternative c-means clustering algorithms, Pattern Recognition vol. 35, pp. 2267–2278, 2002.
Google Scholar
Girolami, M.: Mercer kernel-based clustering in feature space, IEEE Trans. Neual Networks 13(3) (2002), 780–784.
Google Scholar
Zhang, L., Zhou, W. D. and Jiao, L. C.: Kernel clustering algorithm, Chinese J. Computers 25(6) (2002), 587–590 (in Chinese).
Google Scholar
Zhang, D. Q. and Chen, S. C.: Fuzzy clustering using kernel methods, In: Procedings of Inter. Conf. Controland Automatation (ICCA'02), June 16–19, pp. 123–128, Xiamen, China, 2002.
Muller, K. R. and Mika, S. et al.: An introduction to Kernel-based learning algorithms, IEEE Trans. NeuralNetworks 12(2) (2001), 181–202.
Google Scholar
Vapnik, V. N.: StatisticalLearning Theory. Wiley, New York, 1998.
Google Scholar
Hathaway, R. J. and Bezdek, J. C.: Clustering incomplete relational data using the non-Euclidean relational fuzzy c-means algorithm. Pattern Recognition Letters 23 (2002), 151–160.
Google Scholar
Jain, A. K. and Dubes, R. C.: Algorithms for Clustering Data, Englewood Cliffs, NJ, 1988.
Google Scholar
Gaul, W. and Schader, M.: Pyramidal classi.cation based on incomplete data, J. Classification 11 (1994), 171–193.
Google Scholar
Schafer, J. L.: Analysis of Incomplete Multivariate Data, Chapman & Hall, London, 1997.
Google Scholar
Hathaway, R. J. and Bezdek, J. C.: Fuzzy c-means clustering of incomplete data, IEEE Trans. Syst. Man. Cybernetics 31(5) (2001), 735–744.
Google Scholar
Jolliffe, L. T.: PrincipalComponent Analysis, Springer-Verlag, 1986.
Rudin, W.: Principles of Mathematical Analysis, McGraw-Hill Book Company, New York, 1976.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, People's Republic of China
Dao-Qiang Zhang & Song-Can Chen

Authors

Dao-Qiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Song-Can Chen
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, DQ., Chen, SC. Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm. Neural Processing Letters 18, 155–162 (2003). https://doi.org/10.1023/B:NEPL.0000011135.19145.1b

Download citation

Issue Date: December 2003
DOI: https://doi.org/10.1023/B:NEPL.0000011135.19145.1b

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm

Abstract

Access this article

Similar content being viewed by others

Kernel Functions for Clustering of Incomplete Data: A Comparative Study

Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters

Multiple Kernel Based Collaborative Fuzzy Clustering Algorithm

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm

Abstract

Access this article

Similar content being viewed by others

Kernel Functions for Clustering of Incomplete Data: A Comparative Study

Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters

Multiple Kernel Based Collaborative Fuzzy Clustering Algorithm

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation