Abstract
Clustering is a widely applied tool of data mining to detect the hidden structure of complex multivariate datasets. Hence, clustering solves two kinds of problems simultaneously, it partitions the datasets into cluster of objects that are similar to each other and describes the clusters by cluster prototypes to provide some information about the distribution of the data. In most of the cases these cluster prototypes describe the clusters as simple geometrical objects, like spheres, ellipsoids, lines, linear subspaces etc., and the cluster prototype defines a special distance function. Unfortunately in most of the cases the user does not have prior knowledge about the number of clusters and not even about the proper shape of prototypes. The real distribution of data is generally much more complex than these simple geometrical objects, and the number of clusters depends much more on how well the chosen cluster prototypes fit the distribution of data than on the real groups within the data. This is especially true when the clusters are used for local linear modeling purposes.
The aim of this paper is not to define a new distance norm based on a problem dependent cluster prototype but to show how the so called geodesic distance that is based on the exploration of the manifold the data lie on, can be used in the clustering instead of the classical Euclidean distance. The paper presents how this distance measure can be integrated within fuzzy clustering and some examples are presented to demonstrate the advantages of the proposed new methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Naud, A.: Neural and statistical methods for the visualization of multidimensional data. PhD thesis (2001)
Gustafson, D.E., Kessel, W.C.: Fuzzy clustering with fuzzy covariance matrix. In: Proceedings of the IEEE CDC, pp. 761–766. IEEE Computer Society Press, Los Alamitos (1979)
Gath, I., Geva, A.B.: Unsupervised optimal fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 7, 773–781 (1989)
Abonyi, J., et al.: Modified gath-geva clustering for fuzzy segmentation of multivariate time-series. Fuzzy Sets and Systems - Fuzzy Sets in Knowledge Discovery 149(1), 39–56 (2005)
Abonyi, J., Szeifert, F., Babuska, R.: Modified gath-geva fuzzy clustering for identification of takagi-sugeno fuzzy models. IEEE Trans. on Systems, Man and Cybernetics, Part B 32(5), 612–621 (2002)
Tenenbaum, J.B., Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000)
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York (1981)
Kaufman, L., Rousseeuw, P.J.: Finding groups in data: An introduction to cluster analysis. John Wiley & Sons, Chichester (1990)
Saul, L.K., Roweis, S.T.: An introduction to locally linear embedding. Technical report, AT&T Labs - Research (2001)
Bernstein, M., et al.: Graph approximations to geodesics on embedded manifolds. Technical report, Department of Psychology, Stanford University (2000)
Souvenir, R., Pless, R.: Manifold clustering. In: 10th International Conference on Computer Vision, pp. 648–653 (2005)
Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323–2326 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Feil, B., Abonyi, J. (2007). Geodesic Distance Based Fuzzy Clustering. In: Saad, A., Dahal, K., Sarfraz, M., Roy, R. (eds) Soft Computing in Industrial Applications. Advances in Soft Computing, vol 39. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70706-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-70706-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70704-2
Online ISBN: 978-3-540-70706-6
eBook Packages: EngineeringEngineering (R0)