Clustering with a Semantic Criterion Based on Dimensionality Analysis

Li, Wenye; Lee, Kin-Hong; Leung, Kwong-Sak

doi:10.1007/11893257_88

Wenye Li²⁰,
Kin-Hong Lee²⁰ &
Kwong-Sak Leung²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4233))

Included in the following conference series:

International Conference on Neural Information Processing

910 Accesses

Abstract

Considering data processing problems from a geometric point of view, previous work has shown that the intrinsic dimension of the data could have some semantics. In this paper, we start from the consideration of this inherent topology property and propose the usage of such a semantic criterion for clustering. The corresponding learning algorithms are provided. Theoretical justification and analysis of the algorithms are shown. Promising results are reported by the experiments that generally fail with conventional clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belkin, M., Niyogi, P.: Laplacian eigenmaps and spectral techniques for embedding and clustering. In: Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems 14, pp. 585–591. MIT Press, Cambridge (2002)
Google Scholar
Blimes, J.A.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. International Computer Science Institute, UC Berkeley (1998)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Scociety B 39, 1–39 (1977)
MATH MathSciNet Google Scholar
Donoho, D.L., Grimes, C.: Hessian eigenmaps: locally linear embedding techniques for high-dimensional data. In: Proceedings of the National Academy of Arts and Sciences (2003)
Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)
MATH Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 263–323 (1999)
Article Google Scholar
Levina, E., Bickel, P.J.: Maximum likelihood estimation of intrinsic dimension. In: Saul, L.K., Weiss, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 17, pp. 777–784. MIT Press, Cambridge (2005)
Google Scholar
Michael, K., Yishay, M., Andrew, N.: An information-theoretic analysis of hard and soft assignment methods for clustering. In: Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI 1997), pp. 282–293. Morgan Kaufmann Publishers, San Francisco (1997)
Google Scholar
Pettis, K.W., Bailey, T.A., Jain, A.K., Dubes, R.C.: An intrinsic dimensionality estimator from near-neighbor information. IEEE Transactions on Pattern Analysis and Machine Intelligence 1, 25–37 (1979)
Article MATH Google Scholar
Roweis, S., Smola, A.J.: Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323–2326 (2000)
Article Google Scholar
Seung, H.S., Lee, D.D.: The manifold ways of perception. Science 290, 2268–2269 (2000)
Article Google Scholar
Tenenbaum, J.B., de Silva, V., Landford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2313–2323 (2000)
Article Google Scholar
Verveer, R., Duin, R.: An evaluation of intrinsic dimensionality estimators. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(1), 81–86 (1995)
Article Google Scholar
Xu, L., Krzyzak, A., Oja, E.: Rival penalized competitive learning for clustering analysis, RBF net, and curve detection. IEEE Transactions on Neural Networks 4 (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

The Chinese University of Hong Kong, Shatin N.T., Hong Kong, China P.R.
Wenye Li, Kin-Hong Lee & Kwong-Sak Leung

Authors

Wenye Li
View author publications
You can also search for this author in PubMed Google Scholar
Kin-Hong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kwong-Sak Leung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, The Chinese Univ. of Hong Kong, Shatin, N.T., Hong Kong
Irwin King
Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Lai-Wan Chan
Department of Computer Science and Engineering & Center for Cognitive Science, The Ohio State University, OH 43210, Columbus
DeLiang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, W., Lee, KH., Leung, KS. (2006). Clustering with a Semantic Criterion Based on Dimensionality Analysis. In: King, I., Wang, J., Chan, LW., Wang, D. (eds) Neural Information Processing. ICONIP 2006. Lecture Notes in Computer Science, vol 4233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893257_88

Download citation

DOI: https://doi.org/10.1007/11893257_88
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46481-5
Online ISBN: 978-3-540-46482-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics