Normalized Kernels as Similarity Indices

Ah-Pine, Julien

doi:10.1007/978-3-642-13672-6_36

Julien Ah-Pine²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6119))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

2306 Accesses
11 Citations

Abstract

Measuring similarity between objects is a fundamental issue for numerous applications in data-mining and machine learning domains. In this paper, we are interested in kernels. We particularly focus on kernel normalization methods that aim at designing proximity measures that better fit the definition and the intuition of a similarity index. To this end, we introduce a new family of normalization techniques which extends the cosine normalization. Our approach aims at refining the cosine measure between vectors in the feature space by considering another geometrical based score which is the mapped vectors’ norm ratio. We show that the designed normalized kernels satisfy the basic axioms of a similarity index unlike most unnormalized kernels. Furthermore, we prove that the proposed normalized kernels are also kernels. Finally, we assess these different similarity measures in the context of clustering tasks by using a kernel PCA based clustering approach. Our experiments employing several real-world datasets show the potential benefits of normalized kernels over the cosine normalization and the Gaussian RBF kernel.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Scholkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Google Scholar
Vert, J., Tsuda, K., Schlkopf, B.: A primer on kernel methods. In: Schlkopf, B., Vert, J.P., Tsuda, K. (eds.) Kernel Methods in Computational Biology, pp. 35–70. MIT Press, Cambridge (2004)
Google Scholar
Schölkopf, B., Smola, A., Müller, K.R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 10(5), 1299–1319 (1998)
Article Google Scholar
Ding, C., He, X.: K-means clustering via principal component analysis. In: ICML ’04: Proceedings of the twenty-first international conference on Machine learning, p. 29. ACM, New York (2004)
Chapter Google Scholar
Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Google Scholar
Santini, S., Jain, R.: Similarity measures. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(9), 871–883 (1999)
Article Google Scholar
Horn, R., Johnson, C.: Matrix analysis. Cambridge University Press, Cambridge (1985)
MATH Google Scholar
Gower, J., Legendre, P.: Metric and euclidean properties of dissimilarity coefficients. Journal of classification 3, 5–48 (1986)
Article MATH MathSciNet Google Scholar
Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recogn. 41(1), 176–190 (2008)
Article MATH Google Scholar
Tuytelaars, T., Lampert, C.H., Blaschko, M.B., Buntine, W.: Unsupervised object discovery: A comparison. International Journal of Computer Vision Epub ahead, 1–19 (July 2009)
Google Scholar
Minier, Z., Csató, L.: Kernel PCA based clustering for inducing features in text categorization. In: ESANN, pp. 349–354 (2007)
Google Scholar
Zelnik-Manor, L., Perona, P.: Self-tuning spectral clustering. In: Advances in Neural Information Processing Systems, vol. 17 (2005)
Google Scholar
Strehl, A., Strehl, E., Ghosh, J.: Cluster ensembles - a knowledge reuse framework for combining partitionings. Journal of Machine Learning Research 3, 583–617 (2002)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Xerox Research Centre Europe, 6 chemin de Maupertuis, 38240, Meylan, France
Julien Ah-Pine

Authors

Julien Ah-Pine
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, Rensselaer Polytechnic Institute, USA
Mohammed J. Zaki
The Chinese University of Hong Kong, China
Jeffrey Xu Yu
IIT Madras, Chennai, India
B. Ravindran
IIIT, Hyderabad, India
Vikram Pudi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ah-Pine, J. (2010). Normalized Kernels as Similarity Indices. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2010. Lecture Notes in Computer Science(), vol 6119. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13672-6_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-13672-6_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13671-9
Online ISBN: 978-3-642-13672-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics