Data-dependent and Scale-Invariant Kernel for Support Vector Machine Classification

Malgi, Vinayaka Vivekananda; Aryal, Sunil; Rasool, Zafaryab; Tay, David

doi:10.1007/978-3-031-33374-3_14

Vinayaka Vivekananda Malgi¹⁰,
Sunil Aryal¹⁰,
Zafaryab Rasool¹⁰ &
…
David Tay¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13935))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1120 Accesses

Abstract

Kernel similarity function allows a Support Vector Machine (SVM) classifier to learn the maximum margin hyperplane in a higher dimensional space where two classes are linearly separable without explicitly mapping the data. Most existing kernel functions (e.g., RBF) use spatial positions of two data instances in the input space to compute their similarity. These kernels are data distribution independent and sensitive to data representation (i.e., units/scales used to measure/express data). Since this can be unknown in many real-world applications, a careful selection of a suitable kernel is required for a given problem. In this paper, we present a new kernel function based on probability data mass that is both data-dependent and scale-invariant. Our empirical results show that the proposed SVM kernel outperforms popular existing kernels.

V. V. Malgi and S. Aryal—They contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aryal, S., Ting, K.M., Haffari, G., Washio, T.: Mp-dissimilarity: a data dependent dissimilarity measure. In: 2014 IEEE International Conference on Data Mining, pp. 707–712. IEEE (2014)
Google Scholar
Aryal, S., Ting, K.M., Washio, T., Haffari, G.: A comparative study of data-dependent approaches without learning in measuring similarities of data objects. Data Min. Knowl. Disc. 34(1), 124–162 (2020)
Article MathSciNet Google Scholar
Cristianini, N., Shawe-Taylor, J.: An introduction to support vector machines and other Kernel-based learning methods. Cambridge University Press (2000)
Google Scholar
Dua, D., Graff, C.: UCI machine learning repository. http://archive.ics.uci.edu/ml. University of california, Irvine, CA. School Inf. Comput. Sci. 25, 27 (2019)
Fernando, T.L., Webb, G.I.: SimUSF: an efficient and effective similarity measure that is invariant to violations of the interval scale assumption. Data Min. Knowl. Disc. 31(1), 264–286 (2017)
Article MathSciNet Google Scholar
Krumhansl, C.L.: Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol. Rev. 85(5), 445–463 (1978)
Article Google Scholar
Lin, D., et al.: An information-theoretic definition of similarity. In: International Conference on Machine Learning (ICML), pp. 296–304 (1998)
Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet Google Scholar
Ting, K.M., Zhu, Y., Zhou, Z.H.: Isolation kernel and its effect on SVM. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2329–2337 (2018)
Google Scholar

Download references

Acknowledgment

This material is based upon work supported by the U.S Air Force Office of Scientific Research under award number FA2386-20-1-4005.

Author information

Authors and Affiliations

Deakin University, Geelong, Waurn Ponds, VIC, 3216, Australia
Vinayaka Vivekananda Malgi, Sunil Aryal, Zafaryab Rasool & David Tay

Authors

Vinayaka Vivekananda Malgi
View author publications
You can also search for this author in PubMed Google Scholar
Sunil Aryal
View author publications
You can also search for this author in PubMed Google Scholar
Zafaryab Rasool
View author publications
You can also search for this author in PubMed Google Scholar
David Tay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Vinayaka Vivekananda Malgi or Sunil Aryal .

Editor information

Editors and Affiliations

Kyoto University, Kyoto, Japan
Hisashi Kashima
IBM Research, Thomas J. Watson Research Center, Yorktown Heights, NY, USA
Tsuyoshi Ide
National Chiao Tung University, Hsinchu, Taiwan
Wen-Chih Peng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Malgi, V.V., Aryal, S., Rasool, Z., Tay, D. (2023). Data-dependent and Scale-Invariant Kernel for Support Vector Machine Classification. In: Kashima, H., Ide, T., Peng, WC. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2023. Lecture Notes in Computer Science(), vol 13935. Springer, Cham. https://doi.org/10.1007/978-3-031-33374-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-33374-3_14
Published: 27 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33373-6
Online ISBN: 978-3-031-33374-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Data-dependent and Scale-Invariant Kernel for Support Vector Machine Classification