Abstract
Support Vector Machines (SVM) have been applied successfully in a wide variety of fields in the last decade. The SVM problem is formulated as a convex objective function subject to box constraints that needs to be maximized, a quadratic programming (QP) problem. In order to solve the QP problem on larger data sets specialized algorithms and heuristics are required. In this paper we present a new data-squashing method for selecting training instances in support vector learning. Inspired by the growing neural gas algorithm and learning vector quantization we introduce a new, parameter robust neural gas variant to retrieve an initial approximation of the training set containing only those samples that will likely become support vectors in the final classifier. This first approximation is refined in the border areas, defined by neighboring neurons of different classes, yielding the final training set. We evaluate our approach on synthetic as well as real-life datasets, comparing run-time complexity and accuracy to a random sampling approach and the exact solution of the support vector machine. Results show that runtime-complexity can be significantly reduced while achieving the same accuracy as the exact solution and that furthermore our approach does not not rely on data set specific parameterization of the sampling rate like random sampling for doing so. Source code, binary executables as well as the reformatted standard data sets are available for download at http://www.know-center.tugraz.at/forschung/ knowledge_relationship_discovery/downloads_demos/ sngsvm_source_executables
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)
Shin, H., Cho, S.: Fast pattern selection for support vector classifiers. In: Whang, K.-Y., Jeon, J., Shim, K., Srivastava, J. (eds.) PAKDD 2003. LNCS (LNAI), vol. 2637, pp. 376–387. Springer, Heidelberg (2003)
Yu, H., Yang, J., Han, J.: Classifying large data sets using svms with hierarchical clusters (2003)
Boley, D., Cao, D.: Training support vector machine using adaptive clustering. In: Proceedings of the SIAM International Conference on Data Mining, April 2004, pp. 126–137 (2004)
Kohonen, T.: Learning vector quantization. Neural Networks 1(suppl. 1), 3–16 (1988)
Fritzke, B.: A growing neural gas network learns topologies. In: Advances in Neural Information Processing Systems, vol. 7, pp. 625–632. MIT Press, Cambridge (1995)
Syed, N.A., Liu, H., Sung, K.K.: A study of support vectors on model independent example selection. In: KDD, pp. 272–276 (1999)
Gersho, A., Gray, R.M.: Vector quantization and signal compression. Kluwer Academic Publishers, Dordrecht (1992)
Fritzke, B.: A growing neural gas network learns topologies. In: Neural Information Processing Systems, pp. 625–632 (1994)
Shin, H., Cho, S.: Invariance of neighborhood relation under input space to feature space mapping. Pattern Recognition Letters 26(6), 707–718 (2005)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zechner, M., Granitzer, M. (2009). A Competitive Learning Approach to Instance Selection for Support Vector Machines. In: Karagiannis, D., Jin, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2009. Lecture Notes in Computer Science(), vol 5914. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10488-6_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-10488-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10487-9
Online ISBN: 978-3-642-10488-6
eBook Packages: Computer ScienceComputer Science (R0)