A kernel extension to handle missing data
An extension for univariate kernels that deals with missing values is proposed. These extended kernels are shown to be valid Mercer kernels and can adapt to many types of variables, such as categorical or continuous. The proposed kernels are tested against standard RBF kernels in a variety of benchmark problems showing different amounts of missing values and variable types. Our experimental results are very satisfactory, because they usually yield slight to much better improvements over those achieved with standard methods.
KeywordsWaste Water Treatment Plant Normalize Root Mean Square Error Kernel Machine Univariate Kernel Kernel Extension
Unable to display preview. Download preview PDF.
- 1.R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. (2008)Google Scholar
- 2.Asuncion, A. and Newman, D.J. (2007). UCI Machine Learning Repository, http://www.ics.uci.edu/%7emlearn/MLRepository.html. Irvine, CA: University of California, School of Information and Computer Science.
- 4.Berg, C., Christensen, J.P.R. and Ressel, P. Harmonic Analysis on Semigroups: Theory of Positive Definite and Related Functions. Springer-Verlag, 1984.Google Scholar
- 5.Burges, J.C. A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery, 2 (1998).Google Scholar
- 6.Duda, R. and Hart, P. Pattern Classification and Scene Analysis. Wiley (1973).Google Scholar
- 7.Gower, J.C. A general coefficient of similarity and some of its properties. Biometrics, 22, pp. 882-907 (1971).Google Scholar
- 8.Horn, R. and Johnson, C.R. Matrix analysis. Cambridge University Press, 1991.Google Scholar
- 9.Little, R.J.A. and Rubin, D.B. Statistical analysis with missing data. John Wiley, 1987.Google Scholar
- 10.Mercer, J. (1909). Functions of positive and negative type and their connection with the theory of integral equations. Philos. Trans. Roy. Soc. London, A 209: 415-446.Google Scholar
- 12.Prechelt, L. PROBEN1: A Set of Benchmarks and Benchmarking Rules for Neural Network Training Algorithms. Report 21/94. Fakultät für Informatik, Univ. Karlsruhe, 1994.Google Scholar
- 14.Schölkopf, B. Learning with kernels. John Wiley, 2001.Google Scholar
- 16.Vapnik, V. The nature of Statical Learning Theory. Springer-Verlag, New York, 1995.Google Scholar
- 17.Vapnik, V. The support vector method of function estimation. Neural networks and machine learning. C. Bishop (Ed.), NATO ASI Series F. Springer, 1998.Google Scholar