Abstract
The proposed MEMPM model obtains the decision hyperplane by using only global information, e.g. the mean and covariance matrices. However, although these moments can be more reliably obtained than estimating the distribution, they may still be inaccurate in many cases, e.g. when the data are very sparse.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bertsekas DP (1999) Nonlinear Programming. Belmont, MA: Athena Scientific 2nd edition
Blake CL, Merz CJ (1998) Repository of machine learning databases, University of California, Irvine, http://www.ics.uci.edu/~mlearn/MLRepository.html
Breiman L (1997) Arcing classifiers. Technical Report 460, Statistics Department, University of California
Fukunaga K(1990). Introduction to Statistical Pattern Recognition. San Diego, CA: Academic Press, 2nd edition
Huang K, Yang H, King I, Lyu MR (2004) Learning large margin classifiers locally and globally. In The 21st International Conference on Machine Learning (ICML-2004)
Huang K, Yang H, King I, Lyu MR, Chan L (2004) The minimum error minimax probability machine. Journal of Machine Learning Research 5: 1253–1286
Huang K, Yang H, King I, Lyu MR, Chan L (2007). Maxi-Min Margin Machine: Learning large margin classifiers globally and locally. To appear in IEEE Trans. Neural Networks
Ivannov VV (1962) On linear problems which are not well-posed. Soviet Math. Docl. 3(4): 981–983
Kohavi R (1995). A study of cross validation and bootstrap for accuracy estimation and model selection. In Proceedings of the Fourtheenth International Joint Conference on Artificial Intelligence (IJCAI-1995). San Francisco, CA: Morgan Kaufmann 338–345
Kruk S, Wolkowicz H (2000) General nonlinear programming. In H. Wolkowicz, R. Saigal, and L. Vandenberghe, editors, Handbook of Semidefinite Programming: Theory, Algorithms, and Applications. Boston, MA: Kluwer Academic Publishers 563–575
Lanckriet GRG, Ghaoui LE, Bhattacharyya C, Jordan MI (2002) A robust minimax approach to classification. Journal of Machine Learning Research 3: 555–582
Lobo M, Vandenberghe L, Boyd S, Lebret H (1998) Applications of second order cone programming. Linear Algebra and its Applications 284: 193–228
Luntz A, Brailovsky V(1969) On estimation of characters obtained in statistical procedure of recognition (in Russian). Technicheskaya Kibernetica 3(6)
Marshall AW, Olkin I (1960) Multivariate Chebyshev inequalities. Annals of Mathematical Statistics 31(4): 1001–1014
Nesterov Y, Nemirovsky A (1994) Interior point polynomial methods in convex programming: Theory and applications. Philadelphia, PA: SIAM
Platt J(1998) Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report MSR-TR-98-14
Pruessner A(2003). Conic programming in GAMS. In Optimization Software-The State of the Art. INFORMS Atlanta, http://www.gamsworld.org/cone/links.htm
Schölkopf, Smola A(2002) Learning with Kernels. Cambridge, MA: The MIT Press
Smola AJ, Bartlett PL, Scholkopf B, Schuurmans D (2000) Advances in Large Margin Classifiers. Cambridge, MA: The MIT Press
Sturm JF (1999) Using sedumi 1.02, a matlab toolbox for optimization over symmetric cones. Optimization Methods and Software 11: 625–653
Sturm JF (2000) Central region method. In J.B.G. Frenk, C. Roos, T. Terlaky, and S. Zhang, editors, High Performance Optimization. Boston, MA: Kluwer Academic Publishers 157–194
Tikhonov AN (1963) On solving ill-posed problem and method of regularization. Doklady Akademii Nauk USSR 153: 501–504
Tong S, Koller D (2000) Restricted Bayes optimal classifiers. In Proceedings of the 17th National Conference on Artificial Intelligence (AAAI), Austin, Texas 658–664
Vapnik VN (1998) Statistical Learning Theory. New York, NY: John Wiley & Sons
Vapnik VN (1999) The Nature of Statistical Learning Theory. New York, NY: Springer Verlag, 2nd edition
Vasin VV (1970) Relationship of several variational methods for approximate solutions of ill-posed problems. Math. Notes 7: 161–166
Zhu J, Rosset S, Hastie T, Tibshirani R(2003) 1-norm support vector machines. In Advances in Neural Information Processing Systems (NIPS 16)
Rights and permissions
Copyright information
© 2008 Zhejiang University Press, Hangzhou and Springer-Verlag GmbH Berlin Heidelberg
About this chapter
Cite this chapter
(2008). Learning Locally and Globally: Maxi-Min Margin Machine. In: Machine Learning. Advanced Topics in Science and Technology in China. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79452-3_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-79452-3_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79451-6
Online ISBN: 978-3-540-79452-3
eBook Packages: Computer ScienceComputer Science (R0)