Learning Locally and Globally: Maxi-Min Margin Machine

doi:10.1007/978-3-540-79452-3_4

Part of the book series: Advanced Topics in Science and Technology in China ((ATSTC))

6155 Accesses

Abstract

The proposed MEMPM model obtains the decision hyperplane by using only global information, e.g. the mean and covariance matrices. However, although these moments can be more reliably obtained than estimating the distribution, they may still be inaccurate in many cases, e.g. when the data are very sparse.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bertsekas DP (1999) Nonlinear Programming. Belmont, MA: Athena Scientific 2nd edition
MATH Google Scholar
Blake CL, Merz CJ (1998) Repository of machine learning databases, University of California, Irvine, http://www.ics.uci.edu/~mlearn/MLRepository.html
Google Scholar
Breiman L (1997) Arcing classifiers. Technical Report 460, Statistics Department, University of California
Google Scholar
Fukunaga K(1990). Introduction to Statistical Pattern Recognition. San Diego, CA: Academic Press, 2nd edition
Google Scholar
Huang K, Yang H, King I, Lyu MR (2004) Learning large margin classifiers locally and globally. In The 21st International Conference on Machine Learning (ICML-2004)
Google Scholar
Huang K, Yang H, King I, Lyu MR, Chan L (2004) The minimum error minimax probability machine. Journal of Machine Learning Research 5: 1253–1286
MathSciNet Google Scholar
Huang K, Yang H, King I, Lyu MR, Chan L (2007). Maxi-Min Margin Machine: Learning large margin classifiers globally and locally. To appear in IEEE Trans. Neural Networks
Google Scholar
Ivannov VV (1962) On linear problems which are not well-posed. Soviet Math. Docl. 3(4): 981–983
Google Scholar
Kohavi R (1995). A study of cross validation and bootstrap for accuracy estimation and model selection. In Proceedings of the Fourtheenth International Joint Conference on Artificial Intelligence (IJCAI-1995). San Francisco, CA: Morgan Kaufmann 338–345
Google Scholar
Kruk S, Wolkowicz H (2000) General nonlinear programming. In H. Wolkowicz, R. Saigal, and L. Vandenberghe, editors, Handbook of Semidefinite Programming: Theory, Algorithms, and Applications. Boston, MA: Kluwer Academic Publishers 563–575
Google Scholar
Lanckriet GRG, Ghaoui LE, Bhattacharyya C, Jordan MI (2002) A robust minimax approach to classification. Journal of Machine Learning Research 3: 555–582
Article Google Scholar
Lobo M, Vandenberghe L, Boyd S, Lebret H (1998) Applications of second order cone programming. Linear Algebra and its Applications 284: 193–228
Article MATH MathSciNet Google Scholar
Luntz A, Brailovsky V(1969) On estimation of characters obtained in statistical procedure of recognition (in Russian). Technicheskaya Kibernetica 3(6)
Google Scholar
Marshall AW, Olkin I (1960) Multivariate Chebyshev inequalities. Annals of Mathematical Statistics 31(4): 1001–1014
Article MATH MathSciNet Google Scholar
Nesterov Y, Nemirovsky A (1994) Interior point polynomial methods in convex programming: Theory and applications. Philadelphia, PA: SIAM
Google Scholar
Platt J(1998) Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report MSR-TR-98-14
Google Scholar
Pruessner A(2003). Conic programming in GAMS. In Optimization Software-The State of the Art. INFORMS Atlanta, http://www.gamsworld.org/cone/links.htm
Schölkopf, Smola A(2002) Learning with Kernels. Cambridge, MA: The MIT Press
Google Scholar
Smola AJ, Bartlett PL, Scholkopf B, Schuurmans D (2000) Advances in Large Margin Classifiers. Cambridge, MA: The MIT Press
MATH Google Scholar
Sturm JF (1999) Using sedumi 1.02, a matlab toolbox for optimization over symmetric cones. Optimization Methods and Software 11: 625–653
Article MathSciNet Google Scholar
Sturm JF (2000) Central region method. In J.B.G. Frenk, C. Roos, T. Terlaky, and S. Zhang, editors, High Performance Optimization. Boston, MA: Kluwer Academic Publishers 157–194
Google Scholar
Tikhonov AN (1963) On solving ill-posed problem and method of regularization. Doklady Akademii Nauk USSR 153: 501–504
Google Scholar
Tong S, Koller D (2000) Restricted Bayes optimal classifiers. In Proceedings of the 17th National Conference on Artificial Intelligence (AAAI), Austin, Texas 658–664
Google Scholar
Vapnik VN (1998) Statistical Learning Theory. New York, NY: John Wiley & Sons
MATH Google Scholar
Vapnik VN (1999) The Nature of Statistical Learning Theory. New York, NY: Springer Verlag, 2nd edition
Google Scholar
Vasin VV (1970) Relationship of several variational methods for approximate solutions of ill-posed problems. Math. Notes 7: 161–166
MATH Google Scholar
Zhu J, Rosset S, Hastie T, Tibshirani R(2003) 1-norm support vector machines. In Advances in Neural Information Processing Systems (NIPS 16)
Google Scholar

Download references

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

(2008). Learning Locally and Globally: Maxi-Min Margin Machine. In: Machine Learning. Advanced Topics in Science and Technology in China. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79452-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-540-79452-3_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79451-6
Online ISBN: 978-3-540-79452-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics