Abstract
This paper concerns online algorithms for online binary linear classification (OBLC) problems in Machine learning. In a sense of “online” classification, an instance sequence is given step by step and on each round, these problems consist in finding a linear classifier for predicting to which label a new instance belongs. In OBCL, the quality of predictions is assessed by a loss function, specifically 0–1 loss function. In fact, this loss function is nonconvex, nonsmooth and thus, such problems become intractable. In literature, Perceptron is a well-known online classification algorithm, in which one substitutes a surrogate convex loss function for the 0–1 loss function. In this paper, we investigate an efficient DC loss function which is a suitable approximation of the usual 0–1 loss function. Basing on Online DC (Difference of Convex functions) programming and Online DCA (DC Algorithms) [10], we develop an online classification algorithm. Numerical experiments on several test problems show the efficiency of our proposed algorithm with respect to Perceptron.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Azoury, K., Warmuth, M.: Relative loss bounds for on-line density estimation with the exponential family of distributions. Mach. Learn. 43(3), 211–246 (2001). http://dx.doi.org/10.1023/A3A1010896012157
Crammer, K., Singer, Y.: Ultraconservative online algorithms for multiclass problems. J. Mach. Learn. Res. 3, 951–991 (2003). http://dx.doi.org/10.1162/jmlr.2003.3.4-5.951
Gentile, C.: A new approximate maximal margin classification algorithm. J. Mach. Learn. Res. 2, 213–242 (2002). http://dl.acm.org/citation.cfm?id=944790.944811
Gentile, C.: The robustness of the p-norm algorithms. Mach. Learn. 53(3), 265–299 (2003). http://dx.doi.org/10.1023/A:1026319107706
Hoi, S.C.H., Wang, J., Zhao, P.: Libol: a library for online learning algorithms. J. Mach. Learn. Res. 15(1), 495–499 (2014). http://dl.acm.org/citation.cfm?id=2627435.2627450
Kalai, A., Vempala, S.: Efficient algorithms for online decision problems. J. Comput. Syst. Sci. 71(3), 291–307 (2005). http://www.sciencedirect.com/science/article/pii/S0022000004001394
Kivinen, J., Warmuth, M.: Relative loss bounds for multidimensional regression problems. Mach. Learn. 45(3), 301–329 (2001). http://dx.doi.org/10.1023/A:1017938623079
Kivinen, J., Warmuth, M.K.: Exponentiated gradient versus gradient descent for linear predictors. Inf. Comput. 132(1), 1–63 (1997). http://www.sciencedirect.com/science/article/pii/S0890540196926127
Le Thi, H.A.: DC programming and DCA (2012). http://www.lita.univ-lorraine.fr/~lethi
Le Thi, H.A., Ho, V.T.: Online DC programming and DCA. LITA, Technical report, University of Lorraine, Metz, France (2015)
Le Thi, H.A., Pham Dinh, T.: The DC (difference of convex functions) programming and DCA revisited with DC models of real world nonconvex optimization problems. Ann. Oper. Res. 133(1–4), 23–46 (2005)
Li, Y., Long, P.: The relaxed online maximum margin algorithm. Mach. Learn. 46(1–3), 361–387 (2002). http://dx.doi.org/10.1023/A:1012435301888
Novikoff, A.B.: On convergence proofs for perceptrons. Proc. Symp. Math. Theor. Automata 12, 615–622 (1963). http://citeseer.comp.nus.edu.sg/context/494822/0
Pham Dinh, T., Le Thi, H.A.: Convex analysis approach to d.c. programming: theory, algorithms and applications. Acta Mathematica Vietnamica 22(1), 289–355 (1997)
Pham Dinh, T., Le Thi, H.A.: DC optimization algorithms for solving the trust region subproblem. SIAM J. Optim. 8(2), 476–505 (1998)
Rosenblatt, F.: The perceptron: a probabilistic model for information storage and organization in the brain. Psychol. Rev. 65(6), 386–408 (1958)
Shalev-Shwartz, S.: Online learning: theory, algorithms, and applications. Ph.D. thesis, The Hebrew University of Jerusalem (2007)
Shalev-Shwartz, S., Singer, Y.: A primal-dual perspective of online learning algorithms. Mach. Learn. 69(2–3), 115–142 (2007). http://dx.doi.org/10.1007/s10994-007-5014-x
Van Der Malsburg, C.: Frank rosenblatt: principles of neurodynamics: perceptrons and the theory of brain mechanisms. In: Palm, G., Aertsen, A. (eds.) Brain Theory, pp. 245–248. Springer, Heidelberg (1986). http://dx.doi.org/10.1007/978-3-642-70911-1_20
Zinkevich, M.: Online convex programming and generalized infinitesimal gradient ascent. In: Fawcett, T., Mishra, N. (eds.) Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 928–936 (2003). http://www.aaai.org/Papers/ICML/2003/ICML03-120.pdf
Acknowledgements
This research is funded by Foundation for Science and Technology Development of Ton Duc Thang University (FOSTECT), website: http://fostect.tdt.edu.vn, under Grant FOSTECT.2015.BR.15.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Thanh, H.V., An, L.T.H., Chien, B.D. (2016). Online DC Optimization for Online Binary Linear Classification. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9622. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49390-8_64
Download citation
DOI: https://doi.org/10.1007/978-3-662-49390-8_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-49389-2
Online ISBN: 978-3-662-49390-8
eBook Packages: Computer ScienceComputer Science (R0)