Subgradient methods for huge-scale optimization problems

Nesterov, Yu.

doi:10.1007/s10107-013-0686-4

Subgradient methods for huge-scale optimization problems

Full Length Paper
Series A
Published: 25 May 2013

Volume 146, pages 275–297, (2014)
Cite this article

Mathematical Programming Submit manuscript

Yu. Nesterov¹

2173 Accesses
37 Citations
1 Altmetric
Explore all metrics

Abstract

We consider a new class of huge-scale problems, the problems with sparse subgradients. The most important functions of this type are piece-wise linear. For optimization problems with uniform sparsity of corresponding linear operators, we suggest a very efficient implementation of subgradient iterations, which total cost depends logarithmically in the dimension. This technique is based on a recursive update of the results of matrix/vector products and the values of symmetric functions. It works well, for example, for matrices with few nonzero diagonals and for max-type functions. We show that the updating technique can be efficiently coupled with the simplest subgradient methods, the unconstrained minimization method by B.Polyak, and the constrained minimization scheme by N.Shor. Similar results can be obtained for a new nonsmooth random variant of a coordinate descent scheme. We present also the promising results of preliminary computational experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Frank-Wolfe Algorithm: A Short Introduction

Article Open access 13 December 2023

Random Gradient-Free Minimization of Convex Functions

Article 30 November 2015

An Overview of Stochastic Quasi-Newton Methods for Large-Scale Machine Learning

Article Open access 25 February 2023

Notes

We count as one operation the pair of operations formed by real multiplication and addition.

References

Khachiyan, L., Tarasov, S., Erlich, E.: The inscribed ellipsoid method. Sov. Math. Dokl. 37, 226–230 (1988)
Google Scholar
Luo, Z.Q., Tseng, P.: On the convergence rate of dual ascent methods for linearly constrained convex minimization. Math. Oper. Res. 18(2), 846–867 (1993)
Article MATH MathSciNet Google Scholar
Nesterov, Yu.: Smooth minimization of non-smooth functions. Math. Program. A 103(1), 127–152 (2005)
Article MATH MathSciNet Google Scholar
Nesterov, Yu.: Primal-dual subgradient methods for convex problems. Math. Program. 120(1), 261–283 (2009)
Article MathSciNet Google Scholar
Nesterov, Yu.: Efficiency of coordinate descent methods on huge-scale optimization problems. CORE iscussion paper 2010/2. Accepted by SIOPT
Nesterov, Yu., Nemirovskii, A.: Interior Point Polynomial Methods in Convex Programming: Theory and Applications. SIAM, Philadelphia (1994)
Book Google Scholar
Gilpin, A., Peña, J., Sandholm, T.: First-order algorithm with \(O(\ln (1/\epsilon ))\) convergence for \(\epsilon \)-equilibrium in two-person zero-sum games. Math. Program. 133(2), 279–296 (2012)
Polyak, B.: Introduction to Optimization. Optimization Software, Inc., New York (1987)
Richtárik, P., Takac, M.: Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function. April 2011 (revised July 4, 2011). Math. Program. doi:10.1007/s10107-012-0614-z
Shor, N.: Minimization Methods for Non-differentiable Functions. Springer Series in Computational Mathematics. Springer, Berlin (1985)
Book Google Scholar

Download references

Acknowledgments

The author would like to thank two the anonymous referees and associated editor for their very useful comments.

Author information

Authors and Affiliations

Center for Operations Research and Econometrics (CORE), Catholic University of Louvain (UCL), 34 voie du Roman Pays, 1348 , Louvain-la-Neuve, Belgium
Yu. Nesterov

Authors

Yu. Nesterov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu. Nesterov.

Additional information

The research presented in this paper was partially supported by the Laboratory of Structural Methods of Data Analysis in Predictive Modeling, MIPT, through the RF government grant, ag.11.G34.31.0073.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nesterov, Y. Subgradient methods for huge-scale optimization problems. Math. Program. 146, 275–297 (2014). https://doi.org/10.1007/s10107-013-0686-4

Download citation

Received: 30 January 2012
Accepted: 10 May 2013
Published: 25 May 2013
Issue Date: August 2014
DOI: https://doi.org/10.1007/s10107-013-0686-4

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Subgradient methods for huge-scale optimization problems

Abstract

Access this article

Similar content being viewed by others

The Frank-Wolfe Algorithm: A Short Introduction

Random Gradient-Free Minimization of Convex Functions

An Overview of Stochastic Quasi-Newton Methods for Large-Scale Machine Learning

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Subgradient methods for huge-scale optimization problems

Abstract

Access this article

Similar content being viewed by others

The Frank-Wolfe Algorithm: A Short Introduction

Random Gradient-Free Minimization of Convex Functions

An Overview of Stochastic Quasi-Newton Methods for Large-Scale Machine Learning

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation