Global Convergence of a Stochastic Levenberg–Marquardt Algorithm Based on Trust Region

Shao, Wei-Yi; Fan, Jin-Yan

doi:10.1007/s40305-023-00529-6

Global Convergence of a Stochastic Levenberg–Marquardt Algorithm Based on Trust Region

Published: 11 January 2024

(2024)
Cite this article

Journal of the Operations Research Society of China Aims and scope Submit manuscript

Wei-Yi Shao¹ &
Jin-Yan Fan²

124 Accesses
Explore all metrics

Abstract

In this paper, we propose a stochastic Levenberg–Marquardt algorithm based on trust region for stochastic nonlinear least squares problems, where the stochastic Jacobians and gradients are used instead of the exact Jacobians and gradients. We show that the estimates and models of the objective function are probabilistically accurate if the number of samples at each iteration is chosen appropriately. Further, we prove that at least one accumulation point of the sequence generated by the proposed algorithm is a stationary point of the objective function with probability one.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Algebraic rules for computing the regularization parameter of the Levenberg–Marquardt method

Article 25 April 2016

A Levenberg–Marquardt method for large nonlinear least-squares problems with dynamic accuracy in functions and gradients

Article 30 June 2018

Convergence and Complexity Analysis of a Levenberg–Marquardt Algorithm for Inverse Problems

Article 12 May 2020

References

Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60(2), 223–311 (2018)
Article MathSciNet Google Scholar
Johnson, R. and Ahang, T.:Accelerating stochastic gradient descent using predictive variance reduction, In: Advances in Neural Information Processing Systems 26, pp. 315–323. Curran Associates, Inc. (2013)
Defazio, A., Bach, F., and Lacoste-Julien, S.: SAGA: A fast incremental gradient method with support for non-strongly convex composite objectives. In: Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 27, pp. 1646–1654. Curran Associates, Inc. (2014)
Byrd, R.H., Hansen, S.L., Nocedal, J., Singer, Y.: A stochastic quasi-Newton method for large-scale optimization. SIAM J. Optim. 26(2), 1008–1031 (2016)
Article MathSciNet Google Scholar
Mokhtari, A., Ribeiro, A.: Res: Regularized stochastic BFGS algorithm. IEEE Trans. Signal Process. 62(23), 6089–6104 (2014)
Article MathSciNet Google Scholar
Da Silva, I.N.; Spatti, D.H.; Flauzino, R.A.; Liboni, L.H.B.; dos Reis Alves, S.F. Artificial Neural Networks: A Practical Course. Springer International Publishing, Basel (2016)
Yu, H., Wilamowski, B.M.: Levenberg-Marquardt training. In: Intelligent systems, pp. 1–12. CRC Press, Boca Raton (2018)
Google Scholar
Bellavia, S., Riccietti, E.: On an elliptical trust-region procedure for ill-posed nonlinear leastsquares problems. J. Optim. Theory Appl. 178(3), 824–859 (2018)
Article MathSciNet Google Scholar
Bidabadi, N.: Using a spectral scaling structured BFGS method for constrained nonlinear least squares. Optim. Methods Softw. 34(4), 693–706 (2019)
Article MathSciNet Google Scholar
Gould, N.I.M., Rees, T., Scott, J.A.: Convergence and evaluation-complexity analysis of a regularized tensor-Newton method for solving nonlinear least-squares problems. Comput. Optim. Appl. 73(1), 1–35 (2019)
Article MathSciNet Google Scholar
Yuan, Y.X.: Subspace methods for large scale nonlinear equations and nonlinear least squares. Optim. Eng. 10(2), 207–218 (2009)
Article MathSciNet Google Scholar
Zhang, H.C., Conn, A.R., Scheinberg, K.: A derivative-free algorithm for least squares minimization. SIAM J. Optim. 20(6), 3555–3576 (2010)
Article MathSciNet Google Scholar
Cartis, C., Roberts, L.: A derivative-free Gauss-Newton method. Math. Program. Comput. 11(4), 631–674 (2019)
Article MathSciNet Google Scholar
Cartis, C., Fiala, J., Marteau, B., Rober, L.: Improving the flexibility and robustness of modelbased derivative-free optimization solvers. ACM Trans. Math. Software 45(3), 41 (2019)
Article Google Scholar
Bergou, E., Gratton, S., Vicente, L.N.: Levenberg-Marquardt methods based on probabilistic gradient models and inexact subproblem solution, with application to data assimilation. SIAM/ASA J. Uncertain. Quantif. 4(1), 924–951 (2016)
Article MathSciNet Google Scholar
Zhao, R.X., Fan, J.Y.: Levenberg-Marquardt method based on probabilistic Jacobian models for nonlinear equations. Comput. Optim. Appl. 83, 381–401 (2022)
Article MathSciNet Google Scholar
Conn, A. R., Gould, N. I. M., Toint, Ph. L.: Trust-Region Methods. SIAM, Philadelphia (2000)
Powell, M.J.D.: Convergence properties of a class of minimization algorithms. In: Mangasarian, O.L., Meyer, R.R., Robinson, S.M. (eds.) Nonlinear Programming 2, pp. 1–27. Academic Press, New York (1975)
Google Scholar
Çinlar, E., and ðCınlar, E.: Probability and Stochastics. Springer, vol. 261 (2011)
R. Durrett. Probability: theory and examples. Cambridge University Press, Cambridge (2019)
Laha, R.G., Rohatgi, V.K.: Probability Theory. Courier Dover Publications, New York (2020)
Google Scholar
Hong, Y., Bergou, H., Doucet, N., Zhang, H., Cranney, J., Ltaief, H., Gratadour, D., Rigaut, F., and Keyes, D. E.: Stochastic Levenberg-Marquardt for solving optimization problems on hardware accelerators. https://repository.kaust.edu.sa/items/60c86d63-8d42-470a-b875-dd88df1980ff (2021)
Bergou, E., Diouane, Y., Kungurtsev V., and Royer, C. W.: A stochastic Levenberg-Marquardt method using random models with application to data assimilation. arXiv:1807.02176v1 (2018)
Moré, J.J., Garbow, B.S., Hillstrom, K.E.: Testing unconstrained optimization software. ACM Trans. Math. Softw. (TOMS) 7(1), 17–41 (1981)
Article MathSciNet Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(27), 1–27 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, 200240, China
Wei-Yi Shao
School of Mathematical Sciences, and Key Lab of Scientific and Engineering Computing (Ministry of Education), Shanghai Jiao Tong University, Shanghai, 200240, China
Jin-Yan Fan

Authors

Wei-Yi Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Yan Fan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J. -Y. Fan made the research plan and W.-Y. Shao performed the research.

Corresponding author

Correspondence to Jin-Yan Fan.

Ethics declarations

Conflict of interest

The authors declared that they have no conflicts of interest to this work.

Additional information

The authors are supported by Shanghai Municipal Science and Technology Key Project (No. 22JC1401500), the National Natural Science Foundation of China (Nos. 1971309 and 12371307), and the Fundamental Research Funds for the Central Universities.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Shao, WY., Fan, JY. Global Convergence of a Stochastic Levenberg–Marquardt Algorithm Based on Trust Region. J. Oper. Res. Soc. China (2024). https://doi.org/10.1007/s40305-023-00529-6

Download citation

Received: 05 March 2023
Revised: 13 November 2023
Accepted: 14 November 2023
Published: 11 January 2024
DOI: https://doi.org/10.1007/s40305-023-00529-6

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Global Convergence of a Stochastic Levenberg–Marquardt Algorithm Based on Trust Region

Abstract

Access this article

Similar content being viewed by others

Algebraic rules for computing the regularization parameter of the Levenberg–Marquardt method

A Levenberg–Marquardt method for large nonlinear least-squares problems with dynamic accuracy in functions and gradients

Convergence and Complexity Analysis of a Levenberg–Marquardt Algorithm for Inverse Problems

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Global Convergence of a Stochastic Levenberg–Marquardt Algorithm Based on Trust Region

Abstract

Access this article

Similar content being viewed by others

Algebraic rules for computing the regularization parameter of the Levenberg–Marquardt method

A Levenberg–Marquardt method for large nonlinear least-squares problems with dynamic accuracy in functions and gradients

Convergence and Complexity Analysis of a Levenberg–Marquardt Algorithm for Inverse Problems

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation