A Note on the Morozov Principle via Lagrange Duality

Bonnefond, Xavier; Maréchal, Pierre; Lee, Walter Cedric Simo Tao

doi:10.1007/s11228-018-0470-y

A Note on the Morozov Principle via Lagrange Duality

Published: 17 February 2018

Volume 26, pages 265–275, (2018)
Cite this article

Set-Valued and Variational Analysis Aims and scope Submit manuscript

Xavier Bonnefond¹,
Pierre Maréchal¹ &
Walter Cedric Simo Tao Lee¹

100 Accesses
1 Citation
Explore all metrics

Abstract

Considering a general linear ill-posed equation, we explore the duality arising from the requirement that the discrepancy should take a given value based on the estimation of the noise level, as is notably the case when using the Morozov principle. We show that, under reasonable assumptions, the dual function is smooth, and that its maximization points out the appropriate value of Tikhonov’s regularization parameter. The numerical relevance of our approach is established by means of an illustrative example from nonparametric instrumental regression, a standard problem in statistics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Deep Ritz Method: A Deep Learning-Based Numerical Algorithm for Solving Variational Problems

Article 14 February 2018

Weinan E & Bing Yu

The Frank-Wolfe Algorithm: A Short Introduction

Article Open access 13 December 2023

Sebastian Pokutta

Sharp well-posedness for the Benjamin–Ono equation

Article 26 March 2024

Rowan Killip, Thierry Laurens & Monica Vişan

References

Borwein, J., Lewis, A.: Convex Analysis and Nonlinear Optimization, CMS Books in Mathematics, 2nd edn. Springer, Berlin (2005)
Google Scholar
Engl, H.W., Hanke, M., Neubauer, A.: Regularization of Inverse Problems. Springer, Berlin (1996)
Book MATH Google Scholar
Fletcher, R.: Practical Methods of Optimization: Unconstrained Optimization. Wiley, New York (1980)
MATH Google Scholar
Frick, K., Grasmair, M.: Regularization of linear ill-posed problems by the augmented lagrangian method and variational inequalities. Inverse Prob. 28, 1–16 (2012)
MathSciNet MATH Google Scholar
Hall, P., Horowitz, J.: Nonparametric methods for inference in the presence of instrumental variables. Ann. Stat. 33(6), 2904–2929 (2005)
Article MathSciNet MATH Google Scholar
Hantoute, A., López, M.A., Zălinescu, C.: Subdifferential calculus rules in convex analysis: a unifying approach via pointwise supremum functions. SIAM J. Optim. 19(2), 863–882 (2008)
Article MathSciNet MATH Google Scholar
Hiriart-Urruty, J.-B., Lemaréchal, C.: Convex Analysis and Minimization Algorithms I. A Series of Comprehensive Studies in Mathematics. Springer, Berlin (1993)
MATH Google Scholar
Hiriart-Urruty, J.-B., Lemaréchal, C.: Convex Analysis and Minimization Algorithms II. A Series of Comprehensive Studies in Mathematics. Springer, Berlin (1993)
MATH Google Scholar
Hnětynková, I., Plešinger, M., Strakoš, Z.: The regularizing effect of the Golub-Kahan iterative bidiagonalization and revealing the noise level in the data. BIT Numer. Math. 49, 669–696 (2009)
Article MathSciNet MATH Google Scholar
Kirsch, A: An Introduction to the Mathematical Theory of Inverse Problems. Springer, Berlin (2011)
Book MATH Google Scholar
Lemaréchal, C: A view of line-searches. In: Auslender, A., Oettli, W., Stoer, J. (eds.) Optimization and Optimal Control, Lecture Notes in Control and Information Sciences, vol. 30. Springer, Berlin (1981)
Morozov, V.A.: Choice of parameter for the solution of functional equations by the regularization method. Sov. Math. Dokl. 8, 1000–1003 (1967)
MATH Google Scholar
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1970)
Book MATH Google Scholar
Tikhonov, A.N., Arsenin, V.Y.: Solutions of Ill-Posed Problems. Wiley, New York (1977)
MATH Google Scholar
Wolfe, P: Convergence conditions for ascent methods. SIAM Rev. 11, 226–235 (1969)
Article MathSciNet MATH Google Scholar
Zălinescu, C: Convex Analysis in General Vector Spaces. World Scientific, Singapore (2002)
Book MATH Google Scholar

Download references

Acknowledgements

We thank the anonymous referees for their helpful comments and suggestions which have enabled us to improve the manuscript.

Author information

Authors and Affiliations

Université de Toulouse, Toulouse, France
Xavier Bonnefond, Pierre Maréchal & Walter Cedric Simo Tao Lee

Authors

Xavier Bonnefond
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Maréchal
View author publications
You can also search for this author in PubMed Google Scholar
Walter Cedric Simo Tao Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pierre Maréchal.

Appendix: Algorithmic Details

In order to maximize D, which we reformulate here as the minimization of $\bar {D}:=-D$, we combine a Quasi-Newton method with Wolfe-Lemaréchal line-search (see the algorithms 1 and 2 below).

Recall that at each iteration k, we have an update of the form λ_k+ 1 = λ_k + α_kd_k, where d_k denotes the direction of descent computed via $\bar {D}^{\prime }$ and an approximation of $\bar {D}^{\prime \prime }$. The stepsize α_k is chosen so as to satisfy the two Wolfe conditions (C1) and (C2) (see [11]) in order to guarantee the monotonicity of the sequence $\bar {D}(\lambda _{k})$:

$$\begin{array}{ll} (C1) :\quad \bar{D} (\lambda_{k} + \alpha_{k} d_{k}) - \bar{D}(\lambda_{k}) - \alpha_{k} \beta_{1} \bar{D}^{\prime}(\lambda_{k}) d_{k} <0 \\ (C2) :\quad \bar{D}^{\prime} (\lambda_{k} + \alpha_{k} d_{k}) d_{k} - \beta_{2} \bar{D}^{\prime}(\lambda_{k}) d_{k} \geq 0. \end{array} $$

In (C1) and (C2), the parameters β₁ and β₂ are taken in (0,1) (see [3, 15]). In Algorithm 1, the stopping criterion is $| \bar {D}^{\prime }(\lambda _{k})| < \epsilon $, with 𝜖 > 0, defining the tolerance. For computing α_k at line 6, we propose Algorithm 2, which is based on the line-search algorithm (see [11]).

In Algorithm 2, (α_g,α_d) is the interval in which the step α_k will be chosen. Here, M is a large number which emulates ∞. For example, we may set M = 10¹⁰. Recall that the failure of Condition (C1) means that α_k is too large, while the failure of Condition (C2) means that α_k is too small. If it happens that $|\alpha _{d} - \alpha _{g}| \thickapprox 0$, indicating that β₁ is too big or β₂ is too small, one may then merely adjust these parameters. This situation occurs rarely. For instance, β₁ = 0.25 and β₂ = 0.75 worked perfectly well for all our simulations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bonnefond, X., Maréchal, P. & Lee, W.C.S.T. A Note on the Morozov Principle via Lagrange Duality. Set-Valued Var. Anal 26, 265–275 (2018). https://doi.org/10.1007/s11228-018-0470-y

Download citation

Received: 14 December 2016
Accepted: 02 February 2018
Published: 17 February 2018
Issue Date: June 2018
DOI: https://doi.org/10.1007/s11228-018-0470-y

Keywords

Mathematics Subject Classifications (2010)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Note on the Morozov Principle via Lagrange Duality

Abstract

Access this article

Similar content being viewed by others

The Deep Ritz Method: A Deep Learning-Based Numerical Algorithm for Solving Variational Problems

The Frank-Wolfe Algorithm: A Short Introduction

Sharp well-posedness for the Benjamin–Ono equation

References

Acknowledgements