First-order inertial algorithms involving dry friction damping

Adly, Samir; Attouch, Hedy

doi:10.1007/s10107-020-01613-y

First-order inertial algorithms involving dry friction damping

Full Length Paper
Series A
Published: 13 January 2021

Volume 193, pages 405–445, (2022)
Cite this article

Mathematical Programming Submit manuscript

979 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

In a Hilbert space $ {\mathcal H}$, we introduce a new class of first-order algorithms which naturally occur as discrete temporal versions of an inertial differential inclusion jointly involving viscous friction and dry friction. The function $f:{\mathcal H}\rightarrow {\mathbb {R}}$ to be minimized is supposed to be differentiable (not necessarily convex), and enters the algorithm via its gradient. The dry friction damping function $\phi :{\mathcal H}\rightarrow {\mathbb {R}}_+$ is convex with a sharp minimum at the origin, (typically $\phi (x) = r \Vert x\Vert $ with $r >0$). It enters the algorithm via its proximal mapping, which acts as a soft threshold operator on the velocities. As a result, we obtain a new class of splitting algorithms involving separately the proximal and gradient steps. The sequence of iterates has a finite length, and therefore strongly converges towards an approximate critical point $x_{\infty }$ of f (typically $\Vert \nabla f(x_{\infty })\Vert \le r$). Under a geometric property satisfied by the limit point $x_{\infty }$, we obtain geometric and finite rates of convergence. The convergence results tolerate the presence of errors, under the sole assumption of their asymptotic convergence towards zero. By replacing the function f by its Moreau envelope, we extend the results to the case of nonsmooth convex functions. In this case, the algorithm involves the proximal operators of f and $\phi $ separately. Several variants of this algorithm are considered, including the case of the Nesterov accelerated gradient method. We then consider the extension in the case of additive composite optimization, thus leading to new splitting methods. Numerical experiments are given for Lasso-type problems. The performance profiles, as a comparison tool, demonstrate the efficiency of the Nesterov accelerated method with asymptotic vanishing damping combined with dry friction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

First order inertial optimization algorithms with threshold effects associated with dry friction

Article 10 August 2023

A Doubly Nonlinear Evolution System with Threshold Effects Associated with Dry Friction

Article 10 April 2024

Newton-Type Inertial Algorithms for Solving Monotone Equations Governed by Sums of Potential and Nonpotential Operators

Article 10 May 2022

Notes

This interesting suggestion was made to us by one of the two anonymous reviewers.
We thank the anonymous reviewer for suggesting it.
https://sparse.tamu.edu.

References

Adly, S.: A Variational Approach to Nonsmooth Dynamics: Applications in Unilateral Mechanics and Electronics, Springer Briefs in Mathematics. Springer, Berlin (2017)
Book Google Scholar
Adly, S., Attouch, H., Cabot, A.: Finite-time stabilization of nonlinear oscillators subject to dry friction, nonsmooth mechanics and analysis. In: Advances in Mechanics and Mathematics, vol. 12, pp. 289–304. Springer, New York (2006)
Adly, S., Brogliato, B., Le, B.K.: Well-posednesss, robustness and stability analysis of a set-valued controller for Lagrangian systems. SIAM J. Control Optim. 51(2), 1592–1614 (2013)
Article MathSciNet Google Scholar
Alvarez, F.: On the minimizing property of a second-order dissipative system in Hilbert spaces. SIAM J. Control Optim. 38(4), 1102–1119 (2000)
Article MathSciNet Google Scholar
Amann, H., Díaz, J.I.: A note on the dynamics of an oscillator in the presence of strong friction. Nonlinear Anal. 55, 209–216 (2003)
Article MathSciNet Google Scholar
Apidopoulos, V., Aujol, J.-F., Dossal, Ch.: Convergence Rate of Inertial Forward-Backward Algorithm Beyond Nesterov’s Rule, Mathematical Programming, Series A, pp. 1–20. Springer, Berlin (2018)
Attouch, H., Buttazzo, G., Michaille, G.: Variational analysis in Sobolev and BV spaces. Applications to PDEs and optimization, 2nd Edn, MOS/SIAM Series on Optimization, MO 17. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA (2014)
Attouch, H., Cabot, A.: Asymptotic stabilization of inertial gradient dynamics with time-dependent viscosity. J. Differ. Equ. 263, 5412–5458 (2017)
Article MathSciNet Google Scholar
Attouch, H., Cabot, A.: Convergence rates of inertial forward–backward algorithms. SIAM J. Optim. 28(1), 849–874 (2018)
Article MathSciNet Google Scholar
Attouch, H., Cabot, A., Chbani, Z., Riahi, H.: Rate of convergence of inertial gradient dynamics with time-dependent viscous damping coefficient. Evol. Equ. Control Theory 7(3), 353–371 (2018)
Article MathSciNet Google Scholar
Attouch, H., Chbani, Z., Peypouquet, J., Redont, P.: Fast convergence of inertial dynamics and algorithms with asymptotic vanishing viscosity. Math. Program. Ser. B 168, 123–175 (2018)
Article MathSciNet Google Scholar
Attouch, H., Chbani, Z., Riahi, H.: Rate of convergence of the Nesterov accelerated gradient method in the subcritical case $\alpha \le 3$, ESAIM-COCV, 25, published electronically (2019)
Attouch, H., Peypouquet, J.: The rate of convergence of Nesterov’s accelerated forward–backward method is actually faster than $1/k^2$. SIAM J. Optim. 26(3), 1824–1834 (2016)
Article MathSciNet Google Scholar
Attouch, H., Chbani, Z., Fadili, J., Riahi, H.: First-order optimization algorithms via inertial systems with Hessian driven damping. Math. Program. Ser. A (2020). https://doi.org/10.1007/s10107-020-01591-1
Article Google Scholar
Aujol, J.-F., Dossal, Ch.: Stability of over-relaxations for the forward–backward algorithm, application to FISTA. SIAM J. Optim. 25(4), 2408–2433 (2015)
Article MathSciNet Google Scholar
Baji, B., Cabot, A.: An inertial proximal algorithm with dry friction: finite convergence results. Set Valued Anal. 9(1), 1–23 (2006)
Article MathSciNet Google Scholar
Bauschke, H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. CMS Books in Mathematics. Springer, Berlin (2011)
MATH Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2(1), 183–202 (2009)
Article MathSciNet Google Scholar
Bot, R. I., Csetnek, E. R., László, S. C.: A second order dynamical approach with variable damping to nonconvex smooth minimization. Appl. Anal. (2018) (to appear)
Bot, R.I., Csetnek, E.R.: Second order forward–backward dynamical systems for monotone inclusion problems. SIAM J. Control Optim. 54(3), 1423–1443 (2016)
Article MathSciNet Google Scholar
Brézis, H.: Opérateurs maximaux monotones dans les espaces de Hilbert et équations d’évolution, Lecture Notes, vol. 5. North Holland, Amsterdam (1972)
Google Scholar
Chambolle, A., Dossal, Ch.: On the convergence of the iterates of the fast iterative shrinkage thresholding algorithm. J. Optim. Theory Appl. 166, 968–982 (2015)
Article MathSciNet Google Scholar
Chambolle, A., Pock, T.: An introduction to continuous optimization for imaging. Acta Numer. 25, 161–319 (2016)
Article MathSciNet Google Scholar
Díaz, J.I., Liñán, A.: On the asymptotic behavior of a damped oscillator under a sublinear friction term. Rev. R. Acad. Cien. Serie A. Mat. 95(1), 155–160 (2001)
MATH Google Scholar
Dolan, E.D., Moré, J.J.: Benchmarking optimization software with performance profiles. Math. Program. 91, 201–213 (2002)
Article MathSciNet Google Scholar
Ghadimi, E., Feyzmahdavian, H. R., Johansson, M.: Global convergence of the heavy-ball method for convex optimization. In: 2015 European Control Conference, July, pp. 310–315 (2015)
Haraux, A., Jendoubi, M.A.: Convergence of solutions of second-order gradient-like systems with analytic nonlinearities. J. Differ. Equ. 144(2), 313–320 (1998)
Article MathSciNet Google Scholar
Haraux, A., Jendoubi, M.A.: The Convergence Problem for Dissipative Autonomous Systems, Classical Methods and Recent Advances. Springer, Berlin (2015)
Book Google Scholar
Lemaréchal, C., Sagastizábal, C.: Practical aspects of the Moreau–Yosida regularization: theoretical preliminaries. SIAM J. Optim. 7(2), 367–385 (1997)
Article MathSciNet Google Scholar
May, R.: Asymptotic for a second-order evolution equation with convex potential and vanishing damping term. Turk. J. Math. 41(3), 681–685 (2017)
Article MathSciNet Google Scholar
Nesterov, Y.: A method of solving a convex programming problem with convergence rate O(1/k2). Sov. Math. Dokl. 27, 372–376 (1983)
MATH Google Scholar
Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course, Applied Optimization, vol. 87. Kluwer Academic Publishers, Boston, MA (2004)
Book Google Scholar
Peypouquet, J., Sorin, S.: Evolution equations for maximal monotone operators: asymptotic analysis in continuous and discrete time. J. Convex Anal. 17(3–4), 1113–1163 (2010)
MathSciNet MATH Google Scholar
Polyak, B.T.: Some methods of speeding up the convergence of iterative methods. Z. Vylist Math. Fiz. 4, 1–17 (1964)
MathSciNet Google Scholar
Polyak, B.T.: Introduction to Optimization. Optimization Software, New York (1987)
MATH Google Scholar
Siegel, J. W.: Accelerated first-order methods: Differential equations and Lyapunov functions, arXiv:1903.05671v5 [math.OC] (2019)
Su, W., Boyd, S., Candès, E.J.: A differential equation for modeling Nesterov’s accelerated gradient method. J. Mach. Learn. Res. 17, 1–43 (2016)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the two anonymous reviewers as well as the associated editor for their careful reading and their relevant suggestions and comments that helped considerably to improve this paper.

Author information

Authors and Affiliations

Laboratoire XLIM, Université de Limoges, 123 Avenue Albert Thomas, 87060, Limoges Cedex, France
Samir Adly
IMAG, Univ. Montpellier, CNRS, Place Eugène Bataillon, 34095, Montpellier Cedex 5, France
Hedy Attouch

Authors

Samir Adly
View author publications
You can also search for this author in PubMed Google Scholar
Hedy Attouch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Samir Adly.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Auxiliary results

1.1 Finite-time convergence of the continuous dynamic

Theorem 7

Let $f:{\mathcal H}\rightarrow {\mathbb {R}}$ be a $\mathcal C^1$ function whose gradient is Lipschitz continuous, and let $\phi : {\mathcal H}\rightarrow {\mathbb {R}}$ be a convex continuous function that satisfies (DF). Suppose that the function $\gamma : [t_0, +\infty [ \rightarrow {\mathbb {R}}_+$ belongs to $L^1 ([t_0, T])$ for any $T>t_0$. Then, the following properties hold:

a) For any Cauchy data $(x_0, \dot{x}_0 ) \in {\mathcal H}\times {\mathcal H}$, there exists a unique strong global solution of the Heavy Ball system with Dry Friction

$$\begin{aligned} \mathrm{(HBDF)} \qquad \ddot{x}(t) + \gamma (t)\dot{x}(t) + \partial \phi (\dot{x}(t)) + \nabla f (x(t)) \ni 0, \end{aligned}$$

(65)

satisfying $x(t_0) = x_0$, and $\dot{x}(t_0)=\dot{x}_0 $.

b) For any solution trajectory x of $\mathrm{(HBDF)} $ we have:

(i) $\Vert \dot{x}\Vert \in L^1([t_0,+\infty [,{\mathbb {R}})$, and therefore $x_\infty :=$ $\lim _{t\rightarrow +\infty } x(t)$ exists.

(ii) The limit point $x_\infty $ is an equilibrium point of $\mathrm{(HBDF)} $, i.e.

$$\begin{aligned} -\nabla f(x_\infty )\in \partial \phi (0). \end{aligned}$$

(66)

(iii) If

$$\begin{aligned} -\nabla f(x_\infty )\not \in \text{ boundary }(\partial \phi (0)),\end{aligned}$$

then there exists $t_1\ge 0$ such that $x(t)=x_\infty $ for every $t\ge t_1$.

Proof

An existence proof based on a regularization technique, by using the Moreau-Yosida approximation of $\phi $, was given in [2] in a finite dimensional setting. We present here an original proof of the existence and uniqueness part a) of Theorem 7, in a general Hilbert space, which is based on the study of evolution equations governed by the Lipschitz perturbation of maximally monotone operators (see [21]). It is used in an essential way that $\nabla f$ is Lipschitz continuous over the entire space ${\mathcal H}$.

Write $\mathrm{(HBDF)} $ as

$$\begin{aligned} \ddot{x}(t) + \gamma (t)\dot{x}(t)+ \partial \phi (\dot{x}(t)) \ni -\nabla f \left( x_0 + \int _{t_0}^t \dot{x}(\tau )d\tau \right) . \end{aligned}$$

Setting $u(t):= \dot{x}(t)$, this amounts to solving the first-order evolution equation

$$\begin{aligned} \dot{u}(t) + \gamma (t) u(t)+ \partial \phi (u(t)) + \nabla f \left( x_0 + \int _{t_0}^t u(\tau )d\tau \right) \ni 0 \end{aligned}$$

with the Cauchy data $u(t_0)= \dot{x}_0$. Let us introduce the (non-local) operator

$$\begin{aligned} F(u)(t)= \nabla f \left( x_0 + \int _{t_0}^t u(\tau )d\tau \right) . \end{aligned}$$

Thus, we have to solve

$$\begin{aligned} \dot{u}(t) + \gamma (t) u(t) + \partial \phi (u(t)) + F(u)(t) \ni 0. \end{aligned}$$

(67)

For any two trajectories u and v, we have

$$\begin{aligned} \Vert F(u)(t) - F(v)(t) \Vert \le L \int _{t_0}^t \Vert u(\tau )- v(\tau )\Vert d\tau , \end{aligned}$$

where L is the Lipschitz constant of $\nabla f $. Following the approach developed in [21, Proposition 3.12, page 106], we consider the sequence $(u_n)$ defined recursively by

$$\begin{aligned} \dot{u}_{n+1}(t)+ \gamma (t)u_{n+1}(t) + \partial \phi (u_{n+1}(t)) + F(u_n)(t) \ni 0. \end{aligned}$$

(68)

Given $u_n$, the existence and uniqueness of $u_{n+1}$ solution of (68 )with $u_{n+1}(0) = \dot{x}_0$ is ensured by the classical results concerning the evolution equations governed by subdifferentials of convex functions (see [21, Theorem 3.6, page 72], [7, Theorem 17.2.5]). Let’s give $T >t_0$. According to the above Lipschitz continuity property of F, the monotonicity of $\partial \phi $, and $\gamma (t) \ge 0$, we have for all $0 \le t \le T$

$$\begin{aligned} \Vert u_{n+1}(t) - u_{n}(t)\Vert \le L(t-t_0) \int _{t_0}^t \Vert u_{n}(\tau ) - u_{n-1}(\tau )\Vert d\tau , \end{aligned}$$

which gives

$$\begin{aligned} \Vert u_{n+1}(t) - u_{n}(t)\Vert \le \frac{(L (t-t_0)^n}{n!}t^2 \Vert u_1-u_0 \Vert _{L^{\infty }(t_0, T)}. \end{aligned}$$

This implies that $(u_n)$ is a Cauchy sequence for the uniform convergence on $[t_0,T]$. Consequently, it converges uniformly on $[t_0,T]$ to a solution u of (67). So, this uniquely defines $u=\dot{x}$, and at the same time x which is given by to $x(t)= x_0 + \int _{t_0}^t u(\tau )d\tau $.

For part b), we refer to [2, Theorem 3.2 ].

Remark 8

With the condition $-\nabla f(x_\infty )\not \in \text{ boundary }(\partial \phi (0)),$ the finite-time convergence of the trajectory to a stationary point of the dynamic (HBDF) is ensured, i.e. there exists $t_1\ge 0$ such that $x(t)=x_\infty $ for every $t\ge t_1$. In addition, an estimate of the final time could be given. In fact, we can show, by integrating the differential inequality satisfied by $\alpha (t)= \Vert \dot{x}(t)\Vert ^2$

$$\begin{aligned} \dot{\alpha }(t) + 2\epsilon h \sqrt{\alpha (t)}\le 0,\;t\in [0,+\infty [ \end{aligned}$$

(69)

that

$$\begin{aligned} t_1\le t_0 +\frac{2\Vert \dot{x}(t_0)\Vert }{\mathrm{dist}\Big (-\nabla f(x_\infty ), \mathrm{boundary}(\partial \phi (0) \Big )},\end{aligned}$$

where $t_0$ is the first time instant such that

$$\begin{aligned}&\nabla f(x(t))\in \nabla f(x_\infty )+B(0,\varepsilon ),\hbox { for all } t\ge t_0,\hbox { with }\varepsilon \\&\quad =\frac{1}{2} \mathrm{dist}\Big (-\nabla f(x_\infty ), \mathrm{boundary}(\partial \phi (0) \Big ). \end{aligned}$$

We refer to [3] for Lagrangian systems.

Remark 9

The conclusions of Theorem 7 are valid under the key assumption $-\nabla f(x_\infty ) \not \in \text{ boundary }(\partial \phi (0))$. Since the boundary of the convex set $\partial \phi (0)$ has an empty interior, it is reasonable to think that the circumstances leading to the relation $-\nabla f(x_\infty )\in \text{ boundary }(\partial \phi (0))$ are “exceptional”. More precisely, we conjecture that generically with respect to the initial data $(x_0,\dot{x}_0)\in {\mathbb {R}}^n \times {\mathbb {R}}^n$, the point $x_\infty =\lim _{t\rightarrow +\infty }x(t)$ satisfies the condition $-\nabla f(x_\infty ) \not \in \text{ boundary }(\partial \phi (0))$. Consequently, this would give a generic finite-time stabilization result in the case of dry friction.

Let us give a counter-example to convergence in finite-time when the condition $-\nabla f(x_\infty ) \not \in \text{ boundary }(\partial \phi (0))$ is not satisfied, i.e. $-\nabla f(x_\infty ) \in \text{ boundary }(\partial \phi (0))$. For that purpose, take ${\mathcal H}={\mathbb {R}}$, $\phi :=|\,.\,|$ (so that $\partial \phi (0)=[-1,1]$), $\gamma =2$ and $f:=|\,.\,|^2/2$. The differential inclusion $\mathrm{(HBDF)}$ then reads

$$\begin{aligned} \ddot{x}(t)+ \text{ sign }(\dot{x}(t))+2\, \dot{x}(t)+x(t)\ni 0. \end{aligned}$$

Let us choose as initial conditions $x(0)=-2$ and $\dot{x}(0)=1$. The unique solution of $\mathrm{(HBDF)}$ is given by $x(t)=-1-e^{-t}$, $t\ge 0$. The trajectory tends toward the value $x_\infty =-1$, which satisfies $-f'(x_\infty )=1 \in \text{ boundary }(\partial \phi (0))$. However the convergence does not hold in a finite-time.

Remark 10

It is natural to know if convergence in finite-time is specific to the dry friction situation $0\in \mathrm{int}(\partial \phi (0))$. To answer this question Amann-Diaz [5] and Diaz-Linan [24] considered the damped oscillator in ${\mathcal H}= {\mathbb {R}}$

$$\begin{aligned} \ddot{x}(t)+ |\dot{x}(t)|^{\alpha -1} \dot{x}(t)) +x(t)= 0, \end{aligned}$$

where $\alpha \in ]0,1[$. This corresponds to a sub-linear friction, the case of dry friction corresponds to the limiting case $\alpha =0$. They have shown the existence of two curves in the phase space such that, for the solution trajectories with initial data $(x_0, \dot{x}_0)$ belonging to these two curves, there is finite-time stabilization at the origin. Using both energetic and geometrical arguments, they showed that for many other initial data, the solution tends to zero in infinite time, at the rate $t^{-\frac{\alpha }{1-\alpha }}$.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adly, S., Attouch, H. First-order inertial algorithms involving dry friction damping. Math. Program. 193, 405–445 (2022). https://doi.org/10.1007/s10107-020-01613-y

Download citation

Received: 23 November 2019
Accepted: 29 December 2020
Published: 13 January 2021
Issue Date: May 2022
DOI: https://doi.org/10.1007/s10107-020-01613-y

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

First-order inertial algorithms involving dry friction damping

Abstract

Access this article

Similar content being viewed by others

First order inertial optimization algorithms with threshold effects associated with dry friction

A Doubly Nonlinear Evolution System with Threshold Effects Associated with Dry Friction

Newton-Type Inertial Algorithms for Solving Monotone Equations Governed by Sums of Potential and Nonpotential Operators

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Auxiliary results

1.1 Finite-time convergence of the continuous dynamic

Theorem 7

Proof

Remark 8

Remark 9

Remark 10

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

First-order inertial algorithms involving dry friction damping

Abstract

Access this article

Similar content being viewed by others

First order inertial optimization algorithms with threshold effects associated with dry friction

A Doubly Nonlinear Evolution System with Threshold Effects Associated with Dry Friction

Newton-Type Inertial Algorithms for Solving Monotone Equations Governed by Sums of Potential and Nonpotential Operators

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Auxiliary results

Auxiliary results

1.1 Finite-time convergence of the continuous dynamic

Theorem 7

Proof

Remark 8

Remark 9

Remark 10

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation