# Proximal Gradient Methods for Machine Learning and Imaging

## Abstract

Convex optimization plays a key role in data sciences. The objective of this work is to provide basic tools and methods at the core of modern nonlinear convex optimization. Starting from the gradient descent method we will focus on a comprehensive convergence analysis for the proximal gradient algorithm and its state-of-the art variants, including accelerated, stochastic and block-wise implementations, which are nowadays very popular techniques to solve machine learning and inverse problems.

## Notes

1. 1.

Note that if $$\inf \Phi = -\infty$$, it follows from (18) that $$\inf \Phi =\sup (-\Psi )= - \inf \Psi =-\infty$$. In this case, $$\Psi \equiv +\infty$$ and $$\inf \Phi + \inf \Psi = -\infty + \infty$$ does not make sense. Anyway, since there is no gap between $$\Phi$$ and $$-\Psi$$, by convention, we set $$\inf \Phi + \inf \Psi = 0$$. The same situation occurs if $$\inf \Psi =-\infty$$.

## References

## Acknowledgements

The work of S. Villa has been supported by the ITN-ETN project TraDE-OPT funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska–Curie grant agreement No 861137 and by the project “Processi evolutivi con memoria descrivibili tramite equazioni integro-differenziali” funded by Gruppo Nazionale per l’ Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA) of the Istituto Nazionale di Alta Matematica (INdAM).

Saverio Salzo

