CONVERGENCE AND DYNAMICAL BEHAVIOR OF THE ADAM ALGORITHM FOR NONCONVEX STOCHASTIC OPTIMIZATION

Anas Barakat; Pascal Bianchi

doi:10.1137/19M1263443

Back

CONVERGENCE AND DYNAMICAL BEHAVIOR OF THE ADAM ALGORITHM FOR NONCONVEX STOCHASTIC OPTIMIZATION

Journal article

Peer reviewed

CONVERGENCE AND DYNAMICAL BEHAVIOR OF THE ADAM ALGORITHM FOR NONCONVEX STOCHASTIC OPTIMIZATION

Anas Barakat and Pascal Bianchi

SIAM journal on optimization, Vol.31(1), pp.244-274

01/01/2021

DOI: https://doi.org/10.1137/19M1263443

Abstract

Mathematics

Mathematics, Applied

Physical Sciences

Science & Technology

Adam is a popular variant of stochastic gradient descent for finding a local minimizer of a function. In the constant stepsize regime, assuming that the objective function is differentiable and nonconvex, we establish the convergence in the long run of the iterates to a stationary point under a stability condition. The key ingredient is the introduction of a continuous-time version of Adam, under the form of a nonautonomous ordinary differential equation. This continuous-time system is a relevant approximation of the Adam iterates, in the sense that the interpolated Adam process converges weakly toward the solution to the ODE. The existence and the uniqueness of the solution are established. We further show the convergence of the solution toward the critical points of the objective function and quantify its convergence rate under a Lojasiewicz assumption. Then, we introduce a novel decreasing stepsize version of Adam. Under mild assumptions, it is shown that the iterates are almost surely bounded and converge almost surely to critical points of the objective function. Finally, we analyze the fluctuations of the algorithm by means of a conditional central limit theorem.

Metrics

1 Record Views

Details

Title: CONVERGENCE AND DYNAMICAL BEHAVIOR OF THE ADAM ALGORITHM FOR NONCONVEX STOCHASTIC OPTIMIZATION
Creators - without role: Anas Barakat - Laboratoire Traitement et Communication de l’Information
Pascal Bianchi - Laboratoire Traitement et Communication de l’Information
Publication Details: SIAM journal on optimization, Vol.31(1), pp.244-274
Publisher: Siam Publications
Number of pages: 31
Identifiers: 9911270709846
Academic Unit: ESD Pillar
Language: English
Resource Type: Journal article

CONVERGENCE AND DYNAMICAL BEHAVIOR OF THE ADAM ALGORITHM FOR NONCONVEX STOCHASTIC OPTIMIZATION

Abstract

Metrics

Details

Singapore University of Technology and Design Social media