Convergence and Dynamical Behavior of the Adam Algorithm for Non Convex  Stochastic Optimization

Anas Barakat; Pascal Bianchi

Pré-Publication, Document De Travail Année : 2019

Convergence and Dynamical Behavior of the Adam Algorithm for Non Convex Stochastic Optimization

(1, 2, 3) , (1, 2, 3)

1
2
3

Anas Barakat

Fonction : Auteur
PersonId : 1106661

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Laboratoire Traitement et Communication de l'Information

Pascal Bianchi

Fonction : Auteur
PersonId : 846458

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Laboratoire Traitement et Communication de l'Information

Résumé

Adam is a popular variant of the stochastic gradient descent for finding a local minimizer of a function. The objective function is unknown but a random estimate of the current gradient vector is observed at each round of the algorithm. Assuming that the objective function is differentiable and non-convex, we establish the convergence in the long run of the iterates to a stationary point. The key ingredient is the introduction of a continuous-time version of Adam, under the form of a non-autonomous ordinary differential equation. The existence and the uniqueness of the solution are established, as well as the convergence of the solution towards the stationary points of the objective function. The continuous-time system is a relevant approximation of the Adam iterates, in the sense that the interpolated Adam process converges weakly to the solution to the ODE.

Domaines

Machine Learning [stat.ML] Systèmes dynamiques [math.DS] Optimisation et contrôle [math.OC] Mathématiques [math]

Fichier principal

convergence_adam.pdf (1.07 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Anas Barakat : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-02366280

Soumis le : vendredi 15 novembre 2019-18:05:47

Dernière modification le : mercredi 13 décembre 2023-16:28:27

Archivage à long terme le : dimanche 16 février 2020-19:30:49

Dates et versions

hal-02366280 , version 1 (15-11-2019)

hal-02366280 , version 2 (18-11-2022)

Identifiants

HAL Id : hal-02366280 , version 1

Citer

Anas Barakat, Pascal Bianchi. Convergence and Dynamical Behavior of the Adam Algorithm for Non Convex Stochastic Optimization. 2019. ⟨hal-02366280v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

98 Consultations

442 Téléchargements