SGD Algorithms based on Incomplete U -statistics: Large-Scale Minimization of Empirical Risk

Guillaume Papa; Stéphan Clémençon; Aurélien Bellet

Chapitre D'ouvrage Année : 2015

SGD Algorithms based on Incomplete U -statistics: Large-Scale Minimization of Empirical Risk

(1) , (2, 3) , (4)

1
2
3
4

Guillaume Papa

Fonction : Auteur

Laboratoire Traitement et Communication de l'Information

Stéphan Clémençon

Fonction : Auteur
PersonId : 174491
IdHAL : stephan-clemencon
ORCID : 0000-0002-5879-9500
IdRef : 08905203X

Laboratoire Traitement et Communication de l'Information

Département Images, Données, Signal

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251

Machine Learning in Information Networks

Résumé

In many learning problems, ranging from clustering to ranking through metric learning, empirical estimates of the risk functional consist of an average over tu-ples (e.g., pairs or triplets) of observations, rather than over individual observations. In this paper, we focus on how to best implement a stochastic approximation approach to solve such risk minimization problems. We argue that in the large-scale setting, gradient estimates should be obtained by sampling tuples of data points with replacement (incomplete U-statistics) instead of sampling data points without replacement (complete U-statistics based on subsamples). We develop a theoretical framework accounting for the substantial impact of this strategy on the generalization ability of the prediction model returned by the Stochastic Gradient Descent (SGD) algorithm. It reveals that the method we promote achieves a much better trade-off between statistical accuracy and computational cost. Beyond the rate bound analysis, experiments on AUC maximization and metric learning provide strong empirical evidence of the superiority of the proposed approach.

Domaines

Mathématiques [math] Machine Learning [stat.ML] Statistiques [math.ST] Probabilités [math.PR]

Fichier principal

5819-sgd-algorithms-based-on-incomplete-u-statistics-large-scale-minimization-of-empirical-risk.pdf (506.18 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Stephan Clémençon : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-02107492

Soumis le : mardi 23 avril 2019-16:37:57

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-02107492 , version 1 (23-04-2019)

Identifiants

HAL Id : hal-02107492 , version 1

Citer

Guillaume Papa, Stéphan Clémençon, Aurélien Bellet. SGD Algorithms based on Incomplete U -statistics: Large-Scale Minimization of Empirical Risk. SGD Algorithms based on Incomplete U -statistics: Large-Scale Minimization of Empirical Risk, 2015, Advances in Neural Information Processing Systems 28 (NIPS 2015). ⟨hal-02107492⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS INRIA INSMI PARISTECH CRISTAL INRIA2 CRISTAL-MAGNET UNIV-LILLE LTCI IDS S2A

74 Consultations

61 Téléchargements

SGD Algorithms based on Incomplete U -statistics: Large-Scale Minimization of Empirical Risk

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager