Stochastique Gradient Descent#
Ou descente de gradient stochastique en français.
(à venir)
Lectures
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization
Accelerating Stochastic Gradient Descent using Predictive Variance Reduction
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes