2022-02-28 Lectures#

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes