Hitting the High-D(imensional) Notes: An ODE for SGD Learning Dynamics in High-dimensions

Add to Calendar

When and Where

Thursday, October 26, 2023 3:30 pm to 4:30 pm

9014

Ontario Power Building

700 University Avenue, Toronto, ON M5G 1Z5

Speakers

Courtney Paquette, McGill University

Description

In this talk, I will present a framework, inspired by random matrix theory, for analyzing the dynamics of stochastic optimization algorithms (e.g., stochastic gradient descent (SGD) and momentum (SGD + M)) when both the number of samples and dimensions are large. Using this new framework, we show that the dynamics of optimization algorithms on generalized linear models and multi-index problems with random data become deterministic in the large sample and dimensional limit. In particular, the limiting dynamics for stochastic algorithms are governed by an ODE. In the least square setting, from this model, we identify a stability measurement, the implicit conditioning ratio (ICR), which regulates the ability of SGD+M to accelerate the algorithm. When the batch size exceeds this ICR, SGD+M converges linearly at a rate of O(1/ κ), matching optimal full-batch momentum (in particular performing as well as a full-batch but with a fraction of the size). For batch sizes smaller than the ICR, in contrast, SGD+M has rates that scale like a multiple of the single batch SGD rate. We give explicit choices for the learning rate and momentum parameter in terms of the Hessian spectra that achieve this performance. Finally we show this model matches performances on real data sets.

Please join the event.

About Courtney Paquette

Courtney Paquette is an assistant professor at McGill University and a CIFAR Canada AI chair, MILA. Paquette’s research broadly focuses on designing and analyzing algorithms for large-scale optimization problems, motivated by applications in data science. She received her PhD from the mathematics department at the University of Washington (2017), held postdoctoral positions at Lehigh University (2017-2018) and University of Waterloo (NSF postdoctoral fellowship, 2018-2019), and was a research scientist at Google Research, Brain Montreal (2019-2020).

Map

700 University Avenue, Toronto, ON M5G 1Z5

Universal Navigation

Universal Navigation2

Main menu

Hitting the High-D(imensional) Notes: An ODE for SGD Learning Dynamics in High-dimensions

When and Where

Speakers

Description

Map

Categories

Audiences

Footer Main-Menu

Footer Secondary Menu

Contact Us

Footer Accessibility Menu

Universal Navigation

Universal Navigation2

Main menu

Search form

Hitting the High-D(imensional) Notes: An ODE for SGD Learning Dynamics in High-dimensions

When and Where

Speakers

Description

Map

Categories

Audiences