This event has passed.

Statistical mechanics of neural networks: From the geometry of high dimensional error landscapes to beating power law neural scaling

Name: Statistical mechanics of neural networks: From the geometry of high dimensional error landscapes to beating power law neural scaling
Start: 2022-09-28T14:00:00-05:00
End: 2022-09-28T15:00:00-05:00
Location: CMSA Room G10

September 28, 2022 @ 2:00 pm - 3:00 pm

New Technologies in Mathematics

Speaker: Surya Ganguli, Stanford University

Title: Statistical mechanics of neural networks: From the geometry of high dimensional error landscapes to beating power law neural scaling

Abstract: Statistical mechanics and neural network theory have long enjoyed fruitful interactions. We will review some of our recent work in this area and then focus on two vignettes. First we will analyze the high dimensional geometry of neural network error landscapes that happen to arise as the classical limit of a dissipative many-body quantum optimizer. In particular, we will be able to use the Kac-Rice formula and the replica method to calculate the number, location, energy levels, and Hessian eigenspectra of all critical points of any index. Second we will review recent work on neural power laws, which reveal that the error of many neural networks falls off as a power law with network size or dataset size. Such power laws have motivated significant societal investments in large scale model training and data collection efforts. Inspired by statistical mechanics calculations, we show both in theory and in practice how we can beat neural power law scaling with respect to dataset size, sometimes achieving exponential scaling, by collecting small carefully curated datasets rather than large random ones.

References: Y. Bahri, J. Kadmon, J. Pennington, S. Schoenholz, J. Sohl-Dickstein, and S. Ganguli, Statistical mechanics of deep learning, Annual Reviews of Condensed Matter Physics, 2020.

Sorscher, Ben, Robert Geirhos, Shashank Shekhar, Surya Ganguli, and Ari S. Morcos. 2022. Beyond Neural Scaling Laws: Beating Power Law Scaling via Data Pruning https://arxiv.org/abs/2206.14486 (NeurIPS 2022).

Details

Date:: September 28, 2022
Time:: 2:00 pm - 3:00 pm
Event Category:: New Technologies in Mathematics Seminar

Venue

CMSA Room G10; CMSA, 20 Garden Street
Cambridge, MA 02138 United States + Google Map
Phone:: 6174967132