Scaling Stochastic Momentum from Theory to LLMs
New Technologies in Mathematics Seminar Speaker: Courtney Paquette, McGill University Title: Scaling Stochastic Momentum from Theory to LLMs Abstract: Given the massive scale of modern ML models, we now often get only […]