Minerva: Solving Quantitative Reasoning Problems with Language Models

10/05/2022 2:00 pm - 4:00 pm
CMSA Room G10
Address: CMSA, 20 Garden Street, Cambridge, MA 02138 USA

New Technologies in Mathematics Seminar

Speaker: Guy Gur-Ari, Google Research

Title: Minerva: Solving Quantitative Reasoning Problems with Language Models

Abstract: Quantitative reasoning tasks which can involveĀ mathematics, science, and programming are often challenging for machine learning models in general and for language models in particular. We show that transformer-based language models obtain significantly better performance on math and science questions when trained in an unsupervised way on a large, math-focused dataset. Performance can be further improved using prompting and sampling techniques including chain-of-thought and majority voting. Minerva, a model that combines these techniques, achieves SOTA on several math and science benchmarks. I will describe the model, its capabilities and limitations.