A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
https://youtu.be/OoimTbnSe7I Speaker: Nikunj Saunshi, Dept. of Computer Science, Princeton University Title: A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks Abstract: Autoregressive language models pretrained on large corpora have been successful at solving downstream tasks, even with zero-shot usage. However, there is little theoretical justification for their success. This paper considers the following […]