Asymptotic Theory of Attention: In-Context Learning and Sparse Token Detection
Colloquium Speaker: Yue M. Lu, Harvard University Title: Asymptotic Theory of Attention: In-Context Learning and Sparse Token Detection Abstract: Attention-based architectures exhibit striking emergent abilities—from learning tasks directly from context to detecting rare, […]