Toward Demystifying Transformers and Attention
https://youtu.be/MSw8HV0eHo8 Speaker: Ben Edelman, Harvard Computer Science Title: Toward Demystifying Transformers and Attention Abstract: Over the past several years, attention mechanisms (primarily in the form of the Transformer architecture) have revolutionized deep learning, leading to advances in natural language processing, computer vision, code synthesis, protein structure prediction, and beyond. Attention has a remarkable ability to enable the […]