Thinking Like Transformers – A Practical Session
VirtualNew Technologies in Mathematics Seminar Speaker: Gail Weiss, EPFL Title: Thinking Like Transformers - A Practical Session Abstract: With the help of the RASP programming language, we can better imagine how transformers---the powerful attention based sequence processing architecture---solve certain tasks. Some tasks, such as simply repeating or reversing an input sequence, have reasonably straightforward solutions, […]