This event has passed.

An introduction to mixture of experts in deep learning

Name: An introduction to mixture of experts in deep learning
Start: 2023-12-01T12:00:00-05:00
End: 2023-12-01T13:00:00-05:00
Location: CMSA Room G10

December 1, 2023 @ 12:00 pm - 1:00 pm

Member Seminar

Speaker: Samy Jelassi

Title: An introduction to mixture of experts in deep learning

Abstract: Scale has opened new frontiers in natural language processing – but at a high cost. Mixture-of-Experts (MoE) have been proposed as a path to even larger and more capable language models. They select different parameters for each incoming example. By doing so, the parameter count is decoupled from the compute per example leading to very large, but efficient models. In this talk, I will review the concept of mixture of experts, provide a basic description of the Switch Transformers model, characterize some of their behaviors and conclude by highlighting some open problems in the field. This talk is mainly based on the following papers: https://arxiv.org/pdf/2101.03961.pdf, https://arxiv.org/pdf/2209.01667.pdf .

Details

Date: December 1, 2023
Time:
12:00 pm - 1:00 pm
Event Category: Member Seminar

Organizer

: Damian van de Heisteeg
: Samy Jelassi

Venue

CMSA Room G10
CMSA, 20 Garden Street
Cambridge, MA 02138 United States + Google Map
Phone 6174967132