BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//CMSA - ECPv6.15.18//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:CMSA
X-ORIGINAL-URL:https://cmsa.fas.harvard.edu
X-WR-CALDESC:Events for CMSA
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20270314T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20271107T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20260506T140000
DTEND;TZID=America/New_York:20260506T150000
DTSTAMP:20260426T044201
CREATED:20260421T144955Z
LAST-MODIFIED:20260421T150144Z
UID:10003935-1778076000-1778079600@cmsa.fas.harvard.edu
SUMMARY:New directions in synthetic data
DESCRIPTION:New Technologies in Mathematics Seminar \nSpeaker: Tatsunori Hashimoto\, Stanford \nTitle: New directions in synthetic data \nAbstract: Synthetic data has been an effective\, if boring set of techniques: prompt some language model to restructure your corpus to match some downstream task\, with occasionally some distillation. In this talk\, we will take a more expansive view of synthetic data as a general algorithmic tool for generative modeling\, arguing that the design space and possibilities of synthetic data are much bigger than it might seem. Through a few recent works\, we will show that synthetic data has major benefits beyond transforming the data – improving in-domain perplexities\, and enabling unique algorithmic primitives\, such as neighborhood smoothing and concatenated ‘mega’ documents. With this broader view\, we will point towards a nascent but interesting possibility of treating data itself as an algorithmic object to be engineered and optimized end-to-end. \n 
URL:https://cmsa.fas.harvard.edu/event/newtech_5626/
LOCATION:Virtual
CATEGORIES:New Technologies in Mathematics Seminar
ATTACH;FMTTYPE=image/png:https://cmsa.fas.harvard.edu/media/CMSA-NTM-Seminar-5.6.2026.docx.png
END:VEVENT
END:VCALENDAR