This event has passed.

On Provable Copyright Protection for Generative Model

Name: On Provable Copyright Protection for Generative Model
Start: 2023-10-23T16:30:00-05:00
End: 2023-10-23T17:30:00-05:00
Location: CMSA Room G10

October 23, 2023 @ 4:30 pm - 5:30 pm

Speaker: Boaz Barak (Harvard)

Title: On Provable Copyright Protection for Generative Model

Abstract: There is a growing concern that learned conditional generative models may output samples that are substantially similar to some copyrighted data C that was in their training set. We give a formal definition of near access-freeness (NAF) and prove bounds on the probability that a model satisfying this definition outputs a sample similar to C, even if C is included in its training set.

Roughly speaking, a generative model p is k-NAF if for every potentially copyrighted data C, the output of p diverges by at most k-bits from the output of a model q that did not access C at all. We also give generative model learning algorithms, which efficiently modify the original generative model learning algorithm in a black box manner, that output generative models with strong bounds on the probability of sampling protected content. Furthermore, we provide promising experiments for both language (transformers) and image (diffusion) generative models, showing minimal degradation in output quality while ensuring strong protections against sampling protected content.

Joint work with Nikhil Vyas and Sham Kakade. Paper appeared in ICML 2023 and is on https://arxiv.org/abs/2302.10870

Details

Date:: October 23, 2023
Time:: 4:30 pm - 5:30 pm
Event Category:: Colloquium

Organizer

: Alejandro Poveda
: Kai Xu

Venue

CMSA Room G10; CMSA, 20 Garden Street
Cambridge, MA 02138 United States + Google Map
Phone:: 6174967132