Workshop
Machine Unlearning for Generative AI
Vaidehi Patil · Mantas Mazeika · Will Hodgkins · Steven Basart · Yang Liu · Katherine Lee · Mohit Bansal · Bo Li
West Meeting Room 202-204
Fri 18 Jul, 9 a.m. PDT
Generative AI models are trained on internet-scale datasets, yielding powerful capabilities but also introducing risks like copyright infringement, PII leakage, and harmful knowledge. Targeted removal or unlearning of sensitive data is challenging, as retraining on curated sets is computationally expensive, driving research into machine unlearning and model editing. Yet approaches like RLHF only suppress undesirable outputs, leaving underlying knowledge vulnerable to adversarial extraction. This raises urgent privacy, security, and legal concerns, especially under the EU’s GDPR “right to be forgotten”. Because neural networks encode information across millions of parameters, precise deletion without degrading performance is complex, and adversarial or whitebox attacks can recover ostensibly erased data. This workshop brings together experts in AI safety, privacy, and policy to advance robust, verifiable unlearning methods, standardized evaluation frameworks, and theoretical foundations. By achieving true erasure, we aim to ensure AI can ethically and legally forget sensitive data while preserving broader utility.
Schedule
Fri 9:00 a.m. - 9:10 a.m.
|
Opening Remarks
(
Opening Remarks
)
>
|
🔗 |
Fri 9:10 a.m. - 9:40 a.m.
|
Sijia Liu's Talk
(
Invited Talk
)
>
|
🔗 |
Fri 9:40 a.m. - 10:25 a.m.
|
Live Poster Session 1
(
Poster Session
)
>
|
🔗 |
Fri 10:25 a.m. - 10:45 a.m.
|
Coffee Break
|
🔗 |
Fri 10:45 a.m. - 11:15 a.m.
|
Nicholas Carlini's Talk
(
Invited Talk
)
>
|
🔗 |
Fri 11:15 a.m. - 11:30 a.m.
|
Contributed Talk 1
(
Contributed Talk
)
>
|
🔗 |
Fri 11:30 a.m. - 11:45 a.m.
|
Contributed Talk 2
(
Contributed Talk
)
>
|
🔗 |
Fri 11:45 a.m. - 12:00 p.m.
|
Contributed Talk 3
(
Contributed Talk
)
>
|
🔗 |
Fri 12:00 p.m. - 12:30 p.m.
|
Peter Hase's Talk
(
Invited Talk
)
>
|
🔗 |
Fri 12:30 p.m. - 1:30 p.m.
|
Lunch Break
|
🔗 |
Fri 1:30 p.m. - 2:00 p.m.
|
Eleni Triantafillou's Talk
(
Invited Talk
)
>
|
🔗 |
Fri 2:00 p.m. - 2:30 p.m.
|
Shagufta Mehnaz's Talk
(
Invited Talk
)
>
|
Shagufta Mehnaz 🔗 |
Fri 2:30 p.m. - 3:00 p.m.
|
Ling Liu's Talk
(
Invited Talk
)
>
|
Ling Liu 🔗 |
Fri 3:00 p.m. - 3:45 p.m.
|
Live Poster Session 2
(
Poster Session
)
>
|
🔗 |
Fri 3:45 p.m. - 4:00 p.m.
|
Coffee Break
|
🔗 |
Fri 4:00 p.m. - 4:55 p.m.
|
Live Panel Discussion with Speakers and Panelists
(
Panel
)
>
|
🔗 |
Fri 4:55 p.m. - 5:00 p.m.
|
Closing Remarks
(
Closing Remarks
)
>
|
🔗 |