ICML Composition and Alignment of Diffusion Models using Constrained Learning

Poster
in
Workshop: 2nd Workshop on Models of Human Feedback for AI Alignment (MoFA)

Composition and Alignment of Diffusion Models using Constrained Learning

Shervin Khalafi · Ignacio Hounie · Dongsheng Ding · Alejandro Ribeiro

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Diffusion models have become prevalent in generative modeling due to their ability to sample from complex distributions. To improve the quality of generated samples and their compliance with user requirements, two commonly used methods are: (i) Alignment, which involves fine-tuning a diffusion model to align it with a reward; and (ii) Composition, which combines several pre-trained diffusion models together, each emphasizing a desirable attribute in the generated outputs. However, trade-offs often arise when optimizing for multiple rewards or combining multiple models, as they can often represent competing properties. Existing methods cannot guarantee that the resulting model faithfully generates samples with all the desired properties. To address this gap, we propose a constrained optimization framework that unifies alignment and composition of diffusion models by enforcing that the aligned model satisfies reward constraints and/or remains close to each pre-trained model. We provide a theoretical characterization of the solutions to the constrained alignment and composition problems and develop a Lagrangian-based primal-dual training algorithm to approximate these solutions. Empirically, we demonstrate our proposed approach in image generation, applying it to alignment and composition, and show that our aligned or composed model satisfies constraints effectively.

Chat is not available.

Poster in Workshop: 2nd Workshop on Models of Human Feedback for AI Alignment (MoFA)

Composition and Alignment of Diffusion Models using Constrained Learning

Shervin Khalafi · Ignacio Hounie · Dongsheng Ding · Alejandro Ribeiro

Poster
in
Workshop: 2nd Workshop on Models of Human Feedback for AI Alignment (MoFA)