ICML In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models

Poster
in
Affinity Workshop: New In ML

In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

The generation of high-quality, diverse, and prompt-aligned images is a central goal in image-generating diffusion models. The popularclassifier-free guidance (CFG) approach improves quality and alignment at the cost of reduced variation, creating an inherent entanglement of these effects. Recent work has successfully disentangled these properties by guiding a model with a separately trained, inferior counterpart, yet this solution introduces the considerable overhead of requiring an auxiliary model. We challenge this prerequisiteby introducing In-situ Autoguidance, a method that elicits guidance from the model itself without any auxiliary components. Our approachdynamically generates an inferior prediction on-the-fly using a stochastic forward pass, reframing guidance as a form of inference-time self-correction. We demonstrate that this zero-cost approach is not only viable but also establishes a powerful new baseline for cost-efficient guidance, proving that the benefits of self-guidance can be achieved without external models.

Chat is not available.

Poster in Affinity Workshop: New In ML

In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models

Poster
in
Affinity Workshop: New In ML