Skip to yearly menu bar Skip to main content


Spotlight
in
Workshop: 2nd Generative AI for Biology Workshop

To Optimize, Not to Invent: Controlled On-Manifold Diffusion Walk for mRNA Sequence Generation and Optimization Without $\textit{de novo}$ Design

Danqi Liao · Chen Liu · Xingzhi Sun · Die Tang · Haochen Wang · Scott Youlten · Antonio Giraldez · Smita Krishnaswamy

Keywords: [ Langevin dynamics ] [ biological sequence optimization ] [ manifold learning ] [ mRNA design ]


Abstract:

Designing mRNA sequences with optimized biological properties remains a fundamental challenge in synthetic biology and therapeutic development. Deep generative models have enabled data driven sequence generation, but most are designed for \textit{de novo} generation, meaning generating entirely from scratch. This nature makes it difficult for them to refine existing sequences, interpolate between sequences, or produce interpretable optimization steps. In this work, we introduce Controlled On-Manifold Diffusion Walk (C-ManifoldWalk), a score-free framework for mRNA design that combines Langevin dynamics with a learned manifold projector. Operating entirely in the latent space of a pretrained encoder, C-ManifoldWalk updates latent codes using property guided gradients and then projects each noisy step back onto the learned manifold to ensure biological plausibility. This approach enables property guided optimization, smooth interpolation between arbitrary sequences, and tracking of interpretable latent trajectories, all without requiring explicit density estimation or score learning. We demonstrate on real zebrafish mRNA datasets that C-ManifoldWalk can continuously steer sequences toward target properties while remaining close to natural sequences, and can generate biologically plausible intermediate variants along each trajectory. Our results establish a scalable and generalizable paradigm for controllable mRNA design and latent space exploration in biological sequence modeling.

Chat is not available.