Skip to main content

Showing 1–10 of 10 results for author: Biancalani, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.14944  [pdf, other

    cs.LG cs.AI cs.NE q-bio.QM stat.ML

    Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design

    Authors: Masatoshi Uehara, Xingyu Su, Yulai Zhao, Xiner Li, Aviv Regev, Shuiwang Ji, Sergey Levine, Tommaso Biancalani

    Abstract: To fully leverage the capabilities of diffusion models, we are often interested in optimizing downstream reward functions during inference. While numerous algorithms for reward-guided generation have been recently proposed due to their significance, current approaches predominantly focus on single-shot generation, transitioning from fully noised to denoised states. We propose a novel framework for… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Under review. If you have any suggestions/missing references, please let us know

  2. arXiv:2501.09685  [pdf, other

    cs.AI cs.LG q-bio.QM stat.ML

    Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review

    Authors: Masatoshi Uehara, Yulai Zhao, Chenyu Wang, Xiner Li, Aviv Regev, Sergey Levine, Tommaso Biancalani

    Abstract: This tutorial provides an in-depth guide on inference-time guidance and alignment methods for optimizing downstream reward functions in diffusion models. While diffusion models are renowned for their generative modeling capabilities, practical applications in fields such as biology often require sample generation that maximizes specific metrics (e.g., stability, affinity in proteins, closeness to… ▽ More

    Submitted 20 January, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

    Comments: We plan to add more content and codes. Please let us know if there are any comments or missing citations

  3. arXiv:2408.08252  [pdf, other

    cs.LG cs.AI q-bio.GN stat.ML

    Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

    Authors: Xiner Li, Yulai Zhao, Chenyu Wang, Gabriele Scalia, Gokcen Eraslan, Surag Nair, Tommaso Biancalani, Shuiwang Ji, Aviv Regev, Sergey Levine, Masatoshi Uehara

    Abstract: Diffusion models excel at capturing the natural design spaces of images, molecules, DNA, RNA, and protein sequences. However, rather than merely generating designs that are natural, we often aim to optimize downstream reward functions while preserving the naturalness of these design spaces. Existing methods for achieving this goal often require ``differentiable'' proxy models (\textit{e.g.}, class… ▽ More

    Submitted 24 October, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: The code is available at https://github.com/masa-ue/SVDD

  4. arXiv:2407.13734  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

    Authors: Masatoshi Uehara, Yulai Zhao, Tommaso Biancalani, Sergey Levine

    Abstract: This tutorial provides a comprehensive survey of methods for fine-tuning diffusion models to optimize downstream reward functions. While diffusion models are widely known to provide excellent generative modeling capability, practical applications in domains such as biology require generating samples that maximize some desired metric (e.g., translation efficiency in RNA, docking score in molecules,… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: We plan to add more content/codes. Please let us know if there are any comments

  5. arXiv:2406.12120  [pdf, other

    cs.LG cs.AI stat.ML

    Adding Conditional Control to Diffusion Models with Reinforcement Learning

    Authors: Yulai Zhao, Masatoshi Uehara, Gabriele Scalia, Sunyuan Kung, Tommaso Biancalani, Sergey Levine, Ehsan Hajiramezanali

    Abstract: Diffusion models are powerful generative models that allow for precise control over the characteristics of the generated samples. While these diffusion models trained on large datasets have achieved success, there is often a need to introduce additional controls in downstream fine-tuning processes, treating these powerful models as pre-trained diffusion models. This work presents a novel method ba… ▽ More

    Submitted 23 February, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: ICLR 2025

  6. arXiv:2405.19673  [pdf, other

    cs.LG cs.AI stat.ML

    Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

    Authors: Masatoshi Uehara, Yulai Zhao, Ehsan Hajiramezanali, Gabriele Scalia, Gökcen Eraslan, Avantika Lal, Sergey Levine, Tommaso Biancalani

    Abstract: AI-driven design problems, such as DNA/protein sequence design, are commonly tackled from two angles: generative modeling, which efficiently captures the feasible design space (e.g., natural images or biological sequences), and model-based optimization, which utilizes reward models for extrapolation. To combine the strengths of both approaches, we adopt a hybrid method that fine-tunes cutting-edge… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Under review

  7. arXiv:2402.16359  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    Feedback Efficient Online Fine-Tuning of Diffusion Models

    Authors: Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Sergey Levine, Tommaso Biancalani

    Abstract: Diffusion models excel at modeling complex data distributions, including those of images, proteins, and small molecules. However, in many cases, our goal is to model parts of the distribution that maximize certain properties: for example, we may want to generate images with high aesthetic quality, or molecules with high bioactivity. It is natural to frame this as a reinforcement learning (RL) prob… ▽ More

    Submitted 18 July, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  8. arXiv:2402.15194  [pdf, other

    cs.LG cs.AI stat.ML

    Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

    Authors: Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Tommaso Biancalani, Sergey Levine

    Abstract: Diffusion models excel at capturing complex data distributions, such as those of natural images and proteins. While diffusion models are trained to represent the distribution in the training dataset, we often are more concerned with other properties, such as the aesthetic quality of the generated images or the functional properties of generated proteins. Diffusion models can be finetuned in a goal… ▽ More

    Submitted 28 February, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review (codes will be released soon)

  9. arXiv:2306.02957  [pdf, other

    cs.LG stat.ML

    Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion

    Authors: Alex M. Tseng, Nathaniel Diamant, Tommaso Biancalani, Gabriele Scalia

    Abstract: Diffusion models have achieved state-of-the-art performance in generating many different kinds of data, including images, text, and videos. Despite their success, there has been limited research on how the underlying diffusion process and the final convergent prior can affect generative performance; this research has also been limited to continuous data types and a score-based diffusion framework.… ▽ More

    Submitted 21 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  10. arXiv:2301.01849  [pdf, other

    cs.LG stat.ME stat.ML

    NODAGS-Flow: Nonlinear Cyclic Causal Structure Learning

    Authors: Muralikrishnna G. Sethuraman, Romain Lopez, Rahul Mohan, Faramarz Fekri, Tommaso Biancalani, Jan-Christian Hütter

    Abstract: Learning causal relationships between variables is a well-studied problem in statistics, with many important applications in science. However, modeling real-world systems remain challenging, as most existing algorithms assume that the underlying causal graph is acyclic. While this is a convenient framework for developing theoretical developments about causal reasoning and inference, the underlying… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.