Skip to main content

Showing 1–11 of 11 results for author: Didi, K

.
  1. arXiv:2504.08051  [pdf, other

    cs.LG cs.AI

    Compositional Flows for 3D Molecule and Synthesis Pathway Co-design

    Authors: Tony Shen, Seonghwan Seo, Ross Irwin, Kieran Didi, Simon Olsson, Woo Youn Kim, Martin Ester

    Abstract: Many generative applications, such as synthesis-based 3D molecular design, involve constructing compositional objects with continuous features. Here, we introduce Compositional Generative Flows (CGFlow), a novel framework that extends flow matching to generate objects in compositional steps while modeling continuous states. Our key insight is that modeling compositional state transitions can be fo… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Spotlighted at ICLR 2025 GEM and AI4Mat workshops, 29 pages, 7 figures

  2. arXiv:2503.00710  [pdf, other

    cs.LG

    Proteina: Scaling Flow-based Protein Structure Generative Models

    Authors: Tomas Geffner, Kieran Didi, Zuobai Zhang, Danny Reidenbach, Zhonglin Cao, Jason Yim, Mario Geiger, Christian Dallago, Emine Kucukbenli, Arash Vahdat, Karsten Kreis

    Abstract: Recently, diffusion- and flow-based generative models of protein structures have emerged as a powerful tool for de novo protein design. Here, we develop Proteina, a new large-scale flow-based protein backbone generator that utilizes hierarchical fold class labels for conditioning and relies on a tailored scalable transformer architecture with up to 5x as many parameters as previous models. To mean… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

    Comments: ICLR 2025 Oral. Project page: https://research.nvidia.com/labs/genair/proteina/

  3. arXiv:2502.12479  [pdf, other

    cs.LG q-bio.BM

    MotifBench: A standardized protein design benchmark for motif-scaffolding problems

    Authors: Zhuoqi Zheng, Bo Zhang, Kieran Didi, Kevin K. Yang, Jason Yim, Joseph L. Watson, Hai-Feng Chen, Brian L. Trippe

    Abstract: The motif-scaffolding problem is a central task in computational protein design: Given the coordinates of atoms in a geometry chosen to confer a desired biochemical function (a motif), the task is to identify diverse protein structures (scaffolds) that include the motif and maintain its geometry. Significant recent progress on motif-scaffolding has been made due to computational evaluation with re… ▽ More

    Submitted 19 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Associated content available at github.com/blt2114/MotifBench

  4. arXiv:2411.10548  [pdf, ps, other

    cs.LG q-bio.BM

    BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

    Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

    Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More

    Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  5. arXiv:2406.13864  [pdf, other

    cs.LG q-bio.BM

    Evaluating representation learning on the protein structure universe

    Authors: Arian R. Jamasb, Alex Morehead, Chaitanya K. Joshi, Zuobai Zhang, Kieran Didi, Simon V. Mathis, Charles Harris, Jian Tang, Jianlin Cheng, Pietro Lio, Tom L. Blundell

    Abstract: We introduce ProteinWorkshop, a comprehensive benchmark suite for representation learning on protein structures with Geometric Graph Neural Networks. We consider large-scale pre-training and downstream tasks on both experimental and predicted structures to enable the systematic evaluation of the quality of the learned structural representation and their usefulness in capturing functional relations… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ICLR 2024

  6. arXiv:2406.13839  [pdf, other

    q-bio.BM cs.LG q-bio.GN

    RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design

    Authors: Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian R. Jamasb, Charles Harris, Simon V. Mathis, Kieran Didi, Rex Ying, Bryan Hooi, Pietro Liò

    Abstract: We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally fle… ▽ More

    Submitted 18 March, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Oral presentation at Machine Learning in Computational Biology (MLCB), 2024. Also presented as an Oral at ICML 2024 Structured Probabilistic Inference & Generative Modeling Workshop, and a Spotlight at ICML 2024 AI4Science Workshop

  7. arXiv:2406.01781  [pdf, other

    cs.LG

    DEFT: Efficient Fine-Tuning of Diffusion Models by Learning the Generalised $h$-transform

    Authors: Alexander Denker, Francisco Vargas, Shreyas Padhy, Kieran Didi, Simon Mathis, Vincent Dutordoir, Riccardo Barbano, Emile Mathieu, Urszula Julia Komorowska, Pietro Lio

    Abstract: Generative modelling paradigms based on denoising diffusion processes have emerged as a leading candidate for conditional sampling in inverse problems. In many real-world applications, we often have access to large, expensively trained unconditional diffusion models, which we aim to exploit for improving conditional sampling. Most recent approaches are motivated heuristically and lack a unifying f… ▽ More

    Submitted 20 May, 2025; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.09236

  8. arXiv:2312.09236  [pdf, other

    cs.LG q-bio.BM

    A framework for conditional diffusion modelling with applications in motif scaffolding for protein design

    Authors: Kieran Didi, Francisco Vargas, Simon V Mathis, Vincent Dutordoir, Emile Mathieu, Urszula J Komorowska, Pietro Lio

    Abstract: Many protein design applications, such as binder or enzyme design, require scaffolding a structural motif with high precision. Generative modelling paradigms based on denoising diffusion processes emerged as a leading candidate to address this motif scaffolding problem and have shown early experimental success in some cases. In the diffusion paradigm, motif scaffolding is treated as a conditional… ▽ More

    Submitted 13 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages

  9. arXiv:2308.07413  [pdf, other

    q-bio.BM

    Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models?

    Authors: Charles Harris, Kieran Didi, Arian R. Jamasb, Chaitanya K. Joshi, Simon V. Mathis, Pietro Lio, Tom Blundell

    Abstract: Deep generative models for structure-based drug design (SBDD), where molecule generation is conditioned on a 3D protein pocket, have received considerable interest in recent years. These methods offer the promise of higher-quality molecule generation by explicitly modelling the 3D interaction between a potential drug and a protein receptor. However, previous work has primarily focused on the quali… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  10. arXiv:2212.12560  [pdf, other

    cs.AI

    On How AI Needs to Change to Advance the Science of Drug Discovery

    Authors: Kieran Didi, Matej Zečević

    Abstract: Research around AI for Science has seen significant success since the rise of deep learning models over the past decade, even with longstanding challenges such as protein structure prediction. However, this fast development inevitably made their flaws apparent -- especially in domains of reasoning where understanding the cause-effect relationship is important. One such domain is drug discovery, in… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 6 pages, References: 1.5 pages. Main paper: 3 figures

  11. arXiv:2210.13695  [pdf, other

    q-bio.BM cs.LG

    Structure-based Drug Design with Equivariant Diffusion Models

    Authors: Arne Schneuing, Charles Harris, Yuanqi Du, Kieran Didi, Arian Jamasb, Ilia Igashov, Weitao Du, Carla Gomes, Tom Blundell, Pietro Lio, Max Welling, Michael Bronstein, Bruno Correia

    Abstract: Structure-based drug design (SBDD) aims to design small-molecule ligands that bind with high affinity and specificity to pre-determined protein targets. Generative SBDD methods leverage structural data of drugs in complex with their protein targets to propose new drug candidates. These approaches typically place one atom at a time in an autoregressive fashion using the binding pocket as well as pr… ▽ More

    Submitted 23 September, 2024; v1 submitted 24 October, 2022; originally announced October 2022.