Skip to main content

Showing 1–4 of 4 results for author: Ruffolo, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.03634  [pdf, ps, other

    q-bio.BM cs.LG

    Function-Guided Conditional Generation Using Protein Language Models with Adapters

    Authors: Jason Yang, Aadyot Bhatnagar, Jeffrey A. Ruffolo, Ali Madani

    Abstract: The conditional generation of proteins with desired functions is a key goal for generative models. Existing methods based on prompting of protein language models (PLMs) can generate proteins conditioned on a target functionality, such as a desired enzyme family. However, these methods are limited to simple, tokenized conditioning and have not been shown to generalize to unseen functions. In this s… ▽ More

    Submitted 11 June, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

  2. arXiv:2401.06151  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.QM

    Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion

    Authors: Alex Morehead, Jeffrey Ruffolo, Aadyot Bhatnagar, Ali Madani

    Abstract: Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a genera… ▽ More

    Submitted 21 December, 2023; originally announced January 2024.

    Comments: 15 pages, 11 figures, presented at the NeurIPS 2023 Machine Learning in Structural Biology (MLSB) workshop. Code available at https://github.com/Profluent-Internships/MMDiff

    ACM Class: I.2.1; J.3

  3. arXiv:2206.13517  [pdf, other

    cs.LG q-bio.QM

    ProGen2: Exploring the Boundaries of Protein Language Models

    Authors: Erik Nijkamp, Jeffrey Ruffolo, Eli N. Weinstein, Nikhil Naik, Ali Madani

    Abstract: Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design. However, we lack a sufficient understanding of how very large-scale models and data play a role in effective protein model development. We introduce a suite of protein language models, named ProGen2, that are sca… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  4. arXiv:2112.07782  [pdf, other

    q-bio.BM cs.LG

    Deciphering antibody affinity maturation with language models and weakly supervised learning

    Authors: Jeffrey A. Ruffolo, Jeffrey J. Gray, Jeremias Sulam

    Abstract: In response to pathogens, the adaptive immune system generates specific antibodies that bind and neutralize foreign antigens. Understanding the composition of an individual's immune repertoire can provide insights into this process and reveal potential therapeutic antibodies. In this work, we explore the application of antibody-specific language models to aid understanding of immune repertoires. W… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: Presented at Machine Learning for Structural Biology Workshop, NeurIPS 2021