Skip to main content

Showing 1–2 of 2 results for author: Gokaslan, A

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2410.16735  [pdf, other

    q-bio.BM

    MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Protein Language Models

    Authors: Shrey Goel, Vishrut Thoutam, Edgar Mariano Marroquin, Aaron Gokaslan, Arash Firouzbakht, Sophia Vincoff, Volodymyr Kuleshov, Huong T. Kratochvil, Pranam Chatterjee

    Abstract: Masked Diffusion Language Models (MDLMs) have recently emerged as a strong class of generative models, paralleling state-of-the-art (SOTA) autoregressive (AR) performance across natural language modeling domains. While there have been advances in AR as well as both latent and discrete diffusion-based approaches for protein sequence design, masked diffusion language modeling with protein language m… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  2. arXiv:2403.03234  [pdf, other

    q-bio.GN cs.LG

    Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling

    Authors: Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov

    Abstract: Large-scale sequence modeling has sparked rapid advances that now extend into biology and genomics. However, modeling genomic sequences introduces challenges such as the need to model long-range token interactions, the effects of upstream and downstream regions of the genome, and the reverse complementarity (RC) of DNA. Here, we propose an architecture motivated by these challenges that builds off… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: ICML 2024; Code to reproduce our experiments is available at https://github.com/kuleshov-group/caduceus