Skip to main content

Showing 1–3 of 3 results for author: Nadeem, M

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2504.04453  [pdf, other

    q-bio.BM cs.AI cs.CL cs.LG

    Prot42: a Novel Family of Protein Language Models for Target-aware Protein Binder Generation

    Authors: Mohammad Amaan Sayeed, Engin Tekin, Maryam Nadeem, Nancy A. ElNaker, Aahan Singh, Natalia Vassilieva, Boulbaba Ben Amor

    Abstract: Unlocking the next generation of biotechnology and therapeutic innovation demands overcoming the inherent complexity and resource-intensity of conventional protein engineering methods. Recent GenAI-powered computational techniques often rely on the availability of the target protein's 3D structures and specific binding sites to generate high-affinity binders, constraints exhibited by models such a… ▽ More

    Submitted 18 May, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  2. arXiv:2503.16565  [pdf, other

    cs.LG cs.AI cs.CL q-bio.GN

    Gene42: Long-Range Genomic Foundation Model With Dense Attention

    Authors: Kirill Vishniakov, Boulbaba Ben Amor, Engin Tekin, Nancy A. ElNaker, Karthik Viswanathan, Aleksandr Medvedev, Aahan Singh, Maryam Nadeem, Mohammad Amaan Sayeed, Praveenkumar Kanithi, Tiago Magalhaes, Natalia Vassilieva, Dwarikanath Mahapatra, Marco Pimentel, and Shadab Khan

    Abstract: We introduce Gene42, a novel family of Genomic Foundation Models (GFMs) designed to manage context lengths of up to 192,000 base pairs (bp) at a single-nucleotide resolution. Gene42 models utilize a decoder-only (LLaMA-style) architecture with a dense self-attention mechanism. Initially trained on fixed-length sequences of 4,096 bp, our models underwent continuous pretraining to extend the context… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  3. arXiv:2503.16563  [pdf, other

    cs.LG cs.AI cs.CL q-bio.BM

    Chem42: a Family of chemical Language Models for Target-aware Ligand Generation

    Authors: Aahan Singh, Engin Tekin, Maryam Nadeem, Nancy A. ElNaker, Mohammad Amaan Sayeed, Natalia Vassilieva, Boulbaba Ben Amor

    Abstract: Revolutionizing drug discovery demands more than just understanding molecular interactions - it requires generative models that can design novel ligands tailored to specific biological targets. While chemical Language Models (cLMs) have made strides in learning molecular properties, most fail to incorporate target-specific insights, restricting their ability to drive de-novo ligand generation. Che… ▽ More

    Submitted 11 June, 2025; v1 submitted 20 March, 2025; originally announced March 2025.