Skip to main content

Showing 1–5 of 5 results for author: Shehu, A

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2505.11610  [pdf, ps, other

    cs.AI cs.LG q-bio.BM q-bio.GN

    Foundation Models for AI-Enabled Biological Design

    Authors: Asher Moldwin, Amarda Shehu

    Abstract: This paper surveys foundation models for AI-enabled biological design, focusing on recent developments in applying large-scale, self-supervised models to tasks such as protein engineering, small molecule design, and genomic sequence design. Though this domain is evolving rapidly, this survey presents and discusses a taxonomy of current models and methods. The focus is on challenges and solutions i… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Published as part of the workshop proceedings at AAAI 2025 in the workshop "Foundation Models for Biological Discoveries"

  2. arXiv:2206.11057  [pdf, other

    cs.LG cs.AI q-bio.QM

    Transformer Neural Networks Attending to Both Sequence and Structure for Protein Prediction Tasks

    Authors: Anowarul Kabir, Amarda Shehu

    Abstract: The increasing number of protein sequences decoded from genomes is opening up new avenues of research on linking protein sequence to function with transformer neural networks. Recent research has shown that the number of known protein sequences supports learning useful, task-agnostic sequence representations via transformers. In this paper, we posit that learning joint sequence-structure represent… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 8 pages, 4 figures, 3 tables

  3. arXiv:2010.01441  [pdf, other

    q-bio.BM cs.LG

    Decoy Selection for Protein Structure Prediction Via Extreme Gradient Boosting and Ranking

    Authors: Nasrin Akhter, Gopinath Chennupati, Hristo Djidjev, Amarda Shehu

    Abstract: Identifying one or more biologically-active/native decoys from millions of non-native decoys is one of the major challenges in computational structural biology. The extreme lack of balance in positive and negative samples (native and non-native decoys) in a decoy set makes the problem even more complicated. Consensus methods show varied success in handling the challenge of decoy selection despite… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: Accepted for BMC Bioinformatics

  4. arXiv:2004.07119  [pdf, other

    q-bio.BM cs.LG stat.ML

    Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

    Authors: Xiaojie Guo, Yuanqi Du, Sivani Tadepalli, Liang Zhao, Amarda Shehu

    Abstract: Much scientific enquiry across disciplines is founded upon a mechanistic treatment of dynamic systems that ties form to function. A highly visible instance of this is in molecular biology, where an important goal is to determine functionally-relevant forms/structures that a protein molecule employs to interact with molecular partners in the living cell. This goal is typically pursued under the umb… ▽ More

    Submitted 16 June, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

  5. arXiv:1905.08331  [pdf, other

    q-bio.BM cs.HC

    ROMEO: A Plug-and-play Software Platform of Robotics-inspired Algorithms for Modeling Biomolecular Structures and Motions

    Authors: Kevin Molloy, Erion Plaku, Amarda Shehu

    Abstract: Motivation: Due to the central role of protein structure in molecular recognition, great computational efforts are devoted to modeling protein structures and motions that mediate structural rearrangements. The size, dimensionality, and non-linearity of the protein structure space present outstanding challenges. Such challenges also arise in robot motion planning, and robotics-inspired treatments o… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 6 pages, 5 figures