Skip to main content

Showing 1–4 of 4 results for author: Amin, A N

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.19598  [pdf, ps, other

    cs.LG q-bio.PE

    Training Flexible Models of Genetic Variant Effects from Functional Annotations using Accelerated Linear Algebra

    Authors: Alan N. Amin, Andres Potapczynski, Andrew Gordon Wilson

    Abstract: To understand how genetic variants in human genomes manifest in phenotypes -- traits like height or diseases like asthma -- geneticists have sequenced and measured hundreds of thousands of individuals. Geneticists use this data to build models that predict how a genetic variant impacts phenotype given genomic features of the variant, like DNA accessibility or the presence of nearby DNA-bound prote… ▽ More

    Submitted 28 June, 2025; v1 submitted 24 June, 2025; originally announced June 2025.

    Comments: For example: ICML 2025. Code available at: https://github.com/AlanNawzadAmin/DeepWAS

  2. arXiv:2412.07763  [pdf, other

    stat.ML cs.LG q-bio.BM

    Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences

    Authors: Alan Nawzad Amin, Nate Gruver, Yilun Kuang, Lily Li, Hunter Elliott, Calvin McCarter, Aniruddh Raghu, Peyton Greenside, Andrew Gordon Wilson

    Abstract: To build effective therapeutics, biologists iteratively mutate antibody sequences to improve binding and stability. Proposed mutations can be informed by previous measurements or by learning from large antibody databases to predict only typical antibodies. Unfortunately, the space of typical antibodies is enormous to search, and experiments often fail to find suitable antibodies on a budget. We in… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Code available at https://github.com/AlanNawzadAmin/CloneBO

  3. arXiv:2304.03775  [pdf, other

    stat.ML cs.LG q-bio.QM

    Biological Sequence Kernels with Guaranteed Flexibility

    Authors: Alan Nawzad Amin, Eli Nathan Weinstein, Debora Susan Marks

    Abstract: Applying machine learning to biological sequences - DNA, RNA and protein - has enormous potential to advance human health, environmental sustainability, and fundamental biological understanding. However, many existing machine learning methods are ineffective or unreliable in this problem domain. We study these challenges theoretically, through the lens of kernels. Methods based on kernels are ubiq… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  4. Analytical Theory for Sequence-Specific Binary Fuzzy Complexes of Charged Intrinsically Disordered Proteins

    Authors: Alan N. Amin, Yi-Hsuan Lin, Suman Das, Hue Sun Chan

    Abstract: Intrinsically disordered proteins (IDPs) are important for biological functions. In contrast to folded proteins, molecular recognition among certain IDPs is "fuzzy" in that their binding and/or phase separation are stochastically governed by the interacting IDPs' amino acid sequences while their assembled conformations remain largely disordered. To help elucidate a basic aspect of this fascinating… ▽ More

    Submitted 7 July, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: 51 pages, 11 figures. Accepted for Publication in J. Phys. Chem. B

    Journal ref: J. Phys. Chem. B 124, 6709--6720 (2020)