Skip to main content

Showing 1–4 of 4 results for author: Amin, A N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.08316  [pdf, ps, other

    cs.LG stat.ML

    Why Masking Diffusion Works: Condition on the Jump Schedule for Improved Discrete Diffusion

    Authors: Alan N. Amin, Nate Gruver, Andrew Gordon Wilson

    Abstract: Discrete diffusion models, like continuous diffusion models, generate high-quality samples by gradually undoing noise applied to datapoints with a Markov process. Gradual generation in theory comes with many conceptual benefits; for example, inductive biases can be incorporated into the noising Markov process, and access to improved sampling algorithms. In practice, however, the consistently best… ▽ More

    Submitted 27 September, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: Published at Neurips 2025. Code available at: https://github.com/AlanNawzadAmin/SCUD

  2. arXiv:2412.07763  [pdf, other

    stat.ML cs.LG q-bio.BM

    Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences

    Authors: Alan Nawzad Amin, Nate Gruver, Yilun Kuang, Lily Li, Hunter Elliott, Calvin McCarter, Aniruddh Raghu, Peyton Greenside, Andrew Gordon Wilson

    Abstract: To build effective therapeutics, biologists iteratively mutate antibody sequences to improve binding and stability. Proposed mutations can be informed by previous measurements or by learning from large antibody databases to predict only typical antibodies. Unfortunately, the space of typical antibodies is enormous to search, and experiments often fail to find suitable antibodies on a budget. We in… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Code available at https://github.com/AlanNawzadAmin/CloneBO

  3. arXiv:2406.09177  [pdf, other

    stat.ML cs.LG

    Scalable and Flexible Causal Discovery with an Efficient Test for Adjacency

    Authors: Alan Nawzad Amin, Andrew Gordon Wilson

    Abstract: To make accurate predictions, understand mechanisms, and design interventions in systems of many variables, we wish to learn causal graphs from large scale data. Unfortunately the space of all possible causal graphs is enormous so scalably and accurately searching for the best fit to the data is a challenge. In principle we could substantially decrease the search space, or learn the graph entirely… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: ICML 2024; Code at https://github.com/AlanNawzadAmin/DAT-graph

  4. arXiv:2304.03775  [pdf, other

    stat.ML cs.LG q-bio.QM

    Biological Sequence Kernels with Guaranteed Flexibility

    Authors: Alan Nawzad Amin, Eli Nathan Weinstein, Debora Susan Marks

    Abstract: Applying machine learning to biological sequences - DNA, RNA and protein - has enormous potential to advance human health, environmental sustainability, and fundamental biological understanding. However, many existing machine learning methods are ineffective or unreliable in this problem domain. We study these challenges theoretically, through the lens of kernels. Methods based on kernels are ubiq… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.