Skip to main content

Showing 1–17 of 17 results for author: Stanton, S

.
  1. arXiv:2502.14607  [pdf, other

    astro-ph.IM

    A Millimeter-Wave Photometric Camera for Long-Range Imaging Through Optical Obscurants Using Kinetic Inductance Detectors

    Authors: Jack Sayers, Daniel Cunnane, Sage Crystian, Peter K. Day, Fabien Defrance, Byeong Ho Eom, Jonathan Greenfield, Matthew Hollister, Bradley R. Johnson, Henry G. LeDuc, Philip Mauskopf, Nia McNichols, Cody Roberson, Marcus C. Runyan, Adhitya B. Sriram, Sage Stanton, Ryan C. Stephenson, Liam C. Walters, Eric Weeks

    Abstract: Passive imaging through optical obscurants is a promising application for mm-wave sensing. We have thus developed the Superconducting Kinetic Inductance Passive Radiometer (SKIPR), a 150 GHz polarization-sensitive photometric camera optimized for terrestrial imaging using a focal plane array with 3,840 kinetic inductance detectors (KIDs). We present a full description of the instrument design, wit… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  2. arXiv:2411.06090  [pdf, other

    cs.LG

    Concept Bottleneck Language Models For protein design

    Authors: Aya Abdelsalam Ismail, Tuomas Oikarinen, Amy Wang, Julius Adebayo, Samuel Stanton, Taylor Joren, Joseph Kleinhenz, Allen Goodman, Héctor Corrada Bravo, Kyunghyun Cho, Nathan C. Frey

    Abstract: We introduce Concept Bottleneck Protein Language Models (CB-pLM), a generative masked language model with a layer where each neuron corresponds to an interpretable concept. Our architecture offers three key benefits: i) Control: We can intervene on concept values to precisely control the properties of generated proteins, achieving a 3 times larger change in desired concept values compared to basel… ▽ More

    Submitted 11 December, 2024; v1 submitted 9 November, 2024; originally announced November 2024.

  3. arXiv:2410.22296  [pdf, other

    cs.LG q-bio.QM

    Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks

    Authors: Angelica Chen, Samuel D. Stanton, Frances Ding, Robert G. Alberstein, Andrew M. Watkins, Richard Bonneau, Vladimir Gligorijević, Kyunghyun Cho, Nathan C. Frey

    Abstract: Although large language models (LLMs) have shown promise in biomolecule optimization problems, they incur heavy computational costs and struggle to satisfy precise constraints. On the other hand, specialized solvers like LaMBO-2 offer efficiency and fine-grained control but require more domain expertise. Comparing these approaches is challenging due to expensive laboratory validation and inadequat… ▽ More

    Submitted 2 April, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: Supercedes arXiv:2407.00236v1. arXiv admin note: text overlap with arXiv:2407.00236

  4. arXiv:2407.00236  [pdf, other

    cs.LG cs.NE

    Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms

    Authors: Samuel Stanton, Robert Alberstein, Nathan Frey, Andrew Watkins, Kyunghyun Cho

    Abstract: There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  5. arXiv:2405.06627  [pdf, other

    cs.LG cs.AI stat.ML

    Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

    Authors: Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria

    Abstract: As artificial intelligence (AI) / machine learning (ML) gain widespread adoption, practitioners are increasingly seeking means to quantify and control the risk these systems incur. This challenge is especially salient when such systems have autonomy to collect their own data, such as in black-box optimization and active learning, where their actions induce sequential feedback-loop shifts in the da… ▽ More

    Submitted 5 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code available at https://github.com/drewprinster/conformal-mfcs

  6. arXiv:2305.20009  [pdf, other

    cs.LG q-bio.BM

    Protein Design with Guided Discrete Diffusion

    Authors: Nate Gruver, Samuel Stanton, Nathan C. Frey, Tim G. J. Rudner, Isidro Hotzel, Julien Lafrance-Vanasse, Arvind Rajpal, Kyunghyun Cho, Andrew Gordon Wilson

    Abstract: A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to… ▽ More

    Submitted 12 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36, December 10-16, 2023

  7. arXiv:2212.04450  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    GAUCHE: A Library for Gaussian Processes in Chemistry

    Authors: Ryan-Rhys Griffiths, Leo Klarner, Henry B. Moss, Aditya Ravuri, Sang Truong, Samuel Stanton, Gary Tom, Bojana Rankovic, Yuanqi Du, Arian Jamasb, Aryan Deshwal, Julius Schwartz, Austin Tripp, Gregory Kell, Simon Frieder, Anthony Bourached, Alex Chan, Jacob Moss, Chengzhi Guo, Johannes Durholt, Saudamini Chaurasia, Felix Strieth-Kalthoff, Alpha A. Lee, Bingqing Cheng, Alán Aspuru-Guzik , et al. (2 additional authors not shown)

    Abstract: We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings… ▽ More

    Submitted 21 February, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  8. arXiv:2210.12496  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with Conformal Prediction Sets

    Authors: Samuel Stanton, Wesley Maddox, Andrew Gordon Wilson

    Abstract: Bayesian optimization is a coherent, ubiquitous approach to decision-making under uncertainty, with applications including multi-arm bandits, active learning, and black-box optimization. Bayesian optimization selects decisions (i.e. objective function queries) with maximal expected utility with respect to the posterior distribution of a Bayesian model, which quantifies reducible, epistemic uncerta… ▽ More

    Submitted 12 December, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: For code, see https://www.github.com/samuelstanton/conformal-bayesopt.git

    Journal ref: Proceedings of Machine Learning Research, Volume 206, 959-986, PMLR, 2023

  9. arXiv:2210.04096  [pdf, other

    cs.LG q-bio.QM

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

  10. arXiv:2203.12742  [pdf, other

    cs.LG cs.NE q-bio.QM stat.ML

    Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

    Authors: Samuel Stanton, Wesley Maddox, Nate Gruver, Phillip Maffettone, Emily Delaney, Peyton Greenside, Andrew Gordon Wilson

    Abstract: Bayesian optimization (BayesOpt) is a gold standard for query-efficient continuous optimization. However, its adoption for drug design has been hindered by the discrete, high-dimensional nature of the decision variables. We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, allowing gradient-based optimization of mult… ▽ More

    Submitted 12 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: ICML 2022. Code available at https://github.com/samuelstanton/lambo

  11. arXiv:2202.04836  [pdf, other

    cs.LG math.DS physics.data-an stat.ML

    Deconstructing the Inductive Biases of Hamiltonian Neural Networks

    Authors: Nate Gruver, Marc Finzi, Samuel Stanton, Andrew Gordon Wilson

    Abstract: Physics-inspired neural networks (NNs), such as Hamiltonian or Lagrangian NNs, dramatically outperform other learned dynamics models by leveraging strong inductive biases. These models, however, are challenging to apply to many real world systems, such as those that don't conserve energy or contain contacts, a common setting for robotics and reinforcement learning. In this paper, we examine the in… ▽ More

    Submitted 11 February, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: ICLR 2022. Code available at https://github.com/ngruver/decon-hnn

  12. arXiv:2110.15172  [pdf, other

    cs.LG stat.ML

    Conditioning Sparse Variational Gaussian Processes for Online Decision-making

    Authors: Wesley J. Maddox, Samuel Stanton, Andrew Gordon Wilson

    Abstract: With a principled representation of uncertainty and closed form posterior updates, Gaussian processes (GPs) are a natural choice for online decision making. However, Gaussian processes typically require at least $\mathcal{O}(n^2)$ computations for $n$ training points, limiting their general applicability. Stochastic variational Gaussian processes (SVGPs) can provide scalable inference for a datase… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  13. arXiv:2106.05945  [pdf, other

    cs.LG stat.ML

    Does Knowledge Distillation Really Work?

    Authors: Samuel Stanton, Pavel Izmailov, Polina Kirichenko, Alexander A. Alemi, Andrew Gordon Wilson

    Abstract: Knowledge distillation is a popular technique for training a small student network to emulate a larger teacher model, such as an ensemble of networks. We show that while knowledge distillation can improve student generalization, it does not typically work as it is commonly understood: there often remains a surprisingly large discrepancy between the predictive distributions of the teacher and the s… ▽ More

    Submitted 6 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021. Code available at https://github.com/samuelstanton/gnosis

  14. arXiv:2103.01454  [pdf, other

    stat.ML cs.LG

    Kernel Interpolation for Scalable Online Gaussian Processes

    Authors: Samuel Stanton, Wesley J. Maddox, Ian Delbridge, Andrew Gordon Wilson

    Abstract: Gaussian processes (GPs) provide a gold standard for performance in online settings, such as sample-efficient control and black box optimization, where we need to update a posterior distribution as we acquire data in a sequential fashion. However, updating a GP posterior to accommodate even a single new observation after having observed $n$ points incurs at least $O(n)$ computations in the exact s… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: AISTATS 2021

  15. arXiv:2011.11698  [pdf, other

    astro-ph.EP

    Ultra Short Period Planets in K2 III: Neighbors are Common with 13 New Multi-Planet Systems and 10 Newly Validated Planets in Campaigns 0-8, 10

    Authors: Elisabeth R. Adams, Brian Jackson, Samantha Johnson, David R. Ciardi, William D. Cochran, Michael Endl, Mark E. Everett, Elise Furlan, Steve B. Howell, Prasanna Jayanthi, Phillip J. MacQueen, Rachel A. Matson, Ciera Partyka-Worley, Joshua Schlieder, Nicholas J. Scott, Sevio M. Stanton, Carl Ziegler

    Abstract: Using the EVEREST photometry pipeline, we have identified 74 candidate ultra-short-period planets (orbital period P<1 d) in the first half of the K2 data (Campaigns 0-8 and 10). Of these, 33 candidates have not previously been reported. A systematic search for additional transiting planets found 13 new multi-planet systems, doubling the number known and representing a third (32%) of USPs. We also… ▽ More

    Submitted 19 May, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Accepted to PSJ (5/19/21)

  16. arXiv:2008.12775  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    On the model-based stochastic value gradient for continuous reinforcement learning

    Authors: Brandon Amos, Samuel Stanton, Denis Yarats, Andrew Gordon Wilson

    Abstract: For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of reinforcement learning agents. While model-based agents are conceptually appealing, their policies tend to lag behind those of model-free agents in terms of final reward, especially in non-trivial environments. In response, researchers have pro… ▽ More

    Submitted 27 May, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: L4DC 2021

  17. arXiv:2002.12880  [pdf, other

    stat.ML cs.LG

    Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data

    Authors: Marc Finzi, Samuel Stanton, Pavel Izmailov, Andrew Gordon Wilson

    Abstract: The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equi… ▽ More

    Submitted 24 September, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: ICML 2020. Code available at https://github.com/mfinzi/LieConv