Skip to main content

Showing 1–8 of 8 results for author: Benjamin, A S

.
  1. arXiv:2505.22994  [pdf, ps, other

    cs.LG cs.NE

    Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation

    Authors: Ari S. Benjamin, Kyle Daruwalla, Christian Pehle, Anthony M. Zador

    Abstract: One frequently wishes to learn a range of similar tasks as efficiently as possible, re-using knowledge across tasks. In artificial neural networks, this is typically accomplished by conditioning a network upon task context by injecting context as input. Brains have a different strategy: the parameters themselves are modulated as a function of various neuromodulators such as serotonin. Here, we tak… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 17 pages, 4 figures

  2. arXiv:2503.16511  [pdf, ps, other

    cs.CL cs.AI

    Token-Level Uncertainty-Aware Objective for Language Model Post-Training

    Authors: Tingkai Liu, Ari S. Benjamin, Anthony M. Zador

    Abstract: In the current work, we connect token-level uncertainty in causal language modeling to two types of training objectives: 1) masked maximum likelihood (MLE), 2) self-distillation. We show that masked MLE is effective in reducing epistemic uncertainty, and serve as an effective token-level automatic curriculum learning technique. However, masked MLE is prone to overfitting and requires self-distilla… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  3. arXiv:2408.17394  [pdf, other

    cs.LG cs.NE

    Continual learning with the neural tangent ensemble

    Authors: Ari S. Benjamin, Christian Pehle, Kyle Daruwalla

    Abstract: A natural strategy for continual learning is to weigh a Bayesian ensemble of fixed functions. This suggests that if a (single) neural network could be interpreted as an ensemble, one could design effective algorithms that learn without forgetting. To realize this possibility, we observe that a neural network classifier with N parameters can be interpreted as a weighted ensemble of N classifiers, a… ▽ More

    Submitted 27 February, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Presented as a spotlight paper at NeurIPS, 2024

    Journal ref: Neural Information Processing Systems 34 (2024)

  4. arXiv:2106.04540  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG cs.NE

    Object Based Attention Through Internal Gating

    Authors: Jordan Lei, Ari S. Benjamin, Konrad P. Kording

    Abstract: Object-based attention is a key component of the visual system, relevant for perception, learning, and memory. Neurons tuned to features of attended objects tend to be more active than those associated with non-attended objects. There is a rich set of models of this phenomenon in computational neuroscience. However, there is currently a divide between models that successfully match physiological d… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  5. arXiv:2006.10811  [pdf, other

    q-bio.NC cs.NE stat.ML

    Learning to infer in recurrent biological networks

    Authors: Ari S. Benjamin, Konrad P. Kording

    Abstract: A popular theory of perceptual processing holds that the brain learns both a generative model of the world and a paired recognition model using variational Bayesian inference. Most hypotheses of how the brain might learn these models assume that neurons in a population are conditionally independent given their common inputs. This simplification is likely not compatible with the type of local recur… ▽ More

    Submitted 31 May, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

  6. arXiv:1805.08289  [pdf, other

    cs.NE cs.LG stat.ML

    Measuring and regularizing networks in function space

    Authors: Ari S. Benjamin, David Rolnick, Konrad Kording

    Abstract: To optimize a neural network one often thinks of optimizing its parameters, but it is ultimately a matter of optimizing the function that maps inputs to outputs. Since a change in the parameters might serve as a poor proxy for the change in the function, it is of some concern that primacy is given to parameters but that the correspondence has not been tested. Here, we show that it is simple and co… ▽ More

    Submitted 26 June, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Presented at ICLR 2019

    Journal ref: International Conference on Learning Representations, 2019, https://openreview.net/pdf?id=SkMwpiR9Y7

  7. arXiv:1805.08239  [pdf

    q-bio.NC cs.LG stat.ML

    The Roles of Supervised Machine Learning in Systems Neuroscience

    Authors: Joshua I. Glaser, Ari S. Benjamin, Roozbeh Farhoodi, Konrad P. Kording

    Abstract: Over the last several years, the use of machine learning (ML) in neuroscience has been rapidly increasing. Here, we review ML's contributions, both realized and potential, across several areas of systems neuroscience. We describe four primary roles of ML within neuroscience: 1) creating solutions to engineering problems, 2) identifying predictive variables, 3) setting benchmarks for simple models… ▽ More

    Submitted 26 November, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

  8. arXiv:1708.00909  [pdf

    q-bio.NC cs.LG stat.ML

    Machine learning for neural decoding

    Authors: Joshua I. Glaser, Ari S. Benjamin, Raeed H. Chowdhury, Matthew G. Perich, Lee E. Miller, Konrad P. Kording

    Abstract: Despite rapid advances in machine learning tools, the majority of neural decoding approaches still use traditional methods. Modern machine learning tools, which are versatile and easy to use, have the potential to significantly improve decoding performance. This tutorial describes how to effectively apply these algorithms for typical decoding problems. We provide descriptions, best practices, and… ▽ More

    Submitted 3 July, 2020; v1 submitted 2 August, 2017; originally announced August 2017.