Skip to main content

Showing 1–4 of 4 results for author: Daruwalla, K

.
  1. arXiv:2505.22994  [pdf, ps, other

    cs.LG cs.NE

    Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation

    Authors: Ari S. Benjamin, Kyle Daruwalla, Christian Pehle, Anthony M. Zador

    Abstract: One frequently wishes to learn a range of similar tasks as efficiently as possible, re-using knowledge across tasks. In artificial neural networks, this is typically accomplished by conditioning a network upon task context by injecting context as input. Brains have a different strategy: the parameters themselves are modulated as a function of various neuromodulators such as serotonin. Here, we tak… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 17 pages, 4 figures

  2. arXiv:2408.17394  [pdf, other

    cs.LG cs.NE

    Continual learning with the neural tangent ensemble

    Authors: Ari S. Benjamin, Christian Pehle, Kyle Daruwalla

    Abstract: A natural strategy for continual learning is to weigh a Bayesian ensemble of fixed functions. This suggests that if a (single) neural network could be interpreted as an ensemble, one could design effective algorithms that learn without forgetting. To realize this possibility, we observe that a neural network classifier with N parameters can be interpreted as a weighted ensemble of N classifiers, a… ▽ More

    Submitted 27 February, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Presented as a spotlight paper at NeurIPS, 2024

    Journal ref: Neural Information Processing Systems 34 (2024)

  3. arXiv:2111.13187  [pdf, other

    cs.NE cs.AI cs.LG

    Information Bottleneck-Based Hebbian Learning Rule Naturally Ties Working Memory and Synaptic Updates

    Authors: Kyle Daruwalla, Mikko Lipasti

    Abstract: Artificial neural networks have successfully tackled a large variety of problems by training extremely deep networks via back-propagation. A direct application of back-propagation to spiking neural networks contains biologically implausible components, like the weight transport problem or separate inference and learning phases. Various methods address different components individually, but a compl… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 21 pages, 10 figures, under review

  4. arXiv:2111.12621  [pdf, other

    cs.LG

    Accelerating Deep Learning with Dynamic Data Pruning

    Authors: Ravi S Raju, Kyle Daruwalla, Mikko Lipasti

    Abstract: Deep learning's success has been attributed to the training of large, overparameterized models on massive amounts of data. As this trend continues, model training has become prohibitively costly, requiring access to powerful computing systems to train state-of-the-art networks. A large body of research has been devoted to addressing the cost per iteration of training through various model compress… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 11 pages, 13 figures, under review