Skip to main content

Showing 1–4 of 4 results for author: Bencomo, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.20237  [pdf, other

    cs.LG cs.AI

    Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks

    Authors: Gianluca Bencomo, Max Gupta, Ioana Marinescu, R. Thomas McCoy, Thomas L. Griffiths

    Abstract: Artificial neural networks can acquire many aspects of human knowledge from data, making them promising as models of human learning. But what those networks can learn depends upon their inductive biases -- the factors other than the data that influence the solutions they discover -- and the inductive biases of neural networks remain poorly understood, limiting our ability to draw conclusions about… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 11 pages, 6 figures, 6 tables

  2. arXiv:2405.19420  [pdf, other

    cs.LG cs.AI q-bio.NC

    Learning Human-Aligned Representations with Contrastive Learning and Generative Similarity

    Authors: Raja Marjieh, Sreejan Kumar, Declan Campbell, Liyi Zhang, Gianluca Bencomo, Jake Snell, Thomas L. Griffiths

    Abstract: Humans rely on effective representations to learn from few examples and abstract useful information from sensory data. Inducing such representations in machine learning models has been shown to improve their performance on various benchmarks such as few-shot learning and robustness. However, finding effective training procedures to achieve that goal can be challenging as psychologically rich train… ▽ More

    Submitted 31 January, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2311.14601  [pdf, other

    cs.LG cs.NE stat.ML

    A Metalearned Neural Circuit for Nonparametric Bayesian Inference

    Authors: Jake C. Snell, Gianluca Bencomo, Thomas L. Griffiths

    Abstract: Most applications of machine learning to classification assume a closed set of balanced classes. This is at odds with the real world, where class occurrence statistics often follow a long-tailed power-law distribution and it is unlikely that all classes are seen in a single sample. Nonparametric Bayesian models naturally capture this phenomenon, but have significant practical barriers to widesprea… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 13 pages, 3 figures. Code available at https://github.com/jakesnell/neural-circuits

  4. arXiv:2311.10580  [pdf, other

    cs.LG eess.SY stat.ML

    Implicit Maximum a Posteriori Filtering via Adaptive Optimization

    Authors: Gianluca M. Bencomo, Jake C. Snell, Thomas L. Griffiths

    Abstract: Bayesian filtering approximates the true underlying behavior of a time-varying system by inverting an explicit generative model to convert noisy measurements into state estimates. This process typically requires either storage, inversion, and multiplication of large matrices or Monte Carlo estimation, neither of which are practical in high-dimensional state spaces such as the weight spaces of arti… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Under review at ICLR 2024