Skip to main content

Showing 1–17 of 17 results for author: Stachenfeld, K

.
  1. arXiv:2506.04289  [pdf, ps, other

    cs.LG q-bio.NC

    Relational reasoning and inductive bias in transformers trained on a transitive inference task

    Authors: Jesse Geerts, Stephanie Chan, Claudia Clopath, Kimberly Stachenfeld

    Abstract: Transformer-based models have demonstrated remarkable reasoning abilities, but the mechanisms underlying relational reasoning in different learning regimes remain poorly understood. In this work, we investigate how transformers perform a classic relational reasoning task from the Psychology literature, \textit{transitive inference}, which requires inference about indirectly related items by integr… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 13 pages, 6 figures

  2. arXiv:2407.20535  [pdf, other

    cs.NE cs.SD eess.AS

    DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs

    Authors: Cynthia R. Steinhardt, Menoua Keshishian, Nima Mesgarani, Kim Stachenfeld

    Abstract: Cochlear implants(CIs) are arguably the most successful neural implant, having restored hearing to over one million people worldwide. While CI research has focused on modeling the cochlear activations in response to low-level acoustic features, we hypothesize that the success of these implants is due in large part to the role of the upstream network in extracting useful features from a degraded si… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: NEURIPS preprint

  3. arXiv:2405.16391  [pdf, other

    cs.LG q-bio.NC

    When does compositional structure yield compositional generalization? A kernel theory

    Authors: Samuel Lippl, Kim Stachenfeld

    Abstract: Compositional generalization (the ability to respond correctly to novel combinations of familiar components) is thought to be a cornerstone of intelligent behavior. Compositionally structured (e.g. disentangled) representations support this ability; however, the conditions under which they are sufficient for the emergence of compositional generalization remain unclear. To address this gap, we pres… ▽ More

    Submitted 8 April, 2025; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Published at ICLR 2025

  4. arXiv:2405.14045  [pdf, other

    cs.LG cs.CV

    Learning rigid-body simulators over implicit shapes for large-scale scenes and vision

    Authors: Yulia Rubanova, Tatiana Lopez-Guevara, Kelsey R. Allen, William F. Whitney, Kimberly Stachenfeld, Tobias Pfaff

    Abstract: Simulating large scenes with many rigid objects is crucial for a variety of applications, such as robotics, engineering, film and video games. Rigid interactions are notoriously hard to model: small changes to the initial state or the simulation parameters can lead to large changes in the final state. Recently, learned simulators based on graph networks (GNNs) were developed as an alternative to h… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  5. arXiv:2401.11985  [pdf, other

    cs.LG cs.CV cs.RO

    Scaling Face Interaction Graph Networks to Real World Scenes

    Authors: Tatiana Lopez-Guevara, Yulia Rubanova, William F. Whitney, Tobias Pfaff, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Accurately simulating real world object dynamics is essential for various applications such as robotics, engineering, graphics, and design. To better capture complex real dynamics such as contact and friction, learned simulators based on graph networks have recently shown great promise. However, applying these learned simulators to real scenes comes with two major challenges: first, scaling learne… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 16 pages, 12 figures

  6. arXiv:2401.06005  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    How does the primate brain combine generative and discriminative computations in vision?

    Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

    Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  7. arXiv:2312.05359  [pdf, other

    cs.LG

    Learning 3D Particle-based Simulators from RGB-D Videos

    Authors: William F. Whitney, Tatiana Lopez-Guevara, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Realistic simulation is critical for applications ranging from robotics to animation. Traditional analytic simulators sometimes struggle to capture sufficiently realistic simulation which can lead to problems including the well known "sim-to-real" gap in robotics. Learned simulators have emerged as an alternative for better capturing real-world physical dynamics, but require access to privileged g… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  8. arXiv:2310.06089  [pdf, other

    cs.AI

    Predictive auxiliary objectives in deep RL mimic learning in the brain

    Authors: Ching Fang, Kimberly L Stachenfeld

    Abstract: The ability to predict upcoming events has been hypothesized to comprise a key aspect of natural and machine cognition. This is supported by trends in deep reinforcement learning (RL), where self-supervised auxiliary objectives such as prediction are widely used to support representation learning and improve task performance. Here, we study the effects predictive auxiliary objectives have on repre… ▽ More

    Submitted 29 October, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  9. arXiv:2309.02040  [pdf, other

    cs.LG cs.AI

    Diffusion Generative Inverse Design

    Authors: Marin Vlastelica, Tatiana López-Guevara, Kelsey Allen, Peter Battaglia, Arnaud Doucet, Kimberley Stachenfeld

    Abstract: Inverse design refers to the problem of optimizing the input of an objective function in order to enact a target outcome. For many real-world engineering problems, the objective function takes the form of a simulator that predicts how the system state will evolve over time, and the design challenge is to optimize the initial conditions that lead to a target outcome. Recent developments in learned… ▽ More

    Submitted 18 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: ICML workshop on Structured Probabilistic Inference & Generative Modeling

  10. arXiv:2305.06160  [pdf

    q-bio.NC

    Neuroscience needs Network Science

    Authors: Dániel L Barabási, Ginestra Bianconi, Ed Bullmore, Mark Burgess, SueYeon Chung, Tina Eliassi-Rad, Dileep George, István A. Kovács, Hernán Makse, Christos Papadimitriou, Thomas E. Nichols, Olaf Sporns, Kim Stachenfeld, Zoltán Toroczkai, Emma K. Towlson, Anthony M Zador, Hongkui Zeng, Albert-László Barabási, Amy Bernard, György Buzsáki

    Abstract: The brain is a complex system comprising a myriad of interacting elements, posing significant challenges in understanding its structure, function, and dynamics. Network science has emerged as a powerful tool for studying such intricate systems, offering a framework for integrating multiscale data and complexity. Here, we discuss the application of network science in the study of the brain, address… ▽ More

    Submitted 11 May, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: 19 pages, 1 figure, 1 box

  11. arXiv:2202.00728  [pdf, other

    cs.LG

    Physical Design using Differentiable Learned Simulators

    Authors: Kelsey R. Allen, Tatiana Lopez-Guevara, Kimberly Stachenfeld, Alvaro Sanchez-Gonzalez, Peter Battaglia, Jessica Hamrick, Tobias Pfaff

    Abstract: Designing physical artifacts that serve a purpose - such as tools and other functional structures - is central to engineering as well as everyday human behavior. Though automating design has tremendous promise, general-purpose methods do not yet exist. Here we explore a simple, fast, and robust approach to inverse design which combines learned forward simulators based on graph neural networks with… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: First three authors contributed equally

  12. arXiv:2112.15275  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Learned Coarse Models for Efficient Turbulence Simulation

    Authors: Kimberly Stachenfeld, Drummond B. Fielding, Dmitrii Kochkov, Miles Cranmer, Tobias Pfaff, Jonathan Godwin, Can Cui, Shirley Ho, Peter Battaglia, Alvaro Sanchez-Gonzalez

    Abstract: Turbulence simulation with classical numerical solvers requires high-resolution grids to accurately resolve dynamics. Here we train learned simulators at low spatial and temporal resolutions to capture turbulent dynamics generated at high resolution. We show that our proposed model can simulate turbulent dynamics more accurately than classical numerical solvers at the comparably low resolutions ac… ▽ More

    Submitted 22 April, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Journal ref: (2022) International Conference on Learning Representations

  13. arXiv:2101.00079  [pdf, other

    stat.ML cs.LG

    Graph Networks with Spectral Message Passing

    Authors: Kimberly Stachenfeld, Jonathan Godwin, Peter Battaglia

    Abstract: Graph Neural Networks (GNNs) are the subject of intense focus by the machine learning community for problems involving relational reasoning. GNNs can be broadly divided into spatial and spectral approaches. Spatial approaches use a form of learned message-passing, in which interactions among vertices are computed locally, and information propagates over longer distances on the graph with greater n… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

  14. arXiv:1910.14361  [pdf, other

    cs.LG cs.AI stat.ML

    Object-oriented state editing for HRL

    Authors: Victor Bapst, Alvaro Sanchez-Gonzalez, Omar Shams, Kimberly Stachenfeld, Peter W. Battaglia, Satinder Singh, Jessica B. Hamrick

    Abstract: We introduce agents that use object-oriented reasoning to consider alternate states of the world in order to more quickly find solutions to problems. Specifically, a hierarchical controller directs a low-level agent to behave as if objects in the scene were added, deleted, or modified. The actions taken by the controller are defined over a graph-based representation of the scene, with actions corr… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 8 pages; accepted to the Perception as Generative Reasoning workshop of the 33rd Conference on Neural InformationProcessing Systems (NeurIPS 2019)

  15. Probabilistic Successor Representations with Kalman Temporal Differences

    Authors: Jesse P. Geerts, Kimberly L. Stachenfeld, Neil Burgess

    Abstract: The effectiveness of Reinforcement Learning (RL) depends on an animal's ability to assign credit for rewards to the appropriate preceding stimuli. One aspect of understanding the neural underpinnings of this process involves understanding what sorts of stimulus representations support generalisation. The Successor Representation (SR), which enforces generalisation over states that predict similar… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: Conference on Cognitive Computational Neuroscience

  16. arXiv:1904.03177  [pdf, other

    cs.LG cs.AI

    Structured agents for physical construction

    Authors: Victor Bapst, Alvaro Sanchez-Gonzalez, Carl Doersch, Kimberly L. Stachenfeld, Pushmeet Kohli, Peter W. Battaglia, Jessica B. Hamrick

    Abstract: Physical construction---the ability to compose objects, subject to physical dynamics, to serve some function---is fundamental to human intelligence. We introduce a suite of challenging physical construction tasks inspired by how children play with blocks, such as matching a target configuration, stacking blocks to connect objects together, and creating shelter-like structures over target objects.… ▽ More

    Submitted 13 May, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: ICML 2019

  17. arXiv:1806.02215  [pdf, other

    cs.LG cs.AI stat.ML

    Spectral Inference Networks: Unifying Deep and Spectral Learning

    Authors: David Pfau, Stig Petersen, Ashish Agarwal, David G. T. Barrett, Kimberly L. Stachenfeld

    Abstract: We present Spectral Inference Networks, a framework for learning eigenfunctions of linear operators by stochastic optimization. Spectral Inference Networks generalize Slow Feature Analysis to generic symmetric operators, and are closely related to Variational Monte Carlo methods from computational physics. As such, they can be a powerful tool for unsupervised representation learning from video or… ▽ More

    Submitted 16 January, 2020; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Fixed typo in math in section 4

    Journal ref: Seventh International Conference on Learning Representations (ICLR 2019)