Skip to main content

Showing 1–7 of 7 results for author: Gilra, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06248  [pdf, ps, other

    cs.LG

    Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning

    Authors: Guillaume Pourcel, Debabrota Basu, Maxence Ernoult, Aditya Gilra

    Abstract: Equilibrium Propagation (EP) is a learning algorithm for training Energy-based Models (EBMs) on static inputs which leverages the variational description of their fixed points. Extending EP to time-varying inputs is a challenging problem, as the variational description must apply to the entire system trajectory rather than just fixed points, and careful consideration of boundary conditions becomes… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2411.07832  [pdf, other

    cs.LG stat.ML

    Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs

    Authors: Chao Han, Debabrota Basu, Michael Mangan, Eleni Vasilaki, Aditya Gilra

    Abstract: Learning representations of underlying environmental dynamics from partial observations is a critical challenge in machine learning. In the context of Partially Observable Markov Decision Processes (POMDPs), state representations are often inferred from the history of past observations and actions. We demonstrate that incorporating future information is essential to accurately capture causal dynam… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  3. arXiv:2402.09113  [pdf, other

    cs.LG

    How does Your RL Agent Explore? An Optimal Transport Analysis of Occupancy Measure Trajectories

    Authors: Reabetswe M. Nkhumise, Debabrota Basu, Tony J. Prescott, Aditya Gilra

    Abstract: The rising successes of RL are propelled by combining smart algorithmic strategies and deep architectures to optimize the distribution of returns and visitations over the state-action space. A quantitative framework to compare the learning processes of these eclectic RL algorithms is currently absent but desired in practice. We address this gap by representing the learning process of an RL algorit… ▽ More

    Submitted 16 October, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Weijie Ke, Mina A Khoei, Denis Kleyko, Noah Pacik-Nelson, Alessandro Pierro, Philipp Stratmann, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl , et al. (75 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 14 January, 2025; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: To appear in Nature Neuromorphic Hardware and Computing collection

  5. arXiv:1712.10158  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY stat.ML

    Non-linear motor control by local learning in spiking neural networks

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Learning weights in a spiking neural network with hidden neurons, using local, stable and online rules, to control non-linear body dynamics is an open problem. Here, we employ a supervised scheme, Feedback-based Online Local Learning Of Weights (FOLLOW), to train a network of heterogeneous spiking neurons with hidden layers, to control a two-link arm so as to reproduce a desired state trajectory.… ▽ More

    Submitted 29 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:1773-1782, 2018

  6. arXiv:1712.10062  [pdf, other

    q-bio.NC cs.LG cs.NE stat.ML

    Multi-timescale memory dynamics in a reinforcement learning network with attention-gated memory

    Authors: Marco Martinolli, Wulfram Gerstner, Aditya Gilra

    Abstract: Learning and memory are intertwined in our brain and their relationship is at the core of several recent neural network models. In particular, the Attention-Gated MEmory Tagging model (AuGMEnT) is a reinforcement learning network with an emphasis on biological plausibility of memory dynamics and learning. We find that the AuGMEnT network does not solve some hierarchical tasks, where higher-level s… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Frontiers in Computational Neuroscience, 12 July 2018 | https://doi.org/10.3389/fncom.2018.00050

  7. arXiv:1702.06463  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY

    Predicting non-linear dynamics by stable local learning in a recurrent spiking neural network

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Brains need to predict how the body reacts to motor commands. It is an open question how networks of spiking neurons can learn to reproduce the non-linear body dynamics caused by motor commands, using local, online and stable learning rules. Here, we present a supervised learning scheme for the feedforward and recurrent connections in a network of heterogeneous spiking neurons. The error in the ou… ▽ More

    Submitted 26 April, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Journal ref: eLife 2017;6:e28295