Skip to main content

Showing 1–4 of 4 results for author: Raju, R V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.05364  [pdf, other

    cs.LG cs.AI

    Diffusion Model Predictive Control

    Authors: Guangyao Zhou, Sivaramakrishnan Swaminathan, Rajkumar Vasudeva Raju, J. Swaroop Guntupalli, Wolfgang Lehrach, Joseph Ortiz, Antoine Dedieu, Miguel Lázaro-Gredilla, Kevin Murphy

    Abstract: We propose Diffusion Model Predictive Control (D-MPC), a novel MPC approach that learns a multi-step action proposal and a multi-step dynamics model, both using diffusion models, and combines them for use in online MPC. On the popular D4RL benchmark, we show performance that is significantly better than existing model-based offline planning methods using MPC (e.g. MBOP) and competitive with state-… ▽ More

    Submitted 22 May, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Published at TMLR

  2. arXiv:2310.03186  [pdf, other

    q-bio.NC cs.AI

    Inferring Inference

    Authors: Rajkumar Vasudeva Raju, Zhe Li, Scott Linderman, Xaq Pitkow

    Abstract: Patterns of microcircuitry suggest that the brain has an array of repeated canonical computational units. Yet neural representations are distributed, so the relevant computations may only be related indirectly to single-neuron transformations. It thus remains an open challenge how to define canonical distributed computations. We integrate normative and algorithmic theories of neural computation in… ▽ More

    Submitted 13 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 26 pages, 4 figures and 1 supplementary figure

  3. arXiv:2307.01201  [pdf, other

    cs.CL cs.AI

    Schema-learning and rebinding as mechanisms of in-context learning and emergence

    Authors: Sivaramakrishnan Swaminathan, Antoine Dedieu, Rajkumar Vasudeva Raju, Murray Shanahan, Miguel Lazaro-Gredilla, Dileep George

    Abstract: In-context learning (ICL) is one of the most powerful and most unexpected capabilities to emerge in recent transformer-based large language models (LLMs). Yet the mechanisms that underlie it are poorly understood. In this paper, we demonstrate that comparable ICL capabilities can be acquired by an alternative sequence prediction learning method using clone-structured causal graphs (CSCGs). Moreove… ▽ More

    Submitted 15 June, 2023; originally announced July 2023.

  4. arXiv:2302.07350  [pdf, other

    cs.AI cs.LG q-bio.NC

    Graph schemas as abstractions for transfer learning, inference, and planning

    Authors: J. Swaroop Guntupalli, Rajkumar Vasudeva Raju, Shrinu Kushagra, Carter Wendelken, Danny Sawyer, Ishan Deshpande, Guangyao Zhou, Miguel Lázaro-Gredilla, Dileep George

    Abstract: Transferring latent structure from one environment or problem to another is a mechanism by which humans and animals generalize with very little data. Inspired by cognitive and neurobiological insights, we propose graph schemas as a mechanism of abstraction for transfer learning. Graph schemas start with latent graph learning where perceptually aliased observations are disambiguated in the latent s… ▽ More

    Submitted 12 December, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 14 pages, 4 figures in main paper, 13 pages and 8 figures in appendix