Skip to main content

Showing 1–25 of 25 results for author: Gupta, J K

.
  1. arXiv:2405.13063  [pdf, other

    physics.ao-ph cs.LG

    A Foundation Model for the Earth System

    Authors: Cristian Bodnar, Wessel P. Bruinsma, Ana Lucic, Megan Stanley, Anna Vaughan, Johannes Brandstetter, Patrick Garvan, Maik Riechert, Jonathan A. Weyn, Haiyu Dong, Jayesh K. Gupta, Kit Thambiratnam, Alexander T. Archibald, Chun-Chieh Wu, Elizabeth Heider, Max Welling, Richard E. Turner, Paris Perdikaris

    Abstract: Reliable forecasts of the Earth system are crucial for human progress and safety from natural disasters. Artificial intelligence offers substantial potential to improve prediction accuracy and computational efficiency in this field, however this remains underexplored in many domains. Here we introduce Aurora, a large-scale foundation model for the Earth system trained on over a million hours of di… ▽ More

    Submitted 21 November, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2310.02437  [pdf, other

    cs.CV

    EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields

    Authors: Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta

    Abstract: We present EvDNeRF, a pipeline for generating event data and training an event-based dynamic NeRF, for the purpose of faithfully reconstructing eventstreams on scenes with rigid and non-rigid deformations that may be too fast to capture with a standard camera. Event cameras register asynchronous per-pixel brightness changes at MHz rates with high dynamic range, making them ideal for observing fast… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 16 pages, 20 figures, 2 tables

  3. arXiv:2302.06594  [pdf, other

    cs.LG cs.AI cs.CV

    Geometric Clifford Algebra Networks

    Authors: David Ruhe, Jayesh K. Gupta, Steven de Keninck, Max Welling, Johannes Brandstetter

    Abstract: We propose Geometric Clifford Algebra Networks (GCANs) for modeling dynamical systems. GCANs are based on symmetry group transformations using geometric (Clifford) algebras. We first review the quintessence of modern (plane-based) geometric algebra, which builds on isometries encoded as elements of the $\mathrm{Pin}(p,q,r)$ group. We then propose the concept of group action layers, which linearly… ▽ More

    Submitted 29 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

  4. arXiv:2301.10343  [pdf, other

    cs.LG cs.AI

    ClimaX: A foundation model for weather and climate

    Authors: Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover

    Abstract: Most state-of-the-art approaches for weather and climate modeling are based on physics-informed numerical models of the atmosphere. These approaches aim to model the non-linear dynamics and complex interactions between multiple variables, which are challenging to approximate. Additionally, many such numerical models are computationally intensive, especially when modeling the atmospheric phenomenon… ▽ More

    Submitted 18 December, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: International Conference on Machine Learning 2023

  5. arXiv:2210.17540  [pdf, other

    cs.LG cs.MA

    Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning

    Authors: Jennifer She, Jayesh K. Gupta, Mykel J. Kochenderfer

    Abstract: Sparse and delayed rewards pose a challenge to single agent reinforcement learning. This challenge is amplified in multi-agent reinforcement learning (MARL) where credit assignment of these rewards needs to happen not only across time, but also across agents. We propose Agent-Time Attention (ATA), a neural network model with auxiliary losses for redistributing sparse and delayed rewards in collabo… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Full version of the Extended Abstract accepted at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2022

  6. arXiv:2210.16294  [pdf

    cs.LG cs.MA

    Learning Modular Simulations for Homogeneous Systems

    Authors: Jayesh K. Gupta, Sai Vemprala, Ashish Kapoor

    Abstract: Complex systems are often decomposed into modular subsystems for engineering tractability. Although various equation based white-box modeling techniques make use of such structure, learning based methods have yet to incorporate these ideas broadly. We present a modular simulation framework for modeling homogeneous multibody dynamical systems, which combines ideas from graph neural networks and neu… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accepted at NeurIPS 2022

  7. arXiv:2209.15616  [pdf, other

    cs.LG cs.CV

    Towards Multi-spatiotemporal-scale Generalized PDE Modeling

    Authors: Jayesh K. Gupta, Johannes Brandstetter

    Abstract: Partial differential equations (PDEs) are central to describing complex physical system simulations. Their expensive solution techniques have led to an increased interest in deep neural network based surrogates. However, the practical utility of training such surrogates is contingent on their ability to model complex multi-scale spatio-temporal phenomena. Various neural network architectures have… ▽ More

    Submitted 15 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

  8. arXiv:2209.10986  [pdf, other

    cs.RO cs.CV

    Learning to Simulate Realistic LiDARs

    Authors: Benoit Guillard, Sai Vemprala, Jayesh K. Gupta, Ondrej Miksik, Vibhav Vineet, Pascal Fua, Ashish Kapoor

    Abstract: Simulating realistic sensors is a challenging part in data generation for autonomous systems, often involving carefully handcrafted sensor design, scene properties, and physics modeling. To alleviate this, we introduce a pipeline for data-driven simulation of a realistic LiDAR sensor. We propose a model that learns a mapping between RGB images and corresponding LiDAR features such as raydrop or pe… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: IROS2022 paper

  9. arXiv:2209.04934  [pdf, other

    cs.LG cs.CV physics.flu-dyn

    Clifford Neural Layers for PDE Modeling

    Authors: Johannes Brandstetter, Rianne van den Berg, Max Welling, Jayesh K. Gupta

    Abstract: Partial differential equations (PDEs) see widespread use in sciences and engineering to describe simulation of physical processes as scalar and vector fields interacting and coevolving over time. Due to the computationally expensive nature of their standard solution methods, neural PDE surrogates have become an active research topic to accelerate these simulations. However, current methods do not… ▽ More

    Submitted 2 March, 2023; v1 submitted 8 September, 2022; originally announced September 2022.

    Comments: Accepted at ICLR-2023

  10. arXiv:2203.15788  [pdf, other

    cs.RO

    COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems

    Authors: Shuang Ma, Sai Vemprala, Wenshan Wang, Jayesh K. Gupta, Yale Song, Daniel McDuff, Ashish Kapoor

    Abstract: Learning representations that generalize across tasks and domains is challenging yet necessary for autonomous systems. Although task-driven approaches are appealing, designing models specific to each application can be difficult in the face of limited data, especially when dealing with highly variable multimodal input spaces arising from different tasks in different environments.We introduce the f… ▽ More

    Submitted 19 February, 2022; originally announced March 2022.

  11. arXiv:2203.02844  [pdf, other

    cs.LG cs.AI cs.MA

    Recursive Reasoning Graph for Multi-Agent Reinforcement Learning

    Authors: Xiaobai Ma, David Isele, Jayesh K. Gupta, Kikuo Fujimura, Mykel J. Kochenderfer

    Abstract: Multi-agent reinforcement learning (MARL) provides an efficient way for simultaneously learning policies for multiple agents interacting with each other. However, in scenarios requiring complex interactions, existing algorithms can suffer from an inability to accurately anticipate the influence of self-actions on other agents. Incorporating an ability to reason about other agents' potential respon… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: AAAI 2022

  12. arXiv:2105.01811  [pdf, other

    cs.RO cs.LG

    Training Structured Mechanical Models by Minimizing Discrete Euler-Lagrange Residual

    Authors: Kunal Menda, Jayesh K. Gupta, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Model-based paradigms for decision-making and control are becoming ubiquitous in robotics. They rely on the ability to efficiently learn a model of the system from data. Structured Mechanical Models (SMMs) are a data-efficient black-box parameterization of mechanical systems, typically fit to data by minimizing the error between predicted and observed accelerations or next states. In this work, we… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  13. arXiv:2101.04788  [pdf, other

    cs.AI cs.MA

    Scalable Anytime Planning for Multi-Agent MDPs

    Authors: Shushman Choudhury, Jayesh K. Gupta, Peter Morales, Mykel J. Kochenderfer

    Abstract: We present a scalable tree search planning algorithm for large multi-agent sequential decision problems that require dynamic collaboration. Teams of agents need to coordinate decisions in many domains, but naive approaches fail due to the exponential growth of the joint action space with the number of agents. We circumvent this complexity through an anytime approach that allows us to trade computa… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: First two authors contributed equally. Accepted at AAMAS 2021

  14. arXiv:2006.11615  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM

    Authors: Kunal Menda, Jean de Becdelièvre, Jayesh K. Gupta, Ilan Kroo, Mykel J. Kochenderfer, Zachary Manchester

    Abstract: System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: First three authors contributed equally. Accepted at ICML 2020. Website: https://sites.google.com/stanford.edu/ceem/

  15. arXiv:2006.11438  [pdf, other

    cs.LG cs.AI cs.MA

    Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

    Authors: Sheng Li, Jayesh K. Gupta, Peter Morales, Ross Allen, Mykel J. Kochenderfer

    Abstract: Multi-agent reinforcement learning (MARL) requires coordination to efficiently solve certain tasks. Fully centralized control is often infeasible in such domains due to the size of joint action spaces. Coordination graph based formalization allows reasoning about the joint action based on the structure of interactions. However, they often require domain expertise in their design. This paper introd… ▽ More

    Submitted 3 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

  16. arXiv:2005.13109  [pdf, other

    cs.RO cs.AI

    Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints

    Authors: Shushman Choudhury, Jayesh K. Gupta, Mykel J. Kochenderfer, Dorsa Sadigh, Jeannette Bohg

    Abstract: We consider the problem of dynamically allocating tasks to multiple agents under time window constraints and task completion uncertainty. Our objective is to minimize the number of unsuccessful tasks at the end of the operation horizon. We present a multi-robot allocation algorithm that decouples the key computational challenges of sequential decision-making under uncertainty and multi-agent coord… ▽ More

    Submitted 25 July, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Robotics Science and Systems (RSS) 2020; Source code at https://github.com/sisl/SCoBA.jl

  17. arXiv:2004.10301  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Structured Mechanical Models for Robot Learning and Control

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Model-based methods are the dominant paradigm for controlling robotic systems, though their efficacy depends heavily on the accuracy of the model used. Deep neural networks have been used to learn models of robot dynamics from data, but they suffer from data-inefficiency and the difficulty to incorporate prior knowledge. We introduce Structured Mechanical Models, a flexible model class for mechani… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: First two authors contributed equally. Accepted at L4DC2020. Source code and videos at https://sites.google.com/stanford.edu/smm/

  18. arXiv:1908.01022  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning

    Authors: Ross E. Allen, Jayesh K. Gupta, Jaime Pena, Yutai Zhou, Javona White Bear, Mykel J. Kochenderfer

    Abstract: This paper proposes a definition of system health in the context of multiple agents optimizing a joint reward function. We use this definition as a credit assignment term in a policy gradient algorithm to distinguish the contributions of individual agents to the global reward. The health-informed credit assignment is then extended to a multi-agent variant of the proximal policy optimization algori… ▽ More

    Submitted 4 January, 2021; v1 submitted 2 August, 2019; originally announced August 2019.

  19. arXiv:1903.05766  [pdf, other

    cs.MA cs.AI cs.LG

    Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning

    Authors: Raunak P. Bhattacharyya, Derek J. Phillips, Changliu Liu, Jayesh K. Gupta, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Recent developments in multi-agent imitation learning have shown promising results for modeling the behavior of human drivers. However, it is challenging to capture emergent traffic behaviors that are observed in real-world datasets. Such behaviors arise due to the many local interactions between agents that are not commonly accounted for in imitation learning. This paper proposes Reward Augmented… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: Accepted for publication at ICRA 2019

  20. arXiv:1903.01567  [pdf, other

    cs.LG cs.AI cs.NE

    Model Primitive Hierarchical Lifelong Reinforcement Learning

    Authors: Bohan Wu, Jayesh K. Gupta, Mykel J. Kochenderfer

    Abstract: Learning interpretable and transferable subpolicies and performing task decomposition from a single, complex task is difficult. Some traditional hierarchical reinforcement learning techniques enforce this decomposition in a top-down manner, while meta-learning techniques require a task distribution at hand to learn such decompositions. This paper presents a framework for using diverse suboptimal w… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: 9 pages, 10 figures. Accepted as a full paper at AAMAS 2019

    Journal ref: International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)

  21. arXiv:1902.08705  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    A General Framework for Structured Learning of Mechanical Systems

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is avail… ▽ More

    Submitted 1 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 10 pages, 7 figures. First two authors contributed equally. Submitted to IROS/RA-L. Code at https://github.com/sisl/mechamodlearn/

  22. arXiv:1806.06464  [pdf, other

    cs.MA cs.AI cs.LG cs.NE stat.ML

    Learning Policy Representations in Multiagent Systems

    Authors: Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda, Harrison Edwards

    Abstract: Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general learning framework for modeling agent behavior in any multiagent system using only a handful of interaction data. Our framework casts agent model… ▽ More

    Submitted 31 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  23. Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures

    Authors: John Mern, Jayesh K Gupta, Mykel Kochenderfer

    Abstract: Deep artificial neural networks (ANNs) can represent a wide range of complex functions. Implementing ANNs in Von Neumann computing systems, though, incurs a high energy cost due to the bottleneck created between CPU and memory. Implementation on neuromorphic systems may help to reduce energy demand. Conventional ANNs must be converted into equivalent Spiking Neural Networks (SNNs) in order to be d… ▽ More

    Submitted 19 February, 2018; originally announced February 2018.

    Comments: Submitted to IEEE Symposium Series on Computational Intelligence (SSCI) 2017

  24. arXiv:1605.08478  [pdf, other

    cs.LG cs.AI

    Model-Free Imitation Learning with Policy Optimization

    Authors: Jonathan Ho, Jayesh K. Gupta, Stefano Ermon

    Abstract: In imitation learning, an agent learns how to behave in an environment with an unknown cost function by mimicking expert demonstrations. Existing imitation learning algorithms typically involve solving a sequence of planning or reinforcement learning problems. Such algorithms are therefore not directly applicable to large, high-dimensional environments, and their performance can significantly degr… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: In Proceedings of the 33rd International Conference on Machine Learning, 2016

    Journal ref: JMLR W&CP 48 (2016) 2760-2769

  25. arXiv:1406.2616  [pdf, other

    cs.RO cs.AI cs.LG

    PlanIt: A Crowdsourcing Approach for Learning to Plan Paths from Large Scale Preference Feedback

    Authors: Ashesh Jain, Debarghya Das, Jayesh K Gupta, Ashutosh Saxena

    Abstract: We consider the problem of learning user preferences over robot trajectories for environments rich in objects and humans. This is challenging because the criterion defining a good trajectory varies with users, tasks and interactions in the environment. We represent trajectory preferences using a cost function that the robot learns and uses it to generate good trajectories in new environments. We d… ▽ More

    Submitted 5 January, 2016; v1 submitted 10 June, 2014; originally announced June 2014.

    Comments: PlanIt Camera Ready ICRA'15