Skip to main content

Showing 1–12 of 12 results for author: Prajapat, M

.
  1. arXiv:2505.07594  [pdf, other

    eess.SY cs.LG math.OC

    Finite-Sample-Based Reachability for Safe Control with Gaussian Process Dynamics

    Authors: Manish Prajapat, Johannes Köhler, Amon Lahr, Andreas Krause, Melanie N. Zeilinger

    Abstract: Gaussian Process (GP) regression is shown to be effective for learning unknown dynamics, enabling efficient and safety-aware control strategies across diverse applications. However, existing GP-based model predictive control (GP-MPC) methods either rely on approximations, thus lacking guarantees, or are overly conservative, which limits their practical utility. To close this gap, we present a samp… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2503.08795  [pdf, other

    eess.SY math.OC

    Stochastic Model Predictive Control for Sub-Gaussian Noise

    Authors: Yunke Ao, Johannes Köhler, Manish Prajapat, Yarden As, Melanie Zeilinger, Philipp Fürnstahl, Andreas Krause

    Abstract: We propose a stochastic Model Predictive Control (MPC) framework that ensures closed-loop chance constraint satisfaction for linear systems with general sub-Gaussian process and measurement noise. By considering sub-Gaussian noise, we can provide guarantees for a large class of distributions, including time-varying distributions. Specifically, we first provide a new characterization of sub-Gaussia… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 15 pages, 6 figures, submitted to Automatica

    MSC Class: 93E20

  3. Performance-driven Constrained Optimal Auto-Tuner for MPC

    Authors: Albert Gassol Puigjaner, Manish Prajapat, Andrea Carron, Andreas Krause, Melanie N. Zeilinger

    Abstract: A key challenge in tuning Model Predictive Control (MPC) cost function parameters is to ensure that the system performance stays consistently above a certain threshold. To address this challenge, we propose a novel method, COAT-MPC, Constrained Optimal Auto-Tuner for MPC. With every tuning iteration, COAT-MPC gathers performance data and learns by updating its posterior belief. It explores the tun… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 8 pages

  4. arXiv:2410.16128  [pdf, other

    cs.AI cs.LG

    SMART: Self-learning Meta-strategy Agent for Reasoning Tasks

    Authors: Rongxing Liu, Kumar Shridhar, Manish Prajapat, Patrick Xia, Mrinmaya Sachan

    Abstract: Tasks requiring deductive reasoning, especially those involving multiple steps, often demand adaptive strategies such as intermediate generation of rationales or programs, as no single approach is universally optimal. While Language Models (LMs) can enhance their outputs through iterative self-refinement and strategy adjustments, they frequently fail to apply the most effective strategy in their f… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  5. arXiv:2409.08616  [pdf, other

    math.OC cs.LG eess.SY

    Towards safe and tractable Gaussian process-based MPC: Efficient sampling within a sequential quadratic programming framework

    Authors: Manish Prajapat, Amon Lahr, Johannes Köhler, Andreas Krause, Melanie N. Zeilinger

    Abstract: Learning uncertain dynamics models using Gaussian process~(GP) regression has been demonstrated to enable high-performance and safety-aware control strategies for challenging real-world applications. Yet, for computational tractability, most approaches for Gaussian process-based model predictive control (GP-MPC) are based on approximations of the reachable set that are either overly conservative o… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: to be published in 63rd IEEE Conference on Decision and Control (CDC 2024)

    ACM Class: G.1.6

  6. arXiv:2407.09905  [pdf, other

    cs.LG

    Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

    Authors: Riccardo De Santi, Manish Prajapat, Andreas Krause

    Abstract: In classic Reinforcement Learning (RL), the agent maximizes an additive objective of the visited states, e.g., a value function. Unfortunately, objectives of this type cannot model many real-world applications such as experiment design, exploration, imitation learning, and risk-averse RL to name a few. This is due to the fact that additive objectives disregard interactions between states that are… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: ICML 2024

  7. arXiv:2402.06562  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Safe Guaranteed Exploration for Non-linear Systems

    Authors: Manish Prajapat, Johannes Köhler, Matteo Turchetta, Andreas Krause, Melanie N. Zeilinger

    Abstract: Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  8. arXiv:2307.13372  [pdf, other

    cs.LG

    Submodular Reinforcement Learning

    Authors: Manish Prajapat, Mojmír Mutný, Melanie N. Zeilinger, Andreas Krause

    Abstract: In reinforcement learning (RL), rewards of states are typically considered additive, and following the Markov assumption, they are $\textit{independent}$ of states visited previously. In many important applications, such as coverage control, experiment design and informative path planning, rewards naturally have diminishing returns, i.e., their value decreases in light of similar states visited pr… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Spotlight paper at ICLR 2024

  9. arXiv:2210.06380  [pdf, other

    cs.LG cs.AI cs.MA cs.RO math.OC

    Near-Optimal Multi-Agent Learning for Safe Coverage Control

    Authors: Manish Prajapat, Matteo Turchetta, Melanie N. Zeilinger, Andreas Krause

    Abstract: In multi-agent coverage control problems, agents navigate their environment to reach locations that maximize the coverage of some density. In practice, the density is rarely known $\textit{a priori}$, further complicating the original NP-hard problem. Moreover, in many applications, agents cannot visit arbitrary locations due to $\textit{a priori}$ unknown safety constraints. In this paper, we aim… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  10. arXiv:2006.10611  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Competitive Policy Optimization

    Authors: Manish Prajapat, Kamyar Azizzadenesheli, Alexander Liniger, Yisong Yue, Anima Anandkumar

    Abstract: A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. To tackle this, we propose competitive policy optimization (CoPO), a novel policy gradient approach that exploits the game-theoretic nature of competitive games to derive policy updates. Motivated by the competitive gr… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 11 pages main paper, 6 pages references, and 31 pages appendix. 14 figures

  11. arXiv:1905.05150  [pdf, other

    cs.RO

    AMZ Driverless: The Full Autonomous Racing System

    Authors: Juraj Kabzan, Miguel de la Iglesia Valls, Victor Reijgwart, Hubertus Franciscus Cornelis Hendrikx, Claas Ehmke, Manish Prajapat, Andreas Bühler, Nikhil Gosala, Mehak Gupta, Ramya Sivanesan, Ankit Dhall, Eugenio Chisari, Napat Karnchanachari, Sonja Brits, Manuel Dangel, Inkyu Sa, Renaud Dubé, Abel Gawel, Mark Pfeiffer, Alexander Liniger, John Lygeros, Roland Siegwart

    Abstract: This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and con… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: 40 pages, 32 figures, submitted to Journal of Field Robotics

  12. Redundant Perception and State Estimation for Reliable Autonomous Racing

    Authors: Nikhil Bharadwaj Gosala, Andreas Bühler, Manish Prajapat, Claas Ehmke, Mehak Gupta, Ramya Sivanesan, Abel Gawel, Mark Pfeiffer, Mathias Bürki, Inkyu Sa, Renaud Dubé, Roland Siegwart

    Abstract: In autonomous racing, vehicles operate close to the limits of handling and a sensor failure can have critical consequences. To limit the impact of such failures, this paper presents the redundant perception and state estimation approaches developed for an autonomous race car. Redundancy in perception is achieved by estimating the color and position of the track delimiting objects using two sensor… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: 7 pages, 21 figures, submitted to the International Conference on Robotics and Automation 2019, for accompanying video visit https://www.youtube.com/watch?v=ir_uqEYuT84