Skip to main content

Showing 1–50 of 73 results for author: Fridovich-Keil, D

.
  1. arXiv:2505.14817  [pdf, ps, other

    cs.GT cs.MA

    Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles

    Authors: Kushagra Gupta, Surya Murthy, Mustafa O. Karabag, Ufuk Topcu, David Fridovich-Keil

    Abstract: Cooperative bargaining games are widely used to model resource allocation and conflict resolution. Traditional solutions assume the mediator can access agents utility function values and gradients. However, there is an increasing number of settings, such as human AI interactions, where utility values may be inaccessible or incomparable due to unknown, nonaffine transformations. To model such setti… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2505.11494  [pdf, ps, other

    cs.RO

    SHIELD: Safety on Humanoids via CBFs In Expectation on Learned Dynamics

    Authors: Lizhi Yang, Blake Werner, Ryan K. Cosner, David Fridovich-Keil, Preston Culbertson, Aaron D. Ames

    Abstract: Robot learning has produced remarkably effective ``black-box'' controllers for complex tasks such as dynamic locomotion on humanoids. Yet ensuring dynamic safety, i.e., constraint satisfaction, remains challenging for such policies. Reinforcement learning (RL) embeds constraints heuristically through reward engineering, and adding or modifying constraints requires retraining. Model-based approache… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Video at https://vimeo.com/1061676063

  3. arXiv:2505.05519  [pdf, other

    cs.CV

    Real-Time Privacy Preservation for Robot Visual Perception

    Authors: Minkyu Choi, Yunhao Yang, Neel P. Bhatt, Kushagra Gupta, Sahil Shah, Aditya Rai, David Fridovich-Keil, Ufuk Topcu, Sandeep P. Chinchali

    Abstract: Many robots (e.g., iRobot's Roomba) operate based on visual observations from live video streams, and such observations may inadvertently include privacy-sensitive objects, such as personal identifiers. Existing approaches for preserving privacy rely on deep learning models, differential privacy, or cryptography. They lack guarantees for the complete concealment of all sensitive objects. Guarantee… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  4. arXiv:2505.01945  [pdf, other

    cs.MA cs.RO

    Act Natural! Extending Naturalistic Projection to Multimodal Behavior Scenarios

    Authors: Hamzah I. Khan, David Fridovich-Keil

    Abstract: Autonomous agents operating in public spaces must consider how their behaviors might affect the humans around them, even when not directly interacting with them. To this end, it is often beneficial to be predictable and appear naturalistic. Existing methods for this purpose use human actor intent modeling or imitation learning techniques, but these approaches rarely capture all possible motivation… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  5. arXiv:2505.00213  [pdf, ps, other

    cs.RO math.OC

    PSN Game: Game-theoretic Planning via a Player Selection Network

    Authors: Tianyu Qiu, Eric Ouano, Fernando Palafox, Christian Ellis, David Fridovich-Keil

    Abstract: While game-theoretic planning frameworks are effective at modeling multi-agent interactions, they require solving optimization problems with hundreds or thousands of variables, resulting in long computation times that limit their use in large-scale, real-time systems. To address this issue, we propose PSN Game: a novel game-theoretic planning framework that reduces runtime by learning a Player Sel… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  6. arXiv:2504.16923  [pdf, other

    cs.RO cs.LG eess.SY

    Meta-Learning Online Dynamics Model Adaptation in Off-Road Autonomous Driving

    Authors: Jacob Levy, Jason Gibson, Bogdan Vlahov, Erica Tevere, Evangelos Theodorou, David Fridovich-Keil, Patrick Spieler

    Abstract: High-speed off-road autonomous driving presents unique challenges due to complex, evolving terrain characteristics and the difficulty of accurately modeling terrain-vehicle interactions. While dynamics models used in model-based control can be learned from real-world data, they often struggle to generalize to unseen terrain, making real-time adaptation essential. We propose a novel framework that… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  7. arXiv:2503.18224  [pdf, other

    cs.LG cs.GT

    A Framework for Finding Local Saddle Points in Two-Player Zero-Sum Black-Box Games

    Authors: Shubhankar Agarwal, Hamzah I. Khan, Sandeep P. Chinchali, David Fridovich-Keil

    Abstract: Saddle point optimization is a critical problem employed in numerous real-world applications, including portfolio optimization, generative adversarial networks, and robotics. It has been extensively studied in cases where the objective function is known and differentiable. Existing work in black-box settings with unknown objectives that can only be sampled either assumes convexity-concavity in the… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  8. arXiv:2503.15486  [pdf, ps, other

    cs.GT eess.SY

    More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns

    Authors: Kushagra Gupta, Ross Allen, David Fridovich-Keil, Ufuk Topcu

    Abstract: Non-cooperative dynamic game theory provides a principled approach to modeling sequential decision-making among multiple noncommunicative agents. A key focus has been on finding Nash equilibria in two-agent zero-sum dynamic games under various information structures. A well-known result states that in linear-quadratic games, unique Nash equilibria under feedback and open-loop information structure… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 6 pages

  9. arXiv:2503.13790  [pdf, other

    cs.MA cs.GT

    A Convex Formulation of Game-theoretic Hierarchical Routing

    Authors: Dong Ho Lee, Kaitlyn Donnel, Max Z. Li, David Fridovich-Keil

    Abstract: Hierarchical decision-making is a natural paradigm for coordinating multi-agent systems in complex environments such as air traffic management. In this paper, we present a bilevel framework for game-theoretic hierarchical routing, where a high-level router assigns discrete routes to multiple vehicles who seek to optimize potentially noncooperative objectives that depend upon the assigned routes. T… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  10. arXiv:2503.05696  [pdf, other

    cs.LG cs.AI cs.RO

    Multi-Fidelity Policy Gradient Algorithms

    Authors: Xinjie Liu, Cyrus Neary, Kushagra Gupta, Christian Ellis, Ufuk Topcu, David Fridovich-Keil

    Abstract: Many reinforcement learning (RL) algorithms require large amounts of data, prohibiting their use in applications where frequent interactions with operational systems are infeasible, or high-fidelity simulations are expensive or unavailable. Meanwhile, low-fidelity simulators--such as reduced-order models, heuristic reward functions, or generative world models--can cheaply provide useful data for R… ▽ More

    Submitted 9 April, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

  11. arXiv:2502.03616  [pdf, other

    cs.GT cs.MA

    Noncooperative Equilibrium Selection via a Trading-based Auction

    Authors: Jaehan Im, Filippos Fotiadis, Daniel Delahaye, Ufuk Topcu, David Fridovich-Keil

    Abstract: Noncooperative multi-agent systems often face coordination challenges due to conflicting preferences among agents. In particular, agents acting in their own self-interest can settle on different equilibria, leading to suboptimal outcomes or even safety concerns. We propose an algorithm named trading auction for consensus (TACo), a decentralized approach that enables noncooperative agents to reach… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  12. arXiv:2412.14312  [pdf, other

    cs.LG

    Stealing That Free Lunch: Exposing the Limits of Dyna-Style Reinforcement Learning

    Authors: Brett Barkley, David Fridovich-Keil

    Abstract: Dyna-style off-policy model-based reinforcement learning (DMBRL) algorithms are a family of techniques for generating synthetic state transition data and thereby enhancing the sample efficiency of off-policy RL algorithms. This paper identifies and investigates a surprising performance gap observed when applying DMBRL algorithms across different benchmark environments with proprioceptive observati… ▽ More

    Submitted 20 December, 2024; v1 submitted 18 December, 2024; originally announced December 2024.

  13. arXiv:2412.01114  [pdf, other

    cs.LG

    Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations

    Authors: Cevahir Koprulu, Po-han Li, Tianyu Qiu, Ruihan Zhao, Tyler Westenbroek, David Fridovich-Keil, Sandeep Chinchali, Ufuk Topcu

    Abstract: Many continuous control problems can be formulated as sparse-reward reinforcement learning (RL) tasks. In principle, online RL methods can automatically explore the state space to solve each new task. However, discovering sequences of actions that lead to a non-zero reward becomes exponentially more difficult as the task horizon increases. Manually shaping rewards can accelerate learning for a fix… ▽ More

    Submitted 24 April, 2025; v1 submitted 1 December, 2024; originally announced December 2024.

  14. arXiv:2412.01017  [pdf, other

    cs.RO cs.GT cs.MA eess.SY

    Inferring Short-Sightedness in Dynamic Noncooperative Games

    Authors: Cade Armstrong, Ryan Park, Xinjie Liu, Kushagra Gupta, David Fridovich-Keil

    Abstract: Dynamic game theory is an increasingly popular tool for modeling multi-agent, e.g. human-robot, interactions. Game-theoretic models presume that each agent wishes to minimize a private cost function that depends on others' actions. These games typically evolve over a fixed time horizon, specifying how far into the future each agent plans. In practical settings, however, decision-makers may vary in… ▽ More

    Submitted 15 April, 2025; v1 submitted 1 December, 2024; originally announced December 2024.

  15. arXiv:2410.21447  [pdf, other

    cs.GT cs.MA

    You Can't Always Get What You Want: Games of Ordered Preference

    Authors: Dong Ho Lee, Lasse Peters, David Fridovich-Keil

    Abstract: We study noncooperative games, in which each player's objective is composed of a sequence of ordered- and potentially conflicting-preferences. Problems of this type naturally model a wide variety of scenarios: for example, drivers at a busy intersection must balance the desire to make forward progress with the risk of collision. Mathematically, these problems possess a nested structure, and to beh… ▽ More

    Submitted 21 January, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  16. arXiv:2410.21446  [pdf, other

    cs.GT cs.MA

    Improving DeFi Mechanisms with Dynamic Games and Optimal Control: A Case Study in Stablecoins

    Authors: Nicholas Strohmeyer, Sriram Vishwanath, David Fridovich-Keil

    Abstract: Stablecoins are a class of cryptocurrencies which aim at providing consistency and predictability, typically by pegging the token's value to that of a real world asset. Designing resilient decentralized stablecoins is a challenge, and prominent stablecoins today either (i) give up on decentralization, or (ii) rely on user-owned cryptocurrencies as collateral, exposing the token to exogenous price… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  17. arXiv:2410.16441  [pdf, other

    cs.GT cs.MA cs.RO eess.SY

    Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies

    Authors: Xinjie Liu, Jingqi Li, Filippos Fotiadis, Mustafa O. Karabag, Jesse Milzman, David Fridovich-Keil, Ufuk Topcu

    Abstract: Feedback Nash equilibrium strategies in multi-agent dynamic games require availability of all players' state information to compute control actions. However, in real-world scenarios, sensing and communication limitations between agents make full state feedback expensive or impractical, and such strategies can become fragile when state information from other agents is inaccurate. To this end, we pr… ▽ More

    Submitted 9 April, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

  18. arXiv:2410.09163  [pdf, other

    cs.RO cs.LG math.OC

    Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models

    Authors: Jacob Levy, Tyler Westenbroek, David Fridovich-Keil

    Abstract: Traditionally, model-based reinforcement learning (MBRL) methods exploit neural networks as flexible function approximators to represent $\textit{a priori}$ unknown environment dynamics. However, training data are typically scarce in practice, and these black-box models often fail to generalize. Modeling architectures that leverage known physics can substantially reduce the complexity of system-id… ▽ More

    Submitted 28 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: v2: corrected typos in eqs (1) and (3); add CoRL footnote

  19. arXiv:2410.07409  [pdf, other

    eess.SY cs.LG cs.MA cs.RO

    Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

    Authors: Isaac Remy, David Fridovich-Keil, Karen Leung

    Abstract: From autonomous driving to package delivery, ensuring safe yet efficient multi-agent interaction is challenging as the interaction dynamics are influenced by hard-to-model factors such as social norms and contextual cues. Understanding these influences can aid in the design and evaluation of socially-aware autonomous agents whose behaviors are aligned with human values. In this work, we seek to co… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 8 pages, 7 figures

  20. arXiv:2406.03565  [pdf, other

    cs.GT cs.MA eess.SY

    Second-Order Algorithms for Finding Local Nash Equilibria in Zero-Sum Games

    Authors: Kushagra Gupta, Xinjie Liu, Ross Allen, Ufuk Topcu, David Fridovich-Keil

    Abstract: Zero-sum games arise in a wide variety of problems, including robust optimization and adversarial learning. However, algorithms deployed for finding a local Nash equilibrium in these games often converge to non-Nash stationary points. This highlights a key challenge: for any algorithm, the stability properties of its underlying dynamical system can cause non-Nash points to be potential attractors.… ▽ More

    Submitted 3 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  21. arXiv:2405.19292  [pdf, other

    cs.MA

    Act Natural! Projecting Autonomous System Trajectories Into Naturalistic Behavior Sets

    Authors: Hamzah I. Khan, Adam J. Thorpe, David Fridovich-Keil

    Abstract: Autonomous agents operating around human actors must consider how their behaviors might affect those humans, even when not directly interacting with them. To this end, it is often beneficial to be predictable and appear naturalistic. Existing methods to address this problem use human actor intent modeling or imitation learning techniques, but these approaches rarely capture all possible motivation… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  22. arXiv:2404.02876  [pdf, other

    math.OC

    Sensing Resource Allocation Against Data-Poisoning Attacks in Traffic Routing

    Authors: Yue Yu, Adam J. Thorpe, Jesse Milzman, David Fridovich-Keil, Ufuk Topcu

    Abstract: Data-poisoning attacks can disrupt the efficient operations of transportation systems by misdirecting traffic flows via falsified data. One challenge in countering these attacks is to reduce the uncertainties on the types of attacks, such as the distribution of their targets and intensities. We introduce a resource allocation method in transportation networks to detect and distinguish different ty… ▽ More

    Submitted 10 September, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  23. arXiv:2404.00733  [pdf, other

    cs.GT cs.MA eess.SY

    Smooth Information Gathering in Two-Player Noncooperative Games

    Authors: Fernando Palafox, Jesse Milzman, Dong Ho Lee, Ryan Park, David Fridovich-Keil

    Abstract: We present a mathematical framework for modeling two-player noncooperative games in which one player is uncertain of the other player's costs but can preemptively allocate information-gathering resources to reduce this uncertainty. We refer to the players as the uncertain player (UP) and the certain player (CP), respectively. We obtain UP's decisions by solving a two-stage problem where, in Stage… ▽ More

    Submitted 24 October, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: https://github.com/CLeARoboticsLab/GamesVoI.jl

  24. arXiv:2403.12210  [pdf, other

    eess.SY cs.LG

    Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning

    Authors: Antonio Lopez, David Fridovich-Keil

    Abstract: Recent methods using Reinforcement Learning (RL) have proven to be successful for training intelligent agents in unknown environments. However, RL has not been applied widely in real-world robotics scenarios. This is because current state-of-the-art RL methods require large amounts of data to learn a specific task, leading to unreasonable costs when deploying the agent to collect data in real-worl… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  25. arXiv:2403.10384  [pdf, other

    cs.GT cs.MA eess.SY

    Coordination in Noncooperative Multiplayer Matrix Games via Reduced Rank Correlated Equilibria

    Authors: Jaehan Im, Yue Yu, David Fridovich-Keil, Ufuk Topcu

    Abstract: Coordination in multiplayer games enables players to avoid the lose-lose outcome that often arises at Nash equilibria. However, designing a coordination mechanism typically requires the consideration of the joint actions of all players, which becomes intractable in large-scale games. We develop a novel coordination mechanism, termed reduced rank correlated equilibria, which reduces the number of j… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  26. arXiv:2402.08902  [pdf, other

    cs.RO cs.GT cs.LG cs.MA eess.SY

    Auto-Encoding Bayesian Inverse Games

    Authors: Xinjie Liu, Lasse Peters, Javier Alonso-Mora, Ufuk Topcu, David Fridovich-Keil

    Abstract: When multiple agents interact in a common environment, each agent's actions impact others' future decisions, and noncooperative dynamic games naturally capture this coupling. In interactive motion planning, however, agents typically do not have access to a complete model of the game, e.g., due to unknown objectives of other players. Therefore, we consider the inverse game problem, in which some pr… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Journal ref: International Workshop on the Algorithmic Foundations of Robotics 2024 (WAFR)

  27. arXiv:2401.15745  [pdf, other

    math.OC eess.SY

    The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games

    Authors: Jingqi Li, Somayeh Sojoudi, Claire Tomlin, David Fridovich-Keil

    Abstract: Solving feedback Stackelberg games with nonlinear dynamics and coupled constraints, a common scenario in practice, presents significant challenges. This work introduces an efficient method for computing approximate local feedback Stackelberg equilibria in multi-player general-sum dynamic games, with continuous state and action spaces. Different from existing (approximate) dynamic programming solut… ▽ More

    Submitted 2 April, 2025; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: This manuscript has been accepted by SIAM Journal on Optimization. In this arxiv version, we fix a typo in equation (4.3), \ell_{T+1}(x_T) -> \ell_{T+1}(x_{T+1}), and a typo in equation (4.7), L_{T+1} -> L_T. All main results are unchanged

  28. arXiv:2311.17008  [pdf, other

    cs.LG

    An Investigation of Time Reversal Symmetry in Reinforcement Learning

    Authors: Brett Barkley, Amy Zhang, David Fridovich-Keil

    Abstract: One of the fundamental challenges associated with reinforcement learning (RL) is that collecting sufficient data can be both time-consuming and expensive. In this paper, we formalize a concept of time reversal symmetry in a Markov decision process (MDP), which builds upon the established structure of dynamically reversible Markov chains (DRMCs) and time-reversibility in classical physics. Specific… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  29. arXiv:2311.09439  [pdf, other

    cs.RO cs.MA

    Learning Hyperplanes for Multi-Agent Collision Avoidance in Space

    Authors: Fernando Palafox, Yue Yu, David Fridovich-Keil

    Abstract: A core challenge of multi-robot interactions is collision avoidance among robots with potentially conflicting objectives. We propose a game-theoretic method for collision avoidance based on rotating hyperplane constraints. These constraints ensure collision avoidance by defining separating hyperplanes that rotate around a keep-out zone centered on certain robots. Since it is challenging to select… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  30. Leadership Inference for Multi-Agent Interactions

    Authors: Hamzah Khan, David Fridovich-Keil

    Abstract: Effectively predicting intent and behavior requires inferring leadership in multi-agent interactions. Dynamic games provide an expressive theoretical framework for modeling these interactions. Employing this framework, we propose a novel method to infer the leader in a two-agent game by observing the agents' behavior in complex, long-horizon interactions. We make two contributions. First, we intro… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures, accepted to IEEE Robotics and Automation Letters

  31. arXiv:2310.00468  [pdf, ps, other

    cs.GT cs.AI

    When Should a Leader Act Suboptimally? The Role of Inferability in Repeated Stackelberg Games

    Authors: Mustafa O. Karabag, Sophia Smith, Negar Mehr, David Fridovich-Keil, Ufuk Topcu

    Abstract: When interacting with other decision-making agents in non-adversarial scenarios, it is critical for an autonomous agent to have inferable behavior: The agent's actions must convey their intention and strategy. We model the inferability problem using Stackelberg games with observations where a leader and a follower repeatedly interact. During the interactions, the leader uses a fixed mixed strategy… ▽ More

    Submitted 31 May, 2025; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Extended journal version of the ACC 2024 paper "Encouraging Inferable Behavior for Autonomy: Repeated Bimatrix Stackelberg Games with Observations"

  32. arXiv:2309.11076  [pdf, other

    cs.LG eess.SY

    Symbolic Regression on Sparse and Noisy Data with Gaussian Processes

    Authors: Junette Hsin, Shubhankar Agarwal, Adam Thorpe, Luis Sentis, David Fridovich-Keil

    Abstract: In this paper, we address the challenge of deriving dynamical models from sparse and noisy data. High-quality data is crucial for symbolic regression algorithms; limited and noisy data can present modeling challenges. To overcome this, we combine Gaussian process regression with a sparse identification of nonlinear dynamics (SINDy) method to denoise the data and identify nonlinear dynamical equati… ▽ More

    Submitted 10 October, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Submitted to ACC 2025

  33. arXiv:2309.10901  [pdf, other

    cs.MA

    Game-theoretic Occlusion-Aware Motion Planning: an Efficient Hybrid-Information Approach

    Authors: Kushagra Gupta, David Fridovich-Keil

    Abstract: We present a novel algorithm for game-theoretic trajectory planning, tailored for settings in which agents can only observe one another in specific regions of the state space. Such problems arise naturally in the context of multi-robot navigation, where occlusions due to environment geometry naturally mask agents' view of one another. In this paper, we formalize these settings as dynamic games wit… ▽ More

    Submitted 16 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Key Words: Dynamic Game Theory, Multi-Agent Motion Planning

  34. arXiv:2309.07504  [pdf, other

    cs.RO cs.AI

    Connected Autonomous Vehicle Motion Planning with Video Predictions from Smart, Self-Supervised Infrastructure

    Authors: Jiankai Sun, Shreyas Kousik, David Fridovich-Keil, Mac Schwager

    Abstract: Connected autonomous vehicles (CAVs) promise to enhance safety, efficiency, and sustainability in urban transportation. However, this is contingent upon a CAV correctly predicting the motion of surrounding agents and planning its own motion safely. Doing so is challenging in complex urban environments due to frequent occlusions and interactions among many agents. One solution is to leverage smart… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)

  35. arXiv:2308.11546  [pdf, other

    math.OC eess.SY

    Risk-Minimizing Two-Player Zero-Sum Stochastic Differential Game via Path Integral Control

    Authors: Apurva Patil, Yujing Zhou, David Fridovich-Keil, Takashi Tanaka

    Abstract: This paper addresses a continuous-time risk-minimizing two-player zero-sum stochastic differential game (SDG), in which each player aims to minimize its probability of failure. Failure occurs in the event when the state of the game enters into predefined undesirable domains, and one player's failure is the other's success. We derive a sufficient condition for this game to have a saddle-point equil… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, CDC 2023

  36. arXiv:2308.08017  [pdf, other

    cs.GT cs.LG eess.SY

    Active Inverse Learning in Stackelberg Trajectory Games

    Authors: William Ward, Yue Yu, Jacob Levy, Negar Mehr, David Fridovich-Keil, Ufuk Topcu

    Abstract: Game-theoretic inverse learning is the problem of inferring a player's objectives from their actions. We formulate an inverse learning problem in a Stackelberg game between a leader and a follower, where each player's action is the trajectory of a dynamical system. We propose an active inverse learning method for the leader to infer which hypothesis among a finite set of candidates best describes… ▽ More

    Submitted 11 October, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures. Updated previous version to acknowledge funding

  37. arXiv:2307.08168  [pdf, other

    cs.LG cs.RO

    Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

    Authors: Tyler Westenbroek, Jacob Levy, David Fridovich-Keil

    Abstract: We focus on developing efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation. However, these approaches often remain too data inefficient or unreliable to train on real robotic hardware. In this paper we introduce a novel policy gradient… ▽ More

    Submitted 6 November, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

  38. arXiv:2304.05483  [pdf, other

    cs.RO

    Contingency Games for Multi-Agent Interaction

    Authors: Lasse Peters, Andrea Bajcsy, Chih-Yuan Chiu, David Fridovich-Keil, Forrest Laine, Laura Ferranti, Javier Alonso-Mora

    Abstract: Contingency planning, wherein an agent generates a set of possible plans conditioned on the outcome of an uncertain event, is an increasingly popular way for robots to act under uncertainty. In this work we take a game-theoretic perspective on contingency planning, tailored to multi-agent scenarios in which a robot's actions impact the decisions of other agents and vice versa. The resulting contin… ▽ More

    Submitted 21 December, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  39. arXiv:2304.01945  [pdf, other

    eess.SY

    Scenario-Game ADMM: A Parallelized Scenario-Based Solver for Stochastic Noncooperative Games

    Authors: Jingqi Li, Chih-Yuan Chiu, Lasse Peters, Fernando Palafox, Mustafa Karabag, Javier Alonso-Mora, Somayeh Sojoudi, Claire Tomlin, David Fridovich-Keil

    Abstract: Decision-making in multi-player games can be extremely challenging, particularly under uncertainty. In this work, we propose a new sample-based approximation to a class of stochastic, general-sum, pure Nash games, where each player has an expected-value objective and a set of chance constraints. This new approximation scheme inherits the accuracy of objective approximation from the established sam… ▽ More

    Submitted 5 November, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

  40. arXiv:2304.00163  [pdf, other

    cs.GT cs.LG

    Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse Learning

    Authors: Shenghui Chen, Yue Yu, David Fridovich-Keil, Ufuk Topcu

    Abstract: Markov games model interactions among multiple players in a stochastic, dynamic environment. Each player in a Markov game maximizes its expected total discounted reward, which depends upon the policies of the other players. We formulate a class of Markov games, termed affine Markov games, where an affine reward function couples the players' actions. We introduce a novel solution concept, the soft-… ▽ More

    Submitted 8 September, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

  41. Inferring Occluded Agent Behavior in Dynamic Games from Noise Corrupted Observations

    Authors: Tianyu Qiu, David Fridovich-Keil

    Abstract: In mobile robotics and autonomous driving, it is natural to model agent interactions as the Nash equilibrium of a noncooperative, dynamic game. These methods inherently rely on observations from sensors such as lidars and cameras to identify agents participating in the game and, therefore, have difficulty when some agents are occluded. To address this limitation, this paper presents an occlusion-a… ▽ More

    Submitted 13 July, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

  42. arXiv:2302.01999  [pdf, other

    cs.RO

    Online and Offline Learning of Player Objectives from Partial Observations in Dynamic Games

    Authors: Lasse Peters, Vicenç Rubies-Royo, Claire J. Tomlin, Laura Ferranti, Javier Alonso-Mora, Cyrill Stachniss, David Fridovich-Keil

    Abstract: Robots deployed to the real world must be able to interact with other agents in their environment. Dynamic game theory provides a powerful mathematical framework for modeling scenarios in which agents have individual objectives and interactions evolve over time. However, a key limitation of such techniques is that they require a-priori knowledge of all players' objectives. In this work, we address… ▽ More

    Submitted 14 May, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2106.03611

  43. arXiv:2301.07822  [pdf, other

    eess.SY

    GrAVITree: Graph-based Approximate Value Function In a Tree

    Authors: Patrick H. Washington, David Fridovich-Keil, Mac Schwager

    Abstract: In this paper, we introduce GrAVITree, a tree- and sampling-based algorithm to compute a near-optimal value function and corresponding feedback policy for indefinite time-horizon, terminal state-constrained nonlinear optimal control problems. Our algorithm is suitable for arbitrary nonlinear control systems with both state and input constraints. The algorithm works by sampling feasible control inp… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 6 pages. 8 figures. Submitted to the 2023 American Control Conference

  44. arXiv:2301.01398  [pdf, other

    cs.MA cs.RO eess.SY

    Cost Inference for Feedback Dynamic Games from Noisy Partial State Observations and Incomplete Trajectories

    Authors: Jingqi Li, Chih-Yuan Chiu, Lasse Peters, Somayeh Sojoudi, Claire Tomlin, David Fridovich-Keil

    Abstract: In multi-agent dynamic games, the Nash equilibrium state trajectory of each agent is determined by its cost function and the information pattern of the game. However, the cost and trajectory of each agent may be unavailable to the other agents. Prior work on using partial observations to infer the costs in dynamic games assumes an open-loop information pattern. In this work, we demonstrate that th… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Accepted by AAMAS 2023. This is a preprint version

  45. arXiv:2210.01221  [pdf, other

    cs.GT math.OC

    Cost Design in Atomic Routing Games

    Authors: Yue Yu, Shenghui Chen, David Fridovich-Keil, Ufuk Topcu

    Abstract: An atomic routing game is a multiplayer game on a directed graph. Each player in the game chooses a path -- a sequence of links that connect its origin node to its destination node -- with the lowest cost, where the cost of each link is a function of all players' choices. We develop a novel numerical method to design the link cost function in atomic routing games such that the players' choices at… ▽ More

    Submitted 17 May, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  46. arXiv:2209.10802  [pdf, other

    cs.RO cs.LG

    Robust Forecasting for Robotic Control: A Game-Theoretic Approach

    Authors: Shubhankar Agarwal, David Fridovich-Keil, Sandeep P. Chinchali

    Abstract: Modern robots require accurate forecasts to make optimal decisions in the real world. For example, self-driving cars need an accurate forecast of other agents' future actions to plan safe trajectories. Current methods rely heavily on historical time series to accurately predict the future. However, relying entirely on the observed history is problematic since it could be corrupted by noise, have o… ▽ More

    Submitted 4 April, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  47. Alternating Direction Method of Multipliers for Decomposable Saddle-Point Problems

    Authors: Mustafa O. Karabag, David Fridovich-Keil, Ufuk Topcu

    Abstract: Saddle-point problems appear in various settings including machine learning, zero-sum stochastic games, and regression problems. We consider decomposable saddle-point problems and study an extension of the alternating direction method of multipliers to such saddle-point problems. Instead of solving the original saddle-point problem directly, this algorithm solves smaller saddle-point problems by e… ▽ More

    Submitted 27 December, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted to 58th Annual Allerton Conference on Communication, Control, and Computing

  48. arXiv:2207.08275  [pdf, other

    cs.GT

    Inverse Matrix Games with Unique Quantal Response Equilibrium

    Authors: Yue Yu, Jonathan Salfity, David Fridovich-Keil, Ufuk Topcu

    Abstract: In an inverse game problem, one needs to infer the cost function of the players in a game such that a desired joint strategy is a Nash equilibrium. We study the inverse game problem for a class of multiplayer matrix games, where the cost perceived by each player is corrupted by random noise. We provide sufficient conditions for the players' quantal response equilibrium -- a generalization of the N… ▽ More

    Submitted 13 October, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

  49. arXiv:2207.06392  [pdf, other

    cs.MA eess.SY

    Relationship Design for Socially-Aware Behavior in Static Games

    Authors: Shenghui Chen, Yigit E. Bayiz, David Fridovich-Keil, Ufuk Topcu

    Abstract: Autonomous agents can adopt socially-aware behaviors to reduce social costs, mimicking the way animals interact in nature and humans in society. We present a new approach to model socially-aware decision-making that includes two key elements: bounded rationality and inter-agent relationships. We capture the interagent relationships by introducing a novel model called a relationship game and encode… ▽ More

    Submitted 25 January, 2024; v1 submitted 13 July, 2022; originally announced July 2022.

  50. arXiv:2205.00291  [pdf, other

    cs.GT cs.MA eess.SY

    Learning Mixed Strategies in Trajectory Games

    Authors: Lasse Peters, David Fridovich-Keil, Laura Ferranti, Cyrill Stachniss, Javier Alonso-Mora, Forrest Laine

    Abstract: In multi-agent settings, game theory is a natural framework for describing the strategic interactions of agents whose objectives depend upon one another's behavior. Trajectory games capture these complex effects by design. In competitive settings, this makes them a more faithful interaction model than traditional "predict then plan" approaches. However, current game-theoretic planning methods have… ▽ More

    Submitted 3 May, 2022; v1 submitted 30 April, 2022; originally announced May 2022.