-
To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games?
Authors:
Chih-Yuan Chiu,
Jingqi Li,
Maulik Bhatt,
Negar Mehr
Abstract:
Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending on the state at each time, or in open-loop, with policies depending only on the initial state. Empirically, open-loop Nash equilibria (OLNE) coul…
▽ More
Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending on the state at each time, or in open-loop, with policies depending only on the initial state. Empirically, open-loop Nash equilibria (OLNE) could be more efficient to compute, while feedback Nash equilibria (FBNE) often encode more complex interactions. However, it remains unclear exactly which dynamic games yield FBNE and OLNE that differ significantly and which do not. To address this problem, we present a principled comparison study of OLNE and FBNE in linear quadratic (LQ) dynamic games. Specifically, we prove that the OLNE strategies of an LQ dynamic game can be synthesized by solving the coupled Riccati equations of an auxiliary LQ game with perturbed costs. The construction of the auxiliary game allows us to establish conditions under which OLNE and FBNE coincide and derive an upper bound on the deviation between FBNE and OLNE of an LQ game.
△ Less
Submitted 21 November, 2024; v1 submitted 17 September, 2024;
originally announced September 2024.
-
RAPID: Autonomous Multi-Agent Racing using Constrained Potential Dynamic Games
Authors:
Yixuan Jia,
Maulik Bhatt,
Negar Mehr
Abstract:
In this work, we consider the problem of autonomous racing with multiple agents where agents must interact closely and influence each other to compete. We model interactions among agents through a game-theoretical framework and propose an efficient algorithm for tractably solving the resulting game in real time. More specifically, we capture interactions among multiple agents through a constrained…
▽ More
In this work, we consider the problem of autonomous racing with multiple agents where agents must interact closely and influence each other to compete. We model interactions among agents through a game-theoretical framework and propose an efficient algorithm for tractably solving the resulting game in real time. More specifically, we capture interactions among multiple agents through a constrained dynamic game. We show that the resulting dynamic game is an instance of a simple-to-analyze class of games. Namely, we show that our racing game is an instance of a constrained dynamic potential game. An important and appealing property of dynamic potential games is that a generalized Nash equilibrium of the underlying game can be computed by solving a single constrained optimal control problem instead of multiple coupled constrained optimal control problems. Leveraging this property, we show that the problem of autonomous racing is greatly simplified and develop RAPID (autonomous multi-agent RAcing using constrained PotentIal Dynamic games), a racing algorithm that can be solved tractably in real-time. Through simulation studies, we demonstrate that our algorithm outperforms the state-of-the-art approach. We further show the real-time capabilities of our algorithm in hardware experiments.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Deep Learning in Ultrasound Elastography Imaging
Authors:
Hongliang Li,
Manish Bhatt,
Zhen Qu,
Shiming Zhang,
Martin C. Hartel,
Ali Khademhosseini,
Guy Cloutier
Abstract:
It is known that changes in the mechanical properties of tissues are associated with the onset and progression of certain diseases. Ultrasound elastography is a technique to characterize tissue stiffness using ultrasound imaging either by measuring tissue strain using quasi-static elastography or natural organ pulsation elastography, or by tracing a propagated shear wave induced by a source or a n…
▽ More
It is known that changes in the mechanical properties of tissues are associated with the onset and progression of certain diseases. Ultrasound elastography is a technique to characterize tissue stiffness using ultrasound imaging either by measuring tissue strain using quasi-static elastography or natural organ pulsation elastography, or by tracing a propagated shear wave induced by a source or a natural vibration using dynamic elastography. In recent years, deep learning has begun to emerge in ultrasound elastography research. In this review, several common deep learning frameworks in the computer vision community, such as multilayer perceptron, convolutional neural network, and recurrent neural network are described. Then, recent advances in ultrasound elastography using such deep learning techniques are revisited in terms of algorithm development and clinical diagnosis. Finally, the current challenges and future developments of deep learning in ultrasound elastography are prospected.
△ Less
Submitted 31 October, 2020; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Asymptotically Stable Optimal Multi-rate Rigid Body Attitude Estimation based on Lagrange-d'Alembert Principle
Authors:
Maulik Bhatt,
Amit K. Sanyal,
Srikant Sukumar
Abstract:
The rigid body attitude estimation problem is treated using the discrete-time Lagrange-d'Alembert principle. Three different possibilities are considered for the multi-rate relation between angular velocity measurements and direction vector measurements for attitude: 1) integer relation between sampling rates, 2) time-varying sampling rates, 3) non-integer relation between sampling rates. In all c…
▽ More
The rigid body attitude estimation problem is treated using the discrete-time Lagrange-d'Alembert principle. Three different possibilities are considered for the multi-rate relation between angular velocity measurements and direction vector measurements for attitude: 1) integer relation between sampling rates, 2) time-varying sampling rates, 3) non-integer relation between sampling rates. In all cases, it is assumed that angular velocity measurements are sampled at a higher rate compared to the inertial vectors. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. At instants when direction vector measurements are absent, a discrete-time model for attitude kinematics is used to propagate past measurements. A discrete-time Lagrangian is constructed as the difference between a kinetic energy-like term that is quadratic in the angular velocity estimation error and an artificial potential energy-like term obtained from Wahba's cost function. An additional dissipation term is introduced and the discrete-time Lagrange-d'Alembert principle is applied to the Lagrangian with this dissipation to obtain an optimal filtering scheme. A discrete-time Lyapunov analysis is carried out to show that the optimal filtering scheme is asymptotically stable in the absence of measurement noise and the domain of convergence is almost global. For a realistic evaluation of the scheme, numerical experiments are conducted with inputs corrupted by bounded measurement noise. These numerical simulations exhibit convergence of the estimated states to a bounded neighborhood of the actual states.
△ Less
Submitted 17 June, 2022; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Rigid Body Geometric Attitude Estimator using Multi-rate Sensors
Authors:
Maulik Bhatt,
Srikant Sukumar,
Amit K. Sanyal
Abstract:
A geometric estimator is proposed for the rigid body attitude under multi-rate measurements using discrete-time Lyapunov stability analysis in this work. The angular velocity measurements are assumed to be sampled at a higher rate compared to the attitude. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. In the case w…
▽ More
A geometric estimator is proposed for the rigid body attitude under multi-rate measurements using discrete-time Lyapunov stability analysis in this work. The angular velocity measurements are assumed to be sampled at a higher rate compared to the attitude. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. In the case when measurements are absent, a discrete-time model for attitude kinematics is assumed in order to propagate the measurements. A discrete-time Lyapunov function is constructed as the sum of a kinetic energy-like term that is quadratic in the angular velocity estimation error and an artificial potential energy-like term obtained from Wahba's cost function. A filtering scheme is obtained by discrete-time stability analysis using a suitable Lyapunov function. The analysis shows that the filtering scheme is exponentially stable in the absence of measurement noise and the domain of convergence is almost global. For a realistic evaluation of the scheme, numerical experiments are conducted with inputs corrupted by bounded measurement noise. Simulation results exhibit convergence of the estimated states to a bounded neighborhood of the actual states.
△ Less
Submitted 16 July, 2020;
originally announced July 2020.
-
Modification of Hilbert's Space-Filling Curve to Avoid Obstacles: A Robotic Path-Planning Strategy
Authors:
Anant A. Joshi,
Maulik C. Bhatt,
Arpita Sinha
Abstract:
This paper addresses the problem of exploring a region using the Hilbert's space-filling curve in the presence of obstacles. No prior knowledge of the region being explored is assumed. An online algorithm is proposed which can implement evasive strategies to avoid obstacles comprising a single or two blocked unit squares placed side by side and successfully explore the entire region. The strategie…
▽ More
This paper addresses the problem of exploring a region using the Hilbert's space-filling curve in the presence of obstacles. No prior knowledge of the region being explored is assumed. An online algorithm is proposed which can implement evasive strategies to avoid obstacles comprising a single or two blocked unit squares placed side by side and successfully explore the entire region. The strategies are specified by the change in the waypoint array which robot going to follow. The fractal nature of the Hilbert's space-filling curve has been exploited in proving the validity of the solution.
△ Less
Submitted 6 October, 2019;
originally announced October 2019.