Search | arXiv e-print repository

To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games?

Authors: Chih-Yuan Chiu, Jingqi Li, Maulik Bhatt, Negar Mehr

Abstract: Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending on the state at each time, or in open-loop, with policies depending only on the initial state. Empirically, open-loop Nash equilibria (OLNE) coul… ▽ More Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending on the state at each time, or in open-loop, with policies depending only on the initial state. Empirically, open-loop Nash equilibria (OLNE) could be more efficient to compute, while feedback Nash equilibria (FBNE) often encode more complex interactions. However, it remains unclear exactly which dynamic games yield FBNE and OLNE that differ significantly and which do not. To address this problem, we present a principled comparison study of OLNE and FBNE in linear quadratic (LQ) dynamic games. Specifically, we prove that the OLNE strategies of an LQ dynamic game can be synthesized by solving the coupled Riccati equations of an auxiliary LQ game with perturbed costs. The construction of the auxiliary game allows us to establish conditions under which OLNE and FBNE coincide and derive an upper bound on the deviation between FBNE and OLNE of an LQ game. △ Less

Submitted 21 November, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

arXiv:2305.00579 [pdf, other]

RAPID: Autonomous Multi-Agent Racing using Constrained Potential Dynamic Games

Authors: Yixuan Jia, Maulik Bhatt, Negar Mehr

Abstract: In this work, we consider the problem of autonomous racing with multiple agents where agents must interact closely and influence each other to compete. We model interactions among agents through a game-theoretical framework and propose an efficient algorithm for tractably solving the resulting game in real time. More specifically, we capture interactions among multiple agents through a constrained… ▽ More In this work, we consider the problem of autonomous racing with multiple agents where agents must interact closely and influence each other to compete. We model interactions among agents through a game-theoretical framework and propose an efficient algorithm for tractably solving the resulting game in real time. More specifically, we capture interactions among multiple agents through a constrained dynamic game. We show that the resulting dynamic game is an instance of a simple-to-analyze class of games. Namely, we show that our racing game is an instance of a constrained dynamic potential game. An important and appealing property of dynamic potential games is that a generalized Nash equilibrium of the underlying game can be computed by solving a single constrained optimal control problem instead of multiple coupled constrained optimal control problems. Leveraging this property, we show that the problem of autonomous racing is greatly simplified and develop RAPID (autonomous multi-agent RAcing using constrained PotentIal Dynamic games), a racing algorithm that can be solved tractably in real-time. Through simulation studies, we demonstrate that our algorithm outperforms the state-of-the-art approach. We further show the real-time capabilities of our algorithm in hardware experiments. △ Less

Submitted 30 April, 2023; originally announced May 2023.

Comments: 8 pages

arXiv:2010.07360 [pdf]

Deep Learning in Ultrasound Elastography Imaging

Authors: Hongliang Li, Manish Bhatt, Zhen Qu, Shiming Zhang, Martin C. Hartel, Ali Khademhosseini, Guy Cloutier

Abstract: It is known that changes in the mechanical properties of tissues are associated with the onset and progression of certain diseases. Ultrasound elastography is a technique to characterize tissue stiffness using ultrasound imaging either by measuring tissue strain using quasi-static elastography or natural organ pulsation elastography, or by tracing a propagated shear wave induced by a source or a n… ▽ More It is known that changes in the mechanical properties of tissues are associated with the onset and progression of certain diseases. Ultrasound elastography is a technique to characterize tissue stiffness using ultrasound imaging either by measuring tissue strain using quasi-static elastography or natural organ pulsation elastography, or by tracing a propagated shear wave induced by a source or a natural vibration using dynamic elastography. In recent years, deep learning has begun to emerge in ultrasound elastography research. In this review, several common deep learning frameworks in the computer vision community, such as multilayer perceptron, convolutional neural network, and recurrent neural network are described. Then, recent advances in ultrasound elastography using such deep learning techniques are revisited in terms of algorithm development and clinical diagnosis. Finally, the current challenges and future developments of deep learning in ultrasound elastography are prospected. △ Less

Submitted 31 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

arXiv:2008.04104 [pdf, ps, other]

Asymptotically Stable Optimal Multi-rate Rigid Body Attitude Estimation based on Lagrange-d'Alembert Principle

Authors: Maulik Bhatt, Amit K. Sanyal, Srikant Sukumar

Abstract: The rigid body attitude estimation problem is treated using the discrete-time Lagrange-d'Alembert principle. Three different possibilities are considered for the multi-rate relation between angular velocity measurements and direction vector measurements for attitude: 1) integer relation between sampling rates, 2) time-varying sampling rates, 3) non-integer relation between sampling rates. In all c… ▽ More The rigid body attitude estimation problem is treated using the discrete-time Lagrange-d'Alembert principle. Three different possibilities are considered for the multi-rate relation between angular velocity measurements and direction vector measurements for attitude: 1) integer relation between sampling rates, 2) time-varying sampling rates, 3) non-integer relation between sampling rates. In all cases, it is assumed that angular velocity measurements are sampled at a higher rate compared to the inertial vectors. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. At instants when direction vector measurements are absent, a discrete-time model for attitude kinematics is used to propagate past measurements. A discrete-time Lagrangian is constructed as the difference between a kinetic energy-like term that is quadratic in the angular velocity estimation error and an artificial potential energy-like term obtained from Wahba's cost function. An additional dissipation term is introduced and the discrete-time Lagrange-d'Alembert principle is applied to the Lagrangian with this dissipation to obtain an optimal filtering scheme. A discrete-time Lyapunov analysis is carried out to show that the optimal filtering scheme is asymptotically stable in the absence of measurement noise and the domain of convergence is almost global. For a realistic evaluation of the scheme, numerical experiments are conducted with inputs corrupted by bounded measurement noise. These numerical simulations exhibit convergence of the estimated states to a bounded neighborhood of the actual states. △ Less

Submitted 17 June, 2022; v1 submitted 7 August, 2020; originally announced August 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:2007.08185

arXiv:2007.08185 [pdf, ps, other]

Rigid Body Geometric Attitude Estimator using Multi-rate Sensors

Authors: Maulik Bhatt, Srikant Sukumar, Amit K. Sanyal

Abstract: A geometric estimator is proposed for the rigid body attitude under multi-rate measurements using discrete-time Lyapunov stability analysis in this work. The angular velocity measurements are assumed to be sampled at a higher rate compared to the attitude. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. In the case w… ▽ More A geometric estimator is proposed for the rigid body attitude under multi-rate measurements using discrete-time Lyapunov stability analysis in this work. The angular velocity measurements are assumed to be sampled at a higher rate compared to the attitude. The attitude determination problem from two or more vector measurements in the body-fixed frame is formulated as Wahba's problem. In the case when measurements are absent, a discrete-time model for attitude kinematics is assumed in order to propagate the measurements. A discrete-time Lyapunov function is constructed as the sum of a kinetic energy-like term that is quadratic in the angular velocity estimation error and an artificial potential energy-like term obtained from Wahba's cost function. A filtering scheme is obtained by discrete-time stability analysis using a suitable Lyapunov function. The analysis shows that the filtering scheme is exponentially stable in the absence of measurement noise and the domain of convergence is almost global. For a realistic evaluation of the scheme, numerical experiments are conducted with inputs corrupted by bounded measurement noise. Simulation results exhibit convergence of the estimated states to a bounded neighborhood of the actual states. △ Less

Submitted 16 July, 2020; originally announced July 2020.

arXiv:1910.03210 [pdf, other]

doi 10.1109/ICC47138.2019.9123166

Modification of Hilbert's Space-Filling Curve to Avoid Obstacles: A Robotic Path-Planning Strategy

Authors: Anant A. Joshi, Maulik C. Bhatt, Arpita Sinha

Abstract: This paper addresses the problem of exploring a region using the Hilbert's space-filling curve in the presence of obstacles. No prior knowledge of the region being explored is assumed. An online algorithm is proposed which can implement evasive strategies to avoid obstacles comprising a single or two blocked unit squares placed side by side and successfully explore the entire region. The strategie… ▽ More This paper addresses the problem of exploring a region using the Hilbert's space-filling curve in the presence of obstacles. No prior knowledge of the region being explored is assumed. An online algorithm is proposed which can implement evasive strategies to avoid obstacles comprising a single or two blocked unit squares placed side by side and successfully explore the entire region. The strategies are specified by the change in the waypoint array which robot going to follow. The fractal nature of the Hilbert's space-filling curve has been exploited in proving the validity of the solution. △ Less

Submitted 6 October, 2019; originally announced October 2019.

Showing 1–6 of 6 results for author: Bhatt, M