-
Low Frequency Sampling in Model Predictive Path Integral Control
Authors:
Bogdan Vlahov,
Jason Gibson,
David D. Fan,
Patrick Spieler,
Ali-akbar Agha-mohammadi,
Evangelos A. Theodorou
Abstract:
Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which c…
▽ More
Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which can result in smoother and more exploratory samples. We use this frequency-based sampling distribution with Model Predictive Path Integral (MPPI) in both hardware and simulation experiments to show better or equal performance on systems with various speeds of input response.
△ Less
Submitted 18 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Deep Learning Tubes for Tube MPC
Authors:
David D. Fan,
Ali-akbar Agha-mohammadi,
Evangelos A. Theodorou
Abstract:
Learning-based control aims to construct models of a system to use for planning or trajectory optimization, e.g. in model-based reinforcement learning. In order to obtain guarantees of safety in this context, uncertainty must be accurately quantified. This uncertainty may come from errors in learning (due to a lack of data, for example), or may be inherent to the system. Propagating uncertainty fo…
▽ More
Learning-based control aims to construct models of a system to use for planning or trajectory optimization, e.g. in model-based reinforcement learning. In order to obtain guarantees of safety in this context, uncertainty must be accurately quantified. This uncertainty may come from errors in learning (due to a lack of data, for example), or may be inherent to the system. Propagating uncertainty forward in learned dynamics models is a difficult problem. In this work we use deep learning to obtain expressive and flexible models of how distributions of trajectories behave, which we then use for nonlinear Model Predictive Control (MPC). We introduce a deep quantile regression framework for control that enforces probabilistic quantile bounds and quantifies epistemic uncertainty. Using our method we explore three different approaches for learning tubes that contain the possible trajectories of the system, and demonstrate how to use each of them in a Tube MPC scheme. We prove these schemes are recursively feasible and satisfy constraints with a desired margin of probability. We present experiments in simulation on a nonlinear quadrotor system, demonstrating the practical efficacy of these ideas.
△ Less
Submitted 4 June, 2020; v1 submitted 4 February, 2020;
originally announced February 2020.
-
Schrödinger Approach to Optimal Control of Large-Size Populations
Authors:
Kaivalya Bakshi,
David D. Fan,
Evangelos A. Theodorou
Abstract:
Large-size populations consisting of a continuum of identical and non-cooperative agents with stochastic dynamics are useful in modeling various biological and engineered systems. This paper addresses the stochastic control problem of designing optimal state-feedback controllers which guarantee the closed-loop stability of the stationary density of such agents with nonlinear Langevin dynamics, und…
▽ More
Large-size populations consisting of a continuum of identical and non-cooperative agents with stochastic dynamics are useful in modeling various biological and engineered systems. This paper addresses the stochastic control problem of designing optimal state-feedback controllers which guarantee the closed-loop stability of the stationary density of such agents with nonlinear Langevin dynamics, under the action of their individual steady state controls. We represent the corresponding coupled forward-backward PDEs as decoupled Schrödinger equations, by applying two variable transforms. Spectral properties of the linear Schrödinger operator which underlie the stability analysis are used to obtain explicit control design constraints. Our interpretation of the Schrödinger potential as the cost function of a closely related optimal control problem motivates a quadrature based algorithm to compute the finite-time optimal control.
△ Less
Submitted 28 March, 2020; v1 submitted 14 October, 2018;
originally announced October 2018.