Skip to main content

Showing 1–11 of 11 results for author: Westenbroek, T

Searching in archive math. Search in all archives.
.
  1. arXiv:2410.09163  [pdf, other

    cs.RO cs.LG math.OC

    Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models

    Authors: Jacob Levy, Tyler Westenbroek, David Fridovich-Keil

    Abstract: Traditionally, model-based reinforcement learning (MBRL) methods exploit neural networks as flexible function approximators to represent $\textit{a priori}$ unknown environment dynamics. However, training data are typically scarce in practice, and these black-box models often fail to generalize. Modeling architectures that leverage known physics can substantially reduce the complexity of system-id… ▽ More

    Submitted 28 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: v2: corrected typos in eqs (1) and (3); add CoRL footnote

  2. arXiv:2305.09619  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

    Authors: Daniel Pfrommer, Max Simchowitz, Tyler Westenbroek, Nikolai Matni, Stephen Tu

    Abstract: A common pipeline in learning-based control is to iteratively estimate a model of system dynamics, and apply a trajectory optimization algorithm - e.g.~$\mathtt{iLQR}$ - on the learned model to minimize a target cost. This paper conducts a rigorous analysis of a simplified variant of this strategy for general nonlinear systems. We analyze an algorithm which iterates between estimating local linear… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  3. arXiv:2204.01986  [pdf, other

    eess.SY math.OC

    On the Computational Consequences of Cost Function Design in Nonlinear Optimal Control

    Authors: Tyler Westenbroek, Anand Siththaranjan, Mohsin Sarwari, Claire J. Tomlin, Shankar S. Sastry

    Abstract: Optimal control is an essential tool for stabilizing complex nonlinear systems. However, despite the extensive impacts of methods such as receding horizon control, dynamic programming and reinforcement learning, the design of cost functions for a particular system often remains a heuristic-driven process of trial and error. In this paper we seek to gain insights into how the choice of cost functio… ▽ More

    Submitted 17 November, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  4. arXiv:2103.15010  [pdf, other

    math.OC cs.LG eess.SY

    On the Stability of Nonlinear Receding Horizon Control: A Geometric Perspective

    Authors: Tyler Westenbroek, Max Simchowitz, Michael I. Jordan, S. Shankar Sastry

    Abstract: %!TEX root = LCSS_main_max.tex The widespread adoption of nonlinear Receding Horizon Control (RHC) strategies by industry has led to more than 30 years of intense research efforts to provide stability guarantees for these methods. However, current theoretical guarantees require that each (generally nonconvex) planning problem can be solved to (approximate) global optimality, which is an unrealis… ▽ More

    Submitted 25 January, 2024; v1 submitted 27 March, 2021; originally announced March 2021.

  5. arXiv:2004.10331  [pdf, other

    math.OC eess.SY

    Learning Min-norm Stabilizing Control Laws for Systems with Unknown Dynamics

    Authors: Tyler Westenbroek, Fernando Castaneda, Ayush Agrawal, S. Shankar Sastry, Koushil Sreenath

    Abstract: This paper introduces a framework for learning a minimum-norm stabilizing controller for a system with unknown dynamics using model-free policy optimization methods. The approach begins by first designing a Control Lyapunov Function (CLF) for a (possibly inaccurate) dynamics model for the system, along with a function which specifies a minimum acceptable rate of energy dissipation for the CLF at d… ▽ More

    Submitted 1 October, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

  6. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  7. arXiv:1910.13272  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Feedback Linearization for Unknown Systems via Reinforcement Learning

    Authors: Tyler Westenbroek, David Fridovich-Keil, Eric Mazumdar, Shreyas Arora, Valmik Prabhu, S. Shankar Sastry, Claire J. Tomlin

    Abstract: We present a novel approach to control design for nonlinear systems which leverages model-free policy optimization techniques to learn a linearizing controller for a physical plant with unknown dynamics. Feedback linearization is a technique from nonlinear control which renders the input-output dynamics of a nonlinear plant \emph{linear} under application of an appropriate feedback controller. Onc… ▽ More

    Submitted 21 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  8. arXiv:1903.11781  [pdf, other

    math.DS

    Technical Report: Optimal Control of Piecwise-smooth Control Systems via Singular Perturbations

    Authors: Tyler Westenbroek, Xiaobin Xiong, Aaron D Ames, S Shankar Sastry

    Abstract: This paper investigates optimal control problems formulated over a class of piecewise-smooth vector fields. Instead of optimizing over the discontinuous system directly, we instead formulate optimal control problems over a family of regularizations which are obtained by "smoothing out" the discontinuity in the original system. It is shown that the smooth problems can be used to obtain accurate der… ▽ More

    Submitted 31 March, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

  9. arXiv:1803.08092  [pdf, other

    math.DS

    A New Solution Concept and Family of Relaxations for Hybrid Dynamical Systems

    Authors: Tyler Westenbroek, Humberto Gonzalez, S. Shankar Sastry

    Abstract: We introduce a holistic framework for the analysis, approximation and control of the trajectories of hybrid dynamical systems which display event-triggered discrete jumps in the continuous state. We begin by demonstrating how to explicitly represent the dynamics of this class of systems using a single piecewise-smooth vector field defined on a manifold, and then employ Filippov's solution concept… ▽ More

    Submitted 14 December, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: Final Version Appearing in CDC 2018

  10. arXiv:1710.08483  [pdf, other

    math.DS

    On the Relaxation of Hybrid Dynamical Systems

    Authors: Tyler Westenbroek, S. Shankar Sastry, Humberto Gonzalez

    Abstract: Hybrid dynamical systems have proven to be a powerful modeling abstraction, yet fundamental questions regarding the dynamical properties of these systems remain. In this paper, we develop a novel class of relaxations which we use to recover a number of classic systems theoretic properties for hybrid systems, such as existence and uniqueness of trajectories, even past the point of Zeno. Our relaxat… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

  11. arXiv:1510.09127  [pdf, other

    math.OC

    Optimal Control of Hybrid Systems Using a Feedback Relaxed Control Formulation

    Authors: Tyler Westenbroek, Humberto Gonzalez

    Abstract: We present a numerically tractable formulation for computing the optimal control of the class of hybrid dynamical systems whose trajectories are continuous. Our formulation, an extension of existing relaxed-control techniques for switched dynamical systems, incorporates the domain information of each discrete mode as part of the constraints in the optimization problem. Moreover, our numerical resu… ▽ More

    Submitted 25 May, 2016; v1 submitted 30 October, 2015; originally announced October 2015.