-
Universal Dynamics with Globally Controlled Analog Quantum Simulators
Authors:
Hong-Ye Hu,
Abigail McClain Gomez,
Liyuan Chen,
Aaron Trowbridge,
Andy J. Goldschmidt,
Zachary Manchester,
Frederic T. Chong,
Arthur Jaffe,
Susanne F. Yelin
Abstract:
Analog quantum simulators with global control fields have emerged as powerful platforms for exploring complex quantum phenomena. Recent breakthroughs, such as the coherent control of thousands of atoms, highlight the growing potential for quantum applications at scale. Despite these advances, a fundamental theoretical question remains unresolved: to what extent can such systems realize universal q…
▽ More
Analog quantum simulators with global control fields have emerged as powerful platforms for exploring complex quantum phenomena. Recent breakthroughs, such as the coherent control of thousands of atoms, highlight the growing potential for quantum applications at scale. Despite these advances, a fundamental theoretical question remains unresolved: to what extent can such systems realize universal quantum dynamics under global control? Here we establish a necessary and sufficient condition for universal quantum computation using only global pulse control, proving that a broad class of analog quantum simulators is, in fact, universal. We further extend this framework to fermionic and bosonic systems, including modern platforms such as ultracold atoms in optical superlattices. Crucially, to connect the theoretical possibility with experimental reality, we introduce a new control technique into the experiment - direct quantum optimal control. This method enables the synthesis of complex effective Hamiltonians and allows us to incorporate realistic hardware constraints. To show its practical power, we experimentally engineer three-body interactions outside the blockade regime and demonstrate topological dynamics on a Rydberg atom array. Using the new control framework, we overcome key experimental challenges, including hardware limitations and atom position fluctuations in the non-blockade regime, by identifying smooth, short-duration pulses that achieve high-fidelity dynamics. Experimental measurements reveal dynamical signatures of symmetry-protected-topological edge modes, confirming both the expressivity and feasibility of our approach. Our work opens a new avenue for quantum simulation beyond native hardware Hamiltonians, enabling the engineering of effective multi-body interactions and advancing the frontier of quantum information processing with globally-controlled analog platforms.
△ Less
Submitted 24 September, 2025; v1 submitted 26 August, 2025;
originally announced August 2025.
-
Multi-IMU Sensor Fusion for Legged Robots
Authors:
Shuo Yang,
Zixin Zhang,
John Z. Zhang,
Ibrahima Sory Sow,
Zachary Manchester
Abstract:
This paper presents a state-estimation solution for legged robots that uses a set of low-cost, compact, and lightweight sensors to achieve low-drift pose and velocity estimation under challenging locomotion conditions. The key idea is to leverage multiple inertial measurement units on different links of the robot to correct a major error source in standard proprioceptive odometry. We fuse the iner…
▽ More
This paper presents a state-estimation solution for legged robots that uses a set of low-cost, compact, and lightweight sensors to achieve low-drift pose and velocity estimation under challenging locomotion conditions. The key idea is to leverage multiple inertial measurement units on different links of the robot to correct a major error source in standard proprioceptive odometry. We fuse the inertial sensor information and joint encoder measurements in an extended Kalman filter, then combine the velocity estimate from this filter with camera data in a factor-graph-based sliding-window estimator to form a visual-inertial-leg odometry method. We validate our state estimator through comprehensive theoretical analysis and hardware experiments performed using real-world robot data collected during a variety of challenging locomotion tasks. Our algorithm consistently achieves minimal position deviation, even in scenarios involving substantial ground impact, foot slippage, and sudden body rotations. A C++ implementation, along with a large-scale dataset, is available at https://github.com/ShuoYangRobotics/Cerberus2.0.
△ Less
Submitted 15 July, 2025;
originally announced July 2025.
-
Whole-Body Model-Predictive Control of Legged Robots with MuJoCo
Authors:
John Z. Zhang,
Taylor A. Howell,
Zeji Yi,
Chaoyi Pan,
Guanya Shi,
Guannan Qu,
Tom Erez,
Yuval Tassa,
Zachary Manchester
Abstract:
We demonstrate the surprising real-world effectiveness of a very simple approach to whole-body model-predictive control (MPC) of quadruped and humanoid robots: the iterative LQR (iLQR) algorithm with MuJoCo dynamics and finite-difference approximated derivatives. Building upon the previous success of model-based behavior synthesis and control of locomotion and manipulation tasks with MuJoCo in sim…
▽ More
We demonstrate the surprising real-world effectiveness of a very simple approach to whole-body model-predictive control (MPC) of quadruped and humanoid robots: the iterative LQR (iLQR) algorithm with MuJoCo dynamics and finite-difference approximated derivatives. Building upon the previous success of model-based behavior synthesis and control of locomotion and manipulation tasks with MuJoCo in simulation, we show that these policies can easily generalize to the real world with few sim-to-real considerations. Our baseline method achieves real-time whole-body MPC on a variety of hardware experiments, including dynamic quadruped locomotion, quadruped walking on two legs, and full-sized humanoid bipedal locomotion. We hope this easy-to-reproduce hardware baseline lowers the barrier to entry for real-world whole-body MPC research and contributes to accelerating research velocity in the community. Our code and experiment videos will be available online at:https://johnzhang3.github.io/mujoco_ilqr
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
PHODCOS: Pythagorean Hodograph-based Differentiable Coordinate System
Authors:
Jon Arrizabalaga,
Fausto Vega,
Zbyněk ŠÍR,
Zachary Manchester,
Markus Ryll
Abstract:
This paper presents PHODCOS, an algorithm that assigns a moving coordinate system to a given curve. The parametric functions underlying the coordinate system, i.e., the path function, the moving frame and its angular velocity, are exact -- approximation free -- differentiable, and sufficiently continuous. This allows for computing a coordinate system for highly nonlinear curves, while remaining co…
▽ More
This paper presents PHODCOS, an algorithm that assigns a moving coordinate system to a given curve. The parametric functions underlying the coordinate system, i.e., the path function, the moving frame and its angular velocity, are exact -- approximation free -- differentiable, and sufficiently continuous. This allows for computing a coordinate system for highly nonlinear curves, while remaining compliant with autonomous navigation algorithms that require first and second order gradient information. In addition, the coordinate system obtained by PHODCOS is fully defined by a finite number of coefficients, which may then be used to compute additional geometric properties of the curve, such as arc-length, curvature, torsion, etc. Therefore, PHODCOS presents an appealing paradigm to enhance the geometrical awareness of existing guidance and navigation on-orbit spacecraft maneuvers. The PHODCOS algorithm is presented alongside an analysis of its error and approximation order, and thus, it is guaranteed that the obtained coordinate system matches the given curve within a desired tolerance. To demonstrate the applicability of the coordinate system resulting from PHODCOS, we present numerical examples in the Near Rectilinear Halo Orbit (NRHO) for the Lunar Gateway.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Optimal Attitude Control of Large Flexible Space Structures with Distributed Momentum Actuators
Authors:
Pedro Cachim,
Will Kraus,
Zachary Manchester,
Pedro Lourenco,
Rodrigo Ventura
Abstract:
Recent spacecraft mission concepts propose larger payloads that have lighter, less rigid structures. For large lightweight structures, the natural frequencies of their vibration modes may fall within the attitude controller bandwidth, threatening the stability and settling time of the controller and compromising performance. This work tackles this issue by proposing an attitude control design para…
▽ More
Recent spacecraft mission concepts propose larger payloads that have lighter, less rigid structures. For large lightweight structures, the natural frequencies of their vibration modes may fall within the attitude controller bandwidth, threatening the stability and settling time of the controller and compromising performance. This work tackles this issue by proposing an attitude control design paradigm of distributing momentum actuators throughout the structure to have more control authority over vibration modes. The issue of jitter disturbances introduced by these actuators is addressed by expanding the bandwidth of the attitude controller to suppress excess vibrations. Numerical simulation results show that, at the expense of more control action, a distributed configuration can achieve lower settling times and reduce structural deformation compared to a more standard centralized configuration.
△ Less
Submitted 31 October, 2024; v1 submitted 9 October, 2024;
originally announced October 2024.
-
A Universal Formulation for Path-Parametric Planning and Control
Authors:
Jon Arrizabalaga,
Zbyněk ŠÍR,
Zachary Manchester,
Markus Ryll
Abstract:
We present a unified framework for path-parametric planning and control. This formulation is universal as it standardizes the entire spectrum of path-parametric techniques -- from traditional path following to more recent contouring or progress-maximizing Model Predictive Control and Reinforcement Learning -- under a single framework. The ingredients underlying this universality are twofold: First…
▽ More
We present a unified framework for path-parametric planning and control. This formulation is universal as it standardizes the entire spectrum of path-parametric techniques -- from traditional path following to more recent contouring or progress-maximizing Model Predictive Control and Reinforcement Learning -- under a single framework. The ingredients underlying this universality are twofold: First, we present a compact and efficient technique capable of computing singularity-free, smooth and differentiable moving frames. Second, we derive a spatial path parameterization of the Cartesian coordinates for any arbitrary curve without prior assumptions on its parametric speed or moving frame, and that perfectly interplays with the aforementioned path parameterization method. The combination of these two ingredients leads to a planning and control framework that unites existing path-parametric techniques in literature.
△ Less
Submitted 1 March, 2025; v1 submitted 6 October, 2024;
originally announced October 2024.
-
Building a Better B-Dot: Fast Detumbling with Non-Monotonic Lyapunov Functions
Authors:
Jacob B. Willis,
Paulo R. M. Fisch,
Aleksei Seletskiy,
Zachary Manchester
Abstract:
Spacecraft detumbling with magnetic torque coils is an inherently underactuated control problem. Contemporary and classical magnetorquer detumbling methods do not adequately consider this underactuation, and suffer from poor performance as a result. These controllers can get stuck on an uncontrollable manifold, resulting in long detumbling times and high power consumption. This work presents a nov…
▽ More
Spacecraft detumbling with magnetic torque coils is an inherently underactuated control problem. Contemporary and classical magnetorquer detumbling methods do not adequately consider this underactuation, and suffer from poor performance as a result. These controllers can get stuck on an uncontrollable manifold, resulting in long detumbling times and high power consumption. This work presents a novel detumble controller based on a non-monotonic Lyapunov function that predicts the future magnetic field along the satellite's orbit and avoids uncontrollable configurations. In comparison to other controllers in the literature, our controller detumbles a satellite in significantly less time while also converging to lower overall angular momentum. We provide a derivation and proof of convergence for our controller as well as Monte-Carlo simulation results demonstrating its performance in representative use cases.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Contingency-Aware Station-Keeping Control of Halo Orbits
Authors:
Fausto Vega,
Zachary Manchester,
Martin Lo,
Ricardo Restrepo
Abstract:
We present an algorithm to perform fuel-optimal stationkeeping for spacecraft in unstable halo orbits with additional constraints to ensure safety in the event of a control failure. We formulate a convex trajectory-optimization problem to generate impulsive spacecraft maneuvers to loosely track a halo orbit using a receding-horizon controller. Our solution also provides a safe exit strategy in the…
▽ More
We present an algorithm to perform fuel-optimal stationkeeping for spacecraft in unstable halo orbits with additional constraints to ensure safety in the event of a control failure. We formulate a convex trajectory-optimization problem to generate impulsive spacecraft maneuvers to loosely track a halo orbit using a receding-horizon controller. Our solution also provides a safe exit strategy in the event that propulsion is lost at any point in the mission. We validate our algorithm in simulations of the three-body Earth-Moon and Saturn-Enceladus systems, demonstrating both low total delta-v and a safe contingency plan throughout the mission.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
A Convex Formulation of the Soft-Capture Problem
Authors:
Ibrahima Sory Sow,
Geordan Gutow,
Howie Choset,
Zachary Manchester
Abstract:
We present a fast trajectory optimization algorithm for the soft capture of uncooperative tumbling space objects. Our algorithm generates safe, dynamically feasible, and minimum-fuel trajectories for a six-degree-of-freedom servicing spacecraft to achieve soft capture (near-zero relative velocity at contact) between predefined locations on the servicer spacecraft and target body. We solve a convex…
▽ More
We present a fast trajectory optimization algorithm for the soft capture of uncooperative tumbling space objects. Our algorithm generates safe, dynamically feasible, and minimum-fuel trajectories for a six-degree-of-freedom servicing spacecraft to achieve soft capture (near-zero relative velocity at contact) between predefined locations on the servicer spacecraft and target body. We solve a convex problem by enforcing a convex relaxation of the field-of-view constraint, followed by a sequential convex program correcting the trajectory for collision avoidance. The optimization problems can be solved with a standard second-order cone programming solver, making the algorithm both fast and practical for implementation in flight software. We demonstrate the performance and robustness of our algorithm in simulation over a range of object tumble rates up to 10°/s.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Code Generation for Conic Model-Predictive Control on Microcontrollers with TinyMPC
Authors:
Sam Schoedel,
Khai Nguyen,
Elakhya Nedumaran,
Brian Plancher,
Zachary Manchester
Abstract:
Conic constraints appear in many important control applications like legged locomotion, robotic manipulation, and autonomous rocket landing. However, current solvers for conic optimization problems have relatively heavy computational demands in terms of both floating-point operations and memory footprint, making them impractical for use on small embedded devices. We extend TinyMPC, an open-source,…
▽ More
Conic constraints appear in many important control applications like legged locomotion, robotic manipulation, and autonomous rocket landing. However, current solvers for conic optimization problems have relatively heavy computational demands in terms of both floating-point operations and memory footprint, making them impractical for use on small embedded devices. We extend TinyMPC, an open-source, high-speed solver targeting low-power embedded control applications, to handle second-order cone constraints. We also present code-generation software to enable deployment of TinyMPC on a variety of microcontrollers. We benchmark our generated code against state-of-the-art embedded QP and SOCP solvers, demonstrating a two-order-of-magnitude speed increase over ECOS while consuming less memory. Finally, we demonstrate TinyMPC's efficacy on the Crazyflie, a lightweight, resource-constrained quadrotor with fast dynamics. TinyMPC and its code-generation tools are publicly available at https://tinympc.org.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
TinyMPC: Model-Predictive Control on Resource-Constrained Microcontrollers
Authors:
Anoushka Alavilli,
Khai Nguyen,
Sam Schoedel,
Brian Plancher,
Zachary Manchester
Abstract:
Model-predictive control (MPC) is a powerful tool for controlling highly dynamic robotic systems subject to complex constraints. However, MPC is computationally demanding, and is often impractical to implement on small, resource-constrained robotic platforms. We present TinyMPC, a high-speed MPC solver with a low memory footprint targeting the microcontrollers common on small robots. Our approach…
▽ More
Model-predictive control (MPC) is a powerful tool for controlling highly dynamic robotic systems subject to complex constraints. However, MPC is computationally demanding, and is often impractical to implement on small, resource-constrained robotic platforms. We present TinyMPC, a high-speed MPC solver with a low memory footprint targeting the microcontrollers common on small robots. Our approach is based on the alternating direction method of multipliers (ADMM) and leverages the structure of the MPC problem for efficiency. We demonstrate TinyMPC's effectiveness by benchmarking against the state-of-the-art solver OSQP, achieving nearly an order of magnitude speed increase, as well as through hardware experiments on a 27 gram quadrotor, demonstrating high-speed trajectory tracking and dynamic obstacle avoidance. TinyMPC is publicly available at https://tinympc.org.
△ Less
Submitted 12 August, 2025; v1 submitted 25 October, 2023;
originally announced October 2023.
-
Propulsion-Free Cross-Track Control of a LEO Small-Satellite Constellation with Differential Drag
Authors:
Giusy Falcone,
Jacob B. Willis,
Zachary Manchester
Abstract:
In this work, we achieve propellantless control of both cross-track and along-track separation of a satellite formation by manipulating atmospheric drag. Increasing the differential drag of one satellite with respect to another directly introduces along-track separation, while cross-track separation can be achieved by taking advantage of higher-order terms in the Earth's gravitational field that a…
▽ More
In this work, we achieve propellantless control of both cross-track and along-track separation of a satellite formation by manipulating atmospheric drag. Increasing the differential drag of one satellite with respect to another directly introduces along-track separation, while cross-track separation can be achieved by taking advantage of higher-order terms in the Earth's gravitational field that are functions of altitude. We present an algorithm for solving an n-satellite formation flying problem based on linear programming. We demonstrate this algorithm in a receeding-horizon control scheme in the presence of disturbances and modeling errors in a high-fidelity closed-loop orbital dynamics simulation. Our results show that separation distances of hundreds of kilometers can be achieved by a small-satellite formation in low-Earth orbit over a few months.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Cerberus: Low-Drift Visual-Inertial-Leg Odometry For Agile Locomotion
Authors:
Shuo Yang,
Zixin Zhang,
Zhengyu Fu,
Zachary Manchester
Abstract:
We present an open-source Visual-Inertial-Leg Odometry (VILO) state estimation solution, Cerberus, for legged robots that estimates position precisely on various terrains in real time using a set of standard sensors, including stereo cameras, IMU, joint encoders, and contact sensors. In addition to estimating robot states, we also perform online kinematic parameter calibration and contact outlier…
▽ More
We present an open-source Visual-Inertial-Leg Odometry (VILO) state estimation solution, Cerberus, for legged robots that estimates position precisely on various terrains in real time using a set of standard sensors, including stereo cameras, IMU, joint encoders, and contact sensors. In addition to estimating robot states, we also perform online kinematic parameter calibration and contact outlier rejection to substantially reduce position drift. Hardware experiments in various indoor and outdoor environments validate that calibrating kinematic parameters within the Cerberus can reduce estimation drift to lower than 1% during long distance high speed locomotion. Our drift results are better than any other state estimation method using the same set of sensors reported in the literature. Moreover, our state estimator performs well even when the robot is experiencing large impacts and camera occlusion. The implementation of the state estimator, along with the datasets used to compute our results, are available at https://github.com/ShuoYangRobotics/Cerberus.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Low-latency Imaging and Inference from LoRa-enabled CubeSats
Authors:
Akshay Gadre,
Swarun Kumar,
Zachary Manchester
Abstract:
Recent years have seen the rapid deployment of low-cost CubeSats in low-Earth orbit, primarily for research, education, and Earth observation. The vast majority of these CubeSats experience significant latency (several hours) from the time an image is captured to the time it is available on the ground. This is primarily due to the limited availability of dedicated satellite ground stations that te…
▽ More
Recent years have seen the rapid deployment of low-cost CubeSats in low-Earth orbit, primarily for research, education, and Earth observation. The vast majority of these CubeSats experience significant latency (several hours) from the time an image is captured to the time it is available on the ground. This is primarily due to the limited availability of dedicated satellite ground stations that tend to be bulky to deploy and expensive to rent. This paper explores using LoRa radios in the ISM band for low-latency downlink communication from CubeSats, primarily due to the availability of extensive ground LoRa infrastructure and minimal interference to terrestrial communication. However, the limited bandwidth of LoRa precludes rich satellite Earth images to be sent - instead, the CubeSats can at best send short messages (a few hundred bytes).
This paper details our experience in communicating with a LoRa-enabled CubeSat launched by our team. We present Vista, a communication system that makes software modifications to LoRa encoding onboard a CubeSat and decoding on commercial LoRa ground stations to allow for satellite imagery to be communicated, as well as wide-ranging machine learning inference on these images. This is achieved through a LoRa-channel-aware image encoding that is informed by the structure of satellite images, the tasks performed on it, as well as the Doppler variation of satellite signals. A detailed evaluation of Vista through trace-driven emulation with traces from the LoRa-CubeSat launch (in 2021) shows 4.56 dB improvement in LoRa image PSNR and 1.38x improvement in land-use classification over those images.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
CALIPSO: A Differentiable Solver for Trajectory Optimization with Conic and Complementarity Constraints
Authors:
Taylor A. Howell,
Simon Le Cleac'h,
Kevin Tracy,
Zachary Manchester
Abstract:
We present a new solver for non-convex trajectory optimization problems that is specialized for robotics applications. CALIPSO, or the Conic Augmented Lagrangian Interior-Point SOlver, combines several strategies for constrained numerical optimization to natively handle second-order cones and complementarity constraints. It reliably solves challenging motion-planning problems that include contact-…
▽ More
We present a new solver for non-convex trajectory optimization problems that is specialized for robotics applications. CALIPSO, or the Conic Augmented Lagrangian Interior-Point SOlver, combines several strategies for constrained numerical optimization to natively handle second-order cones and complementarity constraints. It reliably solves challenging motion-planning problems that include contact-implicit formulations of impacts and Coulomb friction and state-triggered constraints where general-purpose non-convex solvers like SNOPT and Ipopt fail to converge. Additionally, CALIPSO supports efficient differentiation of solutions with respect to problem data, enabling bi-level optimization applications like auto-tuning of feedback policies. Reliable convergence of the solver is demonstrated on a range of problems from manipulation, locomotion, and aerospace domains. An open-source implementation of this solver is available.
△ Less
Submitted 10 January, 2023; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Trajectory Optimization with Optimization-Based Dynamics
Authors:
Taylor A. Howell,
Simon Le Cleac'h,
Sumeet Singh,
Pete Florence,
Zachary Manchester,
Vikas Sindhwani
Abstract:
We present a framework for bi-level trajectory optimization in which a system's dynamics are encoded as the solution to a constrained optimization problem and smooth gradients of this lower-level problem are passed to an upper-level trajectory optimizer. This optimization-based dynamics representation enables constraint handling, additional variables, and non-smooth behavior to be abstracted away…
▽ More
We present a framework for bi-level trajectory optimization in which a system's dynamics are encoded as the solution to a constrained optimization problem and smooth gradients of this lower-level problem are passed to an upper-level trajectory optimizer. This optimization-based dynamics representation enables constraint handling, additional variables, and non-smooth behavior to be abstracted away from the upper-level optimizer, and allows classical unconstrained optimizers to synthesize trajectories for more complex systems. We provide an interior-point method for efficient evaluation of constrained dynamics and utilize implicit differentiation to compute smooth gradients of this representation. We demonstrate the framework by modeling systems from locomotion, aerospace, and manipulation domains including: acrobot with joint limits, cart-pole subject to Coulomb friction, Raibert hopper, rocket landing with thrust limits, and planar-push task with optimization-based dynamics and then optimize trajectories using iterative LQR.
△ Less
Submitted 11 January, 2023; v1 submitted 10 September, 2021;
originally announced September 2021.
-
Fast Contact-Implicit Model-Predictive Control
Authors:
Simon Le Cleac'h,
Taylor Howell,
Shuo Yang,
Chi-Yen Lee,
John Zhang,
Arun Bishop,
Mac Schwager,
Zachary Manchester
Abstract:
We present a general approach for controlling robotic systems that make and break contact with their environments. Contact-implicit model predictive control (CI-MPC) generalizes linear MPC to contact-rich settings by utilizing a bi-level planning formulation with lower-level contact dynamics formulated as time-varying linear complementarity problems (LCPs) computed using strategic Taylor approxima…
▽ More
We present a general approach for controlling robotic systems that make and break contact with their environments. Contact-implicit model predictive control (CI-MPC) generalizes linear MPC to contact-rich settings by utilizing a bi-level planning formulation with lower-level contact dynamics formulated as time-varying linear complementarity problems (LCPs) computed using strategic Taylor approximations about a reference trajectory. These dynamics enable the upper-level planning problem to reason about contact timing and forces, and generate entirely new contact-mode sequences online. To achieve reliable and fast numerical convergence, we devise a structure-exploiting interior-point solver for these LCP contact dynamics and a custom trajectory optimizer for the tracking problem. We demonstrate real-time solution rates for CI-MPC and the ability to generate and track non-periodic behaviours in hardware experiments on a quadrupedal robot. We also show that the controller is robust to model mismatch and can respond to disturbances by discovering and exploiting new contact modes across a variety of robotic systems in simulation, including a pushbot, planar hopper, planar quadruped, and planar biped.
△ Less
Submitted 6 January, 2023; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM
Authors:
Kunal Menda,
Jean de Becdelièvre,
Jayesh K. Gupta,
Ilan Kroo,
Mykel J. Kochenderfer,
Zachary Manchester
Abstract:
System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We f…
▽ More
System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We formulate certainty-equivalent expectation-maximization as block coordinate-ascent, and provide an efficient implementation. The algorithm is tested on a simulated system of coupled Lorenz attractors, demonstrating its ability to identify high-dimensional systems that can be intractable for particle-based approaches. Our approach is also used to identify the dynamics of an aerobatic helicopter. By augmenting the state with unobserved fluid states, a model is learned that predicts the acceleration of the helicopter better than state-of-the-art approaches. The codebase for this work is available at https://github.com/sisl/CEEM.
△ Less
Submitted 20 June, 2020;
originally announced June 2020.
-
Structured Mechanical Models for Robot Learning and Control
Authors:
Jayesh K. Gupta,
Kunal Menda,
Zachary Manchester,
Mykel J. Kochenderfer
Abstract:
Model-based methods are the dominant paradigm for controlling robotic systems, though their efficacy depends heavily on the accuracy of the model used. Deep neural networks have been used to learn models of robot dynamics from data, but they suffer from data-inefficiency and the difficulty to incorporate prior knowledge. We introduce Structured Mechanical Models, a flexible model class for mechani…
▽ More
Model-based methods are the dominant paradigm for controlling robotic systems, though their efficacy depends heavily on the accuracy of the model used. Deep neural networks have been used to learn models of robot dynamics from data, but they suffer from data-inefficiency and the difficulty to incorporate prior knowledge. We introduce Structured Mechanical Models, a flexible model class for mechanical systems that are data-efficient, easily amenable to prior knowledge, and easily usable with model-based control techniques. The goal of this work is to demonstrate the benefits of using Structured Mechanical Models in lieu of black-box neural networks when modeling robot dynamics. We demonstrate that they generalize better from limited data and yield more reliable model-based controllers on a variety of simulated robotic domains.
△ Less
Submitted 21 April, 2020;
originally announced April 2020.
-
A General Framework for Structured Learning of Mechanical Systems
Authors:
Jayesh K. Gupta,
Kunal Menda,
Zachary Manchester,
Mykel J. Kochenderfer
Abstract:
Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is avail…
▽ More
Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is available, and train expressive function approximators where it is not. We propose to parameterize a mechanical system using neural networks to model its Lagrangian and the generalized forces that act on it. We test our method on a simulated, actuated double pendulum. We show that our method outperforms a naive, black-box model in terms of data-efficiency, as well as performance in model-based reinforcement learning. We also conduct a systematic study of our method's ability to incorporate available prior knowledge about the system to improve data efficiency.
△ Less
Submitted 1 March, 2019; v1 submitted 22 February, 2019;
originally announced February 2019.