Skip to main content

Showing 1–50 of 73 results for author: Theodorou, E A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10168  [pdf, ps, other

    stat.ML cs.LG

    Momentum Multi-Marginal Schrödinger Bridge Matching

    Authors: Panagiotis Theodoropoulos, Augustinos D. Saravanos, Evangelos A. Theodorou, Guan-Horng Liu

    Abstract: Understanding complex systems by inferring trajectories from sparse sample snapshots is a fundamental challenge in a wide range of domains, e.g., single-cell biology, meteorology, and economics. Despite advancements in Bridge and Flow matching frameworks, current methodologies rely on pairwise interpolation between adjacent snapshots. This hinders their ability to capture long-range temporal depen… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2504.15453  [pdf, other

    eess.SY cs.RO

    Nearly Optimal Nonlinear Safe Control with BaS-SDRE

    Authors: Hassan Almubarak, Maitham F. AL-Sunni, Justin T. Dubbin, Nader Sadegh, John M. Dolan, Evangelos A. Theodorou

    Abstract: The State-Dependent Riccati Equation (SDRE) approach has emerged as a systematic and effective means of designing nearly optimal nonlinear controllers. The Barrier States (BaS) embedding methodology was developed recently for safe multi-objective controls in which the safety condition is manifested as a state to be controlled along with other states of the system. The overall system, termed the sa… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  3. arXiv:2504.04605  [pdf, other

    eess.SY cs.RO math.OC

    Nonlinear Robust Optimization for Planning and Control

    Authors: Arshiya Taj Abdul, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: This paper presents a novel robust trajectory optimization method for constrained nonlinear dynamical systems subject to unknown bounded disturbances. In particular, we seek optimal control policies that remain robustly feasible with respect to all possible realizations of the disturbances within prescribed uncertainty sets. To address this problem, we introduce a bi-level optimization algorithm.… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  4. arXiv:2412.20279  [pdf, other

    stat.ML cs.LG math.OC

    Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou

    Abstract: Generalized Schrödinger Bridges (GSBs) are a fundamental mathematical framework used to analyze the most likely particle evolution based on the principle of least action including kinetic and potential energy. In parallel to their well-established presence in the theoretical realms of quantum mechanics and optimal transport, this paper focuses on an algorithmic perspective, aiming to enhance pract… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

  5. arXiv:2412.12156  [pdf, other

    math.OC cs.LG cs.MA

    Deep Distributed Optimization for Large-Scale Quadratic Programming

    Authors: Augustinos D. Saravanos, Hunter Kuperman, Alex Oshin, Arshiya Taj Abdul, Vincent Pacelli, Evangelos A. Theodorou

    Abstract: Quadratic programming (QP) forms a crucial foundation in optimization, encompassing a broad spectrum of domains and serving as the basis for more advanced algorithms. Consequently, as the scale and complexity of modern applications continue to grow, the development of efficient and reliable QP algorithms is becoming increasingly vital. In this context, this paper introduces a novel deep learning-a… ▽ More

    Submitted 25 January, 2025; v1 submitted 11 December, 2024; originally announced December 2024.

  6. arXiv:2412.00581  [pdf, other

    cs.RO

    Dynamics Modeling using Visual Terrain Features for High-Speed Autonomous Off-Road Driving

    Authors: Jason Gibson, Anoushka Alavilli, Erica Tevere, Evangelos A. Theodorou, Patrick Spieler

    Abstract: Rapid autonomous traversal of unstructured terrain is essential for scenarios such as disaster response, search and rescue, or planetary exploration. As a vehicle navigates at the limit of its capabilities over extreme terrain, its dynamics can change suddenly and dramatically. For example, high-speed and varying terrain can affect parameters such as traction, tire slip, and rolling resistance. To… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: Jason Gibson and Anoushka Alavilli contributed equally

  7. arXiv:2411.11211  [pdf, other

    cs.RO math.OC

    Operator Splitting Covariance Steering for Safe Stochastic Nonlinear Control

    Authors: Akash Ratheesh, Vincent Pacelli, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: Most robotics applications are typically accompanied with safety restrictions that need to be satisfied with a high degree of confidence even in environments under uncertainty. Controlling the state distribution of a system and enforcing such specifications as distribution constraints is a promising approach for meeting such requirements. In this direction, covariance steering (CS) is an increasin… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  8. arXiv:2410.14055  [pdf, other

    stat.ML cs.LG

    Feedback Schrödinger Bridge Matching

    Authors: Panagiotis Theodoropoulos, Nikolaos Komianos, Vincent Pacelli, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: Recent advancements in diffusion bridges for distribution transport problems have heavily relied on matching frameworks, yet existing methods often face a trade-off between scalability and access to optimal pairings during training. Fully unsupervised methods make minimal assumptions but incur high computational costs, limiting their practicality. On the other hand, imposing full supervision of th… ▽ More

    Submitted 20 February, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2409.07563  [pdf, other

    cs.MS cs.DC cs.RO eess.SY

    MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization

    Authors: Bogdan Vlahov, Jason Gibson, Manan Gandhi, Evangelos A. Theodorou

    Abstract: This paper introduces a new C++/CUDA library for GPU-accelerated stochastic optimization called MPPI-Generic. It provides implementations of Model Predictive Path Integral control, Tube-Model Predictive Path Integral Control, and Robust Model Predictive Path Integral Control, and allows for these algorithms to be used across many pre-existing dynamics models and cost functions. Furthermore, resear… ▽ More

    Submitted 10 March, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

  10. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifolds is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the p… ▽ More

    Submitted 11 February, 2025; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Accepted to ICLR 2025

  11. arXiv:2404.13430  [pdf, other

    physics.chem-ph cs.LG

    React-OT: Optimal Transport for Generating Transition State in Chemical Reactions

    Authors: Chenru Duan, Guan-Horng Liu, Yuanqi Du, Tianrong Chen, Qiyuan Zhao, Haojun Jia, Carla P. Gomes, Evangelos A. Theodorou, Heather J. Kulik

    Abstract: Transition states (TSs) are transient structures that are key in understanding reaction mechanisms and designing catalysts but challenging to be captured in experiments. Alternatively, many optimization algorithms have been developed to search for TSs computationally. Yet the cost of these algorithms driven by quantum chemistry methods (usually density functional theory) is still high, posing chal… ▽ More

    Submitted 15 October, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  12. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  13. Low Frequency Sampling in Model Predictive Path Integral Control

    Authors: Bogdan Vlahov, Jason Gibson, David D. Fan, Patrick Spieler, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which c… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Published to RA-L

    Report number: 4543

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 5, pp.4543-4550, 2024

  14. arXiv:2403.18130  [pdf, other

    math.OC cs.IT

    Generalized Maximum Entropy Differential Dynamic Programming

    Authors: Yuichiro Aoyama, Evangelos A. Theodorou

    Abstract: We present a sampling-based trajectory optimization method derived from the maximum entropy formulation of Differential Dynamic Programming with Tsallis entropy. This method is a generalization of the legacy work with Shannon entropy, which leads to a Gaussian optimal control policy for exploration during optimization. With the Tsallis entropy, the policy takes the form of $q$-Gaussian, which furt… ▽ More

    Submitted 16 September, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, This paper is for CDC 2024

    MSC Class: 34H05

  15. arXiv:2402.16227  [pdf, ps, other

    cs.RO eess.SY math.OC

    Scaling Robust Optimization for Multi-Agent Robotic Systems: A Distributed Perspective

    Authors: Arshiya Taj Abdul, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: This paper presents a novel distributed robust optimization scheme for steering distributions of multi-agent systems under stochastic and deterministic uncertainty. Robust optimization is a subfield of optimization which aims to discover an optimal solution that remains robustly feasible for all possible realizations of the problem parameters within a given uncertainty set. Such approaches would n… ▽ More

    Submitted 29 January, 2025; v1 submitted 25 February, 2024; originally announced February 2024.

  16. arXiv:2311.06978  [pdf, other

    cs.LG cs.CV stat.ML

    Augmented Bridge Matching

    Authors: Valentin De Bortoli, Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Weilie Nie

    Abstract: Flow and bridge matching are a novel class of processes which encompass diffusion models. One of the main aspect of their increased flexibility is that these models can interpolate between arbitrary data distributions i.e. they generalize beyond generative modeling and can be applied to learning stochastic (and deterministic) processes of arbitrary transfer tasks between two given distributions. I… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  17. arXiv:2310.07805  [pdf, other

    cs.LG cs.AI

    Generative Modeling with Phase Stochastic Bridges

    Authors: Tianrong Chen, Jiatao Gu, Laurent Dinh, Evangelos A. Theodorou, Joshua Susskind, Shuangfei Zhai

    Abstract: Diffusion models (DMs) represent state-of-the-art generative models for continuous inputs. DMs work by constructing a Stochastic Differential Equation (SDE) in the input space (ie, position space), and using a neural network to reverse it. In this work, we introduce a novel generative modeling framework grounded in \textbf{phase space dynamics}, where a phase space is defined as {an augmented spac… ▽ More

    Submitted 12 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  18. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  19. arXiv:2310.01236  [pdf, other

    stat.ML cs.CV cs.LG

    Mirror Diffusion Models for Constrained and Watermarked Generation

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao

    Abstract: Modern successes of diffusion models in learning complex, high-dimensional data distributions are attributed, in part, to their capability to construct diffusion processes with analytic transition kernels and score functions. The tractability results in a simulation-free framework with stable regression losses, from which reversed, generative processes can be learned at scale. However, when data i… ▽ More

    Submitted 29 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: submitted to NeurIPS on 5/18 but did not arxiv per NeurIPS policy, accepted on 9/22

  20. arXiv:2308.08426  [pdf, other

    math.OC cs.RO

    Differentiable Robust Model Predictive Control

    Authors: Alex Oshin, Hassan Almubarak, Evangelos A. Theodorou

    Abstract: Deterministic model predictive control (MPC), while powerful, is often insufficient for effectively controlling autonomous systems in the real-world. Factors such as environmental noise and model error can cause deviations from the expected nominal performance. Robust MPC algorithms aim to bridge this gap between deterministic and uncertain control. However, these methods are often excessively dif… ▽ More

    Submitted 26 July, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted to Robotics: Science and Systems 2024

  21. arXiv:2305.18718  [pdf, other

    cs.RO cs.MA eess.SY

    Distributed Hierarchical Distribution Control for Very-Large-Scale Clustered Multi-Agent Systems

    Authors: Augustinos D. Saravanos, Yihui Li, Evangelos A. Theodorou

    Abstract: As the scale and complexity of multi-agent robotic systems are subject to a continuous increase, this paper considers a class of systems labeled as Very-Large-Scale Multi-Agent Systems (VLMAS) with dimensionality that can scale up to the order of millions of agents. In particular, we consider the problem of steering the state distributions of all agents of a VLMAS to prescribed target distribution… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at Robotics: Science and Systems 2023

  22. arXiv:2305.02241  [pdf, other

    cs.RO eess.SY

    A Multi-step Dynamics Modeling Framework For Autonomous Driving In Multiple Environments

    Authors: Jason Gibson, Bogdan Vlahov, David Fan, Patrick Spieler, Daniel Pastor, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: Modeling dynamics is often the first step to making a vehicle autonomous. While on-road autonomous vehicles have been extensively studied, off-road vehicles pose many challenging modeling problems. An off-road vehicle encounters highly complex and difficult-to-model terrain/vehicle interactions, as well as having complex vehicle dynamics of its own. These complexities can create challenges for eff… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  23. arXiv:2303.03360  [pdf, other

    cs.RO eess.SY

    Improved Exploration for Safety-Embedded Differential Dynamic Programming Using Tolerant Barrier States

    Authors: Joshua E. Kuperman, Hassan Almubarak, Augustinos D. Saravanos, Evangelos A. Theodorou

    Abstract: In this paper, we introduce Tolerant Discrete Barrier States (T-DBaS), a novel safety-embedding technique for trajectory optimization with enhanced exploratory capabilities. The proposed approach generalizes the standard discrete barrier state (DBaS) method by accommodating temporary constraint violation during the optimization process while still approximating its safety guarantees. Consequently,… ▽ More

    Submitted 11 March, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

  24. arXiv:2303.01751  [pdf, other

    stat.ML cs.LG

    Deep Momentum Multi-Marginal Schrödinger Bridge

    Authors: Tianrong Chen, Guan-Horng Liu, Molei Tao, Evangelos A. Theodorou

    Abstract: It is a crucial challenge to reconstruct population dynamics using unlabeled samples from distributions at coarse time intervals. Recent approaches such as flow-based models or Schrödinger Bridge (SB) models have demonstrated appealing performance, yet the inferred sample trajectories either fail to account for the underlying stochasticity or are $\underline{D}$eep $\underline{M}$omentum Multi-Mar… ▽ More

    Submitted 5 October, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  25. arXiv:2302.05872  [pdf, other

    cs.CV cs.LG stat.ML

    I$^2$SB: Image-to-Image Schrödinger Bridge

    Authors: Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, Anima Anandkumar

    Abstract: We propose Image-to-Image Schrödinger Bridge (I$^2$SB), a new class of conditional diffusion models that directly learn the nonlinear diffusion processes between two given distributions. These diffusion bridges are particularly useful for image restoration, as the degraded images are structurally informative priors for reconstructing the clean images. I$^2$SB belongs to a tractable class of Schröd… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: ICML camera ready (high-resolution figures)

  26. Distributed Model Predictive Covariance Steering

    Authors: Augustinos D. Saravanos, Isin M. Balci, Efstathios Bakolas, Evangelos A. Theodorou

    Abstract: This paper proposes Distributed Model Predictive Covariance Steering (DiMPCS) for multi-agent control under stochastic uncertainty. The scope of our approach is to blend covariance steering theory, distributed optimization and model predictive control (MPC) into a single framework that is safe, scalable and decentralized. Initially, we pose a problem formulation that uses the Wasserstein distance… ▽ More

    Submitted 25 January, 2025; v1 submitted 1 December, 2022; originally announced December 2022.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024

  27. arXiv:2212.00268  [pdf, other

    eess.SY cs.RO

    Gaussian Process Barrier States for Safe Trajectory Optimization and Control

    Authors: Hassan Almubarak, Manan Gandhi, Yuichiro Aoyama, Nader Sadegh, Evangelos A. Theodorou

    Abstract: This paper proposes embedded Gaussian Process Barrier States (GP-BaS), a methodology to safely control unmodeled dynamics of nonlinear system using Bayesian learning. Gaussian Processes (GPs) are used to model the dynamics of the safety-critical system, which is subsequently used in the GP-BaS model. We derive the barrier state dynamics utilizing the GP posterior, which is used to construct a safe… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  28. arXiv:2210.10814  [pdf, other

    cs.GT cs.RO math.OC

    MPOGames: Efficient Multimodal Partially Observable Dynamic Games

    Authors: Oswin So, Paul Drews, Thomas Balch, Velin Dimitrov, Guy Rosman, Evangelos A. Theodorou

    Abstract: Game theoretic methods have become popular for planning and prediction in situations involving rich multi-agent interactions. However, these methods often assume the existence of a single local Nash equilibria and are hence unable to handle uncertainty in the intentions of different agents. While maximum entropy (MaxEnt) dynamic games try to address this issue, practical approaches solve for MaxEn… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA 2023

  29. arXiv:2210.00090  [pdf, other

    cs.LG

    Data-driven discovery of non-Newtonian astronomy via learning non-Euclidean Hamiltonian

    Authors: Oswin So, Gongjie Li, Evangelos A. Theodorou, Molei Tao

    Abstract: Incorporating the Hamiltonian structure of physical dynamics into deep learning models provides a powerful way to improve the interpretability and prediction accuracy. While previous works are mostly limited to the Euclidean spaces, their extension to the Lie group manifold is needed when rotations form a key component of the dynamics, such as the higher-order physics beyond simple point-mass dyna… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  30. arXiv:2209.09893  [pdf, other

    stat.ML cs.GT cs.LG math.OC

    Deep Generalized Schrödinger Bridge

    Authors: Guan-Horng Liu, Tianrong Chen, Oswin So, Evangelos A. Theodorou

    Abstract: Mean-Field Game (MFG) serves as a crucial mathematical framework in modeling the collective behavior of individual agents interacting stochastically with a large population. In this work, we aim at solving a challenging class of MFGs in which the differentiability of these interacting preferences may not be available to the solver, and the population is urged to converge exactly to some desired di… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  31. arXiv:2204.03727  [pdf, other

    math.OC cs.RO

    Parameterized Differential Dynamic Programming

    Authors: Alex Oshin, Matthew D. Houghton, Michael J. Acheson, Irene M. Gregory, Evangelos A. Theodorou

    Abstract: Differential Dynamic Programming (DDP) is an efficient trajectory optimization algorithm relying on second-order approximations of a system's dynamics and cost function, and has recently been applied to optimize systems with time-invariant parameters. Prior works include system parameter estimation and identifying the optimal switching time between modes of hybrid dynamical systems. This paper gen… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Submitted to RSS 2022

  32. arXiv:2204.02506  [pdf, other

    cs.MA cs.LG

    Deep Graphic FBSDEs for Opinion Dynamics Stochastic Control

    Authors: Tianrong Chen, Ziyi Wang, Evangelos A. Theodorou

    Abstract: In this paper, we present a scalable deep learning approach to solve opinion dynamics stochastic optimal control problems with mean field term coupling in the dynamics and cost function. Our approach relies on the probabilistic representation of the solution of the Hamilton-Jacobi-Bellman partial differential equation. Grounded on the nonlinear version of the Feynman-Kac lemma, the solutions of th… ▽ More

    Submitted 17 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  33. arXiv:2202.10658  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    Decentralized Safe Multi-agent Stochastic Optimal Control using Deep FBSDEs and ADMM

    Authors: Marcus A. Pereira, Augustinos D. Saravanos, Oswin So, Evangelos A. Theodorou

    Abstract: In this work, we propose a novel safe and scalable decentralized solution for multi-agent control in the presence of stochastic disturbances. Safety is mathematically encoded using stochastic control barrier functions and safe controls are computed by solving quadratic programs. Decentralization is achieved by augmenting to each agent's optimization variables, copy variables, for its neighbors. Th… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Journal ref: Robotics: Science and Systems (RSS), 2022

  34. arXiv:2201.12925  [pdf, other

    math.OC cs.RO

    Multimodal Maximum Entropy Dynamic Games

    Authors: Oswin So, Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: Environments with multi-agent interactions often result a rich set of modalities of behavior between agents due to the inherent suboptimality of decision making processes when agents settle for satisfactory decisions. However, existing algorithms for solving these dynamic games are strictly unimodal and fail to capture the intricate multimodal behaviors of the agents. In this paper, we propose MME… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Under review for RSS 2022. Supplementary Video: https://youtu.be/7molN_Q38dk

  35. arXiv:2201.06539  [pdf, other

    cs.RO cs.AI

    Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning

    Authors: Keuntaek Lee, David Isele, Evangelos A. Theodorou, Sangjae Bae

    Abstract: It can be difficult to autonomously produce driver behavior so that it appears natural to other traffic participants. Through Inverse Reinforcement Learning (IRL), we can automate this process by learning the underlying reward function from human demonstrations. We propose a new IRL algorithm that learns a goal-conditioned spatiotemporal reward function. The resulting costmap is used by Model Pred… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: IEEE Robotics and Automation Letters (RA-L)

  36. arXiv:2111.09207  [pdf, other

    cs.RO eess.SY

    Optimal-Horizon Model-Predictive Control with Differential Dynamic Programming

    Authors: Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: We present an algorithm, based on the Differential Dynamic Programming framework, to handle trajectory optimization problems in which the horizon is determined online rather than fixed a priori. This algorithm exhibits exact one-step convergence for linear, quadratic, time-invariant problems and is fast enough for real-time nonlinear model-predictive control. We show derivations for the nonlinear… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Submitted to ICRA 2022

  37. arXiv:2110.11291  [pdf, other

    stat.ML cs.LG math.AP math.OC

    Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory

    Authors: Tianrong Chen, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: Schrödinger Bridge (SB) is an entropy-regularized optimal transport problem that has received increasing attention in deep generative modeling for its mathematical flexibility compared to the Scored-based Generative Model (SGM). However, it remains unclear whether the optimization principle of SB relates to the modern training of deep generative models, which often rely on constructing log-likelih… ▽ More

    Submitted 3 April, 2023; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: fix appendix net arh error

  38. arXiv:2110.06451  [pdf, other

    math.OC cs.RO

    Maximum Entropy Differential Dynamic Programming

    Authors: Oswin So, Ziyi Wang, Evangelos A. Theodorou

    Abstract: In this paper, we present a novel maximum entropy formulation of the Differential Dynamic Programming algorithm and derive two variants using unimodal and multimodal value functions parameterizations. By combining the maximum entropy Bellman equations with a particular approximation of the cost function, we are able to obtain a new formulation of Differential Dynamic Programming which is able to e… ▽ More

    Submitted 28 February, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to ICRA 2022. Supplementary video available at https://youtu.be/NHr9Kj_jnAI

  39. arXiv:2109.14158  [pdf, other

    cs.LG eess.SY math.OC

    Second-Order Neural ODE Optimizer

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou

    Abstract: We propose a novel second-order optimization framework for training the emerging deep continuous-time models, specifically the Neural Ordinary Differential Equations (Neural ODEs). Since their training already involves expensive gradient computation by solving a backward ODE, deriving efficient second-order methods becomes highly nontrivial. Nevertheless, inspired by the recent Optimal Control (OC… ▽ More

    Submitted 5 November, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Accepted to Advances in Neural Information Processing Systems (NeurIPS) 2021 as Spotlight

  40. arXiv:2109.00183  [pdf, other

    eess.SY cs.AI cs.LG

    Deep $\mathcal{L}^1$ Stochastic Optimal Control Policies for Planetary Soft-landing

    Authors: Marcus A. Pereira, Camilo A. Duarte, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: In this paper, we introduce a novel deep learning based solution to the Powered-Descent Guidance (PDG) problem, grounded in principles of nonlinear Stochastic Optimal Control (SOC) and Feynman-Kac theory. Our algorithm solves the PDG problem by framing it as an $\mathcal{L}^1$ SOC problem for minimum fuel consumption. Additionally, it can handle practically useful control constraints, nonlinear dy… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  41. arXiv:2107.11722  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Risk-aware Costmaps for Traversability in Challenging Environments

    Authors: David D. Fan, Sharmita Dey, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

    Abstract: One of the main challenges in autonomous robotic exploration and navigation in unknown and unstructured environments is determining where the robot can or cannot safely move. A significant source of difficulty in this determination arises from stochasticity and uncertainty, coming from localization error, sensor sparsity and noise, difficult-to-model robot-ground interactions, and disturbances to… ▽ More

    Submitted 4 September, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: Published in RA-L with ICRA presentation option (IEEE International Conference on Robotics and Automation, 2022)

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 1, January 2022)

  42. Safety Embedded Differential Dynamic Programming Using Discrete Barrier States

    Authors: Hassan Almubarak, Kyle Stachowicz, Nader Sadegh, Evangelos A. Theodorou

    Abstract: Certified safe control is a growing challenge in robotics, especially when performance and safety objectives must be concurrently achieved. In this work, we extend the barrier state (BaS) concept, recently proposed for safe stabilization of continuous time systems, to safety embedded trajectory optimization for discrete time systems using discrete barrier states (DBaS). The constructed DBaS is emb… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Added extensive quantitative comparisons and analysis in the implementation examples, and revised discussions and illustrations

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 7, NO. 2, APRIL 2022

  43. arXiv:2105.03788  [pdf, other

    cs.LG cs.GT math.OC

    Dynamic Game Theoretic Neural Optimizer

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou

    Abstract: The connection between training deep neural networks (DNNs) and optimal control theory (OCT) has attracted considerable attention as a principled tool of algorithmic design. Despite few attempts being made, they have been limited to architectures where the layer propagation resembles a Markovian dynamical system. This casts doubts on their flexibility to modern networks that heavily rely on non-Ma… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted in International Conference on Machine Learning (ICML) 2021 as Oral

  44. arXiv:2104.00241  [pdf, other

    cs.LG

    Variational Inference MPC using Tsallis Divergence

    Authors: Ziyi Wang, Oswin So, Jason Gibson, Bogdan Vlahov, Manan S. Gandhi, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using thenon-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function, a novel Tsallis Variational Inference-Model Predictive Control algorithm is derived, which includes prior works such as Variational Inference-Model Predictive… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  45. arXiv:2102.09144  [pdf, other

    cs.RO math.OC physics.app-ph

    Stochastic Spatio-Temporal Optimization for Control and Co-Design of Systems in Robotics and Applied Physics

    Authors: Ethan N. Evans, Andrew P. Kendall, Evangelos A. Theodorou

    Abstract: Correlated with the trend of increasing degrees of freedom in robotic systems is a similar trend of rising interest in Spatio-Temporal systems described by Partial Differential Equations (PDEs) among the robotics and control communities. These systems often exhibit dramatic under-actuation, high dimensionality, bifurcations, and multimodal instabilities. Their control represents many of the curren… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 34 pages, 10 figures. Submitted to Autonomous Robots special issue of RSS 2020. arXiv admin note: text overlap with arXiv:2002.01397

  46. arXiv:2102.09104  [pdf, other

    cs.LG cs.MA cs.RO eess.SY math.OC

    Distributed Algorithms for Linearly-Solvable Optimal Control in Networked Multi-Agent Systems

    Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

    Abstract: Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimall… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  47. arXiv:2011.10890  [pdf, other

    cs.AI

    Large-Scale Multi-Agent Deep FBSDEs

    Authors: Tianrong Chen, Ziyi Wang, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: In this paper we present a scalable deep learning framework for finding Markovian Nash Equilibria in multi-agent stochastic games using fictitious play. The motivation is inspired by theoretical analysis of Forward Backward Stochastic Differential Equations (FBSDE) and their implementation in a deep learning setting, which is the source of our algorithm's sample efficiency improvement. By taking a… ▽ More

    Submitted 21 May, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

  48. arXiv:2009.14775  [pdf, other

    eess.SY cs.LG cs.MA cs.RO math.OC

    Cooperative Path Integral Control for Stochastic Multi-Agent Systems

    Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

    Abstract: A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local co… ▽ More

    Submitted 20 March, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: To appear in American Control Conference 2021, New Orleans, LA, USA

  49. arXiv:2009.13609  [pdf, other

    eess.SY cs.LG cs.MA math.OC

    Compositionality of Linearly Solvable Optimal Control in Networked Multi-Agent Systems

    Authors: Lin Song, Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou

    Abstract: In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative M… ▽ More

    Submitted 22 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted to the 2021 American Control Conference (ACC)

  50. arXiv:2009.01196  [pdf, other

    eess.SY cs.AI cs.RO

    Safe Optimal Control Using Stochastic Barrier Functions and Deep Forward-Backward SDEs

    Authors: Marcus Aloysius Pereira, Ziyi Wang, Ioannis Exarchos, Evangelos A. Theodorou

    Abstract: This paper introduces a new formulation for stochastic optimal control and stochastic dynamic optimization that ensures safety with respect to state and control constraints. The proposed methodology brings together concepts such as Forward-Backward Stochastic Differential Equations, Stochastic Barrier Functions, Differentiable Convex Optimization and Deep Learning. Using the aforementioned concept… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Journal ref: Conference on Robot Learning 2020