Skip to main content

Showing 1–20 of 20 results for author: Hirche, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2412.01591  [pdf, other

    math.OC cs.LG cs.RO eess.SY stat.ML

    Kernel-Based Optimal Control: An Infinitesimal Generator Approach

    Authors: Petar Bevanda, Nicolas Hoischen, Tobias Wittmann, Jan Brüdigam, Sandra Hirche, Boris Houska

    Abstract: This paper presents a novel operator-theoretic approach for optimal control of nonlinear stochastic systems within reproducing kernel Hilbert spaces. Our learning framework leverages data samples of system dynamics and stage cost functions, with only control penalties and constraints provided. The proposed method directly learns the infinitesimal generator of a controlled stochastic diffusion in a… ▽ More

    Submitted 25 April, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

    Comments: Accepted for presentation at 7th Annual Learning for Dynamics & Control Conference (L4DC 2025)

  2. arXiv:2409.16866  [pdf, other

    cs.LG math.OC

    Risk-averse learning with delayed feedback

    Authors: Siyi Wang, Zifan Wang, Karl Henrik Johansson, Sandra Hirche

    Abstract: In real-world scenarios, the impacts of decisions may not manifest immediately. Taking these delays into account facilitates accurate assessment and management of risk in real-world environments, thereby ensuring the efficacy of strategies. In this paper, we investigate risk-averse learning using Conditional Value at Risk (CVaR) as risk measure, while incorporating delayed feedback with unknown bu… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  3. arXiv:2407.16407  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings

    Authors: Petar Bevanda, Nicolas Hoischen, Stefan Sosnowski, Sandra Hirche, Boris Houska

    Abstract: This paper proposes a fully data-driven approach for optimal control of nonlinear control-affine systems represented by a stochastic diffusion. The focus is on the scenario where both the nonlinear dynamics and stage cost functions are unknown, while only control penalty function and constraints are provided. Leveraging the theory of reproducing kernel Hilbert spaces, we introduce novel kernel mea… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: author-submitted electronic preprint version: 16 pages, 3 figures, 4 tables

  4. arXiv:2403.11932  [pdf, ps, other

    cs.IT math.OC

    Consistency of Value of Information: Effects of Packet Loss and Time Delay in Networked Control Systems Tasks

    Authors: Touraj Soleymani, John S. Baras, Siyi Wang, Sandra Hirche, Karl H. Johansson

    Abstract: In this chapter, we study the consistency of the value of information$\unicode{x2014}$a semantic metric that claims to determine the right piece of information in networked control systems tasks$\unicode{x2014}$in a lossy and delayed communication regime. Our analysis begins with a focus on state estimation, and subsequently extends to feedback control. To that end, we make a causal tradeoff betwe… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2403.11927  [pdf, ps, other

    cs.IT math.OC

    Foundations of Value of Information: A Semantic Metric for Networked Control Systems Tasks

    Authors: Touraj Soleymani, John S. Baras, Sandra Hirche, Karl H. Johansson

    Abstract: In this chapter, we present our recent invention, i.e., the notion of the value of information$\unicode{x2014}$a semantic metric that is fundamental for networked control systems tasks. We begin our analysis by formulating a causal tradeoff between the packet rate and the regulation cost, with an encoder and a decoder as two distributed decision makers, and show that the valuation of information i… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2402.09575  [pdf, other

    math.OC eess.SY

    Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem

    Authors: Wenhan Cao, Alexandre Capone, Sandra Hirche, Wei Pan

    Abstract: Adaptive dynamic programming (ADP) for stochastic linear quadratic regulation (LQR) demands the precise computation of stochastic integrals during policy iteration (PI). In a fully model-free problem setting, this computation can only be approximated by state samples collected at discrete time points using computational methods such as the canonical Euler-Maruyama method. Our research reveals a cr… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  7. arXiv:2311.11337  [pdf, other

    eess.SY math.OC

    H2 suboptimal containment control of homogeneous and heterogeneous multi-agent systems

    Authors: Yuan Gao, Junjie Jiao, Zhongkui Li, Sandra Hirche

    Abstract: This paper deals with the H2 suboptimal state containment control problem for homogeneous linear multi-agent systems and the H2 suboptimal output containment control problem for heterogeneous linear multi-agent systems. For both problems, given multiple autonomous leaders and a number of followers, we introduce suitable performance outputs and an associated H2 cost functional, respectively. The ai… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 15 papges, 7 figures

  8. arXiv:2305.16215  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Koopman Kernel Regression

    Authors: Petar Bevanda, Max Beier, Armin Lederer, Stefan Sosnowski, Eyke Hüllermeier, Sandra Hirche

    Abstract: Many machine learning approaches for decision making, such as reinforcement learning, rely on simulators or predictive models to forecast the time-evolution of quantities of interest, e.g., the state of an agent or the reward of a policy. Forecasts of such complex phenomena are commonly described by highly nonlinear dynamical systems, making their use in optimization-based decision-making challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to the thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  9. arXiv:2303.17963  [pdf, other

    eess.SY cs.LG math.OC stat.ML

    Learning-Based Optimal Control with Performance Guarantees for Unknown Systems with Latent States

    Authors: Robert Lefringhausen, Supitsana Srithasan, Armin Lederer, Sandra Hirche

    Abstract: As control engineering methods are applied to increasingly complex systems, data-driven approaches for system identification appear as a promising alternative to physics-based modeling. While the Bayesian approaches prevalent for safety-critical applications usually rely on the availability of state measurements, the states of a complex system are often not directly measurable. It may then be nece… ▽ More

    Submitted 6 August, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted version submitted to the 2024 European Control Conference (ECC)

    Journal ref: 2024 European Control Conference (ECC), pp. 90-97

  10. arXiv:2203.02321  [pdf, ps, other

    math.OC eess.SY

    Actuator Scheduling for Linear Systems: A Convex Relaxation Approach

    Authors: Junjie Jiao, Dipankar Maity, John S. Baras, Sandra Hirche

    Abstract: In this letter, we investigate the problem of actuator scheduling for networked control systems. Given a stochastic linear system with a number of actuators, we consider the case that one actuator is activated at each time. This problem is combinatorial in nature and NP hard to solve. We propose a convex relaxation to the actuator scheduling problem, and use its solution as a reference to design a… ▽ More

    Submitted 20 May, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: 8 pages, 4 figures

  11. arXiv:2201.11640  [pdf, ps, other

    eess.SY cs.LG math.DS math.OC

    Towards Data-driven LQR with Koopmanizing Flows

    Authors: Petar Bevanda, Max Beier, Shahab Heshmati-Alamdari, Stefan Sosnowski, Sandra Hirche

    Abstract: We propose a novel framework for learning linear time-invariant (LTI) models for a class of continuous-time non-autonomous nonlinear dynamics based on a representation of Koopman operators. In general, the operator is infinite-dimensional but, crucially, linear. To utilize it for efficient LTI control design, we learn a finite representation of the Koopman operator that is linear in controls while… ▽ More

    Submitted 23 May, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Final version, accepted for presentation at the 6th IFAC Conference on Intelligent Control and Automation Sciences (ICONS), 2022. arXiv admin note: text overlap with arXiv:2112.04085

  12. arXiv:2110.07786  [pdf, other

    cs.LG eess.SY math.DS

    Learning the Koopman Eigendecomposition: A Diffeomorphic Approach

    Authors: Petar Bevanda, Johannes Kirmayr, Stefan Sosnowski, Sandra Hirche

    Abstract: We present a novel data-driven approach for learning linear representations of a class of stable nonlinear systems using Koopman eigenfunctions. By learning the conjugacy map between a nonlinear system and its Jacobian linearization through a Normalizing Flow one can guarantee the learned function is a diffeomorphism. Using this diffeomorphism, we construct eigenfunctions of the nonlinear system v… ▽ More

    Submitted 30 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted for presentation at the 2022 American Control Conference (ACC)

  13. arXiv:2107.07822  [pdf, other

    eess.SY math.OC

    Distributed Value of Information in Feedback Control over Multi-hop Networks

    Authors: Precious Ugo Abara, Sandra Hirche

    Abstract: Recent works in the domain of networked control systems have demonstrated that the joint design of medium access control strategies and control strategies for the closed-loop system is beneficial. However, several metrics introduced so far fail in either appropriately representing the network requirements or in capturing how valuable the data is. In this paper we propose a distributed value of inf… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 19 pages, 10 figures

  14. Value of Information in Feedback Control: Global Optimality

    Authors: Touraj Soleymani, John S. Baras, Sandra Hirche, Karl H. Johansson

    Abstract: The rate-regulation tradeoff, defined between two objective functions, one penalizing the packet rate and one the regulation cost, can express the fundamental performance bound of networked control systems. However, the characterization of the set of globally optimal solutions in this tradeoff for multi-dimensional Gauss-Markov processes has been an open problem. In the present article, we charact… ▽ More

    Submitted 4 May, 2022; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: text overlap with arXiv:1812.07534

  15. arXiv:2103.11851  [pdf, ps, other

    math.OC eess.SY

    Data-driven output synchronization of heterogeneous leader-follower multi-agent systems

    Authors: Junjie Jiao, Henk J. van Waarde, Harry L. Trentelman, M. Kanat Camlibel, Sandra Hirche

    Abstract: This paper deals with data-driven output synchronization for heterogeneous leader-follower linear multi-agent systems. Given a multi-agent system that consists of one autonomous leader and a number of heterogeneous followers with external disturbances, we provide necessary and sufficient data-based conditions for output synchronization. We also provide a design method for obtaining such output syn… ▽ More

    Submitted 23 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: 6 pages, 2 figures. This paper has been accepted by IEEE CDC 2021

  16. Koopman Operator Dynamical Models: Learning, Analysis and Control

    Authors: Petar Bevanda, Stefan Sosnowski, Sandra Hirche

    Abstract: The Koopman operator allows for handling nonlinear systems through a (globally) linear representation. In general, the operator is infinite-dimensional - necessitating finite approximations - for which there is no overarching framework. Although there are principled ways of learning such finite approximations, they are in many instances overlooked in favor of, often ill-posed and unstructured meth… ▽ More

    Submitted 22 December, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: This is an authors' version of the work that is published in Annual Reviews in Control journal. Changes were made to this version by the publisher prior to publication

    Journal ref: Annual Reviews in Control - Volume 52, 2021, Pages 197-212

  17. Value of Information in Feedback Control: Quantification

    Authors: Touraj Soleymani, John S. Baras, Sandra Hirche

    Abstract: Although transmission of a data packet containing sensory information in a networked control system improves the quality of regulation, it has indeed a price from the communication perspective. It is, therefore, rational that such a data packet be transmitted only if it is valuable in the sense of a cost-benefit analysis. Yet, the fact is that little is known so far about this valuation of informa… ▽ More

    Submitted 2 May, 2022; v1 submitted 18 December, 2018; originally announced December 2018.

  18. Optimal LQG Control under Delay-dependent Costly Information

    Authors: Dipankar Maity, Mohammad H. Mamduhi, Sandra Hirche, Karl Henrik Johansson, John S. Baras

    Abstract: In the design of closed-loop networked control systems (NCSs), induced transmission delay between sensors and the control station is an often-present issue which compromises control performance and may even cause instability. A very relevant scenario in which network-induced delay needs to be investigated is costly usage of communication resources. More precisely, advanced communication technologi… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Journal ref: IEEE Control Systems Letters ( Volume: 3, Issue: 1, Jan. 2019 )

  19. arXiv:1511.02604  [pdf, ps, other

    math.DS math.OC

    Consensus Driven by the Geometric Mean

    Authors: Herbert Mangesius, Dong Xue, Sandra Hirche

    Abstract: Consensus networks are usually understood as arithmetic mean driven dynamical averaging systems. In applications, however, network dynamics often describe inherently non-arithmetic and non-linear consensus processes. In this paper, we propose and study three novel consensus protocols driven by geometric mean averaging: a polynomial, an entropic, and a scaling-invariant protocol, where terminology… ▽ More

    Submitted 5 August, 2016; v1 submitted 9 November, 2015; originally announced November 2015.

  20. arXiv:1203.4980  [pdf, ps, other

    math.OC

    Event-Triggered Estimation of Linear Systems: An Iterative Algorithm and Optimality Properties

    Authors: Adam Molin, Sandra Hirche

    Abstract: This report investigates the optimal design of event-triggered estimation for first-order linear stochastic systems. The problem is posed as a two-player team problem with a partially nested information pattern. The two players are given by an estimator and an event-trigger. The event-trigger has full state information and decides, whether the estimator shall obtain the current state information b… ▽ More

    Submitted 22 March, 2012; originally announced March 2012.