Skip to main content

Showing 1–50 of 87 results for author: Trimpe, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03312  [pdf, ps, other

    cs.LG

    MPX: Mixed Precision Training for JAX

    Authors: Alexander Gräfe, Sebastian Trimpe

    Abstract: Mixed-precision training has emerged as an indispensable tool for enhancing the efficiency of neural network training in recent years. Concurrently, JAX has grown in popularity as a versatile machine learning toolbox. However, it currently lacks robust support for mixed-precision training. We propose MPX, a mixed-precision training toolbox for JAX that simplifies and accelerates the training of la… ▽ More

    Submitted 8 July, 2025; v1 submitted 4 July, 2025; originally announced July 2025.

  2. arXiv:2506.17994  [pdf, ps, other

    cs.RO cs.LG

    Newtonian and Lagrangian Neural Networks: A Comparison Towards Efficient Inverse Dynamics Identification

    Authors: Minh Trinh, Andreas René Geist, Josefine Monnet, Stefan Vilceanu, Sebastian Trimpe, Christian Brecher

    Abstract: Accurate inverse dynamics models are essential tools for controlling industrial robots. Recent research combines neural network regression with inverse dynamics formulations of the Newton-Euler and the Euler-Lagrange equations of motion, resulting in so-called Newtonian neural networks and Lagrangian neural networks, respectively. These physics-informed models seek to identify unknowns in the anal… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Paper accepted for publication in 14th IFAC Symposium on Robotics

    ACM Class: I.2.9; I.2.6; I.6.4

  3. arXiv:2506.10871  [pdf, ps, other

    cs.LG

    Viability of Future Actions: Robust Safety in Reinforcement Learning via Entropy Regularization

    Authors: Pierre-François Massiani, Alexander von Rohr, Lukas Haverbeck, Sebastian Trimpe

    Abstract: Despite the many recent advances in reinforcement learning (RL), the question of learning policies that robustly satisfy state constraints under unknown disturbances remains open. In this paper, we offer a new perspective on achieving robust safety by analyzing the interplay between two well-established techniques in model-free RL: entropy regularization, and constraints penalization. We reveal em… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 24 pages, 11 figures, 2 tables. Accepted for publication at ECML-PKDD 2025

  4. arXiv:2506.03898  [pdf, ps, other

    cs.LG stat.ML

    A kernel conditional two-sample test

    Authors: Pierre-François Massiani, Christian Fiedler, Lukas Haverbeck, Friedrich Solowjow, Sebastian Trimpe

    Abstract: We propose a framework for hypothesis testing on conditional probability distributions, which we then use to construct conditional two-sample statistical tests. These tests identify the inputs -- called covariates in this context -- where two conditional expectations differ with high probability. Our key idea is to transform confidence bounds of a learning method into a conditional two-sample test… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 40 pages, 8 figures, 8 tables. Under review

  5. arXiv:2504.05024  [pdf, other

    cs.LG

    Concept Extraction for Time Series with ECLAD-ts

    Authors: Antonia Holzapfel, Andres Felipe Posada-Moreno, Sebastian Trimpe

    Abstract: Convolutional neural networks (CNNs) for time series classification (TSC) are being increasingly used in applications ranging from quality prediction to medical diagnosis. The black box nature of these models makes understanding their prediction process difficult. This issue is crucial because CNNs are prone to learning shortcuts and biases, compromising their robustness and alignment with human e… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  6. arXiv:2504.04603  [pdf, other

    cs.RO

    Diffusion-Based Approximate MPC: Fast and Consistent Imitation of Multi-Modal Action Distributions

    Authors: Pau Marquez Julbe, Julian Nubert, Henrik Hose, Sebastian Trimpe, Katherine J. Kuchenbecker

    Abstract: Approximating model predictive control (MPC) using imitation learning (IL) allows for fast control without solving expensive optimization problems online. However, methods that use neural networks in a simple L2-regression setup fail to approximate multi-modal (set-valued) solution distributions caused by local optima found by the numerical solver or non-convex constraints, such as obstacles, sign… ▽ More

    Submitted 13 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  7. arXiv:2502.04582  [pdf, other

    cs.RO eess.SY math.OC

    The Mini Wheelbot: A Testbed for Learning-based Balancing, Flips, and Articulated Driving

    Authors: Henrik Hose, Jan Weisgerber, Sebastian Trimpe

    Abstract: The Mini Wheelbot is a balancing, reaction wheel unicycle robot designed as a testbed for learning-based control. It is an unstable system with highly nonlinear yaw dynamics, non-holonomic driving, and discrete contact switches in a small, powerful, and rugged form factor. The Mini Wheelbot can use its wheels to stand up from any initial orientation - enabling automatic environment resets in repet… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  8. arXiv:2501.16918  [pdf, other

    cs.LG

    On Rollouts in Model-Based Reinforcement Learning

    Authors: Bernd Frauenknecht, Devdutt Subhasish, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Model-based reinforcement learning (MBRL) seeks to enhance data efficiency by learning a model of the environment and generating synthetic rollouts from it. However, accumulated model errors during these rollouts can distort the data distribution, negatively impacting policy learning and hindering long-term planning. Thus, the accumulation of model errors is a key bottleneck in current MBRL method… ▽ More

    Submitted 8 April, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  9. arXiv:2501.07985  [pdf, ps, other

    cs.RO cs.LG

    CHEQ-ing the Box: Safe Variable Impedance Learning for Robotic Polishing

    Authors: Emma Cramer, Lukas Jäschke, Sebastian Trimpe

    Abstract: Robotic systems are increasingly employed for industrial automation, with contact-rich tasks like polishing requiring dexterity and compliant behaviour. These tasks are difficult to model, making classical control challenging. Deep reinforcement learning (RL) offers a promising solution by enabling the learning of models and control policies directly from data. However, its application to real-wor… ▽ More

    Submitted 2 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  10. arXiv:2412.09477  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization via Continual Variational Last Layer Training

    Authors: Paul Brunzema, Mikkel Jordahn, John Willes, Sebastian Trimpe, Jasper Snoek, James Harrison

    Abstract: Gaussian Processes (GPs) are widely seen as the state-of-the-art surrogate models for Bayesian optimization (BO) due to their ability to model uncertainty and their performance on tasks where correlations are easily captured (such as those defined by Euclidean metrics) and their ability to be efficiently updated online. However, the performance of GPs depends on the choice of kernel, and kernel se… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  11. arXiv:2412.00395  [pdf, other

    cs.LG cs.RO stat.ML

    On Foundation Models for Dynamical Systems from Purely Synthetic Data

    Authors: Martin Ziegler, Andres Felipe Posada-Moreno, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Foundation models have demonstrated remarkable generalization, data efficiency, and robustness properties across various domains. In this paper, we explore the feasibility of foundation models for applications in the control domain. The success of these models is enabled by large-scale pretaining on Internet-scale datasets. These are available in fields like natural language processing and compute… ▽ More

    Submitted 17 December, 2024; v1 submitted 30 November, 2024; originally announced December 2024.

    Comments: 10 pages

  12. Local Bayesian Optimization for Controller Tuning with Crash Constraints

    Authors: Alexander von Rohr, David Stenger, Dominik Scheurenberg, Sebastian Trimpe

    Abstract: Controller tuning is crucial for closed-loop performance but often involves manual adjustments. Although Bayesian optimization (BO) has been established as a data-efficient method for automated tuning, applying it to large and high-dimensional search spaces remains challenging. We extend a recently proposed local variant of BO to include crash constraints, where the controller can only be successf… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: Published in at-Automatisierungstechnik

    Journal ref: von Rohr, Alexander, Stenger, David, Scheurenberg, Dominik and Trimpe, Sebastian. "Local Bayesian optimization for controller tuning with crash constraints" at - Automatisierungstechnik, vol. 72, no. 4, 2024, pp. 281-292

  13. arXiv:2411.14246  [pdf, other

    cs.RO cs.LG eess.SY

    Simulation-Aided Policy Tuning for Black-Box Robot Learning

    Authors: Shiming He, Alexander von Rohr, Dominik Baumann, Ji Xiang, Sebastian Trimpe

    Abstract: How can robots learn and adapt to new tasks and situations with little data? Systematic exploration and simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on data-efficient policy improvements. The algorithm learns directly on the robot and treats simulation as an additional information source to speed up the learning process. At… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  14. arXiv:2409.16875  [pdf, other

    eess.SY cs.LG

    Feedforward Controllers from Learned Dynamic Local Model Networks with Application to Excavator Assistance Functions

    Authors: Leon Greiser, Ozan Demir, Benjamin Hartmann, Henrik Hose, Sebastian Trimpe

    Abstract: Complicated first principles modelling and controller synthesis can be prohibitively slow and expensive for high-mix, low-volume products such as hydraulic excavators. Instead, in a data-driven approach, recorded trajectories from the real system can be used to train local model networks (LMNs), for which feedforward controllers are derived via feedback linearization. However, previous works requi… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  15. arXiv:2407.03476  [pdf, other

    cs.RO

    Learning deformable linear object dynamics from a single trajectory

    Authors: Shamil Mamedov, A. René Geist, Ruan Viljoen, Sebastian Trimpe, Jan Swevers

    Abstract: The manipulation of deformable linear objects (DLOs) via model-based control requires an accurate and computationally efficient dynamics model. Yet, data-driven DLO dynamics models require large training data sets while their predictions often do not generalize, whereas physics-based models rely on good approximations of physical phenomena and often lack accuracy. To address these challenges, we p… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  16. arXiv:2406.19768  [pdf, other

    cs.LG

    Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors

    Authors: Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe

    Abstract: Combining Reinforcement Learning (RL) with a prior controller can yield the best out of two worlds: RL can solve complex nonlinear problems, while the control prior ensures safer exploration and speeds up training. Prior work largely blends both components with a fixed weight, neglecting that the RL agent's performance varies with the training progress and across regions in the state space. Theref… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  17. arXiv:2406.18293  [pdf, other

    cs.LG cs.AI

    Combining Automated Optimisation of Hyperparameters and Reward Shape

    Authors: Julian Dierkes, Emma Cramer, Holger H. Hoos, Sebastian Trimpe

    Abstract: There has been significant progress in deep reinforcement learning (RL) in recent years. Nevertheless, finding suitable hyperparameter configurations and reward functions remains challenging even for experts, and performance heavily relies on these design choices. Also, most RL research is conducted on known benchmarks where knowledge about these choices already exists. However, novel practical ap… ▽ More

    Submitted 9 October, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Published in the Reinforcement Learning Journal 2024

  18. arXiv:2406.06101  [pdf, ps, other

    cs.LG stat.ML

    On the Consistency of Kernel Methods with Dependent Observations

    Authors: Pierre-François Massiani, Sebastian Trimpe, Friedrich Solowjow

    Abstract: The consistency of a learning method is usually established under the assumption that the observations are a realization of an independent and identically distributed (i.i.d.) or mixing process. Yet, kernel methods such as support vector machines (SVMs), Gaussian processes, or conditional kernel mean embeddings (CKMEs) all give excellent performance under sampling schemes that are obviously non-i.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 26 pages, 1 figure

  19. arXiv:2405.19014  [pdf, other

    cs.LG

    Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

    Authors: Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Dyna-style model-based reinforcement learning (MBRL) combines model-free agents with predictive transition models through model-based rollouts. This combination raises a critical question: 'When to trust your model?'; i.e., which rollout length results in the model providing useful data? Janner et al. (2019) address this question by gradually increasing rollout lengths throughout the training. Whi… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  20. arXiv:2405.10618  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Event-Based Learning via ADMM

    Authors: Guner Dilsad Er, Sebastian Trimpe, Michael Muehlebach

    Abstract: We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We therefore guarantee convergence even if the local da… ▽ More

    Submitted 6 February, 2025; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 35 pages, 12 figures

  21. arXiv:2404.05835  [pdf, other

    eess.SY cs.LG math.OC

    Parameter-Adaptive Approximate MPC: Tuning Neural-Network Controllers without Retraining

    Authors: Henrik Hose, Alexander Gräfe, Sebastian Trimpe

    Abstract: Model Predictive Control (MPC) is a method to control nonlinear systems with guaranteed stability and constraint satisfaction but suffers from high computation times. Approximate MPC (AMPC) with neural networks (NNs) has emerged to address this limitation, enabling deployment on resource-constrained embedded systems. However, when tuning AMPCs for real-world systems, large datasets need to be rege… ▽ More

    Submitted 6 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to L4DC 2024

    Journal ref: PMLR 242:349-360, 2024

  22. arXiv:2403.12948  [pdf, other

    cs.LG stat.ML

    On Safety in Safe Bayesian Optimization

    Authors: Christian Fiedler, Johanna Menn, Lukas Kreisköther, Sebastian Trimpe

    Abstract: Optimizing an unknown function under safety constraints is a central task in robotics, biomedical engineering, and many other disciplines, and increasingly safe Bayesian Optimization (BO) is used for this. Due to the safety critical nature of these applications, it is of utmost importance that theoretical safety guarantees for these algorithms translate into the real world. In this work, we invest… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  23. arXiv:2312.10199  [pdf, other

    eess.SY cs.LG math.OC

    Automatic nonlinear MPC approximation with closed-loop guarantees

    Authors: Abdullah Tokmak, Christian Fiedler, Melanie N. Zeilinger, Sebastian Trimpe, Johannes Köhler

    Abstract: Safety guarantees are vital in many control applications, such as robotics. Model predictive control (MPC) provides a constructive framework for controlling safety-critical systems, but is limited by its computational complexity. We address this problem by presenting a novel algorithm that automatically computes an explicit approximation to nonlinear MPC schemes while retaining closed-loop guarant… ▽ More

    Submitted 11 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE Transactions on Automatic Control. Compared to the previously uploaded version, this version contains an additional numerical example

  24. arXiv:2312.00592  [pdf, other

    cs.LG cs.CV cs.RO

    Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version)

    Authors: Emma Cramer, Jonas Reiher, Sebastian Trimpe

    Abstract: Reinforcement learning (RL) for robot control typically requires a detailed representation of the environment state, including information about task-relevant objects not directly measurable. Keypoint detectors, such as spatial autoencoders (SAEs), are a common approach to extracting a low-dimensional representation from high-dimensional image data. SAEs aim at spatial features such as object posi… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 19 pages, 12 figures

  25. arXiv:2311.18393  [pdf, other

    cs.LG cs.RO

    Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control

    Authors: Bernd Frauenknecht, Tobias Ehlgen, Sebastian Trimpe

    Abstract: Advanced vehicle control is a fundamental building block in the development of autonomous driving systems. Reinforcement learning (RL) promises to achieve control performance superior to classical approaches while keeping computational demands low during deployment. However, standard RL approaches like soft-actor critic (SAC) require extensive amounts of training data to be collected and are thus… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  26. arXiv:2310.18074  [pdf, ps, other

    cs.LG math.ST stat.ML

    On kernel-based statistical learning in the mean field limit

    Authors: Christian Fiedler, Michael Herty, Sebastian Trimpe

    Abstract: In many applications of machine learning, a large number of variables are considered. Motivated by machine learning of interacting particle systems, we consider the situation when the number of input variables goes to infinity. First, we continue the recent investigation of the mean field limit of kernels and their reproducing kernel Hilbert spaces, completing the existing theory. Next, we provide… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  27. arXiv:2309.02873  [pdf, other

    cs.LG

    Learning Hybrid Dynamics Models With Simulator-Informed Latent States

    Authors: Katharina Ensinger, Sebastian Ziesche, Sebastian Trimpe

    Abstract: Dynamics model learning deals with the task of inferring unknown dynamics from measurement data and predicting the future behavior of the system. A typical approach to address this problem is to train recurrent models. However, predictions with these models are often not physically meaningful. Further, they suffer from deteriorated behavior over time due to accumulating errors. Often, simulators b… ▽ More

    Submitted 29 January, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at The 38th Annual AAAI Conference on Artificial Intelligence, 2024

  28. arXiv:2309.02351  [pdf, other

    cs.LG stat.ML

    Exact Inference for Continuous-Time Gaussian Process Dynamics

    Authors: Katharina Ensinger, Nicholas Tagliapietra, Sebastian Ziesche, Sebastian Trimpe

    Abstract: Physical systems can often be described via a continuous-time dynamical system. In practice, the true system is often unknown and has to be learned from measurement data. Since data is typically collected in discrete time, e.g. by sensors, most methods in Gaussian process (GP) dynamics model learning are trained on one-step ahead predictions. This can become problematic in several scenarios, e.g.… ▽ More

    Submitted 29 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at The 38th Annual AAAI Conference on Artificial Intelligence. 2024

  29. arXiv:2308.06022  [pdf, other

    cs.CV

    Scale-Preserving Automatic Concept Extraction (SPACE)

    Authors: Andrés Felipe Posada-Moreno, Lukas Kreisköther, Tassilo Glander, Sebastian Trimpe

    Abstract: Convolutional Neural Networks (CNN) have become a common choice for industrial quality control, as well as other critical applications in the Industry 4.0. When these CNNs behave in ways unexpected to human users or developers, severe consequences can arise, such as economic losses or an increased risk to human life. Concept extraction techniques can be applied to increase the reliability and tran… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 22 pages, 7 figures

  30. arXiv:2307.07975  [pdf, other

    cs.RO cs.LG

    Pseudo-rigid body networks: learning interpretable deformable object dynamics from partial observations

    Authors: Shamil Mamedov, A. René Geist, Jan Swevers, Sebastian Trimpe

    Abstract: Accurately predicting deformable linear object (DLO) dynamics is challenging, especially when the task requires a model that is both human-interpretable and computationally efficient. In this work, we draw inspiration from the pseudo-rigid body method (PRB) and model a DLO as a serial chain of rigid bodies whose internal state is unrolled through time by a dynamics network. This dynamics network i… ▽ More

    Submitted 10 September, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication

  31. arXiv:2306.16973  [pdf, other

    cs.RO eess.SY

    Robust Direct Data-Driven Control for Probabilistic Systems

    Authors: Alexander von Rohr, Dmitrii Likhachev, Sebastian Trimpe

    Abstract: We propose a data-driven control method for systems with aleatoric uncertainty, for example, robot fleets with variations between agents. Our method leverages shared trajectory data to increase the robustness of the designed controller and thus facilitate transfer to new variations without the need for prior parameter and uncertainty estimations. In contrast to existing work on experience transfer… ▽ More

    Submitted 22 March, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  32. arXiv:2306.03551  [pdf, other

    cs.AI cs.CV cs.LG

    Scalable Concept Extraction in Industry 4.0

    Authors: Andrés Felipe Posada-Moreno, Kai Müller, Florian Brillowski, Friedrich Solowjow, Thomas Gries, Sebastian Trimpe

    Abstract: The industry 4.0 is leveraging digital technologies and machine learning techniques to connect and optimize manufacturing processes. Central to this idea is the ability to transform raw data into human understandable knowledge for reliable data-driven decision-making. Convolutional Neural Networks (CNNs) have been instrumental in processing image data, yet, their ``black box'' nature complicates t… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  33. arXiv:2304.09575  [pdf, ps, other

    eess.SY cs.LG math.OC

    Approximate non-linear model predictive control with safety-augmented neural networks

    Authors: Henrik Hose, Johannes Köhler, Melanie N. Zeilinger, Sebastian Trimpe

    Abstract: Model predictive control (MPC) achieves stability and constraint satisfaction for general nonlinear systems, but requires computationally expensive online optimization. This paper studies approximations of such MPC controllers via neural networks (NNs) to achieve fast online evaluation. We propose safety augmentation that yields deterministic guarantees for convergence and constraint satisfaction… ▽ More

    Submitted 8 October, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

  34. Multimodal Multi-User Surface Recognition with the Kernel Two-Sample Test

    Authors: Behnam Khojasteh, Friedrich Solowjow, Sebastian Trimpe, Katherine J. Kuchenbecker

    Abstract: Machine learning and deep learning have been used extensively to classify physical surfaces through images and time-series contact data. However, these methods rely on human expertise and entail the time-consuming processes of data and parameter tuning. To overcome these challenges, we propose an easily implemented framework that can directly handle heterogeneous data sources for classification ta… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  35. arXiv:2302.14446  [pdf, ps, other

    stat.ML cs.LG math.FA math.NA

    Reproducing kernel Hilbert spaces in the mean field limit

    Authors: Christian Fiedler, Michael Herty, Michael Rom, Chiara Segala, Sebastian Trimpe

    Abstract: Kernel methods, being supported by a well-developed theory and coming with efficient algorithms, are among the most popular and successful machine learning techniques. From a mathematical point of view, these methods rest on the concept of kernels and function spaces generated by kernels, so called reproducing kernel Hilbert spaces. Motivated by recent developments of learning approaches in the co… ▽ More

    Submitted 17 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Updated author email addresses

    MSC Class: 46E22; 82B40; 74A25; 82C40

  36. arXiv:2302.13754  [pdf, other

    cs.LG

    Combining Slow and Fast: Complementary Filtering for Dynamics Learning

    Authors: Katharina Ensinger, Sebastian Ziesche, Barbara Rakitsch, Michael Tiemann, Sebastian Trimpe

    Abstract: Modeling an unknown dynamical system is crucial in order to predict the future behavior of the system. A standard approach is training recurrent models on measurement data. While these models typically provide exact short-term predictions, accumulating errors yield deteriorated long-term behavior. In contrast, models with reliable long-term predictions can often be obtained, either by training a r… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  37. Data-Driven Observability Analysis for Nonlinear Stochastic Systems

    Authors: Pierre-François Massiani, Mona Buisson-Fenet, Friedrich Solowjow, Florent Di Meglio, Sebastian Trimpe

    Abstract: Distinguishability and, by extension, observability are key properties of dynamical systems. Establishing these properties is challenging, especially when no analytical model is available and they are to be inferred directly from measurement data. The presence of noise further complicates this analysis, as standard notions of distinguishability are tailored to deterministic systems. We build on di… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

    Journal ref: IEEE Transactions of Automatic Control 69 (2023) 4042 -- 4049

  38. arXiv:2208.10790  [pdf, other

    cs.LG stat.ML

    Event-Triggered Time-Varying Bayesian Optimization

    Authors: Paul Brunzema, Alexander von Rohr, Friedrich Solowjow, Sebastian Trimpe

    Abstract: We consider the problem of sequentially optimizing a time-varying objective function using time-varying Bayesian optimization (TVBO). Current approaches to TVBO require prior knowledge of a constant rate of change to cope with stale data arising from time variations. However, in practice, the rate of change is usually unknown. We propose an event-triggered algorithm, ET-GP-UCB, that treats the opt… ▽ More

    Submitted 4 February, 2025; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  39. Learning Fast and Precise Pixel-to-Torque Control

    Authors: Steffen Bleher, Steve Heim, Sebastian Trimpe

    Abstract: In the field, robots often need to operate in unknown and unstructured environments, where accurate sensing and state estimation (SE) becomes a major challenge. Cameras have been used to great success in mapping and planning in such environments, as well as complex but quasi-static tasks such as grasping, but are rarely integrated into the control loop for unstable systems. Learning pixel-to-torqu… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: video: https://www.youtube.com/watch?v=S2llScfG-8E 9 pages. Published in Robotics and Automation Magazine

  40. Improving the Performance of Robust Control through Event-Triggered Learning

    Authors: Alexander von Rohr, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Robust controllers ensure stability in feedback loops designed under uncertainty but at the cost of performance. Model uncertainty in time-invariant systems can be reduced by recently proposed learning-based methods, which improve the performance of robust controllers using data. However, in practice, many systems also exhibit uncertainty in the form of changes over time, e.g., due to weight shift… ▽ More

    Submitted 21 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: To appear in the proceedings of the 61st IEEE Conference on Decision and Control

  41. On Controller Tuning with Time-Varying Bayesian Optimization

    Authors: Paul Brunzema, Alexander von Rohr, Sebastian Trimpe

    Abstract: Changing conditions or environments can cause system dynamics to vary over time. To ensure optimal control performance, controllers should adapt to these changes. When the underlying cause and time of change is unknown, we need to rely on online data for this adaptation. In this paper, we will use time-varying Bayesian optimization (TVBO) to tune controllers online in changing environments using a… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: To appear in the proceedings of the 61st IEEE Conference on Decision and Control

    Journal ref: IEEE 61st Conference on Decision and Control (2022), p. 4046-4052

  42. arXiv:2207.06988  [pdf, other

    cs.RO

    The Wheelbot: A Jumping Reaction Wheel Unicycle

    Authors: A. René Geist, Jonathan Fiene, Naomi Tashiro, Zheng Jia, Sebastian Trimpe

    Abstract: Combining off-the-shelf components with 3D-printing, the Wheelbot is a symmetric reaction wheel unicycle that can jump onto its wheels from any initial position. With non-holonomic and under-actuated dynamics, as well as two coupled unstable degrees of freedom, the Wheelbot provides a challenging platform for nonlinear and data-driven control research. This paper presents the Wheelbot's mechanical… ▽ More

    Submitted 23 July, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Erratum: In the initial publication, Equation (3) was wrong and has been corrected in this version. Equation (3) relates to the transform from averaged body rates ${}^{\text{B}}ω_i$ to Euler rates. Importantly, the results in this papers are not affected by the wrong transform. More details are found in the projects github repo: https://github.com/AndReGeist/wheelbot-v2.5

  43. arXiv:2206.04531  [pdf, other

    cs.CV cs.AI cs.LG

    ECLAD: Extracting Concepts with Local Aggregated Descriptors

    Authors: Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

    Abstract: Convolutional neural networks (CNNs) are increasingly being used in critical systems, where robustness and alignment are crucial. In this context, the field of explainable artificial intelligence has proposed the generation of high-level explanations of the prediction process of CNNs through concept extraction. While these methods can detect whether or not a concept is present in an image, they ar… ▽ More

    Submitted 11 August, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 34 pages, under review

    MSC Class: 68T01 ACM Class: I.2.10; I.2.m

  44. arXiv:2205.12550  [pdf, other

    eess.SY cs.LG

    Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs

    Authors: Mona Buisson-Fenet, Valery Morgenthaler, Sebastian Trimpe, Florent Di Meglio

    Abstract: Identifying dynamical systems from experimental data is a notably difficult task. Prior knowledge generally helps, but the extent of this knowledge varies with the application, and customized models are often needed. Neural ordinary differential equations can be written as a flexible framework for system identification and can incorporate a broad spectrum of physical insight, giving physical inter… ▽ More

    Submitted 12 January, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  45. arXiv:2202.06052  [pdf, other

    cs.LG cs.RO eess.SY stat.ME stat.ML

    Learning by Doing: Controlling a Dynamical System using Causality, Control, and Reinforcement Learning

    Authors: Sebastian Weichwald, Søren Wengel Mogensen, Tabitha Edith Lee, Dominik Baumann, Oliver Kroemer, Isabelle Guyon, Sebastian Trimpe, Jonas Peters, Niklas Pfister

    Abstract: Questions in causality, control, and reinforcement learning go beyond the classical machine learning task of prediction under i.i.d. observations. Instead, these fields consider the problem of learning how to actively perturb a system to achieve a certain effect on a response variable. Arguably, they have complementary views on the problem: In control, one usually aims to first identify the system… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: https://learningbydoingcompetition.github.io/

  46. GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

    Authors: Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

    Abstract: Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: Artificial Intelligence, Volume 320, Year 2023

  47. arXiv:2106.11899  [pdf, other

    cs.LG stat.ML

    Local policy search with Bayesian optimization

    Authors: Sarah Müller, Alexander von Rohr, Sebastian Trimpe

    Abstract: Reinforcement learning (RL) aims to find an optimal policy by interaction with an environment. Consequently, learning complex behavior requires a vast number of samples, which can be prohibitive in practice. Nevertheless, instead of systematically reasoning and actively choosing informative samples, policy gradients for local search are often obtained from random perturbations. These random sample… ▽ More

    Submitted 22 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Presented at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021

  48. arXiv:2105.13281  [pdf, other

    cs.RO cs.LG eess.SY

    GoSafe: Globally Optimal Safe Robot Learning

    Authors: Dominik Baumann, Alonso Marco, Matteo Turchetta, Sebastian Trimpe

    Abstract: When learning policies for robotic systems from data, safety is a major concern, as violation of safety constraints may cause hardware damage. SafeOpt is an efficient Bayesian optimization (BO) algorithm that can learn policies while guaranteeing safety with high probability. However, its search space is limited to an initially given safe region. We extend this method by exploring outside the init… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  49. arXiv:2105.12204  [pdf, other

    eess.SY cs.LG cs.RO

    Safe Value Functions

    Authors: Pierre-François Massiani, Steve Heim, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Safety constraints and optimality are important, but sometimes conflicting criteria for controllers. Although these criteria are often solved separately with different tools to maintain formal guarantees, it is also common practice in reinforcement learning to simply modify reward functions by penalizing failures, with the penalty treated as a mere heuristic. We rigorously examine the relationship… ▽ More

    Submitted 1 December, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 16 pages, 6 figures. Accepted for publication in: Transactions of Automatic Control, special issue on Learning and Control

    Journal ref: IEEE Transactions of Automatic Control 68, Issue 5 (2023) 2743 -- 2757

  50. arXiv:2105.07668  [pdf, other

    eess.SY cs.LG

    Probabilistic Robust Linear Quadratic Regulators with Gaussian Processes

    Authors: Alexander von Rohr, Matthias Neumann-Brosig, Sebastian Trimpe

    Abstract: Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based control has the potential to yield superior performance in demanding applications, robustness to uncertainty remains an important challenge. Since Bayesian methods quantify uncertainty of the learning results, it is natural… ▽ More

    Submitted 21 September, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Published in the proceedings of the 3rd Conference on Learning for Dynamics and Control, this version fixes a typo in Algorithm 1