Skip to main content

Showing 1–2 of 2 results for author: Julbe, P M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.04603  [pdf, other

    cs.RO

    Diffusion-Based Approximate MPC: Fast and Consistent Imitation of Multi-Modal Action Distributions

    Authors: Pau Marquez Julbe, Julian Nubert, Henrik Hose, Sebastian Trimpe, Katherine J. Kuchenbecker

    Abstract: Approximating model predictive control (MPC) using imitation learning (IL) allows for fast control without solving expensive optimization problems online. However, methods that use neural networks in a simple L2-regression setup fail to approximate multi-modal (set-valued) solution distributions caused by local optima found by the numerical solver or non-convex constraints, such as obstacles, sign… ▽ More

    Submitted 13 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  2. arXiv:2312.02665  [pdf, other

    cs.AI cs.LG

    Lights out: training RL agents robust to temporary blindness

    Authors: N. Ordonez, M. Tromp, P. M. Julbe, W. Böhmer

    Abstract: Agents trained with DQN rely on an observation at each timestep to decide what action to take next. However, in real world applications observations can change or be missing entirely. Examples of this could be a light bulb breaking down, or the wallpaper in a certain room changing. While these situations change the actual observation, the underlying optimal policy does not change. Because of this… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.