Skip to main content

Showing 1–3 of 3 results for author: Ingelhag, N

.
  1. arXiv:2502.02308  [pdf, other

    cs.RO cs.LG

    Real-Time Operator Takeover for Visuomotor Diffusion Policy Training

    Authors: Nils Ingelhag, Jesper Munkeby, Michael C. Welle, Marco Moletta, Danica Kragic

    Abstract: We present a Real-Time Operator Takeover (RTOT) paradigm enabling operators to seamlessly take control of a live visuomotor diffusion policy, guiding the system back into desirable states or reinforcing specific demonstrations. We present new insights in using the Mahalonobis distance to automatically identify undesirable states. Once the operator has intervened and redirected the system, the cont… ▽ More

    Submitted 13 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  2. arXiv:2409.20248  [pdf, other

    cs.RO

    Feature Extractor or Decision Maker: Rethinking the Role of Visual Encoders in Visuomotor Policies

    Authors: Ruiyu Wang, Zheyu Zhuang, Shutong Jin, Nils Ingelhag, Danica Kragic, Florian T. Pokorny

    Abstract: An end-to-end (E2E) visuomotor policy is typically treated as a unified whole, but recent approaches using out-of-domain (OOD) data to pretrain the visual encoder have cleanly separated the visual encoder from the network, with the remainder referred to as the policy. We propose Visual Alignment Testing, an experimental framework designed to evaluate the validity of this functional separation. Our… ▽ More

    Submitted 14 May, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

  3. arXiv:2403.16730  [pdf, other

    cs.RO

    A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models

    Authors: Nils Ingelhag, Jesper Munkeby, Jonne van Haastregt, Anastasia Varava, Michael C. Welle, Danica Kragic

    Abstract: In this paper, we build upon two major recent developments in the field, Diffusion Policies for visuomotor manipulation and large pre-trained multimodal foundational models to obtain a robotic skill learning system. The system can obtain new skills via the behavioral cloning approach of visuomotor diffusion policies given teleoperated demonstrations. Foundational models are being used to perform s… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://roboskillframework.github.io