Sim-Anchored Learning for On-the-Fly Adaptation

Mabsout, Bassel El; Roozkhosh, Shahin; Mysore, Siddharth; Saenko, Kate; Mancuso, Renato

Computer Science > Robotics

arXiv:2301.06987 (cs)

[Submitted on 17 Jan 2023 (v1), last revised 1 May 2025 (this version, v3)]

Title:Sim-Anchored Learning for On-the-Fly Adaptation

Authors:Bassel El Mabsout, Shahin Roozkhosh, Siddharth Mysore, Kate Saenko, Renato Mancuso

View PDF HTML (experimental)

Abstract:Fine-tuning simulation-trained RL agents with real-world data often degrades crucial behaviors due to limited or skewed data distributions. We argue that designer priorities exist not just in reward functions, but also in simulation design choices like task selection and state initialization. When adapting to real-world data, agents can experience catastrophic forgetting in important but underrepresented scenarios. We propose framing live-adaptation as a multi-objective optimization problem, where policy objectives must be satisfied both in simulation and reality. Our approach leverages critics from simulation as "anchors for design intent" (anchor critics). By jointly optimizing policies against both anchor critics and critics trained on real-world experience, our method enables adaptation while preserving prioritized behaviors from simulation. Evaluations demonstrate robust behavior retention in sim-to-sim benchmarks and a sim-to-real scenario with a racing quadrotor, allowing for power consumption reductions of up to 50% without control loss. We also contribute SwaNNFlight, an open-source firmware for enabling live adaptation on similar robotic platforms.

Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2301.06987 [cs.RO]
	(or arXiv:2301.06987v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2301.06987

Submission history

From: Bassel El Mabsout [view email]
[v1] Tue, 17 Jan 2023 16:16:53 UTC (2,404 KB)
[v2] Fri, 25 Oct 2024 20:59:49 UTC (2,727 KB)
[v3] Thu, 1 May 2025 15:26:45 UTC (5,794 KB)

Computer Science > Robotics

Title:Sim-Anchored Learning for On-the-Fly Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Sim-Anchored Learning for On-the-Fly Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators