Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models

Zhang, Xingyuan; Becker-Ehmck, Philip; van der Smagt, Patrick; Karl, Maximilian

Computer Science > Machine Learning

arXiv:2404.18896 (cs)

[Submitted on 29 Apr 2024 (v1), last revised 23 Apr 2025 (this version, v2)]

Title:Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models

Authors:Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

View PDF HTML (experimental)

Abstract:Pretraining and finetuning models has become increasingly popular in decision-making. But there are still serious impediments in Imitation Learning from Observation (ILfO) with pretrained models. This study identifies two primary obstacles: the Embodiment Knowledge Barrier (EKB) and the Demonstration Knowledge Barrier (DKB). The EKB emerges due to the pretrained models' limitations in handling novel observations, which leads to inaccurate action inference. Conversely, the DKB stems from the reliance on limited demonstration datasets, restricting the model's adaptability across diverse scenarios. We propose separate solutions to overcome each barrier and apply them to Action Inference by Maximising Evidence (AIME), a state-of-the-art algorithm. This new algorithm, AIME-NoB, integrates online interactions and a data-driven regulariser to mitigate the EKB. Additionally, it uses a surrogate reward function to broaden the policy's supported states, addressing the DKB. Our experiments on vision-based control tasks from the DeepMind Control Suite and MetaWorld benchmarks show that AIME-NoB significantly improves sample efficiency and converged performance, presenting a robust framework for overcoming the challenges in ILfO with pretrained models. Code available at this https URL.

Comments:	Accepted at TMLR
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2404.18896 [cs.LG]
	(or arXiv:2404.18896v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.18896

Submission history

From: Xingyuan Zhang [view email]
[v1] Mon, 29 Apr 2024 17:33:52 UTC (555 KB)
[v2] Wed, 23 Apr 2025 19:11:30 UTC (2,055 KB)

Computer Science > Machine Learning

Title:Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators