Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Kim, Dongyoung; Park, Sumin; Jang, Huiwon; Shin, Jinwoo; Kim, Jaehyung; Seo, Younggyo

Computer Science > Robotics

arXiv:2506.00070 (cs)

[Submitted on 29 May 2025]

Title:Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Authors:Dongyoung Kim, Sumin Park, Huiwon Jang, Jinwoo Shin, Jaehyung Kim, Younggyo Seo

View PDF HTML (experimental)

Abstract:Large Vision-Language Models (LVLMs) have recently shown great promise in advancing robotics by combining embodied reasoning with robot control. A common approach involves training on embodied reasoning tasks related to robot control using Supervised Fine-Tuning (SFT). However, SFT datasets are often heuristically constructed and not explicitly optimized for improving robot control. Furthermore, SFT often leads to issues such as catastrophic forgetting and reduced generalization performance. To address these limitations, we introduce Robot-R1, a novel framework that leverages reinforcement learning to enhance embodied reasoning specifically for robot control. Robot-R1 learns to predict the next keypoint state required for task completion, conditioned on the current scene image and environment metadata derived from expert demonstrations. Inspired by the DeepSeek-R1 learning approach, Robot-R1 samples reasoning-based responses and reinforces those that lead to more accurate predictions. Our experiments show that models trained with Robot-R1 outperform SFT methods on embodied reasoning tasks. Despite having only 7B parameters, Robot-R1 even surpasses GPT-4o on reasoning tasks related to low-level action control, such as spatial and primitive movement reasoning.

Comments:	26 pages, 14 figures
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2506.00070 [cs.RO]
	(or arXiv:2506.00070v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2506.00070

Submission history

From: Dongyoung Kim [view email]
[v1] Thu, 29 May 2025 16:41:12 UTC (323 KB)

Computer Science > Robotics

Title:Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators