PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Zeng, Kuo-Hao; Zhang, Zichen; Ehsani, Kiana; Hendrix, Rose; Salvador, Jordi; Herrasti, Alvaro; Girshick, Ross; Kembhavi, Aniruddha; Weihs, Luca

Computer Science > Robotics

arXiv:2406.20083 (cs)

[Submitted on 28 Jun 2024]

Title:PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Authors:Kuo-Hao Zeng, Zichen Zhang, Kiana Ehsani, Rose Hendrix, Jordi Salvador, Alvaro Herrasti, Ross Girshick, Aniruddha Kembhavi, Luca Weihs

View PDF HTML (experimental)

Abstract:We present PoliFormer (Policy Transformer), an RGB-only indoor navigation agent trained end-to-end with reinforcement learning at scale that generalizes to the real-world without adaptation despite being trained purely in simulation. PoliFormer uses a foundational vision transformer encoder with a causal transformer decoder enabling long-term memory and reasoning. It is trained for hundreds of millions of interactions across diverse environments, leveraging parallelized, multi-machine rollouts for efficient training with high throughput. PoliFormer is a masterful navigator, producing state-of-the-art results across two distinct embodiments, the LoCoBot and Stretch RE-1 robots, and four navigation benchmarks. It breaks through the plateaus of previous work, achieving an unprecedented 85.5% success rate in object goal navigation on the CHORES-S benchmark, a 28.5% absolute improvement. PoliFormer can also be trivially extended to a variety of downstream applications such as object tracking, multi-object navigation, and open-vocabulary navigation with no finetuning.

Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.20083 [cs.RO]
	(or arXiv:2406.20083v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2406.20083

Submission history

From: Kuo-Hao Zeng [view email]
[v1] Fri, 28 Jun 2024 17:51:10 UTC (4,736 KB)

Computer Science > Robotics

Title:PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators