EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Weng, Jiayi; Lin, Min; Huang, Shengyi; Liu, Bo; Makoviichuk, Denys; Makoviychuk, Viktor; Liu, Zichen; Song, Yufan; Luo, Ting; Jiang, Yukun; Xu, Zhongwen; Yan, Shuicheng

Computer Science > Machine Learning

arXiv:2206.10558 (cs)

[Submitted on 21 Jun 2022 (v1), last revised 12 Oct 2022 (this version, v2)]

Title:EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Authors:Jiayi Weng, Min Lin, Shengyi Huang, Bo Liu, Denys Makoviichuk, Viktor Makoviychuk, Zichen Liu, Yufan Song, Ting Luo, Yukun Jiang, Zhongwen Xu, Shuicheng Yan

View PDF

Abstract:There has been significant progress in developing reinforcement learning (RL) training systems. Past works such as IMPALA, Apex, Seed RL, Sample Factory, and others, aim to improve the system's overall throughput. In this paper, we aim to address a common bottleneck in the RL training system, i.e., parallel environment execution, which is often the slowest part of the whole system but receives little attention. With a curated design for paralleling RL environments, we have improved the RL environment simulation speed across different hardware setups, ranging from a laptop and a modest workstation, to a high-end machine such as NVIDIA DGX-A100. On a high-end machine, EnvPool achieves one million frames per second for the environment execution on Atari environments and three million frames per second on MuJoCo environments. When running EnvPool on a laptop, the speed is 2.8x that of the Python subprocess. Moreover, great compatibility with existing RL training libraries has been demonstrated in the open-sourced community, including CleanRL, rl_games, DeepMind Acme, etc. Finally, EnvPool allows researchers to iterate their ideas at a much faster pace and has great potential to become the de facto RL environment execution engine. Example runs show that it only takes five minutes to train agents to play Atari Pong and MuJoCo Ant on a laptop. EnvPool is open-sourced at this https URL.

Comments:	NeurIPS'22 camera-ready version
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Robotics (cs.RO)
Cite as:	arXiv:2206.10558 [cs.LG]
	(or arXiv:2206.10558v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.10558

Submission history

From: Jiayi Weng [view email]
[v1] Tue, 21 Jun 2022 17:36:15 UTC (18,970 KB)
[v2] Wed, 12 Oct 2022 16:53:29 UTC (20,874 KB)

Computer Science > Machine Learning

Title:EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators