Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Shang, Wenling; Trott, Alex; Zheng, Stephan; Xiong, Caiming; Socher, Richard

Computer Science > Machine Learning

arXiv:1907.00664 (cs)

[Submitted on 1 Jul 2019]

Title:Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Authors:Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

View PDF

Abstract:In many real-world scenarios, an autonomous agent often encounters various tasks within a single complex environment. We propose to build a graph abstraction over the environment structure to accelerate the learning of these tasks. Here, nodes are important points of interest (pivotal states) and edges represent feasible traversals between them. Our approach has two stages. First, we jointly train a latent pivotal state model and a curiosity-driven goal-conditioned policy in a task-agnostic manner. Second, provided with the information from the world graph, a high-level Manager quickly finds solution to new tasks and expresses subgoals in reference to pivotal states to a low-level Worker. The Worker can then also leverage the graph to easily traverse to the pivotal states of interest, even across long distance, and explore non-locally. We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1907.00664 [cs.LG]
	(or arXiv:1907.00664v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.00664

Submission history

From: Wenling Shang [view email]
[v1] Mon, 1 Jul 2019 11:22:52 UTC (592 KB)

Computer Science > Machine Learning

Title:Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators