Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning

Nayyar, Rashmeet Kaur; Srivastava, Siddharth

Abstract:Abstraction is key to scaling up reinforcement learning (RL). However, autonomously learning abstract state and action representations to enable transfer and generalization remains a challenging open problem. This paper presents a novel approach for inventing, representing, and utilizing options, which represent temporally extended behaviors, in continual RL settings. Our approach addresses streams of stochastic problems characterized by long horizons, sparse rewards, and unknown transition and reward functions.
Our approach continually learns and maintains an interpretable state abstraction, and uses it to invent high-level options with abstract symbolic representations. These options meet three key desiderata: (1) composability for solving tasks effectively with lookahead planning, (2) reusability across problem instances for minimizing the need for relearning, and (3) mutual independence for reducing interference among options. Our main contributions are approaches for continually learning transferable, generalizable options with symbolic representations, and for integrating search techniques with RL to efficiently plan over these learned options to solve new problems. Empirical results demonstrate that the resulting approach effectively learns and transfers abstract knowledge across problem instances, achieving superior sample efficiency compared to state-of-the-art methods.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.16395 [cs.AI]
	(or arXiv:2412.16395v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2412.16395

Computer Science > Artificial Intelligence

Title:Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators