Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Imagawa, Takahisa; Hiraoka, Takuya; Tsuruoka, Yoshimasa

Computer Science > Artificial Intelligence

arXiv:2101.01883 (cs)

[Submitted on 6 Jan 2021]

Title:Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Authors:Takahisa Imagawa, Takuya Hiraoka, Yoshimasa Tsuruoka

View PDF

Abstract:Meta-reinforcement learning (RL) addresses the problem of sample inefficiency in deep RL by using experience obtained in past tasks for a new task to be solved.
However, most meta-RL methods require partially or fully on-policy data, i.e., they cannot reuse the data collected by past policies, which hinders the improvement of sample efficiency.
To alleviate this problem, we propose a novel off-policy meta-RL method, embedding learning and evaluation of uncertainty (ELUE).
An ELUE agent is characterized by the learning of a feature embedding space shared among tasks.
It learns a belief model over the embedding space and a belief-conditional policy and Q-function.
Then, for a new task, it collects data by the pretrained policy, and updates its belief based on the belief model.
Thanks to the belief update, the performance can be improved with a small amount of data.
In addition, it updates the parameters of the neural networks to adjust the pretrained relationships when there are enough data.
We demonstrate that ELUE outperforms state-of-the-art meta RL methods through experiments on meta-RL benchmarks.

Comments:	14pages
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2101.01883 [cs.AI]
	(or arXiv:2101.01883v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2101.01883

Submission history

From: Takahisa Imagawa [view email]
[v1] Wed, 6 Jan 2021 05:51:38 UTC (5,475 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2021-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators