Context-aware Active Multi-Step Reinforcement Learning

Chen, Gang; Li, Dingcheng; Xu, Ran

Computer Science > Machine Learning

arXiv:1911.04107 (cs)

[Submitted on 11 Nov 2019 (v1), last revised 27 Nov 2019 (this version, v2)]

Title:Context-aware Active Multi-Step Reinforcement Learning

Authors:Gang Chen, Dingcheng Li, Ran Xu

View PDF

Abstract:Reinforcement learning has attracted great attention recently, especially policy gradient algorithms, which have been demonstrated on challenging decision making and control tasks. In this paper, we propose an active multi-step TD algorithm with adaptive stepsizes to learn actor and critic. Specifically, our model consists of two components: active stepsize learning and adaptive multi-step TD algorithm. Firstly, we divide the time horizon into chunks and actively select state and action inside each chunk. Then given the selected samples, we propose the adaptive multi-step TD, which generalizes TD($\lambda$), but adaptively switch on/off the backups from future returns of different steps. Particularly, the adaptive multi-step TD introduces a context-aware mechanism, here a binary classifier, which decides whether or not to turn on its future backups based on the context changes. Thus, our model is kind of combination of active learning and multi-step TD algorithm, which has the capacity for learning off-policy without the need of importance sampling. We evaluate our approach on both discrete and continuous space tasks in an off-policy setting respectively, and demonstrate competitive results compared to other reinforcement learning baselines.

Comments:	9 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
MSC classes:	I.2.6
ACM classes:	I.2.6
Cite as:	arXiv:1911.04107 [cs.LG]
	(or arXiv:1911.04107v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.04107

Submission history

From: Gang Chen [view email]
[v1] Mon, 11 Nov 2019 06:37:47 UTC (1,607 KB)
[v2] Wed, 27 Nov 2019 06:47:54 UTC (1,925 KB)

Computer Science > Machine Learning

Title:Context-aware Active Multi-Step Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Context-aware Active Multi-Step Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators