Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Pardo, Fabio; Levdik, Vitaly; Kormushev, Petar

Computer Science > Machine Learning

arXiv:1810.02927 (cs)

[Submitted on 6 Oct 2018 (v1), last revised 4 Feb 2020 (this version, v2)]

Title:Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Authors:Fabio Pardo, Vitaly Levdik, Petar Kormushev

View PDF

Abstract:Being able to reach any desired location in the environment can be a valuable asset for an agent. Learning a policy to navigate between all pairs of states individually is often not feasible. An all-goals updating algorithm uses each transition to learn Q-values towards all goals simultaneously and off-policy. However the expensive numerous updates in parallel limited the approach to small tabular cases so far. To tackle this problem we propose to use convolutional network architectures to generate Q-values and updates for a large number of goals at once. We demonstrate the accuracy and generalization qualities of the proposed method on randomly generated mazes and Sokoban puzzles. In the case of on-screen goal coordinates the resulting mapping from frames to distance-maps directly informs the agent about which places are reachable and in how many steps. As an example of application we show that replacing the random actions in epsilon-greedy exploration by several actions towards feasible goals generates better exploratory trajectories on Montezuma's Revenge and Super Mario All-Stars games.

Comments:	AAAI 2020, this https URL
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.02927 [cs.LG]
	(or arXiv:1810.02927v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.02927

Submission history

From: Fabio Pardo [view email]
[v1] Sat, 6 Oct 2018 03:26:43 UTC (416 KB)
[v2] Tue, 4 Feb 2020 19:54:40 UTC (449 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Fabio Pardo
Vitaly Levdik
Petar Kormushev

export BibTeX citation

Computer Science > Machine Learning

Title:Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators