Reinforcement Learning via Recurrent Convolutional Neural Networks

Shankar, Tanmay; Dwivedy, Santosha K.; Guha, Prithwijit

Computer Science > Machine Learning

arXiv:1701.02392 (cs)

[Submitted on 9 Jan 2017]

Title:Reinforcement Learning via Recurrent Convolutional Neural Networks

Authors:Tanmay Shankar, Santosha K. Dwivedy, Prithwijit Guha

View PDF

Abstract:Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks. While such model-free methods achieve considerable performance, they often ignore the structure of task. We present a natural representation of to Reinforcement Learning (RL) problems using Recurrent Convolutional Neural Networks (RCNNs), to better exploit this inherent structure. We define 3 such RCNNs, whose forward passes execute an efficient Value Iteration, propagate beliefs of state in partially observable environments, and choose optimal actions respectively. Backpropagating gradients through these RCNNs allows the system to explicitly learn the Transition Model and Reward Function associated with the underlying MDP, serving as an elegant alternative to classical model-based RL. We evaluate the proposed algorithms in simulation, considering a robot planning problem. We demonstrate the capability of our framework to reduce the cost of replanning, learn accurate MDP models, and finally re-plan with learnt models to achieve near-optimal policies.

Comments:	Accepted at the International Conference on Pattern Recognition, ICPR 2016
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
MSC classes:	68T05
Cite as:	arXiv:1701.02392 [cs.LG]
	(or arXiv:1701.02392v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1701.02392

Submission history

From: Tanmay Shankar [view email]
[v1] Mon, 9 Jan 2017 23:36:05 UTC (1,448 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-01

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tanmay Shankar
Santosha K. Dwivedy
Prithwijit Guha

export BibTeX citation

Computer Science > Machine Learning

Title:Reinforcement Learning via Recurrent Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning via Recurrent Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators