Reinforcement Learning in a Physics-Inspired Semi-Markov Environment

Bellinger, Colin; Coles, Rory; Crowley, Mark; Tamblyn, Isaac

Computer Science > Machine Learning

arXiv:2004.07333 (cs)

[Submitted on 15 Apr 2020]

Title:Reinforcement Learning in a Physics-Inspired Semi-Markov Environment

Authors:Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn

View PDF

Abstract:Reinforcement learning (RL) has been demonstrated to have great potential in many applications of scientific discovery and design. Recent work includes, for example, the design of new structures and compositions of molecules for therapeutic drugs. Much of the existing work related to the application of RL to scientific domains, however, assumes that the available state representation obeys the Markov property. For reasons associated with time, cost, sensor accuracy, and gaps in scientific knowledge, many scientific design and discovery problems do not satisfy the Markov property. Thus, something other than a Markov decision process (MDP) should be used to plan / find the optimal policy. In this paper, we present a physics-inspired semi-Markov RL environment, namely the phase change environment. In addition, we evaluate the performance of value-based RL algorithms for both MDPs and partially observable MDPs (POMDPs) on the proposed environment. Our results demonstrate deep recurrent Q-networks (DRQN) significantly outperform deep Q-networks (DQN), and that DRQNs benefit from training with hindsight experience replay. Implications for the use of semi-Markovian RL and POMDPs for scientific laboratories are also discussed.

Comments:	To appear in the Canadian Conference on Artificial Intelligence, 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
ACM classes:	I.2; J.2
Cite as:	arXiv:2004.07333 [cs.LG]
	(or arXiv:2004.07333v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.07333

Submission history

From: Colin Bellinger [view email]
[v1] Wed, 15 Apr 2020 20:43:29 UTC (1,078 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning in a Physics-Inspired Semi-Markov Environment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning in a Physics-Inspired Semi-Markov Environment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators