Formalising the Foundations of Discrete Reinforcement Learning in Isabelle/HOL

Chevallier, Mark; Fleuriot, Jacques

Computer Science > Logic in Computer Science

arXiv:2112.05996 (cs)

[Submitted on 11 Dec 2021]

Title:Formalising the Foundations of Discrete Reinforcement Learning in Isabelle/HOL

Authors:Mark Chevallier, Jacques Fleuriot

View PDF

Abstract:We present a formalisation of finite Markov decision processes with rewards in the Isabelle theorem prover. We focus on the foundations required for dynamic programming and the use of reinforcement learning agents over such processes. In particular, we derive the Bellman equation from first principles (in both scalar and vector form), derive a vector calculation that produces the expected value of any policy p, and go on to prove the existence of a universally optimal policy where there is a discounting factor less than one. Lastly, we prove that the value iteration and the policy iteration algorithms work in finite time, producing an epsilon-optimal and a fully optimal policy respectively.

Subjects:	Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
Cite as:	arXiv:2112.05996 [cs.LO]
	(or arXiv:2112.05996v1 [cs.LO] for this version)
	https://doi.org/10.48550/arXiv.2112.05996

Submission history

From: Mark Chevallier [view email]
[v1] Sat, 11 Dec 2021 14:38:36 UTC (1,329 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LO

< prev | next >

new | recent | 2021-12

Change to browse by:

cs
cs.AI
math
math.OC

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jacques D. Fleuriot

export BibTeX citation

Computer Science > Logic in Computer Science

Title:Formalising the Foundations of Discrete Reinforcement Learning in Isabelle/HOL

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Logic in Computer Science

Title:Formalising the Foundations of Discrete Reinforcement Learning in Isabelle/HOL

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators