On the function approximation error for risk-sensitive reinforcement learning

Karmakar, Prasenjit; Bhatnagar, Shalabh

Computer Science > Machine Learning

arXiv:1612.07562 (cs)

[Submitted on 22 Dec 2016 (v1), last revised 22 Oct 2019 (this version, v15)]

Title:On the function approximation error for risk-sensitive reinforcement learning

Authors:Prasenjit Karmakar, Shalabh Bhatnagar

View PDF

Abstract:In this paper we obtain several informative error bounds on function approximation for the policy evaluation algorithm proposed by Basu et al. when the aim is to find the risk-sensitive cost represented using exponential utility. The main idea is to use classical Bapat's inequality and to use Perron-Frobenius eigenvectors (exists if we assume irreducible Markov chain) to get the new bounds. The novelty of our approach is that we use the irreduciblity of Markov chain to get the new bounds whereas the earlier work by Basu et al. used spectral variation bound which is true for any matrix. We also give examples where all our bounds achieve the "actual error" whereas the earlier bound given by Basu et al. is much weaker in comparison. We show that this happens due to the absence of difference term in the earlier bound which is always present in all our bounds when the state space is large. Additionally, we discuss how all our bounds compare with each other. As a corollary of our main result we provide a bound between largest eigenvalues of two irreducibile matrices in terms of the matrix entries.

Comments:	Improved the bound in Theorem V.4
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1612.07562 [cs.LG]
	(or arXiv:1612.07562v15 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1612.07562

Submission history

From: Prasenjit Karmakar [view email]
[v1] Thu, 22 Dec 2016 12:05:29 UTC (4 KB)
[v2] Tue, 27 Dec 2016 03:22:28 UTC (8 KB)
[v3] Fri, 24 Feb 2017 15:35:03 UTC (10 KB)
[v4] Thu, 2 Mar 2017 11:44:54 UTC (27 KB)
[v5] Thu, 29 Jun 2017 11:19:13 UTC (13 KB)
[v6] Sun, 2 Jul 2017 15:47:10 UTC (14 KB)
[v7] Wed, 5 Jul 2017 14:01:42 UTC (15 KB)
[v8] Tue, 11 Jul 2017 12:14:48 UTC (15 KB)
[v9] Sun, 7 Oct 2018 10:13:02 UTC (81 KB)
[v10] Sun, 14 Oct 2018 06:01:20 UTC (82 KB)
[v11] Tue, 16 Oct 2018 16:48:38 UTC (82 KB)
[v12] Tue, 15 Jan 2019 19:43:06 UTC (82 KB)
[v13] Fri, 15 Feb 2019 12:59:43 UTC (82 KB)
[v14] Thu, 28 Mar 2019 15:10:52 UTC (83 KB)
[v15] Tue, 22 Oct 2019 14:48:35 UTC (88 KB)

Computer Science > Machine Learning

Title:On the function approximation error for risk-sensitive reinforcement learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the function approximation error for risk-sensitive reinforcement learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators