A note on the function approximation error bound for risk-sensitive reinforcement learning

Karmakar, Prasenjit; Bhatnagar, Shalabh

Computer Science > Machine Learning

arXiv:1612.07562v4 (cs)

[Submitted on 22 Dec 2016 (v1), revised 2 Mar 2017 (this version, v4), latest version 22 Oct 2019 (v15)]

Title:A note on the function approximation error bound for risk-sensitive reinforcement learning

Authors:Prasenjit Karmakar, Shalabh Bhatnagar

View PDF

Abstract:In this paper we obtain new error bounds of function approximation for the policy evaluation algorithm when the aim is to find the risk-sensitive cost represented using exponential utility. Additionally, we obtain conditions involving the feature matrix so that the error is zero.

Comments:	6 pages (2 column) technical short note
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1612.07562 [cs.LG]
	(or arXiv:1612.07562v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1612.07562

Submission history

From: Prasenjit Karmakar [view email]
[v1] Thu, 22 Dec 2016 12:05:29 UTC (4 KB)
[v2] Tue, 27 Dec 2016 03:22:28 UTC (8 KB)
[v3] Fri, 24 Feb 2017 15:35:03 UTC (10 KB)
[v4] Thu, 2 Mar 2017 11:44:54 UTC (27 KB)
[v5] Thu, 29 Jun 2017 11:19:13 UTC (13 KB)
[v6] Sun, 2 Jul 2017 15:47:10 UTC (14 KB)
[v7] Wed, 5 Jul 2017 14:01:42 UTC (15 KB)
[v8] Tue, 11 Jul 2017 12:14:48 UTC (15 KB)
[v9] Sun, 7 Oct 2018 10:13:02 UTC (81 KB)
[v10] Sun, 14 Oct 2018 06:01:20 UTC (82 KB)
[v11] Tue, 16 Oct 2018 16:48:38 UTC (82 KB)
[v12] Tue, 15 Jan 2019 19:43:06 UTC (82 KB)
[v13] Fri, 15 Feb 2019 12:59:43 UTC (82 KB)
[v14] Thu, 28 Mar 2019 15:10:52 UTC (83 KB)
[v15] Tue, 22 Oct 2019 14:48:35 UTC (88 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Prasenjit Karmakar
Shalabh Bhatnagar

export BibTeX citation

Computer Science > Machine Learning

Title:A note on the function approximation error bound for risk-sensitive reinforcement learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A note on the function approximation error bound for risk-sensitive reinforcement learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators