Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Ni, Xinyi; Lai, Lifeng

Computer Science > Machine Learning

arXiv:2405.01718 (cs)

[Submitted on 2 May 2024]

Title:Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Authors:Xinyi Ni, Lifeng Lai

View PDF HTML (experimental)

Abstract:Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing for the worst-case scenarios within ambiguity sets. While earlier studies on RMDPs have largely centered on risk-neutral reinforcement learning (RL), with the goal of minimizing expected total discounted costs, in this paper, we analyze the robustness of CVaR-based risk-sensitive RL under RMDP. Firstly, we consider predetermined ambiguity sets. Based on the coherency of CVaR, we establish a connection between robustness and risk sensitivity, thus, techniques in risk-sensitive RL can be adopted to solve the proposed problem. Furthermore, motivated by the existence of decision-dependent uncertainty in real-world problems, we study problems with state-action-dependent ambiguity sets. To solve this, we define a new risk measure named NCVaR and build the equivalence of NCVaR optimization and robust CVaR optimization. We further propose value iteration algorithms and validate our approach in simulation experiments.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2405.01718 [cs.LG]
	(or arXiv:2405.01718v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.01718

Submission history

From: Xinyi Ni [view email]
[v1] Thu, 2 May 2024 20:28:49 UTC (188 KB)

Computer Science > Machine Learning

Title:Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators