Information Relaxation and Dual Formulation of Controlled Markov Diffusions

Ye, Fan; Zhou, Enlu

Mathematics > Optimization and Control

arXiv:1303.2388 (math)

[Submitted on 10 Mar 2013 (v1), last revised 22 Oct 2014 (this version, v3)]

Title:Information Relaxation and Dual Formulation of Controlled Markov Diffusions

Authors:Fan Ye, Enlu Zhou

View PDF

Abstract:Information relaxation and duality in Markov decision processes have been studied recently by several researchers with the goal to derive dual bounds on the value function. In this paper we extend this dual formulation to controlled Markov diffusions: in a similar way we relax the constraint that the decision should be made based on the current information and impose penalty to punish the access to the information in advance. We establish the weak duality, strong duality and complementary slackness results in a parallel way as those in Markov decision processes. We explore the structure of the optimal penalties and expose the connection between Markov decision processes and controlled Markov diffusions. We demonstrate the use of the dual representation for controlled Markov diffusions in a classic dynamic portfolio choice problem. We evaluate the lower bounds on the expected utility by Monte Carlo simulation under a sub-optimal policy, and we propose a new class of penalties to derive upper bounds with little extra computation. The small gaps between the lower bounds and upper bounds indicate that the available policy is near optimal as well as the effectiveness of our proposed penalty in the dual method.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:1303.2388 [math.OC]
	(or arXiv:1303.2388v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1303.2388

Submission history

From: Fan Ye [view email]
[v1] Sun, 10 Mar 2013 22:51:57 UTC (96 KB)
[v2] Mon, 18 Mar 2013 14:25:40 UTC (96 KB)
[v3] Wed, 22 Oct 2014 17:42:17 UTC (99 KB)

Mathematics > Optimization and Control

Title:Information Relaxation and Dual Formulation of Controlled Markov Diffusions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Information Relaxation and Dual Formulation of Controlled Markov Diffusions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators