Skip to main content

Showing 1–5 of 5 results for author: Obermayer, K

Searching in archive math. Search in all archives.
.
  1. arXiv:1808.04478  [pdf, other

    math.OC

    Risk-Sensitive Partially Observable Markov Decision Processes as Fully Observable Multivariate Utility Optimization problems

    Authors: Arsham Afsardeir, Andreas Kapetanis, Vaios Laschos, Klaus Obermayer

    Abstract: We provide a new algorithm for solving Risk Sensitive Partially Observable Markov Decisions Processes, when the risk is modeled by a utility function, and both the state space and the space of observations is finite. This algorithm is based on an observation that the change of measure and the subsequent introduction of the information space that is used for exponential utility functions, can be ac… ▽ More

    Submitted 17 July, 2022; v1 submitted 13 August, 2018; originally announced August 2018.

    MSC Class: 93E20

  2. A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with Applications in Partially Observable Markov Decision Processes

    Authors: Vaios Laschos, Klaus Obermayer, Yun Shen, Wilhelm Stannat

    Abstract: By using the fact that the space of all probability measures with finite support can be somehow completed in two different fashions, one generating the Arens-Eells space and another generating the Kantorovich-Wasserstein (Wasserstein-1) space, and by exploiting the duality relationship between the Arens-Eells space with the space of Lipschitz functions, we provide a dual representation of Fenchel-… ▽ More

    Submitted 5 May, 2019; v1 submitted 9 March, 2016; originally announced March 2016.

    Comments: 20 pages

    MSC Class: 46E15; 46E27; 90C40; 90C46; 90C25; 46B10

    Journal ref: Journal of Mathematical Analysis and Applications Volume 477, Issue 2, 15 September 2019, Pages 1133-1156

  3. arXiv:1403.3321  [pdf, ps, other

    math.OC

    On Average Risk-sensitive Markov Control Processes

    Authors: Yun Shen, Klaus Obermayer, Wilhelm Stannat

    Abstract: We introduce the Lyapunov approach to optimal control problems of average risk-sensitive Markov control processes with general risk maps. Motivated by applications in particular to behavioral economics, we consider possibly non-convex risk maps, modeling behavior with mixed risk preference. We introduce classical objective functions to the risk-sensitive setting and we are in particular interested… ▽ More

    Submitted 22 July, 2015; v1 submitted 13 March, 2014; originally announced March 2014.

    Comments: 37 pages, submitted to SIAM J. on Control and Optimization

    MSC Class: 60J05; 93E20; 93C55; 47H07; 91B06

  4. arXiv:1110.6317  [pdf, ps, other

    math.OC cs.CE math.DS stat.ML

    Risk-sensitive Markov control processes

    Authors: Yun Shen, Wilhelm Stannat, Klaus Obermayer

    Abstract: We introduce a general framework for measuring risk in the context of Markov control processes with risk maps on general Borel spaces that generalize known concepts of risk measures in mathematical finance, operations research and behavioral economics. Within the framework, applying weighted norm spaces to incorporate also unbounded costs, we study two types of infinite-horizon risk-sensitive crit… ▽ More

    Submitted 23 January, 2014; v1 submitted 28 October, 2011; originally announced October 2011.

    Comments: 21 pages

    MSC Class: 60J05; 93E20; 93C55; 47H07; 91B06

    Journal ref: SIAM J. Control Optim., 51(5), 3652-3672, 2013

  5. arXiv:0908.3458  [pdf, ps, other

    stat.ML math.ST

    The Optimal Unbiased Value Estimator and its Relation to LSTD, TD and MC

    Authors: Steffen Grünewälder, Klaus Obermayer

    Abstract: In this analytical study we derive the optimal unbiased value estimator (MVU) and compare its statistical risk to three well known value estimators: Temporal Difference learning (TD), Monte Carlo estimation (MC) and Least-Squares Temporal Difference Learning (LSTD). We demonstrate that LSTD is equivalent to the MVU if the Markov Reward Process (MRP) is acyclic and show that both differ for most… ▽ More

    Submitted 24 August, 2009; originally announced August 2009.

    Comments: Final version is under review. 38 pages, 8 figures