Skip to main content

Showing 1–3 of 3 results for author: Szehr, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.05672  [pdf, other

    stat.ML cs.AI cs.LG cs.NE eess.SY

    On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers

    Authors: Miroslav Štrupl, Oleg Szehr, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, Jürgen Schmidhuber

    Abstract: This article provides a rigorous analysis of convergence and stability of Episodic Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning and Online Decision Transformers. These algorithms performed competitively across various benchmarks, from games to robotic tasks, but their theoretical understanding is limited to specific environmental conditions. This work initiates a theore… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 85 pages in main text + 4 pages of references + 26 pages of appendices, 12 figures in main text + 2 figures in appendices; source code available at https://github.com/struplm/eUDRL-GCSL-ODT-Convergence-public

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1

  2. arXiv:2011.06848  [pdf, other

    math.ST stat.ML

    An exact kernel framework for spatio-temporal dynamics

    Authors: Oleg Szehr, Dario Azzimonti, Laura Azzimonti

    Abstract: A kernel-based framework for spatio-temporal data analysis is introduced that applies in situations when the underlying system dynamics are governed by a dynamic equation. The key ingredient is a representer theorem that involves time-dependent kernels. Such kernels occur commonly in the expansion of solutions of partial differential equations. The representer theorem is applied to find among all… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  3. arXiv:2007.01623  [pdf, other

    cs.LG q-fin.CP stat.ML

    Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

    Authors: Loris Cannelli, Giuseppe Nuti, Marzio Sala, Oleg Szehr

    Abstract: The construction of replication strategies for contingent claims in the presence of risk and market friction is a key problem of financial engineering. In real markets, continuous replication, such as in the model of Black, Scholes and Merton (BSM), is not only unrealistic but it is also undesirable due to high transaction costs. A variety of methods have been proposed to balance between effective… ▽ More

    Submitted 6 February, 2022; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 30 pages, 11 figures

    Journal ref: The Journal of Finance and Data Science Volume 9, November 2023, 100101