Skip to main content

Showing 1–2 of 2 results for author: Berthier, E

Searching in archive math. Search in all archives.
.
  1. arXiv:2205.11831  [pdf, other

    math.OC

    A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning

    Authors: Eloïse Berthier, Ziad Kobeissi, Francis Bach

    Abstract: Temporal-difference learning is a popular algorithm for policy evaluation. In this paper, we study the convergence of the regularized non-parametric TD(0) algorithm, in both the independent and Markovian observation settings. In particular, when TD is performed in a universal reproducing kernel Hilbert space (RKHS), we prove convergence of the averaged iterates to the optimal value function, even… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  2. arXiv:2110.07396  [pdf, other

    math.OC

    Infinite-Dimensional Sums-of-Squares for Optimal Control

    Authors: Eloïse Berthier, Justin Carpentier, Alessandro Rudi, Francis Bach

    Abstract: We introduce an approximation method to solve an optimal control problem via the Lagrange dual of its weak formulation. It is based on a sum-of-squares representation of the Hamiltonian, and extends a previous method from polynomial optimization to the generic case of smooth problems. Such a representation is infinite-dimensional and relies on a particular space of functions-a reproducing kernel H… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.