Skip to main content

Showing 1–3 of 3 results for author: Charvet, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.10006  [pdf, other

    cs.LG

    Improving Controller Generalization with Dimensionless Markov Decision Processes

    Authors: Valentin Charvet, Sebastian Stein, Roderick Murray-Smith

    Abstract: Controllers trained with Reinforcement Learning tend to be very specialized and thus generalize poorly when their testing environment differs from their training one. We propose a Model-Based approach to increase generalization where both world model and policy are trained in a dimensionless state-action space. To do so, we introduce the Dimensionless Markov Decision Process ($Π$-MDP): an extensio… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 11 pages, 5 figures

  2. arXiv:2110.13576  [pdf, other

    cs.LG

    Learning Robust Controllers Via Probabilistic Model-Based Policy Search

    Authors: Valentin Charvet, Bjørn Sand Jensen, Roderick Murray-Smith

    Abstract: Model-based Reinforcement Learning estimates the true environment through a world model in order to approximate the optimal policy. This family of algorithms usually benefits from better sample efficiency than their model-free counterparts. We investigate whether controllers learned in such a way are robust and able to generalize under small perturbations of the environment. Our work is inspired b… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at RobustML Workshop - ICLR 2021

  3. arXiv:2010.09370  [pdf, other

    cs.LG stat.ML

    Probabilistic selection of inducing points in sparse Gaussian processes

    Authors: Anders Kirk Uhrenholt, Valentin Charvet, Bjørn Sand Jensen

    Abstract: Sparse Gaussian processes and various extensions thereof are enabled through inducing points, that simultaneously bottleneck the predictive capacity and act as the main contributor towards model complexity. However, the number of inducing points is generally not associated with uncertainty which prevents us from applying the apparatus of Bayesian reasoning for identifying an appropriate trade-off.… ▽ More

    Submitted 25 July, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)