Skip to main content

Showing 1–4 of 4 results for author: Daoudi, P

Searching in archive cs. Search in all archives.
.
  1. Enhancing Reinforcement Learning Agents with Local Guides

    Authors: Paul Daoudi, Bogdan Robu, Christophe Prieur, Ludovic Dos Santos, Merwan Barlier

    Abstract: This paper addresses the problem of integrating local guide policies into a Reinforcement Learning agent. For this, we show how to adapt existing algorithms to this setting before introducing a novel algorithm based on a noisy policy-switching procedure. This approach builds on a proper Approximate Policy Evaluation (APE) scheme to provide a perturbation that carefully leads the local guides towar… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Journal ref: AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems

  2. arXiv:2402.13654  [pdf, other

    eess.SY cs.LG

    Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark

    Authors: Paul Daoudi, Bojan Mavkov, Bogdan Robu, Christophe Prieur, Emmanuel Witrant, Merwan Barlier, Ludovic Dos Santos

    Abstract: This paper presents a learning-based control strategy for non-linear throttle valves with an asymmetric hysteresis, leading to a near-optimal controller without requiring any prior knowledge about the environment. We start with a carefully tuned Proportional Integrator (PI) controller and exploit the recent advances in Reinforcement Learning (RL) with Guides to improve the closed-loop behavior by… ▽ More

    Submitted 15 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2024 IEEE Conference on Control Technology and Applications (CCTA)

  3. arXiv:2312.15474  [pdf, other

    cs.LG stat.ML

    A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

    Authors: Paul Daoudi, Christophe Prieur, Bogdan Robu, Merwan Barlier, Ludovic Dos Santos

    Abstract: Off-dynamics Reinforcement Learning (ODRL) seeks to transfer a policy from a source environment to a target environment characterized by distinct yet similar dynamics. In this context, traditional RL agents depend excessively on the dynamics of the source environment, resulting in the discovery of policies that excel in this environment but fail to provide reasonable performance in the target one.… ▽ More

    Submitted 15 July, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  4. arXiv:2312.15458  [pdf

    stat.ML cs.LG

    Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation

    Authors: Paul Daoudi, Mathias Formoso, Othman Gaizi, Achraf Azize, Evrard Garcelon

    Abstract: A precondition for the deployment of a Reinforcement Learning agent to a real-world system is to provide guarantees on the learning process. While a learning algorithm will eventually converge to a good policy, there are no guarantees on the performance of the exploratory policies. We study the problem of conservative exploration, where the learner must at least be able to guarantee its performanc… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.