Skip to main content

Showing 1–8 of 8 results for author: Robu, B

Searching in archive cs. Search in all archives.
.
  1. Enhancing Reinforcement Learning Agents with Local Guides

    Authors: Paul Daoudi, Bogdan Robu, Christophe Prieur, Ludovic Dos Santos, Merwan Barlier

    Abstract: This paper addresses the problem of integrating local guide policies into a Reinforcement Learning agent. For this, we show how to adapt existing algorithms to this setting before introducing a novel algorithm based on a noisy policy-switching procedure. This approach builds on a proper Approximate Policy Evaluation (APE) scheme to provide a perturbation that carefully leads the local guides towar… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Journal ref: AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems

  2. arXiv:2402.13654  [pdf, other

    eess.SY cs.LG

    Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark

    Authors: Paul Daoudi, Bojan Mavkov, Bogdan Robu, Christophe Prieur, Emmanuel Witrant, Merwan Barlier, Ludovic Dos Santos

    Abstract: This paper presents a learning-based control strategy for non-linear throttle valves with an asymmetric hysteresis, leading to a near-optimal controller without requiring any prior knowledge about the environment. We start with a carefully tuned Proportional Integrator (PI) controller and exploit the recent advances in Reinforcement Learning (RL) with Guides to improve the closed-loop behavior by… ▽ More

    Submitted 15 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2024 IEEE Conference on Control Technology and Applications (CCTA)

  3. arXiv:2312.15474  [pdf, other

    cs.LG stat.ML

    A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

    Authors: Paul Daoudi, Christophe Prieur, Bogdan Robu, Merwan Barlier, Ludovic Dos Santos

    Abstract: Off-dynamics Reinforcement Learning (ODRL) seeks to transfer a policy from a source environment to a target environment characterized by distinct yet similar dynamics. In this context, traditional RL agents depend excessively on the dynamics of the source environment, resulting in the discovery of policies that excel in this environment but fail to provide reasonable performance in the target one.… ▽ More

    Submitted 15 July, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  4. arXiv:2210.11624  [pdf, other

    eess.SY cs.LG

    Sparse Dynamical Features generation, application to Parkinson's Disease diagnosis

    Authors: Houssem Meghnoudj, Bogdan Robu, Mazen Alamir

    Abstract: In this study we focus on the diagnosis of Parkinson's Disease (PD) based on electroencephalogram (EEG) signals. We propose a new approach inspired by the functioning of the brain that uses the dynamics, frequency and temporal content of EEGs to extract new demarcating features of the disease. The method was evaluated on a publicly available dataset containing EEG signals recorded during a 3-oddba… ▽ More

    Submitted 29 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 18 pages, 13 figures

  5. Enhancing Robustness of On-line Learning Models on Highly Noisy Data

    Authors: Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

    Abstract: Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper,… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Transactions on Dependable and Secure Computing. arXiv admin note: substantial text overlap with arXiv:1911.04383

  6. Event-Based Control for Online Training of Neural Networks

    Authors: Zilong Zhao, Sophie Cerf, Bogdan Robu, Nicolas Marchand

    Abstract: Convolutional Neural Network (CNN) has become the most used method for image classification tasks. During its training the learning rate and the gradient are two key factors to tune for influencing the convergence speed of the model. Usual learning rate strategies are time-based i.e. monotonous decay over time. Recent state-of-the-art techniques focus on adaptive gradient algorithms i.e. Adam and… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  7. arXiv:1911.07710  [pdf, other

    cs.LG stat.ML

    Feedback Control for Online Training of Neural Networks

    Authors: Zilong Zhao, Sophie Cerf, Bogdan Robu, Nicolas Marchand

    Abstract: Convolutional neural networks (CNNs) are commonly used for image classification tasks, raising the challenge of their application on data flows. During their training, adaptation is often performed by tuning the learning rate. Usual learning rate strategies are time-based i.e. monotonously decreasing. In this paper, we advocate switching to a performance-based adaptation, in order to improve the l… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

  8. arXiv:1911.04383  [pdf, other

    cs.LG stat.ML

    RAD: On-line Anomaly Detection for Highly Unreliable Data

    Authors: Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

    Abstract: Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper,… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.