Skip to main content

Showing 1–4 of 4 results for author: Labbi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23459  [pdf, ps, other

    cs.LG

    On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment

    Authors: Safwan Labbi, Paul Mangold, Daniil Tiapkin, Eric Moulines

    Abstract: Ensuring convergence of policy gradient methods in federated reinforcement learning (FRL) under environment heterogeneity remains a major challenge. In this work, we first establish that heterogeneity, perhaps counter-intuitively, can necessitate optimal policies to be non-deterministic or even time-varying, even in tabular environments. Subsequently, we prove global convergence results for federa… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Preprint

  2. arXiv:2410.22908  [pdf, other

    cs.LG stat.ML

    Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

    Authors: Safwan Labbi, Daniil Tiapkin, Lorenzo Mancini, Paul Mangold, Eric Moulines

    Abstract: In this paper, we present the Federated Upper Confidence Bound Value Iteration algorithm ($\texttt{Fed-UCBVI}$), a novel extension of the $\texttt{UCBVI}$ algorithm (Azar et al., 2017) tailored for the federated learning framework. We prove that the regret of $\texttt{Fed-UCBVI}$ scales as $\tilde{\mathcal{O}}(\sqrt{H^3 |\mathcal{S}| |\mathcal{A}| T / M})$, with a small additional term due to hete… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  3. arXiv:2410.20687  [pdf, other

    eess.SP cs.LG

    Joint Channel Selection using FedDRL in V2X

    Authors: Lorenzo Mancini, Safwan Labbi, Karim Abed Meraim, Fouzi Boukhalfa, Alain Durmus, Paul Mangold, Eric Moulines

    Abstract: Vehicle-to-everything (V2X) communication technology is revolutionizing transportation by enabling interactions between vehicles, devices, and infrastructures. This connectivity enhances road safety, transportation efficiency, and driver assistance systems. V2X benefits from Machine Learning, enabling real-time data analysis, better decision-making, and improved traffic predictions, making transpo… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  4. arXiv:2402.04114  [pdf, other

    stat.ML cs.LG math.OC

    SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning

    Authors: Paul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Reda Alami, Alexey Naumov, Eric Moulines

    Abstract: In this paper, we analyze the sample and communication complexity of the federated linear stochastic approximation (FedLSA) algorithm. We explicitly quantify the effects of local training with agent heterogeneity. We show that the communication complexity of FedLSA scales polynomially with the inverse of the desired accuracy $ε$. To overcome this, we propose SCAFFLSA a new variant of FedLSA that u… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: now with linear speed-up!