Skip to main content

Showing 1–11 of 11 results for author: Mangold, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23459  [pdf, ps, other

    cs.LG

    On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment

    Authors: Safwan Labbi, Paul Mangold, Daniil Tiapkin, Eric Moulines

    Abstract: Ensuring convergence of policy gradient methods in federated reinforcement learning (FRL) under environment heterogeneity remains a major challenge. In this work, we first establish that heterogeneity, perhaps counter-intuitively, can necessitate optimal policies to be non-deterministic or even time-varying, even in tabular environments. Subsequently, we prove global convergence results for federa… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Preprint

  2. arXiv:2503.07594  [pdf, other

    stat.ML cs.LG math.OC

    Scaffold with Stochastic Gradients: New Analysis with Linear Speed-Up

    Authors: Paul Mangold, Alain Durmus, Aymeric Dieuleveut, Eric Moulines

    Abstract: This paper proposes a novel analysis for the Scaffold algorithm, a popular method for dealing with data heterogeneity in federated learning. While its convergence in deterministic settings--where local control variates mitigate client drift--is well established, the impact of stochastic gradient updates on its performance is less understood. To address this problem, we first show that its global p… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  3. arXiv:2412.01389  [pdf, other

    stat.ML cs.LG math.OC

    Refined Analysis of Federated Averaging's Bias and Federated Richardson-Romberg Extrapolation

    Authors: Paul Mangold, Alain Durmus, Aymeric Dieuleveut, Sergey Samsonov, Eric Moulines

    Abstract: In this paper, we present a novel analysis of FedAvg with constant step size, relying on the Markov property of the underlying process. We demonstrate that the global iterates of the algorithm converge to a stationary distribution and analyze its resulting bias and variance relative to the problem's solution. We provide a first-order expansion of the bias in both homogeneous and heterogeneous sett… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 37 pages

  4. arXiv:2410.22908  [pdf, other

    cs.LG stat.ML

    Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

    Authors: Safwan Labbi, Daniil Tiapkin, Lorenzo Mancini, Paul Mangold, Eric Moulines

    Abstract: In this paper, we present the Federated Upper Confidence Bound Value Iteration algorithm ($\texttt{Fed-UCBVI}$), a novel extension of the $\texttt{UCBVI}$ algorithm (Azar et al., 2017) tailored for the federated learning framework. We prove that the regret of $\texttt{Fed-UCBVI}$ scales as $\tilde{\mathcal{O}}(\sqrt{H^3 |\mathcal{S}| |\mathcal{A}| T / M})$, with a small additional term due to hete… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  5. arXiv:2410.20687  [pdf, other

    eess.SP cs.LG

    Joint Channel Selection using FedDRL in V2X

    Authors: Lorenzo Mancini, Safwan Labbi, Karim Abed Meraim, Fouzi Boukhalfa, Alain Durmus, Paul Mangold, Eric Moulines

    Abstract: Vehicle-to-everything (V2X) communication technology is revolutionizing transportation by enabling interactions between vehicles, devices, and infrastructures. This connectivity enhances road safety, transportation efficiency, and driver assistance systems. V2X benefits from Machine Learning, enabling real-time data analysis, better decision-making, and improved traffic predictions, making transpo… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  6. arXiv:2402.04114  [pdf, other

    stat.ML cs.LG math.OC

    SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning

    Authors: Paul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Reda Alami, Alexey Naumov, Eric Moulines

    Abstract: In this paper, we analyze the sample and communication complexity of the federated linear stochastic approximation (FedLSA) algorithm. We explicitly quantify the effects of local training with agent heterogeneity. We show that the communication complexity of FedLSA scales polynomially with the inverse of the desired accuracy $ε$. To overcome this, we propose SCAFFLSA a new variant of FedLSA that u… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: now with linear speed-up!

  7. arXiv:2308.15250  [pdf, other

    cs.LG cs.CR math.OC

    The Relative Gaussian Mechanism and its Application to Private Gradient Descent

    Authors: Hadrien Hendrikx, Paul Mangold, Aurélien Bellet

    Abstract: The Gaussian Mechanism (GM), which consists in adding Gaussian noise to a vector-valued query before releasing it, is a standard privacy protection mechanism. In particular, given that the query respects some L2 sensitivity property (the L2 distance between outputs on any two neighboring inputs is bounded), GM guarantees Rényi Differential Privacy (RDP). Unfortunately, precisely bounding the L2 se… ▽ More

    Submitted 19 March, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  8. arXiv:2210.16242  [pdf, other

    cs.LG cs.CR stat.ML

    Differential Privacy has Bounded Impact on Fairness in Classification

    Authors: Paul Mangold, Michaël Perrot, Aurélien Bellet, Marc Tommasi

    Abstract: We theoretically study the impact of differential privacy on fairness in classification. We prove that, given a class of models, popular group fairness measures are pointwise Lipschitz-continuous with respect to the parameters of the model. This result is a consequence of a more general statement on accuracy conditioned on an arbitrary event (such as membership to a sensitive group), which may be… ▽ More

    Submitted 18 September, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

  9. arXiv:2210.04620  [pdf, other

    cs.LG cs.CV

    FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings

    Authors: Jean Ogier du Terrail, Samy-Safwan Ayed, Edwige Cyffers, Felix Grimberg, Chaoyang He, Regis Loeb, Paul Mangold, Tanguy Marchand, Othmane Marfoq, Erum Mushtaq, Boris Muzellec, Constantin Philippenko, Santiago Silva, Maria Teleńczuk, Shadi Albarqouni, Salman Avestimehr, Aurélien Bellet, Aymeric Dieuleveut, Martin Jaggi, Sai Praneeth Karimireddy, Marco Lorenzi, Giovanni Neglia, Marc Tommasi, Mathieu Andreux

    Abstract: Federated Learning (FL) is a novel approach enabling several clients holding sensitive data to collaboratively train machine learning models, without centralizing data. The cross-silo FL setting corresponds to the case of few ($2$--$50$) reliable clients, each holding medium to large datasets, and is typically found in applications such as healthcare, finance, or industry. While previous works hav… ▽ More

    Submitted 5 May, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS, Datasets and Benchmarks Track, this version fixes typos in the datasets' table and the appendix

  10. arXiv:2207.01560  [pdf, other

    cs.LG cs.CR stat.ML

    High-Dimensional Private Empirical Risk Minimization by Greedy Coordinate Descent

    Authors: Paul Mangold, Aurélien Bellet, Joseph Salmon, Marc Tommasi

    Abstract: In this paper, we study differentially private empirical risk minimization (DP-ERM). It has been shown that the worst-case utility of DP-ERM reduces polynomially as the dimension increases. This is a major obstacle to privately learning large machine learning models. In high dimension, it is common for some model's parameters to carry more information than others. To exploit this, we propose a dif… ▽ More

    Submitted 9 April, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

  11. arXiv:2110.11688  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Coordinate Descent for Composite Empirical Risk Minimization

    Authors: Paul Mangold, Aurélien Bellet, Joseph Salmon, Marc Tommasi

    Abstract: Machine learning models can leak information about the data used to train them. To mitigate this issue, Differentially Private (DP) variants of optimization algorithms like Stochastic Gradient Descent (DP-SGD) have been designed to trade-off utility for privacy in Empirical Risk Minimization (ERM) problems. In this paper, we propose Differentially Private proximal Coordinate Descent (DP-CD), a new… ▽ More

    Submitted 21 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 36 pages, 3 figures