Skip to main content

Showing 1–13 of 13 results for author: Allouah, Y

.
  1. arXiv:2506.06985  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Certified Unlearning for Neural Networks

    Authors: Anastasia Koloskova, Youssef Allouah, Animesh Jha, Rachid Guerraoui, Sanmi Koyejo

    Abstract: We address the problem of machine unlearning, where the goal is to remove the influence of specific training data from a model upon request, motivated by privacy concerns and regulatory requirements such as the "right to be forgotten." Unfortunately, existing methods rely on restrictive assumptions or lack formal guarantees. To this end, we propose a novel method for certified machine unlearning,… ▽ More

    Submitted 10 June, 2025; v1 submitted 7 June, 2025; originally announced June 2025.

  2. arXiv:2505.01874  [pdf, ps, other

    cs.LG cs.CR cs.DC

    Towards Trustworthy Federated Learning with Untrusted Participants

    Authors: Youssef Allouah, Rachid Guerraoui, John Stephan

    Abstract: Resilience against malicious participants and data privacy are essential for trustworthy federated learning, yet achieving both with good utility typically requires the strong assumption of a trusted central server. This paper shows that a significantly weaker assumption suffices: each pair of participants shares a randomness seed unknown to others. In a setting where malicious participants may co… ▽ More

    Submitted 4 June, 2025; v1 submitted 3 May, 2025; originally announced May 2025.

    Comments: ICML 2025 conference paper

  3. arXiv:2412.09119  [pdf, other

    cs.LG cs.CR math.OC

    The Utility and Complexity of in- and out-of-Distribution Machine Unlearning

    Authors: Youssef Allouah, Joshua Kazdan, Rachid Guerraoui, Sanmi Koyejo

    Abstract: Machine unlearning, the process of selectively removing data from trained models, is increasingly crucial for addressing privacy concerns and knowledge gaps post-deployment. Despite this importance, existing approaches are often heuristic and lack formal guarantees. In this paper, we analyze the fundamental utility, time, and space complexity trade-offs of approximate unlearning, providing rigorou… ▽ More

    Submitted 12 February, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  4. arXiv:2411.07182  [pdf, other

    cs.LG cs.DC

    Revisiting Ensembling in One-Shot Federated Learning

    Authors: Youssef Allouah, Akash Dhasade, Rachid Guerraoui, Nirupam Gupta, Anne-Marie Kermarrec, Rafael Pinot, Rafael Pires, Rishi Sharma

    Abstract: Federated learning (FL) is an appealing approach to training machine learning models without sharing raw data. However, standard FL algorithms are iterative and thus induce a significant communication cost. One-shot federated learning (OFL) trades the iterative exchange of models between clients and the server with a single round of communication, thereby saving substantially on communication cost… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted at NeurIPS 2024

  5. arXiv:2409.20329  [pdf, other

    cs.LG cs.CR

    Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients

    Authors: Youssef Allouah, Abdellah El Mrini, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot

    Abstract: Federated learning (FL) is an appealing paradigm that allows a group of machines (a.k.a. clients) to learn collectively while keeping their data local. However, due to the heterogeneity between the clients' data distributions, the model obtained through the use of FL algorithms may perform poorly on some client's data. Personalization addresses this issue by enabling each client to have a differen… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  6. arXiv:2405.14432  [pdf, other

    cs.LG

    Adaptive Gradient Clipping for Robust Federated Learning

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Ahmed Jellouli, Geovani Rizk, John Stephan

    Abstract: Robust federated learning aims to maintain reliable performance despite the presence of adversarial or misbehaving workers. While state-of-the-art (SOTA) robust distributed gradient descent (Robust-DGD) methods were proven theoretically optimal, their empirical success has often relied on pre-aggregation gradient clipping. However, existing static clipping strategies yield inconsistent results: en… ▽ More

    Submitted 9 May, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.01031  [pdf, other

    cs.LG cs.CR cs.DC math.OC stat.ML

    The Privacy Power of Correlated Noise in Decentralized Learning

    Authors: Youssef Allouah, Anastasia Koloskova, Aymane El Firdoussi, Martin Jaggi, Rachid Guerraoui

    Abstract: Decentralized learning is appealing as it enables the scalable usage of large amounts of distributed data and resources (without resorting to any central entity), while promoting privacy since every user minimizes the direct exposure of their data. Yet, without additional precautions, curious users can still leverage models obtained from their peers to violate privacy. In this paper, we propose De… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted as conference paper at ICML 2024

  8. arXiv:2402.12780  [pdf, other

    cs.LG

    Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates

    Authors: Youssef Allouah, Sadegh Farhadkhani, Rachid GuerraouI, Nirupam Gupta, Rafael Pinot, Geovani Rizk, Sasha Voitovych

    Abstract: The possibility of adversarial (a.k.a., {\em Byzantine}) clients makes federated learning (FL) prone to arbitrary manipulation. The natural approach to robustify FL against adversarial clients is to replace the simple averaging operation at the server in the standard $\mathsf{FedAvg}$ algorithm by a \emph{robust averaging rule}. While a significant amount of work has been devoted to studying the c… ▽ More

    Submitted 10 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  9. arXiv:2312.14712  [pdf, other

    cs.LG cs.CR cs.DC

    Robustness, Efficiency, or Privacy: Pick Two in Machine Learning

    Authors: Youssef Allouah, Rachid Guerraoui, John Stephan

    Abstract: The success of machine learning (ML) applications relies on vast datasets and distributed architectures which, as they grow, present major challenges. In real-world scenarios, where data often contains sensitive information, issues like data poisoning and hardware failures are common. Ensuring privacy and robustness is vital for the broad adoption of ML in public life. This paper examines the cost… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  10. arXiv:2309.13591  [pdf, other

    cs.LG cs.DC math.OC

    Robust Distributed Learning: Tight Error Bounds and Breakdown Point under Data Heterogeneity

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Rafaël Pinot, Geovani Rizk

    Abstract: The theory underlying robust distributed learning algorithms, designed to resist adversarial machines, matches empirical observations when data is homogeneous. Under data heterogeneity however, which is the norm in practical scenarios, established lower bounds on the learning error are essentially vacuous and greatly mismatch empirical observations. This is because the heterogeneity model consider… ▽ More

    Submitted 28 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to NeurIPS 2023

  11. arXiv:2302.04787  [pdf, other

    cs.LG cs.CR cs.DC

    On the Privacy-Robustness-Utility Trilemma in Distributed Learning

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

    Abstract: The ubiquity of distributed machine learning (ML) in sensitive public domain applications calls for algorithms that protect data privacy, while being robust to faults and adversarial behaviors. Although privacy and robustness have been extensively studied independently in distributed ML, their synthesis remains poorly understood. We present the first tight analysis of the error incurred by any alg… ▽ More

    Submitted 29 May, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted paper at ICML

  12. arXiv:2302.01772  [pdf, other

    cs.LG cs.DC

    Fixing by Mixing: A Recipe for Optimal Byzantine ML under Heterogeneity

    Authors: Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

    Abstract: Byzantine machine learning (ML) aims to ensure the resilience of distributed learning algorithms to misbehaving (or Byzantine) machines. Although this problem received significant attention, prior works often assume the data held by the machines to be homogeneous, which is seldom true in practical settings. Data heterogeneity makes Byzantine ML considerably more challenging, since a Byzantine mach… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted paper at AISTATS 2023

  13. arXiv:2202.08656  [pdf, other

    cs.GT econ.TH

    Robust Sparse Voting

    Authors: Youssef Allouah, Rachid Guerraoui, Lê-Nguyên Hoang, Oscar Villemaud

    Abstract: Many applications, such as content moderation and recommendation, require reviewing and scoring a large number of alternatives. Doing so robustly is however very challenging. Indeed, voters' inputs are inevitably sparse: most alternatives are only scored by a small fraction of voters. This sparsity amplifies the effects of biased voters introducing unfairness, and of malicious voters seeking to ha… ▽ More

    Submitted 25 January, 2024; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted at AISTATS 2024