Skip to main content

Showing 1–13 of 13 results for author: Khirirat, S

.
  1. arXiv:2503.17454  [pdf, ps, other

    cs.LG

    Collaborative Value Function Estimation Under Model Mismatch: A Federated Temporal Difference Analysis

    Authors: Ali Beikmohammadi, Sarit Khirirat, Peter Richtárik, Sindri Magnússon

    Abstract: Federated reinforcement learning (FedRL) enables collaborative learning while preserving data privacy by preventing direct data exchange between agents. However, many existing FedRL algorithms assume that all agents operate in identical environments, which is often unrealistic. In real-world applications, such as multi-robot teams, crowdsourced systems, and large-scale sensor networks, each agent… ▽ More

    Submitted 14 June, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2502.13482  [pdf, other

    cs.LG cs.CR cs.DC math.OC stat.ML

    Smoothed Normalization for Efficient Distributed Private Optimization

    Authors: Egor Shulgin, Sarit Khirirat, Peter Richtárik

    Abstract: Federated learning enables training machine learning models while preserving the privacy of participants. Surprisingly, there is no differentially private distributed method for smooth, non-convex optimization problems. The reason is that standard privacy techniques require bounding the participants' contributions, usually enforced via $\textit{clipping}$ of the updates. Existing literature typica… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 36 pages

  3. arXiv:2410.16871  [pdf, other

    cs.LG

    Error Feedback under $(L_0,L_1)$-Smoothness: Normalization and Momentum

    Authors: Sarit Khirirat, Abdurakhmon Sadiev, Artem Riabinin, Eduard Gorbunov, Peter Richtárik

    Abstract: We provide the first proof of convergence for normalized error feedback algorithms across a wide range of machine learning problems. Despite their popularity and efficiency in training deep neural networks, traditional analyses of error feedback algorithms rely on the smoothness assumption that does not capture the properties of objective functions in these problems. Rather, these problems have re… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  4. Compressed Federated Reinforcement Learning with a Generative Model

    Authors: Ali Beikmohammadi, Sarit Khirirat, Sindri Magnússon

    Abstract: Reinforcement learning has recently gained unprecedented popularity, yet it still grapples with sample inefficiency. Addressing this challenge, federated reinforcement learning (FedRL) has emerged, wherein agents collaboratively learn a single policy by aggregating local estimations. However, this aggregation step incurs significant communication costs. In this paper, we propose CompFedRL, a commu… ▽ More

    Submitted 14 October, 2024; v1 submitted 26 March, 2024; originally announced April 2024.

    Comments: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2024)

  5. On the Convergence of Federated Learning Algorithms without Data Similarity

    Authors: Ali Beikmohammadi, Sarit Khirirat, Sindri Magnússon

    Abstract: Data similarity assumptions have traditionally been relied upon to understand the convergence behaviors of federated learning methods. Unfortunately, this approach often demands fine-tuning step sizes based on the level of data similarity. When data similarity is low, these small step sizes result in an unacceptably slow convergence speed for federated methods. In this paper, we present a novel an… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted by the IEEE Transactions on Big Data Journal

    Journal ref: IEEE Transactions on Big Data (2024)

  6. Parallel Momentum Methods Under Biased Gradient Estimations

    Authors: Ali Beikmohammadi, Sarit Khirirat, Sindri Magnússon

    Abstract: Parallel stochastic gradient methods are gaining prominence in solving large-scale machine learning problems that involve data distributed across multiple nodes. However, obtaining unbiased stochastic gradients, which have been the focus of most theoretical research, is challenging in many distributed machine learning applications. The gradient estimations easily become biased, for example, when g… ▽ More

    Submitted 12 January, 2025; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 12 pages

    Journal ref: IEEE Transactions on Control of Network Systems (2025)

  7. arXiv:2305.18929  [pdf, other

    cs.LG math.OC stat.ML

    Clip21: Error Feedback for Gradient Clipping

    Authors: Sarit Khirirat, Eduard Gorbunov, Samuel Horváth, Rustem Islamov, Fakhri Karray, Peter Richtárik

    Abstract: Motivated by the increasing popularity and importance of large-scale training under differential privacy (DP) constraints, we study distributed gradient methods with gradient clipping, i.e., clipping applied to the gradients computed from local information at the nodes. While gradient clipping is an essential tool for injecting formal DP guarantees into gradient-based methods [1], it also induces… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  8. arXiv:2304.05127  [pdf, other

    cs.CR cs.CV cs.LG eess.IV

    Balancing Privacy and Performance for Private Federated Learning Algorithms

    Authors: Xiangjian Hou, Sarit Khirirat, Mohammad Yaqub, Samuel Horvath

    Abstract: Federated learning (FL) is a distributed machine learning (ML) framework where multiple clients collaborate to train a model without exposing their private data. FL involves cycles of local computations and bi-directional communications between the clients and server. To bolster data security during this process, FL algorithms frequently employ a differential privacy (DP) mechanism that introduces… ▽ More

    Submitted 18 August, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  9. arXiv:2202.04612  [pdf, other

    math.OC

    Zeroth-Order Randomized Subspace Newton Methods

    Authors: Erik Berglund, Sarit Khirirat, Xiaoyu Wang

    Abstract: Zeroth-order methods have become important tools for solving problems where we have access only to function evaluations. However, the zeroth-order methods only using gradient approximations are $n$ times slower than classical first-order methods for solving n-dimensional problems. To accelerate the convergence rate, this paper proposes the zeroth order randomized subspace Newton (ZO-RSN) method, w… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: Submitted to the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing

  10. arXiv:2003.06377  [pdf, other

    math.OC stat.ML

    A flexible framework for communication-efficient machine learning: from HPC to IoT

    Authors: Sarit Khirirat, Sindri Magnússon, Arda Aytekin, Mikael Johansson

    Abstract: With the increasing scale of machine learning tasks, it has become essential to reduce the communication between computing nodes. Early work on gradient compression focused on the bottleneck between CPUs and GPUs, but communication-efficiency is now needed in a variety of different system architectures, from high-performance clusters to energy-constrained IoT devices. In the current practice, comp… ▽ More

    Submitted 17 June, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: 27 pages, 11 figures, 1 table

  11. arXiv:1909.10327  [pdf, other

    eess.SP math.OC

    Compressed Gradient Methods with Hessian-Aided Error Compensation

    Authors: Sarit Khirirat, Sindri Magnússon, Mikael Johansson

    Abstract: The emergence of big data has caused a dramatic shift in the operating regime for optimization algorithms. The performance bottleneck, which used to be computations, is now often communications. Several gradient compression techniques have been proposed to reduce the communication load at the price of a loss in solution accuracy. Recently, it has been shown how compression errors can be compensate… ▽ More

    Submitted 18 June, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 15 pages, 7 figures

  12. arXiv:1809.10505  [pdf, other

    cs.LG cs.DC stat.ML

    The Convergence of Sparsified Gradient Methods

    Authors: Dan Alistarh, Torsten Hoefler, Mikael Johansson, Sarit Khirirat, Nikola Konstantinov, Cédric Renggli

    Abstract: Distributed training of massive machine learning models, in particular deep neural networks, via Stochastic Gradient Descent (SGD) is becoming commonplace. Several families of communication-reduction methods, such as quantization, large-batch methods, and gradient sparsification, have been proposed. To date, gradient sparsification methods - where each node sorts gradients by magnitude, and only c… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: NIPS 2018 - Advances in Neural Information Processing Systems; Authors in alphabetic order

  13. arXiv:1806.06573  [pdf, other

    math.OC stat.ML

    Distributed learning with compressed gradients

    Authors: Sarit Khirirat, Hamid Reza Feyzmahdavian, Mikael Johansson

    Abstract: Asynchronous computation and gradient compression have emerged as two key techniques for achieving scalability in distributed optimization for large-scale machine learning. This paper presents a unified analysis framework for distributed gradient methods operating with staled and compressed gradients. Non-asymptotic bounds on convergence rates and information exchange are derived for several optim… ▽ More

    Submitted 29 November, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: 33 pages, 4 figures, 2 tables