Skip to main content

Showing 1–5 of 5 results for author: Khirirat, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.13482  [pdf, other

    cs.LG cs.CR cs.DC math.OC stat.ML

    Smoothed Normalization for Efficient Distributed Private Optimization

    Authors: Egor Shulgin, Sarit Khirirat, Peter Richtárik

    Abstract: Federated learning enables training machine learning models while preserving the privacy of participants. Surprisingly, there is no differentially private distributed method for smooth, non-convex optimization problems. The reason is that standard privacy techniques require bounding the participants' contributions, usually enforced via $\textit{clipping}$ of the updates. Existing literature typica… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 36 pages

  2. arXiv:2305.18929  [pdf, other

    cs.LG math.OC stat.ML

    Clip21: Error Feedback for Gradient Clipping

    Authors: Sarit Khirirat, Eduard Gorbunov, Samuel Horváth, Rustem Islamov, Fakhri Karray, Peter Richtárik

    Abstract: Motivated by the increasing popularity and importance of large-scale training under differential privacy (DP) constraints, we study distributed gradient methods with gradient clipping, i.e., clipping applied to the gradients computed from local information at the nodes. While gradient clipping is an essential tool for injecting formal DP guarantees into gradient-based methods [1], it also induces… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  3. arXiv:2003.06377  [pdf, other

    math.OC stat.ML

    A flexible framework for communication-efficient machine learning: from HPC to IoT

    Authors: Sarit Khirirat, Sindri Magnússon, Arda Aytekin, Mikael Johansson

    Abstract: With the increasing scale of machine learning tasks, it has become essential to reduce the communication between computing nodes. Early work on gradient compression focused on the bottleneck between CPUs and GPUs, but communication-efficiency is now needed in a variety of different system architectures, from high-performance clusters to energy-constrained IoT devices. In the current practice, comp… ▽ More

    Submitted 17 June, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: 27 pages, 11 figures, 1 table

  4. arXiv:1809.10505  [pdf, other

    cs.LG cs.DC stat.ML

    The Convergence of Sparsified Gradient Methods

    Authors: Dan Alistarh, Torsten Hoefler, Mikael Johansson, Sarit Khirirat, Nikola Konstantinov, Cédric Renggli

    Abstract: Distributed training of massive machine learning models, in particular deep neural networks, via Stochastic Gradient Descent (SGD) is becoming commonplace. Several families of communication-reduction methods, such as quantization, large-batch methods, and gradient sparsification, have been proposed. To date, gradient sparsification methods - where each node sorts gradients by magnitude, and only c… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: NIPS 2018 - Advances in Neural Information Processing Systems; Authors in alphabetic order

  5. arXiv:1806.06573  [pdf, other

    math.OC stat.ML

    Distributed learning with compressed gradients

    Authors: Sarit Khirirat, Hamid Reza Feyzmahdavian, Mikael Johansson

    Abstract: Asynchronous computation and gradient compression have emerged as two key techniques for achieving scalability in distributed optimization for large-scale machine learning. This paper presents a unified analysis framework for distributed gradient methods operating with staled and compressed gradients. Non-asymptotic bounds on convergence rates and information exchange are derived for several optim… ▽ More

    Submitted 29 November, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: 33 pages, 4 figures, 2 tables