Skip to main content

Showing 1–17 of 17 results for author: Konečný, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2201.02664  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    Optimizing the Communication-Accuracy Trade-off in Federated Learning with Rate-Distortion Theory

    Authors: Nicole Mitchell, Johannes Ballé, Zachary Charles, Jakub Konečný

    Abstract: A significant bottleneck in federated learning (FL) is the network communication cost of sending model updates from client devices to the central server. We present a comprehensive empirical study of the statistics of model updates in FL, as well as the role and benefits of various compression techniques. Motivated by these observations, we propose a novel method to reduce the average communicatio… ▽ More

    Submitted 19 May, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  2. arXiv:2103.05032  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Convergence and Accuracy Trade-Offs in Federated Learning and Meta-Learning

    Authors: Zachary Charles, Jakub Konečný

    Abstract: We study a family of algorithms, which we refer to as local update methods, generalizing many federated and meta-learning algorithms. We prove that for quadratic models, local update methods are equivalent to first-order optimization on a surrogate loss we exactly characterize. Moreover, fundamental algorithmic choices (such as learning rates) explicitly govern a trade-off between the condition nu… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021. PMLR: Volume 130

  3. arXiv:2007.00878  [pdf, other

    cs.LG math.OC stat.ML

    On the Outsized Importance of Learning Rates in Local Update Methods

    Authors: Zachary Charles, Jakub Konečný

    Abstract: We study a family of algorithms, which we refer to as local update methods, that generalize many federated learning and meta-learning algorithms. We prove that for quadratic objectives, local update methods perform stochastic gradient descent on a surrogate loss function which we exactly characterize. We show that the choice of client learning rate controls the condition number of that surrogate l… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  4. arXiv:2003.00295  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Adaptive Federated Optimization

    Authors: Sashank Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Konečný, Sanjiv Kumar, H. Brendan McMahan

    Abstract: Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data. Standard federated optimization methods such as Federated Averaging (FedAvg) are often difficult to tune and exhibit unfavorable convergence behavior. In non-federated settings, adaptive optimization methods have… ▽ More

    Submitted 8 September, 2021; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: Published as a conference paper at ICLR 2021

  5. arXiv:1912.04977  [pdf, other

    cs.LG cs.CR stat.ML

    Advances and Open Problems in Federated Learning

    Authors: Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson , et al. (34 additional authors not shown)

    Abstract: Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs re… ▽ More

    Submitted 8 March, 2021; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published in Foundations and Trends in Machine Learning Vol 4 Issue 1. See: https://www.nowpublishers.com/article/Details/MAL-083

  6. arXiv:1912.00131  [pdf, other

    cs.DC cs.CR cs.LG stat.ML

    Federated Learning with Autotuned Communication-Efficient Secure Aggregation

    Authors: Keith Bonawitz, Fariborz Salehi, Jakub Konečný, Brendan McMahan, Marco Gruteser

    Abstract: Federated Learning enables mobile devices to collaboratively learn a shared inference model while keeping all the training data on a user's device, decoupling the ability to do machine learning from the need to store the data in the cloud. Existing work on federated learning with limited communication demonstrates how random rotation can enable users' model updates to be quantized much more effici… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 5 pages, 3 figures. To appear at the IEEE Asilomar Conference on Signals, Systems, and Computers 2019

  7. arXiv:1909.12488  [pdf, other

    cs.LG stat.ML

    Improving Federated Learning Personalization via Model Agnostic Meta Learning

    Authors: Yihan Jiang, Jakub Konečný, Keith Rush, Sreeram Kannan

    Abstract: Federated Learning (FL) refers to learning a high quality global model based on decentralized data storage, without ever copying the raw data. A natural scenario arises with data created on mobile phones by the activity of their users. Given the typical data heterogeneity in such situations, it is natural to ask how can the global model be personalized for every such device, individually. In this… ▽ More

    Submitted 18 January, 2023; v1 submitted 27 September, 2019; originally announced September 2019.

  8. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  9. arXiv:1902.01046  [pdf, other

    cs.LG cs.DC stat.ML

    Towards Federated Learning at Scale: System Design

    Authors: Keith Bonawitz, Hubert Eichner, Wolfgang Grieskamp, Dzmitry Huba, Alex Ingerman, Vladimir Ivanov, Chloe Kiddon, Jakub Konečný, Stefano Mazzocchi, H. Brendan McMahan, Timon Van Overveldt, David Petrou, Daniel Ramage, Jason Roselander

    Abstract: Federated Learning is a distributed machine learning approach which enables model training on a large corpus of decentralized data. We have built a scalable production system for Federated Learning in the domain of mobile devices, based on TensorFlow. In this paper, we describe the resulting high-level design, sketch some of the challenges and their solutions, and touch upon the open problems and… ▽ More

    Submitted 22 March, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  10. arXiv:1812.07210  [pdf, other

    cs.LG cs.DC stat.ML

    Expanding the Reach of Federated Learning by Reducing Client Resource Requirements

    Authors: Sebastian Caldas, Jakub Konečny, H. Brendan McMahan, Ameet Talwalkar

    Abstract: Communication on heterogeneous edge networks is a fundamental bottleneck in Federated Learning (FL), restricting both model capacity and user participation. To address this issue, we introduce two novel strategies to reduce communication costs: (1) the use of lossy compression on the global model sent server-to-client; and (2) Federated Dropout, which allows users to efficiently train locally on s… ▽ More

    Submitted 8 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

  11. arXiv:1812.01097  [pdf, other

    cs.LG stat.ML

    LEAF: A Benchmark for Federated Settings

    Authors: Sebastian Caldas, Sai Meher Karthik Duddu, Peter Wu, Tian Li, Jakub Konečný, H. Brendan McMahan, Virginia Smith, Ameet Talwalkar

    Abstract: Modern federated networks, such as those comprised of wearable devices, mobile phones, or autonomous vehicles, generate massive amounts of data each day. This wealth of data can help to learn models that can improve the user experience on each device. However, the scale and heterogeneity of federated data presents new challenges in research areas such as federated learning, meta-learning, and mult… ▽ More

    Submitted 9 December, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  12. arXiv:1611.07555  [pdf, other

    cs.DC math.NA stat.ML

    Randomized Distributed Mean Estimation: Accuracy vs Communication

    Authors: Jakub Konečný, Peter Richtárik

    Abstract: We consider the problem of estimating the arithmetic average of a finite collection of real vectors stored in a distributed fashion across several compute nodes subject to a communication budget constraint. Our analysis does not rely on any statistical assumptions about the source of the vectors. This problem arises as a subproblem in many applications, including reduce-all operations within algor… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: 19 pages, 1 figure

  13. arXiv:1608.06879  [pdf, other

    math.OC cs.LG stat.ML

    AIDE: Fast and Communication Efficient Distributed Optimization

    Authors: Sashank J. Reddi, Jakub Konečný, Peter Richtárik, Barnabás Póczós, Alex Smola

    Abstract: In this paper, we present two new communication-efficient methods for distributed minimization of an average of functions. The first algorithm is an inexact variant of the DANE algorithm that allows any local algorithm to return an approximate solution to a local subproblem. We show that such a strategy does not affect the theoretical guarantees of DANE significantly. In fact, our approach can be… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

  14. arXiv:1511.01942  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Stop Wasting My Gradients: Practical SVRG

    Authors: Reza Babanezhad, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

    Abstract: We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  15. Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting

    Authors: Jakub Konečný, Jie Liu, Peter Richtárik, Martin Takáč

    Abstract: We propose mS2GD: a method incorporating a mini-batching scheme for improving the theoretical complexity and practical performance of semi-stochastic gradient descent (S2GD). We consider the problem of minimizing a strongly convex function represented as the sum of an average of a large number of smooth convex functions, and a simple nonsmooth convex regularizer. Our method first performs a determ… ▽ More

    Submitted 16 November, 2015; v1 submitted 16 April, 2015; originally announced April 2015.

  16. arXiv:1410.4744  [pdf, other

    cs.LG stat.ML

    mS2GD: Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting

    Authors: Jakub Konečný, Jie Liu, Peter Richtárik, Martin Takáč

    Abstract: We propose a mini-batching scheme for improving the theoretical complexity and practical performance of semi-stochastic gradient descent applied to the problem of minimizing a strongly convex composite function represented as the sum of an average of a large number of smooth convex functions, and simple nonsmooth convex function. Our method first performs a deterministic step (computation of the g… ▽ More

    Submitted 17 October, 2014; originally announced October 2014.

  17. arXiv:1312.1666  [pdf, other

    stat.ML cs.LG math.NA math.OC

    Semi-Stochastic Gradient Descent Methods

    Authors: Jakub Konečný, Peter Richtárik

    Abstract: In this paper we study the problem of minimizing the average of a large number ($n$) of smooth convex loss functions. We propose a new method, S2GD (Semi-Stochastic Gradient Descent), which runs for one or several epochs in each of which a single full gradient and a random number of stochastic gradients is computed, following a geometric law. The total work needed for the method to output an… ▽ More

    Submitted 16 June, 2015; v1 submitted 5 December, 2013; originally announced December 2013.

    Comments: 19 pages, 3 figures, 2 algorithms, 3 tables