Skip to main content

Showing 1–8 of 8 results for author: Haddadpour, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2204.12446  [pdf, ps, other

    stat.ML cs.LG

    Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD

    Authors: Konstantinos E. Nikolakakis, Farzin Haddadpour, Amin Karbasi, Dionysios S. Kalogerias

    Abstract: We provide sharp path-dependent generalization and excess risk guarantees for the full-batch Gradient Descent (GD) algorithm on smooth losses (possibly non-Lipschitz, possibly nonconvex). At the heart of our analysis is an upper bound on the generalization error, which implies that average output stability and a bounded expected optimization error at termination lead to generalization. This result… ▽ More

    Submitted 9 February, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 35 pages

  2. arXiv:2203.09607  [pdf, other

    cs.LG stat.ML

    Learning Distributionally Robust Models at Scale via Composite Optimization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Amin Karbasi

    Abstract: To train machine learning models that are robust to distribution shifts in the data, distributionally robust optimization (DRO) has been proven very effective. However, the existing approaches to learning a distributionally robust model either require solving complex optimization problems such as semidefinite programming or a first-order method whose convergence scales linearly with the number of… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to ICLR2022 as a conference paper. International Conference on Learning Representations (2022)

  3. arXiv:2202.06880  [pdf, ps, other

    cs.LG math.OC stat.ML

    Black-Box Generalization: Stability of Zeroth-Order Learning

    Authors: Konstantinos E. Nikolakakis, Farzin Haddadpour, Dionysios S. Kalogerias, Amin Karbasi

    Abstract: We provide the first generalization error analysis for black-box learning through derivative-free optimization. Under the assumption of a Lipschitz and smooth unknown loss, we consider the Zeroth-order Stochastic Search (ZoSS) algorithm, that updates a $d$-dimensional model by replacing stochastic gradient directions with stochastic differences of $K+1$ perturbed loss evaluations per dataset (exam… ▽ More

    Submitted 9 February, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 32 pages

  4. arXiv:2008.04975  [pdf, ps, other

    stat.ML cs.DS cs.LG

    FedSKETCH: Communication-Efficient and Private Federated Learning via Sketching

    Authors: Farzin Haddadpour, Belhal Karimi, Ping Li, Xiaoyun Li

    Abstract: Communication complexity and privacy are the two key challenges in Federated Learning where the goal is to perform a distributed learning through a large volume of devices. In this work, we introduce FedSKETCH and FedSKETCHGATE algorithms to address both challenges in Federated learning jointly, where these algorithms are intended to be used for homogeneous and heterogeneous data distribution sett… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  5. arXiv:2007.01154  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning with Compression: Unified Analysis and Sharp Guarantees

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

    Abstract: In federated learning, communication cost is often a critical bottleneck to scale up distributed optimization algorithms to collaboratively learn a model from millions of devices with potentially unreliable or limited communication and heterogeneous data distributions. Two notable trends to deal with the communication overhead of federated algorithms are gradient compression and local computation… ▽ More

    Submitted 20 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: version 2. more experiments and comparisons

  6. arXiv:1911.04931  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    Efficient Fair Principal Component Analysis

    Authors: Mohammad Mahdi Kamani, Farzin Haddadpour, Rana Forsati, Mehrdad Mahdavi

    Abstract: It has been shown that dimension reduction methods such as PCA may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly comprom… ▽ More

    Submitted 7 March, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  7. arXiv:1910.14425  [pdf, other

    cs.LG cs.DC stat.ML

    On the Convergence of Local Descent Methods in Federated Learning

    Authors: Farzin Haddadpour, Mehrdad Mahdavi

    Abstract: In federated distributed learning, the goal is to optimize a global training objective defined over distributed devices, where the data shard at each device is sampled from a possibly different distribution (a.k.a., heterogeneous or non i.i.d. data samples). In this paper, we generalize the local stochastic and full gradient descent with periodic averaging-- originally designed for homogeneous dis… ▽ More

    Submitted 6 December, 2019; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: 47 pages, "Updates from v1: A technical error in Lemma B3 is corrected"

  8. arXiv:1910.13598  [pdf, other

    cs.LG cs.DC stat.ML

    Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Viveck R. Cadambe

    Abstract: Communication overhead is one of the key challenges that hinders the scalability of distributed optimization algorithms. In this paper, we study local distributed SGD, where data is partitioned among computation nodes, and the computation nodes perform local updates with periodically exchanging the model among the workers to perform averaging. While local SGD is empirically shown to provide promis… ▽ More

    Submitted 14 May, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Paper accepted to NeurIPS 2019 - We fixed a flaw in the earlier version regarding the dependency on constants but this change does not affect the communication complexity