Skip to main content

Showing 1–8 of 8 results for author: Data, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.01771  [pdf, other

    cs.LG cs.CR stat.ML

    A Generative Framework for Personalized Learning and Estimation: Theory, Algorithms, and Privacy

    Authors: Kaan Ozkara, Antonious M. Girgis, Deepesh Data, Suhas Diggavi

    Abstract: A distinguishing characteristic of federated learning is that the (local) client data could have statistical heterogeneity. This heterogeneity has motivated the design of personalized learning, where individual (personalized) models are trained, through collaboration. There have been various personalization methods proposed in literature, with seemingly very different forms and methods ranging fro… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  2. arXiv:2008.07180  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Shuffled Model of Federated Learning: Privacy, Communication and Accuracy Trade-offs

    Authors: Antonious M. Girgis, Deepesh Data, Suhas Diggavi, Peter Kairouz, Ananda Theertha Suresh

    Abstract: We consider a distributed empirical risk minimization (ERM) optimization problem with communication efficiency and privacy requirements, motivated by the federated learning (FL) framework. Unique challenges to the traditional ERM problem in the context of FL include (i) need to provide privacy guarantees on clients' data, (ii) compress the communication between clients and the server, since client… ▽ More

    Submitted 23 September, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

  3. arXiv:2006.13041  [pdf, other

    stat.ML cs.CR cs.DC cs.LG

    Byzantine-Resilient High-Dimensional Federated Learning

    Authors: Deepesh Data, Suhas Diggavi

    Abstract: We study stochastic gradient descent (SGD) with local iterations in the presence of malicious/Byzantine clients, motivated by the federated learning. The clients, instead of communicating with the central server in every iteration, maintain their local models, which they update by taking several SGD iterations based on their own datasets and then communicate the net update with the server, thereby… ▽ More

    Submitted 16 August, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: 33 pages; title change; improved bound on the approximation error by the factor of H

  4. arXiv:2005.07866  [pdf, other

    stat.ML cs.CR cs.DC cs.LG

    Byzantine-Resilient SGD in High Dimensions on Heterogeneous Data

    Authors: Deepesh Data, Suhas Diggavi

    Abstract: We study distributed stochastic gradient descent (SGD) in the master-worker architecture under Byzantine attacks. We consider the heterogeneous data model, where different workers may have different local datasets, and we do not make any probabilistic assumptions on data generation. At the core of our algorithm, we use the polynomial-time outlier-filtering procedure for robust mean estimation prop… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: 57 pages, 2 figures

  5. arXiv:2005.07041  [pdf, other

    cs.LG cs.DC stat.ML

    SQuARM-SGD: Communication-Efficient Momentum SGD for Decentralized Optimization

    Authors: Navjot Singh, Deepesh Data, Jemin George, Suhas Diggavi

    Abstract: In this paper, we propose and analyze SQuARM-SGD, a communication-efficient algorithm for decentralized training of large-scale machine learning models over a network. In SQuARM-SGD, each node performs a fixed number of local SGD steps using Nesterov's momentum and then sends sparsified and quantized updates to its neighbors regulated by a locally computable triggering criterion. We provide conver… ▽ More

    Submitted 11 October, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: 58 pages, 8 figures

  6. arXiv:1910.14280  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Stochastic Optimization

    Authors: Navjot Singh, Deepesh Data, Jemin George, Suhas Diggavi

    Abstract: In this paper, we propose and analyze SPARQ-SGD, which is an event-triggered and compressed algorithm for decentralized training of large-scale machine learning models. Each node can locally compute a condition (event) which triggers a communication where quantized and sparsified local model parameters are sent. In SPARQ-SGD each node takes at least a fixed number ($H$) of local gradient steps and… ▽ More

    Submitted 24 February, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: 41 pages, 4 figures

  7. arXiv:1906.02367  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    Qsparse-local-SGD: Distributed SGD with Quantization, Sparsification, and Local Computations

    Authors: Debraj Basu, Deepesh Data, Can Karakus, Suhas Diggavi

    Abstract: Communication bottleneck has been identified as a significant issue in distributed optimization of large-scale learning models. Recently, several approaches to mitigate this problem have been proposed, including different forms of gradient compression or computing local models and mixing them iteratively. In this paper, we propose \emph{Qsparse-local-SGD} algorithm, which combines aggressive spars… ▽ More

    Submitted 2 November, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: 50 pages; 8 figures; full version of a paper in NeurIPS 2019 with the same title

  8. Estimating the sample mean and standard deviation from commonly reported quantiles in meta-analysis

    Authors: Sean McGrath, XiaoFei Zhao, Russell Steele, Brett D. Thombs, Andrea Benedetti, the DEPRESsion Screening Data, Collaboration

    Abstract: Researchers increasingly use meta-analysis to synthesize the results of several studies in order to estimate a common effect. When the outcome variable is continuous, standard meta-analytic approaches assume that the primary studies report the sample mean and standard deviation of the outcome. However, when the outcome is skewed, authors sometimes summarize the data by reporting the sample median… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Journal ref: Stat. Methods Med. Res. 29 (2020) 2520-2537