Skip to main content

Showing 1–13 of 13 results for author: Suresh, A T

Searching in archive math. Search in all archives.
.
  1. arXiv:2302.06869  [pdf, other

    stat.ML cs.DM cs.IT cs.LG math.PR

    Concentration Bounds for Discrete Distribution Estimation in KL Divergence

    Authors: Clément L. Canonne, Ziteng Sun, Ananda Theertha Suresh

    Abstract: We study the problem of discrete distribution estimation in KL divergence and provide concentration bounds for the Laplace estimator. We show that the deviation from mean scales as $\sqrt{k}/n$ when $n \ge k$, improving upon the best prior result of $k/n$. We also establish a matching lower bound that shows that our bounds are tight up to polylogarithmic factors.

    Submitted 12 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Updated discussion of previous work

  2. arXiv:2111.05320  [pdf, ps, other

    cs.DS cs.IT math.ST stat.ML

    Robust Estimation for Random Graphs

    Authors: Jayadev Acharya, Ayush Jain, Gautam Kamath, Ananda Theertha Suresh, Huanyu Zhang

    Abstract: We study the problem of robustly estimating the parameter $p$ of an Erdős-Rényi random graph on $n$ nodes, where a $γ$ fraction of nodes may be adversarially corrupted. After showing the deficiencies of canonical estimators, we design a computationally-efficient spectral algorithm which estimates $p$ up to accuracy $\tilde O(\sqrt{p(1-p)}/n + γ\sqrt{p(1-p)} /\sqrt{n}+ γ/n)$ for $γ< 1/60$. Furtherm… ▽ More

    Submitted 15 February, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

  3. arXiv:2102.11845  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Learning with User-Level Privacy

    Authors: Daniel Levy, Ziteng Sun, Kareem Amin, Satyen Kale, Alex Kulesza, Mehryar Mohri, Ananda Theertha Suresh

    Abstract: We propose and analyze algorithms to solve a range of learning tasks under user-level differential privacy constraints. Rather than guaranteeing only the privacy of individual samples, user-level DP protects a user's entire contribution ($m \ge 1$ samples), providing more stringent but more realistic protection against information leaks. We show that for high-dimensional mean estimation, empirical… ▽ More

    Submitted 3 December, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021. 43 pages, 0 figure

  4. arXiv:2011.01848  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Robust hypothesis testing and distribution estimation in Hellinger distance

    Authors: Ananda Theertha Suresh

    Abstract: We propose a simple robust hypothesis test that has the same sample complexity as that of the optimal Neyman-Pearson test up to constants, but robust to distribution perturbations under Hellinger distance. We discuss the applicability of such a robust test for estimating distributions in Hellinger distance. We empirically demonstrate the power of the test on canonical distributions.

    Submitted 3 November, 2020; originally announced November 2020.

  5. arXiv:2008.03606  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Mime: Mimicking Centralized Stochastic Algorithms in Federated Learning

    Authors: Sai Praneeth Karimireddy, Martin Jaggi, Satyen Kale, Mehryar Mohri, Sashank J. Reddi, Sebastian U. Stich, Ananda Theertha Suresh

    Abstract: Federated learning (FL) is a challenging setting for optimization due to the heterogeneity of the data across different clients which gives rise to the client drift phenomenon. In fact, obtaining an algorithm for FL which is uniformly better than simple centralized training has been a major open problem thus far. In this work, we propose a general algorithmic framework, Mime, which i) mitigates cl… ▽ More

    Submitted 8 June, 2021; v1 submitted 8 August, 2020; originally announced August 2020.

    Comments: Version 2 provides stronger theoretical results and more thorough experiments

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  6. arXiv:2001.04130  [pdf, ps, other

    math.ST

    Convergence of Chao Unseen Species Estimator

    Authors: Nived Rajaraman, Prafulla Chandra, Andrew Thangaraj, Ananda Theertha Suresh

    Abstract: Support size estimation and the related problem of unseen species estimation have wide applications in ecology and database analysis. Perhaps the most used support size estimator is the Chao estimator. Despite its wide spread use, little is known about its theoretical properties. We analyze the Chao estimator and show that its worst case mean squared error (MSE) is smaller than the MSE of the plug… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: 20 pages, 1 figure, short version presented at International Symposium on Information Theory (ISIT) 2019

  7. arXiv:1910.06378  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    SCAFFOLD: Stochastic Controlled Averaging for Federated Learning

    Authors: Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank J. Reddi, Sebastian U. Stich, Ananda Theertha Suresh

    Abstract: Federated Averaging (FedAvg) has emerged as the algorithm of choice for federated learning due to its simplicity and low communication cost. However, in spite of recent research efforts, its performance is not fully understood. We obtain tight convergence rates for FedAvg and prove that it suffers from `client-drift' when the data is heterogeneous (non-iid), resulting in unstable and slow converge… ▽ More

    Submitted 9 April, 2021; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: v2 contains analysis of FedAvg, non-convex rates of Scaffold, and experimental evaluation. v3 fixes typos, ICML version. v4 slightly improves rate of SCAFFOLD for general convex functions

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  8. arXiv:1904.00070  [pdf, other

    stat.ML cs.LG math.ST

    Data Amplification: A Unified and Competitive Approach to Property Estimation

    Authors: Yi Hao, Alon Orlitsky, Ananda T. Suresh, Yihong Wu

    Abstract: Estimating properties of discrete distributions is a fundamental problem in statistical learning. We design the first unified, linear-time, competitive, property estimator that for a wide class of properties and for all underlying distributions uses just $2n$ samples to achieve the performance attained by the empirical estimator with $n\sqrt{\log n}$ samples. This provides off-the-shelf, distribut… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.

    Comments: In NeurIPS 2018

  9. arXiv:1702.05574  [pdf, ps, other

    math.ST cs.IT stat.ML

    Sample complexity of population recovery

    Authors: Yury Polyanskiy, Ananda Theertha Suresh, Yihong Wu

    Abstract: The problem of population recovery refers to estimating a distribution based on incomplete or corrupted samples. Consider a random poll of sample size $n$ conducted on a population of individuals, where each pollee is asked to answer $d$ binary questions. We consider one of the two polling impediments: (a) in lossy population recovery, a pollee may skip each question with probability $ε$, (b) in n… ▽ More

    Submitted 29 April, 2020; v1 submitted 18 February, 2017; originally announced February 2017.

    Comments: Earlier versions (incl. the one in proceedings) had a mistake in Prop. 9 that propagated to Theorem 1 (lower bound) and Lemma 12. This version (v3) fixes those

  10. arXiv:1511.07428  [pdf, other

    math.ST stat.ML

    Estimating the number of unseen species: A bird in the hand is worth $\log n $ in the bush

    Authors: Alon Orlitsky, Ananda Theertha Suresh, Yihong Wu

    Abstract: Estimating the number of unseen species is an important problem in many scientific endeavors. Its most popular formulation, introduced by Fisher, uses $n$ samples to predict the number $U$ of hitherto unseen species that would be observed if $t\cdot n$ new samples were collected. Of considerable interest is the largest ratio $t$ between the number of new and existing samples for which $U$ can be a… ▽ More

    Submitted 2 March, 2016; v1 submitted 23 November, 2015; originally announced November 2015.

  11. arXiv:1504.08070  [pdf, ps, other

    cs.IT math.ST

    Universal Compression of Power-Law Distributions

    Authors: Moein Falahatgar, Ashkan Jafarpour, Alon Orlitsky, Venkatadheeraj Pichapati, Ananda Theertha Suresh

    Abstract: English words and the outputs of many other natural processes are well-known to follow a Zipf distribution. Yet this thoroughly-established property has never been shown to help compress or predict these important processes. We show that the expected redundancy of Zipf distributions of order $α>1$ is roughly the $1/α$ power of the expected redundancy of unrestricted distributions. Hence for these… ▽ More

    Submitted 30 April, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

    Comments: 20 pages

  12. arXiv:1504.04103  [pdf, ps, other

    cs.DS cs.CC cs.LG math.ST

    Faster Algorithms for Testing under Conditional Sampling

    Authors: Moein Falahatgar, Ashkan Jafarpour, Alon Orlitsky, Venkatadheeraj Pichapathi, Ananda Theertha Suresh

    Abstract: There has been considerable recent interest in distribution-tests whose run-time and sample requirements are sublinear in the domain-size $k$. We study two of the most important tests under the conditional-sampling model where each query specifies a subset $S$ of the domain, and the response is a sample drawn from $S$ according to the underlying distribution. For identity testing, which asks whe… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

    Comments: 31 pages

  13. arXiv:1503.07940  [pdf, other

    cs.IT cs.DS cs.LG math.ST

    Competitive Distribution Estimation

    Authors: Alon Orlitsky, Ananda Theertha Suresh

    Abstract: Estimating an unknown distribution from its samples is a fundamental problem in statistics. The common, min-max, formulation of this goal considers the performance of the best estimator over all distributions in a class. It shows that with $n$ samples, distributions over $k$ symbols can be learned to a KL divergence that decreases to zero with the sample size $n$, but grows unboundedly with the al… ▽ More

    Submitted 26 March, 2015; originally announced March 2015.

    Comments: 15 pages