Skip to main content

Showing 1–12 of 12 results for author: Manurangsi, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.08889  [pdf, ps, other

    cs.LG cs.CR cs.DS stat.ML

    Linear-Time User-Level DP-SCO via Robust Statistics

    Authors: Badih Ghazi, Ravi Kumar, Daogao Liu, Pasin Manurangsi

    Abstract: User-level differentially private stochastic convex optimization (DP-SCO) has garnered significant attention due to the paramount importance of safeguarding user privacy in modern large-scale machine learning applications. Current methods, such as those based on differentially private stochastic gradient descent (DP-SGD), often struggle with high noise accumulation and suboptimal utility due to th… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  2. arXiv:2412.16802  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Balls-and-Bins Sampling for DP-SGD

    Authors: Lynn Chua, Badih Ghazi, Charlie Harrison, Ethan Leeman, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

    Abstract: We introduce the Balls-and-Bins sampling for differentially private (DP) optimization methods such as DP-SGD. While it has been common practice to use some form of shuffling in DP-SGD implementations, privacy accounting algorithms have typically assumed that Poisson subsampling is used instead. Recent work by Chua et al. (ICML 2024), however, pointed out that shuffling based DP-SGD can have a much… ▽ More

    Submitted 31 March, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: Conference Proceedings version for AISTATS 2025

  3. arXiv:2404.10881  [pdf, ps, other

    cs.LG math.OC stat.ML

    Differentially Private Optimization with Sparse Gradients

    Authors: Badih Ghazi, Cristóbal Guzmán, Pritish Kamath, Ravi Kumar, Pasin Manurangsi

    Abstract: Motivated by applications of large embedding models, we study differentially private (DP) optimization problems under sparsity of individual gradients. We start with new near-optimal bounds for the classic mean estimation problem but with sparse data, improving upon existing algorithms particularly for the high-dimensional regime. Building on this, we obtain pure- and approximate-DP algorithms wit… ▽ More

    Submitted 31 October, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Minor corrections and re-structuring of the presentation

  4. arXiv:2306.15744  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Ticketed Learning-Unlearning Schemes

    Authors: Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Ayush Sekhari, Chiyuan Zhang

    Abstract: We consider the learning--unlearning paradigm defined as follows. First given a dataset, the goal is to learn a good predictor, such as one minimizing a certain loss. Subsequently, given any subset of examples that wish to be unlearnt, the goal is to learn, without the knowledge of the original training dataset, a good predictor that is identical to the predictor that would have been produced when… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Conference on Learning Theory (COLT) 2023

  5. arXiv:2210.15175  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Private Isotonic Regression

    Authors: Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi

    Abstract: In this paper, we consider the problem of differentially private (DP) algorithms for isotonic regression. For the most general problem of isotonic regression over a partially ordered set (poset) $\mathcal{X}$ and for any Lipschitz loss function, we obtain a pure-DP algorithm that, given $n$ input points, has an expected excess empirical risk of roughly… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems (NeurIPS), 2022

  6. arXiv:2112.03548  [pdf, ps, other

    stat.ML cs.CR cs.DS cs.IT cs.LG

    Private Robust Estimation by Stabilizing Convex Relaxations

    Authors: Pravesh K. Kothari, Pasin Manurangsi, Ameya Velingker

    Abstract: We give the first polynomial time and sample $(ε, δ)$-differentially private (DP) algorithm to estimate the mean, covariance and higher moments in the presence of a constant fraction of adversarial outliers. Our algorithm succeeds for families of distributions that satisfy two well-studied properties in prior works on robust estimation: certifiable subgaussianity of directional moments and certifi… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  7. arXiv:2011.14580  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Robust and Private Learning of Halfspaces

    Authors: Badih Ghazi, Ravi Kumar, Pasin Manurangsi, Thao Nguyen

    Abstract: In this work, we study the trade-off between differential privacy and adversarial robustness under L2-perturbations in the context of learning halfspaces. We prove nearly tight bounds on the sample complexity of robust private learning of halfspaces for a large regime of parameters. A highlight of our results is that robust and private learning is harder than robust or private learning alone. We c… ▽ More

    Submitted 25 March, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: AISTATS 2021

  8. arXiv:2009.09604  [pdf, ps, other

    cs.CR cs.DS cs.LG stat.ML

    On Distributed Differential Privacy and Counting Distinct Elements

    Authors: Lijie Chen, Badih Ghazi, Ravi Kumar, Pasin Manurangsi

    Abstract: We study the setup where each of $n$ users holds an element from a discrete set, and the goal is to count the number of distinct elements across all users, under the constraint of $(ε, δ)$-differentially privacy: - In the non-interactive local setting, we prove that the additive error of any protocol is $Ω(n)$ for any constant $ε$ and for any $δ$ inverse polynomial in $n$. - In the single-mess… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: 68 pages, 4 algorithms

  9. arXiv:2008.08007  [pdf, ps, other

    cs.LG cs.CR cs.DS stat.ML

    Differentially Private Clustering: Tight Approximation Ratios

    Authors: Badih Ghazi, Ravi Kumar, Pasin Manurangsi

    Abstract: We study the task of differentially private clustering. For several basic clustering problems, including Euclidean DensestBall, 1-Cluster, k-means, and k-median, we give efficient differentially private algorithms that achieve essentially the same approximation ratios as those that can be obtained by any non-private algorithm, while incurring only small additive errors. This improves upon existing… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 60 pages, 1 table

  10. arXiv:2007.15220  [pdf, ps, other

    cs.LG cs.CC cs.DS stat.ML

    The Complexity of Adversarially Robust Proper Learning of Halfspaces with Agnostic Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi

    Abstract: We study the computational complexity of adversarially robust proper learning of halfspaces in the distribution-independent agnostic PAC model, with a focus on $L_p$ perturbations. We give a computationally efficient learning algorithm and a nearly matching computational hardness result for this problem. An interesting implication of our findings is that the $L_{\infty}$ perturbations case is prov… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  11. arXiv:2007.03668  [pdf, ps, other

    cs.LG math.CO stat.ML

    Near-tight closure bounds for Littlestone and threshold dimensions

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi

    Abstract: We study closure properties for the Littlestone and threshold dimensions of binary hypothesis classes. Given classes $\mathcal{H}_1, \ldots, \mathcal{H}_k$ of Boolean functions with bounded Littlestone (respectively, threshold) dimension, we establish an upper bound on the Littlestone (respectively, threshold) dimension of the class defined by applying an arbitrary binary aggregation rule to… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 7 pages

  12. arXiv:1908.11335  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi

    Abstract: We study the problem of {\em properly} learning large margin halfspaces in the agnostic PAC model. In more detail, we study the complexity of properly learning $d$-dimensional halfspaces on the unit ball within misclassification error $α\cdot \mathrm{OPT}_γ + ε$, where $\mathrm{OPT}_γ$ is the optimal $γ$-margin error rate and $α\geq 1$ is the approximation ratio. We give learning algorithms and co… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.