Skip to main content

Showing 1–14 of 14 results for author: Singhal, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.12553  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Beyond Laplace and Gaussian: Exploring the Generalized Gaussian Mechanism for Private Machine Learning

    Authors: Roy Rinberg, Ilia Shumailov, Vikrant Singhal, Rachel Cummings, Nicolas Papernot

    Abstract: Differential privacy (DP) is obtained by randomizing a data analysis algorithm, which necessarily introduces a tradeoff between its utility and privacy. Many DP mechanisms are built upon one of two underlying tools: Laplace and Gaussian additive noise mechanisms. We expand the search space of algorithms by investigating the Generalized Gaussian mechanism, which samples the additive noise term $x$… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2402.00267  [pdf, ps, other

    cs.DS cs.CR stat.ML

    Not All Learnable Distribution Classes are Privately Learnable

    Authors: Mark Bun, Gautam Kamath, Argyris Mouzakis, Vikrant Singhal

    Abstract: We give an example of a class of distributions that is learnable up to constant error in total variation distance with a finite number of samples, but not learnable under $(\varepsilon, δ)$-differential privacy with the same target error. This weakly refutes a conjecture of Ashtiani.

    Submitted 25 March, 2025; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: Appeared in ALT 2024. Added clarification about result, and updated affiliation and funding for VS

  3. arXiv:2308.06239  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Private Distribution Learning with Public Data: The View from Sample Compression

    Authors: Shai Ben-David, Alex Bie, Clément L. Canonne, Gautam Kamath, Vikrant Singhal

    Abstract: We study the problem of private distribution learning with access to public data. In this setup, which we refer to as public-private learning, the learner is given public and private samples drawn from an unknown distribution $p$ belonging to a class $\mathcal Q$, with the goal of outputting an estimate of $p$ while adhering to privacy constraints (here, pure differential privacy) only with respec… ▽ More

    Submitted 14 August, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: 31 pages

  4. arXiv:2304.06787  [pdf, ps, other

    cs.DS cs.CR cs.LG stat.ML

    A Polynomial Time, Pure Differentially Private Estimator for Binary Product Distributions

    Authors: Vikrant Singhal

    Abstract: We present the first $\varepsilon$-differentially private, computationally efficient algorithm that estimates the means of product distributions over $\{0,1\}^d$ accurately in total-variation distance, whilst attaining the optimal sample complexity to within polylogarithmic factors. The prior work had either solved this problem efficiently and optimally under weaker notions of privacy, or had solv… ▽ More

    Submitted 25 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

  5. arXiv:2301.13334  [pdf, other

    math.ST cs.CR cs.DS stat.ML

    A Bias-Accuracy-Privacy Trilemma for Statistical Estimation

    Authors: Gautam Kamath, Argyris Mouzakis, Matthew Regehr, Vikrant Singhal, Thomas Steinke, Jonathan Ullman

    Abstract: Differential privacy (DP) is a rigorous notion of data privacy, used for private statistics. The canonical algorithm for differentially private mean estimation is to first clip the samples to a bounded range and then add noise to their empirical mean. Clipping controls the sensitivity and, hence, the variance of the noise that we add for privacy. But clipping also introduces statistical bias. This… ▽ More

    Submitted 7 October, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  6. arXiv:2208.07984  [pdf, other

    cs.LG cs.CR stat.ML

    Private Estimation with Public Data

    Authors: Alex Bie, Gautam Kamath, Vikrant Singhal

    Abstract: We initiate the study of differentially private (DP) estimation with access to a small amount of public data. For private estimation of d-dimensional Gaussians, we assume that the public data comes from a Gaussian that may have vanishing similarity in total variation distance with the underlying Gaussian of the private data. We show that under the constraints of pure or concentrated DP, d+1 public… ▽ More

    Submitted 5 April, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 55 pages; updated funding acknowledgement + simulation results from NeurIPS 2022 camera-ready

  7. arXiv:2205.08532  [pdf, ps, other

    cs.DS cs.CR stat.ML

    New Lower Bounds for Private Estimation and a Generalized Fingerprinting Lemma

    Authors: Gautam Kamath, Argyris Mouzakis, Vikrant Singhal

    Abstract: We prove new lower bounds for statistical estimation tasks under the constraint of $(\varepsilon, δ)$-differential privacy. First, we provide tight lower bounds for private covariance estimation of Gaussian distributions. We show that estimating the covariance matrix in Frobenius norm requires $Ω(d^2)$ samples, and in spectral norm requires $Ω(d^{3/2})$ samples, both matching upper bounds up to lo… ▽ More

    Submitted 28 March, 2023; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022. Minor correction to the discussion of independent work

  8. arXiv:2111.04609  [pdf, ps, other

    stat.ML cs.CR cs.DS cs.IT cs.LG

    A Private and Computationally-Efficient Estimator for Unbounded Gaussians

    Authors: Gautam Kamath, Argyris Mouzakis, Vikrant Singhal, Thomas Steinke, Jonathan Ullman

    Abstract: We give the first polynomial-time, polynomial-sample, differentially private estimator for the mean and covariance of an arbitrary Gaussian distribution $\mathcal{N}(μ,Σ)$ in $\mathbb{R}^d$. All previous estimators are either nonconstructive, with unbounded running time, or require the user to specify a priori bounds on the parameters $μ$ and $Σ$. The primary new technical tool in our algorithm is… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  9. arXiv:2106.00001  [pdf, other

    cs.CR cs.DS cs.LG stat.CO

    Privately Learning Subspaces

    Authors: Vikrant Singhal, Thomas Steinke

    Abstract: Private data analysis suffers a costly curse of dimensionality. However, the data often has an underlying low-dimensional structure. For example, when optimizing via gradient descent, the gradients often lie in or near a low-dimensional subspace. If that low-dimensional structure can be identified, then we can avoid paying (in terms of privacy or accuracy) for the high ambient dimension. We pres… ▽ More

    Submitted 10 August, 2021; v1 submitted 28 May, 2021; originally announced June 2021.

  10. arXiv:2002.09464  [pdf, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    Private Mean Estimation of Heavy-Tailed Distributions

    Authors: Gautam Kamath, Vikrant Singhal, Jonathan Ullman

    Abstract: We give new upper and lower bounds on the minimax sample complexity of differentially private mean estimation of distributions with bounded $k$-th moments. Roughly speaking, in the univariate case, we show that $n = Θ\left(\frac{1}{α^2} + \frac{1}{α^{\frac{k}{k-1}}\varepsilon}\right)$ samples are necessary and sufficient to estimate the mean to $α$-accuracy under $\varepsilon$-differential privacy… ▽ More

    Submitted 16 February, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: Appeared in COLT 2020

  11. arXiv:1912.04690  [pdf

    cs.LG eess.IV stat.ML

    Reconstructing Multi-echo Magnetic Resonance Images via Structured Deep Dictionary Learning

    Authors: Vanika Singhal, Angshul Majumdar

    Abstract: Multi-echo magnetic resonance (MR) images are acquired by changing the echo times (for T2 weighted) or relaxation times (for T1 weighted) of scans. The resulting (multi-echo) images are usually used for quantitative MR imaging. Acquiring MR images is a slow process and acquiring multi scans of the same cross section for multi-echo imaging is even slower. In order to accelerate the scan, compressed… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: Final version accepted at Neurocomputing

  12. arXiv:1909.03951  [pdf, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    Differentially Private Algorithms for Learning Mixtures of Separated Gaussians

    Authors: Gautam Kamath, Or Sheffet, Vikrant Singhal, Jonathan Ullman

    Abstract: Learning the parameters of Gaussian mixture models is a fundamental and widely studied problem with numerous applications. In this work, we give new algorithms for learning the parameters of a high-dimensional, well separated, Gaussian mixture model subject to the strong constraint of differential privacy. In particular, we give a differentially private analogue of the algorithm of Achlioptas and… ▽ More

    Submitted 15 October, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2019

  13. arXiv:1805.00216  [pdf, other

    cs.DS cs.CR cs.LG stat.ML

    Privately Learning High-Dimensional Distributions

    Authors: Gautam Kamath, Jerry Li, Vikrant Singhal, Jonathan Ullman

    Abstract: We present novel, computationally efficient, and differentially private algorithms for two fundamental high-dimensional learning problems: learning a multivariate Gaussian and learning a product distribution over the Boolean hypercube in total variation distance. The sample complexity of our algorithms nearly matches the sample complexity of the optimal non-private learners for these tasks in a wi… ▽ More

    Submitted 30 May, 2019; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: To appear in COLT 2019

  14. arXiv:1612.07454  [pdf

    cs.LG stat.ML

    How to Train Your Deep Neural Network with Dictionary Learning

    Authors: Vanika Singhal, Shikha Singh, Angshul Majumdar

    Abstract: Currently there are two predominant ways to train deep neural networks. The first one uses restricted Boltzmann machine (RBM) and the second one autoencoders. RBMs are stacked in layers to form deep belief network (DBN); the final representation layer is attached to the target to complete the deep neural network. Autoencoders are nested one inside the other to form stacked autoencoders; once the s… ▽ More

    Submitted 22 December, 2016; originally announced December 2016.

    Comments: DCC 2017 poster