Skip to main content

Showing 1–5 of 5 results for author: Valiant, P

Searching in archive math. Search in all archives.
.
  1. arXiv:2311.12784  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+α$ Moments

    Authors: Trung Dang, Jasper C. H. Lee, Maoyuan Song, Paul Valiant

    Abstract: There is growing interest in improving our algorithmic understanding of fundamental statistical problems such as mean estimation, driven by the goal of understanding the limits of what we can extract from valuable data. The state of the art results for mean estimation in $\mathbb{R}$ are 1) the optimal sub-Gaussian mean estimator by [LV22], with the tight sub-Gaussian constant for all distribution… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 27 pages, to appear in NeurIPS 2023. Abstract shortened to fit arXiv limit

  2. arXiv:2310.09408  [pdf, other

    math.ST

    Improving Pearson's chi-squared test: hypothesis testing of distributions -- optimally

    Authors: Trung Dang, Walter McKelvie, Paul Valiant, Hongao Wang

    Abstract: Pearson's chi-squared test, from 1900, is the standard statistical tool for "hypothesis testing on distributions": namely, given samples from an unknown distribution $Q$ that may or may not equal a hypothesis distribution $P$, we want to return "yes" if $P=Q$ and "no" if $P$ is far from $Q$. While the chi-squared test is easy to use, it has been known for a while that it is not "data efficient", i… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  3. arXiv:2206.02348  [pdf, other

    math.ST cs.DS cs.IT cs.LG stat.ML

    Finite-Sample Maximum Likelihood Estimation of Location

    Authors: Shivam Gupta, Jasper C. H. Lee, Eric Price, Paul Valiant

    Abstract: We consider 1-dimensional location estimation, where we estimate a parameter $λ$ from $n$ samples $λ+ η_i$, with each $η_i$ drawn i.i.d. from a known distribution $f$. For fixed $f$ the maximum-likelihood estimate (MLE) is well-known to be optimal in the limit as $n \to \infty$: it is asymptotically normal with variance matching the Cramér-Rao lower bound of $\frac{1}{n\mathcal{I}}$, where… ▽ More

    Submitted 18 July, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Corrected an inaccuracy in the description of the experimental setup. Also updated funding acknowledgements

  4. arXiv:2011.08384  [pdf, ps, other

    math.ST cs.DS cs.IT cs.LG stat.ML

    Optimal Sub-Gaussian Mean Estimation in $\mathbb{R}$

    Authors: Jasper C. H. Lee, Paul Valiant

    Abstract: We revisit the problem of estimating the mean of a real-valued distribution, presenting a novel estimator with sub-Gaussian convergence: intuitively, "our estimator, on any distribution, is as accurate as the sample mean is for the Gaussian distribution of matching variance." Crucially, in contrast to prior works, our estimator does not require prior knowledge of the variance, and works across the… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  5. arXiv:1112.5659  [pdf, ps, other

    cs.DS math.PR math.ST

    Testing $k$-Modal Distributions: Optimal Algorithms via Reductions

    Authors: Constantinos Daskalakis, Ilias Diakonikolas, Rocco A. Servedio, Gregory Valiant, Paul Valiant

    Abstract: We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statistical problems that involve testing and estimating the L_1 distance between two k-modal distributions $p$ and $q$ over the discrete domain $\{1,\dots,n\}$. More precisely, we consider the following four problems: given sample access to an unknown k-modal distribution $p$, Testing identity to a known… ▽ More

    Submitted 23 December, 2011; originally announced December 2011.