Skip to main content

Showing 1–11 of 11 results for author: Cherapanamjeri, Y

Searching in archive math. Search in all archives.
.
  1. arXiv:2411.15306  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ME stat.ML

    Heavy-tailed Contamination is Easier than Adversarial Contamination

    Authors: Yeshwanth Cherapanamjeri, Daniel Lee

    Abstract: A large body of work in the statistics and computer science communities dating back to Huber (Huber, 1960) has led to statistically and computationally efficient outlier-robust estimators. Two particular outlier models have received significant attention: the adversarial and heavy-tailed models. While the former models outliers as the result of a malicious adversary manipulating the data, the latt… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  2. arXiv:2310.10758  [pdf, ps, other

    math.ST cs.DS cs.LG

    Statistical Barriers to Affine-equivariant Estimation

    Authors: Zihao Chen, Yeshwanth Cherapanamjeri

    Abstract: We investigate the quantitative performance of affine-equivariant estimators for robust mean estimation. As a natural stability requirement, the construction of such affine-equivariant estimators has been extensively studied in the statistics literature. We quantitatively evaluate these estimators under two outlier models which have been the subject of much recent work: the heavy-tailed and advers… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  3. arXiv:2304.09167  [pdf, ps, other

    cs.LG cs.DS math.ST

    Optimal PAC Bounds Without Uniform Convergence

    Authors: Ishaq Aden-Ali, Yeshwanth Cherapanamjeri, Abhishek Shetty, Nikita Zhivotovskiy

    Abstract: In statistical learning theory, determining the sample complexity of realizable binary classification for VC classes was a long-standing open problem. The results of Simon and Hanneke established sharp upper bounds in this setting. However, the reliance of their argument on the uniform convergence principle limits its applicability to more general learning settings such as multiclass classificatio… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 27 pages

  4. arXiv:2212.09270  [pdf, ps, other

    cs.LG cs.DS math.ST

    The One-Inclusion Graph Algorithm is not Always Optimal

    Authors: Ishaq Aden-Ali, Yeshwanth Cherapanamjeri, Abhishek Shetty, Nikita Zhivotovskiy

    Abstract: The one-inclusion graph algorithm of Haussler, Littlestone, and Warmuth achieves an optimal in-expectation risk bound in the standard PAC classification setup. In one of the first COLT open problems, Warmuth conjectured that this prediction strategy always implies an optimal high probability bound on the risk, and hence is also an optimal PAC algorithm. We refute this conjecture in the strongest s… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 16 pages

  5. arXiv:2205.03246  [pdf, other

    math.ST cs.DS cs.LG stat.ML

    What Makes A Good Fisherman? Linear Regression under Self-Selection Bias

    Authors: Yeshwanth Cherapanamjeri, Constantinos Daskalakis, Andrew Ilyas, Manolis Zampetakis

    Abstract: In the classical setting of self-selection, the goal is to learn $k$ models, simultaneously from observations $(x^{(i)}, y^{(i)})$ where $y^{(i)}$ is the output of one of $k$ underlying models on input $x^{(i)}$. In contrast to mixture models, where we observe the output of a randomly selected model, here the observed model depends on the outputs themselves, and is determined by some known selecti… ▽ More

    Submitted 10 December, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

  6. arXiv:2205.02060  [pdf, ps, other

    cs.GT cs.DS math.ST stat.ML

    Estimation of Standard Auction Models

    Authors: Yeshwanth Cherapanamjeri, Constantinos Daskalakis, Andrew Ilyas, Manolis Zampetakis

    Abstract: We provide efficient estimation methods for first- and second-price auctions under independent (asymmetric) private values and partial observability. Given a finite set of observations, each comprising the identity of the winner and the price they paid in a sequence of identical auctions, we provide algorithms for non-parametrically estimating the bid distribution of each bidder, as well as their… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  7. arXiv:2011.12433  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ML

    Optimal Mean Estimation without a Variance

    Authors: Yeshwanth Cherapanamjeri, Nilesh Tripuraneni, Peter L. Bartlett, Michael I. Jordan

    Abstract: We study the problem of heavy-tailed mean estimation in settings where the variance of the data-generating distribution does not exist. Concretely, given a sample $\mathbf{X} = \{X_i\}_{i = 1}^n$ from a distribution $\mathcal{D}$ over $\mathbb{R}^d$ with mean $μ$ which satisfies the following \emph{weak-moment} assumption for some ${α\in [0, 1]}$: \begin{equation*} \forall \|v\| = 1: \mathbb{E}_{X… ▽ More

    Submitted 8 December, 2020; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Fixed typographical errors in Theorem 1.2, Lemmas 4.3 and C.8

  8. arXiv:1912.11071  [pdf, ps, other

    math.ST cs.DS

    Algorithms for Heavy-Tailed Statistics: Regression, Covariance Estimation, and Beyond

    Authors: Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria, Prasad Raghavendra, Nilesh Tripuraneni

    Abstract: We study efficient algorithms for linear regression and covariance estimation in the absence of Gaussian assumptions on the underlying distributions of samples, making assumptions instead about only finitely-many moments. We focus on how many samples are needed to do estimation and regression with high accuracy and exponentially-good success probability. For covariance estimation, linear regress… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

  9. arXiv:1902.01999  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ML

    Testing Markov Chains without Hitting

    Authors: Yeshwanth Cherapanamjeri, Peter L. Bartlett

    Abstract: We study the problem of identity testing of markov chains. In this setting, we are given access to a single trajectory from a markov chain with unknown transition matrix $Q$ and the goal is to determine whether $Q = P$ for some known matrix $P$ or $\text{Dist}(P, Q) \geq ε$ where $\text{Dist}$ is suitably defined. In recent work by Daskalakis, Dikkala and Gravin, 2018, it was shown that it is poss… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  10. arXiv:1902.01998  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ML

    Fast Mean Estimation with Sub-Gaussian Rates

    Authors: Yeshwanth Cherapanamjeri, Nicolas Flammarion, Peter L. Bartlett

    Abstract: We propose an estimator for the mean of a random vector in $\mathbb{R}^d$ that can be computed in time $O(n^4+n^2d)$ for $n$ i.i.d.~samples and that has error bounds matching the sub-Gaussian case. The only assumptions we make about the data distribution are that it has finite mean and covariance; in particular, we make no assumptions about higher-order moments. Like the polynomial time estimator… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  11. arXiv:1606.07315  [pdf, ps, other

    cs.LG math.NA

    Nearly-optimal Robust Matrix Completion

    Authors: Yeshwanth Cherapanamjeri, Kartik Gupta, Prateek Jain

    Abstract: In this paper, we consider the problem of Robust Matrix Completion (RMC) where the goal is to recover a low-rank matrix by observing a small number of its entries out of which a few can be arbitrarily corrupted. We propose a simple projected gradient descent method to estimate the low-rank matrix that alternately performs a projected gradient descent step and cleans up a few of the corrupted entri… ▽ More

    Submitted 8 December, 2016; v1 submitted 23 June, 2016; originally announced June 2016.