Skip to main content

Showing 1–50 of 88 results for author: Wasserman, L

Searching in archive math. Search in all archives.
.
  1. arXiv:2507.00260  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Disentangled Feature Importance

    Authors: Jin-Hong Du, Kathryn Roeder, Larry Wasserman

    Abstract: Feature importance quantification faces a fundamental challenge: when predictors are correlated, standard methods systematically underestimate their contributions. We prove that major existing approaches target identical population functionals under squared-error loss, revealing why they share this correlation-induced bias. To address this limitation, we introduce \emph{Disentangled Feature Impo… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

    Comments: 26 main and 29 supplementary pages

  2. arXiv:2506.19025  [pdf, ps, other

    math.ST cs.AI cs.LG stat.ME stat.ML

    Statistical Inference for Optimal Transport Maps: Recent Advances and Perspectives

    Authors: Sivaraman Balakrishnan, Tudor Manole, Larry Wasserman

    Abstract: In many applications of optimal transport (OT), the object of primary interest is the optimal transport map. This map rearranges mass from one probability distribution to another in the most efficient way possible by minimizing a specified cost. In this paper we review recent advances in estimating and developing limit theorems for the OT map, using samples from the underlying distributions. We al… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 36 pages, 1 figure

  3. arXiv:2504.13977  [pdf, other

    math.ST stat.ME

    Testing Random Effects for Binomial Data

    Authors: Lucas Kania, Larry Wasserman, Sivaraman Balakrishnan

    Abstract: In modern scientific research, small-scale studies with limited participants are increasingly common. However, interpreting individual outcomes can be challenging, making it standard practice to combine data across studies using random effects to draw broader scientific conclusions. In this work, we introduce an optimal methodology for assessing the goodness of fit between a given reference distri… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  4. arXiv:2411.14285  [pdf, other

    stat.ME math.ST

    Stochastic interventions, sensitivity analysis, and optimal transport

    Authors: Alexander W. Levis, Edward H. Kennedy, Alec McClean, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: Recent methodological research in causal inference has focused on effects of stochastic interventions, which assign treatment randomly, often according to subject-specific covariates. In this work, we demonstrate that the usual notion of stochastic interventions have a surprising property: when there is unmeasured confounding, bounds on their effects do not collapse when the policy approaches the… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 37 pages, 1 figure

  5. arXiv:2403.15175  [pdf, other

    math.ST stat.ME stat.ML

    Double Cross-fit Doubly Robust Estimators: Beyond Series Regression

    Authors: Alec McClean, Sivaraman Balakrishnan, Edward H. Kennedy, Larry Wasserman

    Abstract: Doubly robust estimators with cross-fitting have gained popularity in causal inference due to their favorable structure-agnostic error guarantees. However, when additional structure, such as Hölder smoothness, is available then more accurate "double cross-fit doubly robust" (DCDR) estimators can be constructed by splitting the training data and undersmoothing nuisance function estimators on indepe… ▽ More

    Submitted 7 May, 2025; v1 submitted 22 March, 2024; originally announced March 2024.

  6. arXiv:2402.18921  [pdf, other

    math.ST stat.ME stat.ML

    Semi-Supervised U-statistics

    Authors: Ilmun Kim, Larry Wasserman, Sivaraman Balakrishnan, Matey Neykov

    Abstract: Semi-supervised datasets are ubiquitous across diverse domains where obtaining fully labeled data is costly or time-consuming. The prevalence of such datasets has consistently driven the demand for new tools and methods that exploit the potential of unlabeled data. Responding to this demand, we introduce semi-supervised U-statistics enhanced by the abundance of unlabeled data, and investigate thei… ▽ More

    Submitted 9 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  7. arXiv:2312.12407  [pdf, other

    math.PR math.AP math.ST

    Central Limit Theorems for Smooth Optimal Transport Maps

    Authors: Tudor Manole, Sivaraman Balakrishnan, Jonathan Niles-Weed, Larry Wasserman

    Abstract: One of the central objects in the theory of optimal transport is the Brenier map: the unique monotone transformation which pushes forward an absolutely continuous probability law onto any other given law. A line of recent work has analyzed $L^2$ convergence rates of plugin estimators of Brenier maps, which are defined as the Brenier map between density estimators of the underlying distributions. I… ▽ More

    Submitted 16 September, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  8. arXiv:2310.12757  [pdf, other

    stat.ME math.ST

    Conservative Inference for Counterfactuals

    Authors: Sivaraman Balakrishnan, Edward Kennedy, Larry Wasserman

    Abstract: In causal inference, the joint law of a set of counterfactual random variables is generally not identified. We show that a conservative version of the joint law - corresponding to the smallest treatment effect - is identified. Finding this law uses recent results from optimal transport theory. Under this conservative law we can bound causal effects and we may construct inferences for each individu… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  9. arXiv:2309.00706  [pdf, other

    stat.ME math.ST

    Causal Effect Estimation after Propensity Score Trimming with Continuous Treatments

    Authors: Zach Branson, Edward H. Kennedy, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: Propensity score trimming, which discards subjects with propensity scores below a threshold, is a common way to address positivity violations that complicate causal effect estimation. However, most works on trimming assume treatment is discrete and models for the outcome regression and propensity score are parametric. This work proposes nonparametric estimators for trimmed average causal effects i… ▽ More

    Submitted 29 July, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

  10. arXiv:2308.08672  [pdf, other

    math.ST

    Nearly Minimax Optimal Wasserstein Conditional Independence Testing

    Authors: Matey Neykov, Larry Wasserman, Ilmun Kim, Sivaraman Balakrishnan

    Abstract: This paper is concerned with minimax conditional independence testing. In contrast to some previous works on the topic, which use the total variation distance to separate the null from the alternative, here we use the Wasserstein distance. In addition, we impose Wasserstein smoothness conditions which on bounded domains are weaker than the corresponding total variation smoothness imposed, for inst… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 24 pages, 1 figure, ordering of the last three authors is random

  11. arXiv:2308.05373  [pdf, other

    math.ST stat.CO stat.ME

    Conditional Independence Testing for Discrete Distributions: Beyond $χ^2$- and $G$-tests

    Authors: Ilmun Kim, Matey Neykov, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: This paper is concerned with the problem of conditional independence testing for discrete data. In recent years, researchers have shed new light on this fundamental problem, emphasizing finite-sample optimality. The non-asymptotic viewpoint adapted in these works has led to novel conditional independence tests that enjoy certain optimality under various regimes. Despite their attractive theoretica… ▽ More

    Submitted 28 October, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

  12. arXiv:2305.04116  [pdf, ps, other

    math.ST stat.ME stat.ML

    The Fundamental Limits of Structure-Agnostic Functional Estimation

    Authors: Sivaraman Balakrishnan, Edward H. Kennedy, Larry Wasserman

    Abstract: Many recent developments in causal inference, and functional estimation problems more generally, have been motivated by the fact that classical one-step (first-order) debiasing methods, or their more recent sample-split double machine-learning avatars, can outperform plugin estimators under surprisingly weak conditions. These first-order corrections improve on plugin estimators in a black-box fash… ▽ More

    Submitted 7 June, 2025; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 34 pages, to appear in Statistical Science

  13. arXiv:2210.04681  [pdf, other

    stat.ME math.ST

    Sensitivity Analysis for Marginal Structural Models

    Authors: Matteo Bonvini, Edward Kennedy, Valerie Ventura, Larry Wasserman

    Abstract: We introduce several methods for assessing sensitivity to unmeasured confounding in marginal structural models; importantly we allow treatments to be discrete or continuous, static or time-varying. We consider three sensitivity models: a propensity-based model, an outcome-based model, and a subset confounding model, in which only a fraction of the population is subject to unmeasured confounding. I… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

  14. arXiv:2206.02954  [pdf, ps, other

    math.ST stat.ME

    Median Regularity and Honest Inference

    Authors: Arun Kumar Kuchibhotla, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: We introduce a new notion of regularity of an estimator called median regularity. We prove that uniformly valid (honest) inference for a functional is possible if and only if there exists a median regular estimator of that functional. To our knowledge, such a notion of regularity that is necessary for uniformly valid inference is unavailable in the literature.

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 10 pages

  15. arXiv:2203.00837  [pdf, other

    math.ST

    Minimax rates for heterogeneous causal effect estimation

    Authors: Edward H. Kennedy, Sivaraman Balakrishnan, James M. Robins, Larry Wasserman

    Abstract: Estimation of heterogeneous causal effects - i.e., how effects of policies and treatments vary across subjects - is a fundamental task in causal inference. Many methods for estimating conditional average treatment effects (CATEs) have been proposed in recent years, but questions surrounding optimality have remained largely unanswered. In particular, a minimax theory of optimality has yet to be dev… ▽ More

    Submitted 22 December, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

  16. arXiv:2112.11666  [pdf, other

    math.ST stat.ME

    Local permutation tests for conditional independence

    Authors: Ilmun Kim, Matey Neykov, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: In this paper, we investigate local permutation tests for testing conditional independence between two random vectors $X$ and $Y$ given $Z$. The local permutation test determines the significance of a test statistic by locally shuffling samples which share similar values of the conditioning variables $Z$, and it forms a natural extension of the usual permutation approach for unconditional independ… ▽ More

    Submitted 6 January, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: A few important references (missed before) added

  17. arXiv:2112.11079  [pdf, other

    stat.ME math.ST stat.ML stat.OT

    Data fission: splitting a single data point

    Authors: James Leiner, Boyan Duan, Larry Wasserman, Aaditya Ramdas

    Abstract: Suppose we observe a random vector $X$ from some distribution $P$ in a known family with unknown parameters. We ask the following question: when is it possible to split $X$ into two parts $f(X)$ and $g(X)$ such that neither part is sufficient to reconstruct $X$ by itself, but both together can recover $X$ fully, and the joint distribution of $(f(X),g(X))$ is tractable? As one example, if… ▽ More

    Submitted 10 December, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 57 pages, 35 figures

  18. arXiv:2111.09254  [pdf, other

    stat.ME cs.LG math.ST

    Universal Inference Meets Random Projections: A Scalable Test for Log-concavity

    Authors: Robin Dunn, Aditya Gangrade, Larry Wasserman, Aaditya Ramdas

    Abstract: Shape constraints yield flexible middle grounds between fully nonparametric and fully parametric approaches to modeling distributions of data. The specific assumption of log-concavity is motivated by applications across economics, survival modeling, and reliability theory. However, there do not currently exist valid tests for whether the underlying density of given data is log-concave. The recent… ▽ More

    Submitted 14 April, 2024; v1 submitted 17 November, 2021; originally announced November 2021.

  19. arXiv:2107.12364  [pdf, other

    math.ST stat.ML

    Plugin Estimation of Smooth Optimal Transport Maps

    Authors: Tudor Manole, Sivaraman Balakrishnan, Jonathan Niles-Weed, Larry Wasserman

    Abstract: We analyze a number of natural estimators for the optimal transport map between two distributions and show that they are minimax optimal. We adopt the plugin approach: our estimators are simply optimal couplings between measures derived from our observations, appropriately extended so that they define functions on $\mathbb{R}^d$. When the underlying map is assumed to be Lipschitz, we show that com… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: To appear in the Annals of Statistics

  20. arXiv:2105.14577  [pdf, other

    math.ST stat.CO stat.ME

    The HulC: Confidence Regions from Convex Hulls

    Authors: Arun Kumar Kuchibhotla, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: We develop and analyze the HulC, an intuitive and general method for constructing confidence sets using the convex hull of estimates constructed from subsets of the data. Unlike classical methods which are based on estimating the (limiting) distribution of an estimator, the HulC is often simpler to use and effectively bypasses this step. In comparison to the bootstrap, the HulC requires fewer regu… ▽ More

    Submitted 8 September, 2023; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Latest version. Fixed a gap in Proposition and Theorem 1 pointed out by Prof. Hannes Leeb. Now all the simulations include a comparison with subsampling. Also, added several new simulation settings including quantile regression, isotonic regression both under non-standard assumptions

  21. arXiv:2102.12034  [pdf, other

    stat.ME math.ST

    Semiparametric counterfactual density estimation

    Authors: Edward H. Kennedy, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: Causal effects are often characterized with averages, which can give an incomplete picture of the underlying counterfactual distributions. Here we consider estimating the entire counterfactual density and generic functionals thereof. We focus on two kinds of target parameters. The first is a density approximation, defined by a projection onto a finite-dimensional model using a generalized distance… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  22. arXiv:2007.09751  [pdf, ps, other

    math.ST stat.ME

    Berry-Esseen Bounds for Projection Parameters and Partial Correlations with Increasing Dimension

    Authors: Arun Kumar Kuchibhotla, Alessandro Rinaldo, Larry Wasserman

    Abstract: We provide finite sample bounds on the Normal approximation to the law of the least squares estimator of the projection parameters normalized by the sandwich-based standard errors. Our results hold in the increasing dimension setting and under minimal assumptions on the data generating distribution. In particular, we do not assume a linear regression function and only require the existence of fini… ▽ More

    Submitted 22 October, 2021; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: 58 pages, 0 figures

  23. arXiv:2006.14781  [pdf, other

    stat.ML cs.LG math.OC

    The huge Package for High-dimensional Undirected Graph Estimation in R

    Authors: Tuo Zhao, Han Liu, Kathryn Roeder, John Lafferty, Larry Wasserman

    Abstract: We describe an R package named huge which provides easy-to-use functions for estimating high dimensional undirected graphs from data. This package implements recent results in the literature, including Friedman et al. (2007), Liu et al. (2009, 2012) and Liu et al. (2010). Compared with the existing graph estimation package glasso, the huge package provides extra features: (1) instead of using Fort… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Published on JMLR in 2012

  24. arXiv:2003.13208  [pdf, other

    math.ST

    Minimax optimality of permutation tests

    Authors: Ilmun Kim, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: Permutation tests are widely used in statistics, providing a finite-sample guarantee on the type I error rate whenever the distribution of the samples under the null hypothesis is invariant to some rearrangement. Despite its increasing popularity and empirical success, theoretical properties of the permutation test, especially its power, have not been fully explored beyond simple cases. In this pa… ▽ More

    Submitted 25 May, 2022; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Typo in Eq.(38) is fixed

  25. arXiv:2001.03039  [pdf, other

    math.ST

    Minimax Optimal Conditional Independence Testing

    Authors: Matey Neykov, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: We consider the problem of conditional independence testing of $X$ and $Y$ given $Z$ where $X,Y$ and $Z$ are three real random variables and $Z$ is continuous. We focus on two main cases - when $X$ and $Y$ are both discrete, and when $X$ and $Y$ are both continuous. In view of recent results on conditional independence testing (Shah and Peters, 2018), one cannot hope to design non-trivial tests, w… ▽ More

    Submitted 1 July, 2021; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: 92 pages, 1 table, 6 figures. v4 major updates: fixed and error in appendix G -- multivariate Z case

  26. arXiv:1912.11436  [pdf, other

    math.ST stat.ME stat.ML

    Universal Inference

    Authors: Larry Wasserman, Aaditya Ramdas, Sivaraman Balakrishnan

    Abstract: We propose a general method for constructing hypothesis tests and confidence sets that have finite sample guarantees without regularity conditions. We refer to such procedures as "universal." The method is very simple and is based on a modified version of the usual likelihood ratio statistic, that we call "the split likelihood ratio test" (split LRT). The method is especially appealing for irregul… ▽ More

    Submitted 19 October, 2022; v1 submitted 24 December, 2019; originally announced December 2019.

    Comments: To appear in the Proceedings of the National Academy of Sciences

  27. arXiv:1909.07862  [pdf, other

    math.ST stat.ML

    Minimax Confidence Intervals for the Sliced Wasserstein Distance

    Authors: Tudor Manole, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: Motivated by the growing popularity of variants of the Wasserstein distance in statistics and machine learning, we study statistical inference for the Sliced Wasserstein distance--an easily computable variant of the Wasserstein distance. Specifically, we construct confidence intervals for the Sliced Wasserstein distance which have finite-sample validity under no assumptions or under mild moment as… ▽ More

    Submitted 3 April, 2022; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Published at https://doi.org/10.1214/22-EJS2001 in the Electronic Journal of Statistics

    Journal ref: Electronic Journal of Statistics 2022, Vol 16, No. 1, 2252-2345

  28. arXiv:1903.06955  [pdf, other

    math.AT cs.CG

    Homotopy Reconstruction via the Cech Complex and the Vietoris-Rips Complex

    Authors: Jisu Kim, Jaehyeok Shin, Frédéric Chazal, Alessandro Rinaldo, Larry Wasserman

    Abstract: We derive conditions under which the reconstruction of a target space is topologically correct via the Čech complex or the Vietoris-Rips complex obtained from possibly noisy point cloud data. We provide two novel theoretical results. First, we describe sufficient conditions under which any non-empty intersection of finitely many Euclidean balls intersected with a positive reach set is contractible… ▽ More

    Submitted 12 May, 2020; v1 submitted 16 March, 2019; originally announced March 2019.

    Comments: 60 pages, 7 figures, to appear in the 36th International Symposium on Computational Geometry (SoCG 2020), the code is available at https://github.com/jisuk1/nerveshape

  29. arXiv:1810.05935  [pdf, ps, other

    math.ST

    Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension

    Authors: Jisu Kim, Jaehyeok Shin, Alessandro Rinaldo, Larry Wasserman

    Abstract: We derive concentration inequalities for the supremum norm of the difference between a kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the selection of the bandwidth and under weaker conditions on the kernel and the data generating distribution than previously used in the literature. We first propose a novel concept, called the volume dimension, to measure th… ▽ More

    Submitted 31 December, 2019; v1 submitted 13 October, 2018; originally announced October 2018.

    Comments: 51 pages, to be published in Proceedings of Thirty-sixth International Conference on Machine Learning (ICML 2019), Volume 97, 2019

  30. arXiv:1809.07441  [pdf, other

    stat.ME math.ST

    Distribution-Free Prediction Sets for Two-Layer Hierarchical Models

    Authors: Robin Dunn, Larry Wasserman, Aaditya Ramdas

    Abstract: We consider the problem of constructing distribution-free prediction sets for data from two-layer hierarchical distributions. For iid data, prediction sets can be constructed using the method of conformal prediction. The validity of conformal prediction hinges on the exchangeability of the data, which does not hold when groups of observations come from distinct distributions, such as multiple obse… ▽ More

    Submitted 23 February, 2022; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: Minor revisions for journal. Accepted to the Journal of the American Statistical Association

  31. arXiv:1803.00715  [pdf, other

    math.ST stat.ME

    Robust Multivariate Nonparametric Tests via Projection-Averaging

    Authors: Ilmun Kim, Sivaraman Balakrishnan, Larry Wasserman

    Abstract: In this work, we generalize the Cramér-von Mises statistic via projection-averaging to obtain a robust test for the multivariate two-sample problem. The proposed test is consistent against all fixed alternatives, robust to heavy-tailed data and minimax rate optimal against a certain class of alternatives. Our test statistic is completely free of tuning parameters and is computationally efficient e… ▽ More

    Submitted 21 May, 2019; v1 submitted 1 March, 2018; originally announced March 2018.

  32. arXiv:1706.10003  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Hypothesis Testing For Densities and High-Dimensional Multinomials: Sharp Local Minimax Rates

    Authors: Sivaraman Balakrishnan, Larry Wasserman

    Abstract: We consider the goodness-of-fit testing problem of distinguishing whether the data are drawn from a specified distribution, versus a composite alternative separated from the null in the total variation metric. In the discrete case, we consider goodness-of-fit testing when the null distribution has a possibly growing or unbounded number of categories. In the continuous case, we consider testing a L… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: 60 pages, 6 figures

  33. arXiv:1705.04565  [pdf, other

    math.ST math.DG

    Estimating the Reach of a Manifold

    Authors: Eddie Aamari, Jisu Kim, Frédéric Chazal, Bertrand Michel, Alessandro Rinaldo, Larry Wasserman

    Abstract: Various problems in manifold estimation make use of a quantity called the reach, denoted by $τ\_M$, which is a measure of the regularity of the manifold. This paper is the first investigation into the problem of how to estimate the reach. First, we study the geometry of the reach through an approximation perspective. We derive new geometric results on the reach for submanifolds without boundary. A… ▽ More

    Submitted 8 April, 2019; v1 submitted 12 May, 2017; originally announced May 2017.

  34. arXiv:1611.05401  [pdf, other

    math.ST

    Bootstrapping and Sample Splitting For High-Dimensional, Assumption-Free Inference

    Authors: Alessandro Rinaldo, Larry Wasserman, Max G'Sell, Jing Lei

    Abstract: Several new methods have been proposed for performing valid inference after model selection. An older method is sampling splitting: use part of the data for model selection and part for inference. In this paper we revisit sample splitting combined with the bootstrap (or the Normal approximation). We show that this leads to a simple, assumption-free approach to inference and we establish results on… ▽ More

    Submitted 2 April, 2018; v1 submitted 16 November, 2016; originally announced November 2016.

    MSC Class: 62G05

  35. arXiv:1605.06416  [pdf, other

    math.ST stat.ME stat.ML

    Statistical Inference for Cluster Trees

    Authors: Jisu Kim, Yen-Chi Chen, Sivaraman Balakrishnan, Alessandro Rinaldo, Larry Wasserman

    Abstract: A cluster tree provides a highly-interpretable summary of a density function by representing the hierarchy of its high-density clusters. It is estimated using the empirical tree, which is the cluster tree constructed from a density estimator. This paper addresses the basic question of quantifying our uncertainty by assessing the statistical significance of topological features of an empirical clus… ▽ More

    Submitted 12 February, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: 20 pages, 6 figures, accepted in Neural Information Processing Systems (NIPS) 2016

  36. Minimax Rates for Estimating the Dimension of a Manifold

    Authors: Jisu Kim, Alessandro Rinaldo, Larry Wasserman

    Abstract: Many algorithms in machine learning and computational geometry require, as input, the intrinsic dimension of the manifold that supports the probability distribution of the data. This parameter is rarely known and therefore has to be estimated. We characterize the statistical difficulty of this problem by deriving upper and lower bounds on the minimax rate for estimating the dimension. First, we co… ▽ More

    Submitted 30 December, 2019; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: 54 pages, 11 figures, to be published in Journal of Computational Geometry, Volume 10, Number 1

  37. arXiv:1604.04173  [pdf, other

    stat.ME math.ST stat.ML

    Distribution-Free Predictive Inference For Regression

    Authors: Jing Lei, Max G'Sell, Alessandro Rinaldo, Ryan J. Tibshirani, Larry Wasserman

    Abstract: We develop a general framework for distribution-free predictive inference in regression, using conformal inference. The proposed methodology allows for the construction of a prediction band for the response variable using any estimator of the regression function. The resulting prediction band preserves the consistency properties of the original estimator under standard assumptions, while guarantee… ▽ More

    Submitted 8 March, 2017; v1 submitted 14 April, 2016; originally announced April 2016.

    Comments: 50 pages, 7 figures, 3 tables

  38. arXiv:1602.02210  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Classification accuracy as a proxy for two sample testing

    Authors: Ilmun Kim, Aaditya Ramdas, Aarti Singh, Larry Wasserman

    Abstract: When data analysts train a classifier and check if its accuracy is significantly different from chance, they are implicitly performing a two-sample test. We investigate the statistical properties of this flexible approach in the high-dimensional setting. We prove two results that hold for all classifiers in any dimensions: if its true error remains $ε$-better than chance for some $ε>0$ as… ▽ More

    Submitted 17 February, 2020; v1 submitted 5 February, 2016; originally announced February 2016.

    Comments: 71 pages, 4 figures. Accepted for publication at the Annals of Statistics (2020)

  39. arXiv:1601.06259  [pdf, ps, other

    stat.ML cs.IT cs.LG math.ST

    Minimax Lower Bounds for Linear Independence Testing

    Authors: Aaditya Ramdas, David Isenberg, Aarti Singh, Larry Wasserman

    Abstract: Linear independence testing is a fundamental information-theoretic and statistical problem that can be posed as follows: given $n$ points $\{(X_i,Y_i)\}^n_{i=1}$ from a $p+q$ dimensional multivariate distribution where $X_i \in \mathbb{R}^p$ and $Y_i \in\mathbb{R}^q$, determine whether $a^T X$ and $b^T Y$ are uncorrelated for every $a \in \mathbb{R}^p, b\in \mathbb{R}^q$ or not. We give minimax lo… ▽ More

    Submitted 23 January, 2016; originally announced January 2016.

    Comments: 9 pages

  40. arXiv:1508.00655  [pdf, other

    math.ST cs.AI cs.IT cs.LG stat.ML

    Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing

    Authors: Aaditya Ramdas, Sashank J. Reddi, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: Nonparametric two sample testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. We refer to the most common settings as mean difference alternatives (MDA), for testing differences only in first moments, and general difference alternatives (GDA), which is about testing for… ▽ More

    Submitted 4 August, 2015; originally announced August 2015.

    Comments: 35 pages, 4 figures

  41. arXiv:1506.08826  [pdf, other

    math.ST stat.ME stat.ML

    Statistical Inference using the Morse-Smale Complex

    Authors: Yen-Chi Chen, Christopher R. Genovese, Larry Wasserman

    Abstract: The Morse-Smale complex of a function $f$ decomposes the sample space into cells where $f$ is increasing or decreasing. When applied to nonparametric density estimation and regression, it provides a way to represent, visualize, and compare multivariate functions. In this paper, we present some statistical results on estimating Morse-Smale complexes. This allows us to derive new results for two exi… ▽ More

    Submitted 3 April, 2017; v1 submitted 29 June, 2015; originally announced June 2015.

    Comments: 45 pages, 13 figures. Accepted to Electronic Journal of Statistics

    MSC Class: 62G20 (Primary); 62G05; 62G08 (Secondary)

  42. arXiv:1506.06266  [pdf, other

    math.ST

    Uniform Asymptotic Inference and the Bootstrap After Model Selection

    Authors: Ryan J. Tibshirani, Alessandro Rinaldo, Robert Tibshirani, Larry Wasserman

    Abstract: Recently, Tibshirani et al. (2016) proposed a method for making inferences about parameters defined by model selection, in a typical regression setting with normally distributed errors. Here, we study the large sample properties of this method, without assuming normality. We prove that the test statistic of Tibshirani et al. (2016) is asymptotically valid, as the number of samples n grows and the… ▽ More

    Submitted 9 August, 2017; v1 submitted 20 June, 2015; originally announced June 2015.

    Comments: 47 pages, 13 figures

  43. arXiv:1505.04215  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    An Analysis of Active Learning With Uniform Feature Noise

    Authors: Aaditya Ramdas, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: In active learning, the user sequentially chooses values for feature $X$ and an oracle returns the corresponding label $Y$. In this paper, we consider the effect of feature noise in active learning, which could arise either because $X$ itself is being measured, or it is corrupted in transmission to the oracle, or the oracle returns the label of a noisy version of the query point. In statistics, fe… ▽ More

    Submitted 15 May, 2015; originally announced May 2015.

    Comments: 24 pages, 2 figures, published in the proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS), 2014

  44. arXiv:1505.00482  [pdf, other

    math.ST cs.LG stat.ML

    Risk Bounds For Mode Clustering

    Authors: Martin Azizyan, Yen-Chi Chen, Aarti Singh, Larry Wasserman

    Abstract: Density mode clustering is a nonparametric clustering method. The clusters are the basins of attraction of the modes of a density estimator. We study the risk of mode-based clustering. We show that the clustering risk over the cluster cores --- the regions where the density is high --- is very small even in high dimensions. And under a low noise condition, the overall cluster risk is small even be… ▽ More

    Submitted 3 May, 2015; originally announced May 2015.

  45. arXiv:1504.05438  [pdf, other

    stat.ME math.ST

    Density Level Sets: Asymptotics, Inference, and Visualization

    Authors: Yen-Chi Chen, Christopher R. Genovese, Larry Wasserman

    Abstract: We derive asymptotic theory for the plug-in estimate for density level sets under Hausdoff loss. Based on the asymptotic theory, we propose two bootstrap confidence regions for level sets. The confidence regions can be used to perform tests for anomaly detection and clustering. We also introduce a technique to visualize high dimensional density level sets by combining mode clustering and multidime… ▽ More

    Submitted 5 September, 2016; v1 submitted 21 April, 2015; originally announced April 2015.

    Comments: Accepted to JASA-T&M. 40 pages, 11 figures

    MSC Class: Primary: 62G20; Secondary 62G10; 62G15

  46. arXiv:1412.7197  [pdf, other

    math.ST cs.CG math.AT

    Robust Topological Inference: Distance To a Measure and Kernel Distance

    Authors: Frédéric Chazal, Brittany T. Fasy, Fabrizio Lecci, Bertrand Michel, Alessandro Rinaldo, Larry Wasserman

    Abstract: Let P be a distribution with support S. The salient features of S can be quantified with persistent homology, which summarizes topological features of the sublevel sets of the distance function (the distance of any point x to S). Given a sample from P we can infer the persistent homology using an empirical version of the distance function. However, the empirical distance function is highly non-rob… ▽ More

    Submitted 22 December, 2014; originally announced December 2014.

  47. arXiv:1412.1716  [pdf, ps, other

    stat.ME math.ST stat.ML

    Nonparametric modal regression

    Authors: Yen-Chi Chen, Christopher R. Genovese, Ryan J. Tibshirani, Larry Wasserman

    Abstract: Modal regression estimates the local modes of the distribution of $Y$ given $X=x$, instead of the mean, as in the usual regression sense, and can hence reveal important structure missed by usual regression methods. We study a simple nonparametric method for modal regression, based on a kernel density estimate (KDE) of the joint distribution of $Y$ and $X$. We derive asymptotic error bounds for thi… ▽ More

    Submitted 30 March, 2016; v1 submitted 4 December, 2014; originally announced December 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1373 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1373

    Journal ref: Annals of Statistics 2016, Vol. 44, No. 2, 489-514

  48. arXiv:1411.6314  [pdf, other

    math.ST cs.AI cs.IT cs.LG stat.ML

    On the High-dimensional Power of Linear-time Kernel Two-Sample Testing under Mean-difference Alternatives

    Authors: Aaditya Ramdas, Sashank J. Reddi, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: Nonparametric two sample testing deals with the question of consistently deciding if two distributions are different, given samples from both, without making any parametric assumptions about the form of the distributions. The current literature is split into two kinds of tests - those which are consistent without any assumptions about how the distributions may differ (\textit{general} alternatives… ▽ More

    Submitted 23 November, 2014; originally announced November 2014.

    Comments: 25 pages, 5 figures

  49. arXiv:1406.5663  [pdf, ps, other

    stat.ME math.ST

    Asymptotic theory for density ridges

    Authors: Yen-Chi Chen, Christopher R. Genovese, Larry Wasserman

    Abstract: The large sample theory of estimators for density modes is well understood. In this paper we consider density ridges, which are a higher-dimensional extension of modes. Modes correspond to zero-dimensional, local high-density regions in point clouds. Density ridges correspond to $s$-dimensional, local high-density regions in point clouds. We establish three main results. First we show that under a… ▽ More

    Submitted 13 October, 2015; v1 submitted 21 June, 2014; originally announced June 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1329 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1329

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 5, 1896-1928

  50. arXiv:1406.2240  [pdf, other

    math.ST stat.ML

    Feature Selection For High-Dimensional Clustering

    Authors: Larry Wasserman, Martin Azizyan, Aarti Singh

    Abstract: We present a nonparametric method for selecting informative features in high-dimensional clustering problems. We start with a screening step that uses a test for multimodality. Then we apply kernel density estimation and mode clustering to the selected features. The output of the method consists of a list of relevant features, and cluster assignments. We provide explicit bounds on the error rate o… ▽ More

    Submitted 9 June, 2014; originally announced June 2014.

    Comments: 11 pages, 2 figures