Skip to main content

Showing 1–50 of 110 results for author: Ramdas, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.01987  [pdf, other

    math.ST

    Sharp empirical Bernstein bounds for the variance of bounded random variables

    Authors: Diego Martinez-Taboada, Aaditya Ramdas

    Abstract: We develop novel empirical Bernstein inequalities for the variance of bounded random variables. Our inequalities hold under constant conditional variance and mean, without further assumptions like independence or identical distribution of the random variables, making them suitable for sequential decision making contexts. The results are instantiated for both the batch setting (where the sample siz… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  2. arXiv:2505.00292  [pdf, other

    math.ST eess.SP stat.ME

    Conformal changepoint localization

    Authors: Sanjit Dandapanthula, Aaditya Ramdas

    Abstract: Changepoint localization is the problem of estimating the index at which a change occurred in the data generating distribution of an ordered list of data, or declaring that no change occurred. We present the broadly applicable CONCH (CONformal CHangepoint localization) algorithm, which uses a matrix of conformal p-values to produce a confidence interval for a (single) changepoint under the mild as… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  3. arXiv:2504.21647  [pdf, other

    stat.ME math.ST stat.ML

    Conditional independence testing with a single realization of a multivariate nonstationary nonlinear time series

    Authors: Michael Wieck-Sosa, Michel F. C. Haddad, Aaditya Ramdas

    Abstract: Identifying relationships among stochastic processes is a key goal in disciplines that deal with complex temporal systems, such as economics. While the standard toolkit for multivariate time series analysis has many advantages, it can be difficult to capture nonlinear dynamics using linear vector autoregressive models. This difficulty has motivated the development of methods for variable selection… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  4. arXiv:2504.19952  [pdf, ps, other

    math.ST cs.LG stat.ML

    On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds

    Authors: Shubhada Agrawal, Aaditya Ramdas

    Abstract: We prove two lower bounds for stopping times of sequential tests between general composite nulls and alternatives. The first lower bound is for the setting where the type-1 error level $α$ approaches zero, and equals $\log(1/α)$ divided by a certain infimum KL divergence, termed $\operatorname{KL_{inf}}$. The second lower bound applies to the setting where $α$ is fixed and… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 36 pages

  5. arXiv:2504.11759  [pdf, other

    stat.ME math.ST

    Bringing closure to FDR control: beating the e-Benjamini-Hochberg procedure

    Authors: Ziyu Xu, Lasse Fischer, Aaditya Ramdas

    Abstract: False discovery rate (FDR) has been a key metric for error control in multiple hypothesis testing, and many methods have developed for FDR control across a diverse cross-section of settings and applications. We develop a closure principle for all FDR controlling procedures, i.e., we provide a characterization based on e-values for all admissible FDR controlling procedures. A general version of thi… ▽ More

    Submitted 22 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: 18 pages, 1 figure

  6. arXiv:2504.02974  [pdf, ps, other

    math.ST

    E-variables for hypotheses generated by constraints

    Authors: Martin Larsson, Aaditya Ramdas, Johannes Ruf

    Abstract: An e-variable for a family of distributions $\mathcal{P}$ is a nonnegative random variable whose expected value under every distribution in $\mathcal{P}$ is at most one. E-variables have recently been recognized as fundamental objects in hypothesis testing, and a rapidly growing body of work has attempted to derive admissible or optimal e-variables for various families $\mathcal{P}$. In this paper… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  7. arXiv:2503.21639  [pdf, other

    math.ST stat.ME stat.ML

    Locally minimax optimal and dimension-agnostic discrete argmin inference

    Authors: Ilmun Kim, Aaditya Ramdas

    Abstract: This paper tackles a fundamental inference problem: given $n$ observations from a $d$ dimensional vector with unknown mean $\boldsymbolμ$, we must form a confidence set for the index (or indices) corresponding to the smallest component of $\boldsymbolμ$. By duality, we reduce this to testing, for each $r$ in $1,\ldots,d$, whether $μ_r$ is the smallest. Based on the sample splitting and self-normal… ▽ More

    Submitted 1 May, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

  8. arXiv:2502.08539  [pdf, ps, other

    stat.ME math.ST

    Anytime-valid FDR control with the stopped e-BH procedure

    Authors: Hongjian Wang, Sanjit Dandapanthula, Aaditya Ramdas

    Abstract: The recent e-Benjamini-Hochberg (e-BH) procedure for multiple hypothesis testing is known to control the false discovery rate (FDR) under arbitrary dependence between the input e-values. This paper points out an important subtlety when applying the e-BH procedure with e-processes, which are sequential generalizations of e-values (where the data are observed sequentially). Since adaptively stopped… ▽ More

    Submitted 30 April, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  9. arXiv:2502.06188  [pdf, ps, other

    math.PR math.ST

    Nonasymptotic and distribution-uniform Komlós-Major-Tusnády approximation

    Authors: Ian Waudby-Smith, Martin Larsson, Aaditya Ramdas

    Abstract: We present nonasymptotic concentration inequalities for sums of independent and identically distributed random variables that yield asymptotic strong Gaussian approximations of Komlós, Major, and Tusnády (KMT) [1975,1976]. The constants appearing in our inequalities are either universal or explicit, and thus as corollaries, they imply distribution-uniform generalizations of the aforementioned KMT… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 27 pages

  10. arXiv:2501.04130  [pdf, other

    math.ST eess.SP stat.ME

    Multiple testing in multi-stream sequential change detection

    Authors: Sanjit Dandapanthula, Aaditya Ramdas

    Abstract: Multi-stream sequential change detection involves simultaneously monitoring many streams of data and trying to detect when their distributions change, if at all. Here, we theoretically study multiple testing issues that arise from detecting changes in many streams. We point out that any algorithm with finite average run length (ARL) must have a trivial worst-case false detection rate (FDR), family… ▽ More

    Submitted 3 February, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

  11. arXiv:2411.11271  [pdf, other

    math.ST math.PR stat.ME

    Mean Estimation in Banach Spaces Under Infinite Variance and Martingale Dependence

    Authors: Justin Whitehouse, Ben Chugg, Diego Martinez-Taboada, Aaditya Ramdas

    Abstract: We consider estimating the shared mean of a sequence of heavy-tailed random variables taking values in a Banach space. In particular, we revisit and extend a simple truncation-based mean estimator first proposed by Catoni and Giulini. While existing truncation-based approaches require a bound on the raw (non-central) second moment of observations, our results hold under a bound on either the centr… ▽ More

    Submitted 24 March, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

    Comments: 31 pages, 2 figures

  12. arXiv:2411.09516  [pdf, ps, other

    math.PR math.FA math.ST stat.ML

    Sharp Matrix Empirical Bernstein Inequalities

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: We present two sharp, closed-form empirical Bernstein inequalities for symmetric random matrices with bounded eigenvalues. By sharp, we mean that both inequalities adapt to the unknown variance in a tight manner: the deviation captured by the first-order $1/\sqrt{n}$ term asymptotically matches the matrix Bernstein inequality exactly, including constants, the latter requiring knowledge of the vari… ▽ More

    Submitted 2 April, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

  13. arXiv:2410.23614  [pdf, other

    math.ST stat.ME

    Hypothesis testing with e-values

    Authors: Aaditya Ramdas, Ruodu Wang

    Abstract: This book is written to offer a humble, but unified, treatment of e-values in hypothesis testing. It is organized into three parts: Fundamental Concepts, Core Ideas, and Advanced Topics. The first part includes four chapters that introduce the basic concepts. The second part includes five chapters of core ideas such as universal inference, log-optimality, e-processes, operations on e-values, and e… ▽ More

    Submitted 4 May, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

  14. arXiv:2409.06060  [pdf, other

    math.ST

    Empirical Bernstein in smooth Banach spaces

    Authors: Diego Martinez-Taboada, Aaditya Ramdas

    Abstract: Existing concentration bounds for bounded vector-valued random variables include extensions of the scalar Hoeffding and Bernstein inequalities. While the latter is typically tighter, it requires knowing a bound on the variance of the random variables. We derive a new vector-valued empirical Bernstein inequality, which makes use of an empirical estimator of the variance instead of the true variance… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  15. arXiv:2408.09598  [pdf, other

    stat.ME econ.EM math.ST stat.ML

    Anytime-Valid Inference for Double/Debiased Machine Learning of Causal Parameters

    Authors: Abhinandan Dalal, Patrick Blöbaum, Shiva Kasiviswanathan, Aaditya Ramdas

    Abstract: Double (debiased) machine learning (DML) has seen widespread use in recent years for learning causal/structural parameters, in part due to its flexibility and adaptability to high-dimensional nuisance functions as well as its ability to avoid bias from regularization or overfitting. However, the classic double-debiased framework is only valid asymptotically for a predetermined sample size, thus la… ▽ More

    Submitted 10 September, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  16. arXiv:2408.05998  [pdf, ps, other

    math.PR

    Matrix Concentration: Order versus Anti-order

    Authors: Reihaneh Malekian, Aaditya Ramdas

    Abstract: The matrix Markov inequality by Ahlswede was stated using the Loewner anti-order between positive definite matrices. Wang use this to derive several other Chebyshev and Chernoff-type inequalities (Hoeffding, Bernstein, empirical Bernstein) in the Loewner anti-order, including self-normalized matrix martingale inequalities. These imply upper tail bounds on the maximum eigenvalue, such as those deve… ▽ More

    Submitted 13 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  17. arXiv:2407.11465  [pdf, ps, other

    math.ST math.PR q-fin.MF stat.ME

    Testing by Betting while Borrowing and Bargaining

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Testing by betting has been a cornerstone of the game-theoretic statistics literature. In this framework, a betting score (or more generally an e-process), as opposed to a traditional p-value, is used to quantify the evidence against a null hypothesis: the higher the betting score, the more money one has made betting against the null, and thus the larger the evidence that the null is false. A key… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  18. Combining exchangeable p-values

    Authors: Matteo Gasparin, Ruodu Wang, Aaditya Ramdas

    Abstract: The problem of combining p-values is an old and fundamental one, and the classic assumption of independence is often violated or unverifiable in many applications. There are many well-known rules that can combine a set of arbitrarily dependent p-values (for the same hypothesis) into a single p-value. We show that essentially all these existing rules can be strictly improved when the p-values are e… ▽ More

    Submitted 20 March, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

  19. arXiv:2402.18810  [pdf, ps, other

    math.ST stat.ME

    The numeraire e-variable and reverse information projection

    Authors: Martin Larsson, Aaditya Ramdas, Johannes Ruf

    Abstract: We consider testing a composite null hypothesis $\mathcal{P}$ against a point alternative $\mathsf{Q}$ using e-variables, which are nonnegative random variables $X$ such that $\mathbb{E}_\mathsf{P}[X] \leq 1$ for every $\mathsf{P} \in \mathcal{P}$. This paper establishes a fundamental result: under no conditions whatsoever on $\mathcal{P}$ or $\mathsf{Q}$, there exists a special e-variable $X^*$ t… ▽ More

    Submitted 3 February, 2025; v1 submitted 28 February, 2024; originally announced February 2024.

  20. arXiv:2402.09698  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Combining Evidence Across Filtrations

    Authors: Yo Joong Choe, Aaditya Ramdas

    Abstract: In sequential anytime-valid inference, any admissible procedure must be based on e-processes: generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any stopping time. This paper proposes a method for combining e-processes constructed in different filtrations but for the same null. Although e-processes in the same filtration can be combine… ▽ More

    Submitted 15 February, 2025; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. Previous title was "Combining Evidence Across Filtrations Using Adjusters". Code is available at https://github.com/yjchoe/CombiningEvidenceAcrossFiltrations

  21. arXiv:2402.00713  [pdf, ps, other

    math.PR math.ST

    Distribution-uniform strong laws of large numbers

    Authors: Ian Waudby-Smith, Martin Larsson, Aaditya Ramdas

    Abstract: We revisit the question of whether the strong law of large numbers (SLLN) holds uniformly in a rich family of distributions, culminating in a distribution-uniform generalization of the Marcinkiewicz-Zygmund SLLN. These results can be viewed as extensions of Chung's distribution-uniform SLLN to random variables with uniformly integrable $q^\text{th}$ absolute central moments for $0 < q < 2$. Furthe… ▽ More

    Submitted 21 October, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 32 pages

  22. arXiv:2401.15567  [pdf, other

    math.PR math.FA math.ST stat.ME stat.ML

    Positive Semidefinite Matrix Supermartingales

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: We explore the asymptotic convergence and nonasymptotic maximal inequalities of supermartingales and backward submartingales in the space of positive semidefinite matrices. These are natural matrix analogs of scalar nonnegative supermartingales and backward nonnegative submartingales, whose convergence and maximal inequalities are the theoretical foundations for a wide and ever-growing body of res… ▽ More

    Submitted 28 January, 2025; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: substantial revision v4 (including a title change)

    MSC Class: 60B20; 60G48; 62L10

  23. arXiv:2401.15063  [pdf, other

    stat.ME math.ST stat.OT

    Graph fission and cross-validation

    Authors: James Leiner, Aaditya Ramdas

    Abstract: We introduce a technique called graph fission which takes in a graph which potentially contains only one observation per node (whose distribution lies in a known class) and produces two (or more) independent graphs with the same node/edge set in a way that splits the original graph's information amongst them in any desired proportion. Our proposal builds on data fission/thinning, a method that use… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 19 pages, 9 figures

  24. arXiv:2311.08168  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Time-Uniform Confidence Spheres for Means of Random Vectors

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We study sequential mean estimation in $\mathbb{R}^d$. In particular, we derive time-uniform confidence spheres -- confidence sphere sequences (CSSs) -- which contain the mean of random vectors with high probability simultaneously across all sample sizes. Our results include a dimension-free CSS for log-concave random vectors, a dimension-free CSS for sub-Gaussian random vectors, and CSSs for sub-… ▽ More

    Submitted 14 May, 2025; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 52 pages; 3 figures. Published in Transactions on Machine Learning Research

  25. arXiv:2311.03343  [pdf, other

    math.ST stat.ME

    Distribution-uniform anytime-valid sequential inference

    Authors: Ian Waudby-Smith, Edward H. Kennedy, Aaditya Ramdas

    Abstract: Are asymptotic confidence sequences and anytime $p$-values uniformly valid for a nontrivial class of distributions $\mathcal{P}$? We give a positive answer to this question by deriving distribution-uniform anytime-valid inference procedures. Historically, anytime-valid methods -- including confidence sequences, anytime $p$-values, and sequential hypothesis tests that enable inference at stopping t… ▽ More

    Submitted 18 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  26. arXiv:2310.09100  [pdf, other

    math.PR math.ST stat.ME

    Time-Uniform Self-Normalized Concentration for Vector-Valued Processes

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: Self-normalized processes arise naturally in many learning-related tasks. While self-normalized concentration has been extensively studied for scalar-valued processes, there are few results for multidimensional processes outside of the sub-Gaussian setting. In this work, we construct a general, self-normalized inequality for multivariate processes that satisfy a simple yet broad sub-$ψ$ tail condi… ▽ More

    Submitted 30 April, 2025; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49 pages, 4 figures

  27. arXiv:2310.03722  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: In 1976, Lai constructed a nontrivial confidence sequence for the mean $μ$ of a Gaussian distribution with unknown variance $σ^2$. Curiously, he employed both an improper (right Haar) mixture over $σ$ and an improper (flat) mixture over $μ$. Here, we elaborate carefully on the details of his construction, which use generalized nonintegrable martingales and an extended Ville's inequality. While thi… ▽ More

    Submitted 6 November, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Substantive revision in v3 (Apr 23 2024); Final revision in v4 (Nov 6 2024) accepted by the journal Sequential Analysis

  28. arXiv:2310.01547  [pdf, other

    math.ST cs.IT cs.LG stat.AP stat.ML

    On the near-optimality of betting confidence sets for bounded means

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical nonparametric approach proceeds by inverting standard concentration bounds, such as Hoeffding's or Bernstein's inequalities. Recently, an alternative betting-base… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 53 pages, 2 figures

  29. arXiv:2309.09111  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Reducing sequential change detection to sequential estimation

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $θ$ of the data stream distribution that has small detection delay, but guarantees control on the frequency of false alarms in the absence of changes. In this paper, we describe a simple reduction from sequential change detection to sequential estimati… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 11 pages

  30. arXiv:2307.07539  [pdf, ps, other

    cs.LG math.ST stat.ML

    On the Sublinear Regret of GP-UCB

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to minimize regret, which is a measure of the suboptimality of the choices made. Arguably the most popular algorithm is the Gaussian Process Upper Confidence Bound (… ▽ More

    Submitted 14 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 20 pages, 0 figures

  31. arXiv:2305.16539  [pdf, other

    math.ST cs.IT math.PR stat.ME

    On the existence of powerful p-values and e-values for composite hypotheses

    Authors: Zhenyuan Zhang, Aaditya Ramdas, Ruodu Wang

    Abstract: Given a composite null $ \mathcal P$ and composite alternative $ \mathcal Q$, when and how can we construct a p-value whose distribution is exactly uniform under the null, and stochastically smaller than uniform under the alternative? Similarly, when and how can we construct an e-value whose expectation exactly equals one under the null, but its expected logarithm under the alternative is positive… ▽ More

    Submitted 30 November, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 47 pages, 6 figures; The Annals of Statistics 52 (5), 2241--2267

  32. arXiv:2305.06884  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.AP stat.ML

    Risk-limiting Financial Audits via Weighted Sampling without Replacement

    Authors: Shubhanshu Shekhar, Ziyu Xu, Zachary C. Lipton, Pierre J. Liang, Aaditya Ramdas

    Abstract: We introduce the notion of a risk-limiting financial auditing (RLFA): given $N$ transactions, the goal is to estimate the total misstated monetary fraction~($m^*$) to a given accuracy $ε$, with confidence $1-δ$. We do this by constructing new confidence sequences (CSs) for the weighted average of $N$ unknown values, based on samples drawn without replacement according to a (randomized) weighted sa… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 23 pages, 8 figures, to appear in the Proceedings of Uncertainty in Artificial Intelligence (UAI) 2023

  33. arXiv:2305.00143  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Predictive Two-Sample and Independence Testing

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: We study the problems of sequential nonparametric two-sample and independence testing. Sequential tests process data online and allow using observed data to decide whether to stop and reject the null hypothesis or to collect more data, while maintaining type I error control. We build upon the principle of (nonparametric) testing by betting, where a gambler places bets on future observations and th… ▽ More

    Submitted 19 July, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  34. arXiv:2305.00070  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Online Platt Scaling with Calibeating

    Authors: Chirag Gupta, Aaditya Ramdas

    Abstract: We present an online post-hoc calibration method, called Online Platt Scaling (OPS), which combines the Platt scaling technique with online logistic regression. We demonstrate that OPS smoothly adapts between i.i.d. and non-i.i.d. settings with distribution drift. Further, in scenarios where the best Platt scaling model is itself miscalibrated, we enhance OPS by incorporating a recently developed… ▽ More

    Submitted 16 August, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICML 2023; 24 pages and 16 figures

  35. arXiv:2304.03927  [pdf, ps, other

    math.ST math.PR

    De Finetti's theorem and related results for infinite weighted exchangeable sequences

    Authors: Rina Foygel Barber, Emmanuel J. Candes, Aaditya Ramdas, Ryan J. Tibshirani

    Abstract: De Finetti's theorem, also called the de Finetti-Hewitt-Savage theorem, is a foundational result in probability and statistics. Roughly, it says that an infinite sequence of exchangeable random variables can always be written as a mixture of independent and identically distributed (i.i.d.) sequences of random variables. In this paper, we consider a weighted generalization of exchangeability that a… ▽ More

    Submitted 27 November, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  36. arXiv:2304.02611  [pdf, other

    math.ST cs.IT math.PR stat.ME

    Randomized and Exchangeable Improvements of Markov's, Chebyshev's and Chernoff's Inequalities

    Authors: Aaditya Ramdas, Tudor Manole

    Abstract: We present simple randomized and exchangeable improvements of Markov's inequality, as well as Chebyshev's inequality and Chernoff bounds. Our variants are never worse and typically strictly more powerful than the original inequalities. The proofs are short and elementary, and can easily yield similarly randomized or exchangeable versions of a host of other inequalities that employ Markov's inequal… ▽ More

    Submitted 9 May, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  37. arXiv:2304.01163  [pdf, ps, other

    math.PR math.ST stat.ME stat.ML

    The extended Ville's inequality for nonintegrable nonnegative supermartingales

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Following the initial work by Robbins, we rigorously present an extended theory of nonnegative supermartingales, requiring neither integrability nor finiteness. In particular, we derive a key maximal inequality foreshadowed by Robbins, which we call the extended Ville's inequality, that strengthens the classical Ville's inequality (for integrable nonnegative supermartingales), and also applies to… ▽ More

    Submitted 8 October, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  38. arXiv:2302.03421  [pdf, ps, other

    stat.ML cs.IT cs.LG math.ST

    A unified recipe for deriving (time-uniform) PAC-Bayes bounds

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike most previous literature on this topic, our bounds are anytime-valid (i.e., time-uniform), meaning that they hold at all stopping times, not only for a fixed sample size. Our approach combines four tools in the following order: (a) nonnegative supermartingales or reverse submartingales, (b) the method of mixture… ▽ More

    Submitted 3 January, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 56 pages. Published in the Journal of Machine Learning Research, Volume 24 Issue 372

  39. arXiv:2302.02544  [pdf, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Sequential change detection via backward confidence sequences

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $θ$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $θ$, then we can also successfully perform SCD for $θ$. This is accomplished by checking if two CSs… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 24 pages, 10 figures

  40. arXiv:2301.09573  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Huber-Robust Confidence Sequences

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Confidence sequences are confidence intervals that can be sequentially tracked, and are valid at arbitrary data-dependent stopping times. This paper presents confidence sequences for a univariate mean of an unknown distribution with a known upper bound on the $p$-th central moment ($p$ > 1), but allowing for (at most) $ε$ fraction of arbitrary distribution corruption, as in Huber's contamination m… ▽ More

    Submitted 7 February, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted for publication at the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  41. arXiv:2301.03542  [pdf, ps, other

    math.ST stat.ME

    A Sequential Test for Log-Concavity

    Authors: Aditya Gangrade, Alessandro Rinaldo, Aaditya Ramdas

    Abstract: On observing a sequence of i.i.d.\ data with distribution $P$ on $\mathbb{R}^d$, we ask the question of how one can test the null hypothesis that $P$ has a log-concave density. This paper proves one interesting negative and positive result: the non-existence of test (super)martingales, and the consistency of universal inference. To elaborate, the set of log-concave distributions $\mathcal{L}$ is a… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  42. arXiv:2212.09706  [pdf, ps, other

    math.ST math.PR stat.ME

    Multiple testing under negative dependence

    Authors: Ziyu Chi, Aaditya Ramdas, Ruodu Wang

    Abstract: The multiple testing literature has primarily dealt with three types of dependence assumptions between p-values: independence, positive regression dependence, and arbitrary dependence. In this paper, we provide what we believe are the first theoretical results under various notions of negative dependence (negative Gaussian dependence, negative regression dependence, negative association, negative… ▽ More

    Submitted 8 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 28 pages, 5 figures

  43. arXiv:2212.09108  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-Free Kernel Independence Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: In nonparametric independence testing, we observe i.i.d.\ data $\{(X_i,Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 52 pages, 4 figures

  44. arXiv:2212.07383  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Kernelized Independence Testing

    Authors: Aleksandr Podkopaev, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Independence testing is a classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the complexity of a problem at hand instead of setting sample size in advance. Ideally, such procedures should (a) stop earlier on easy tasks (and later on harder tasks), he… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: To appear at ICML 2023

  45. arXiv:2211.14908  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-free Kernel Two-Sample Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: The kernel Maximum Mean Discrepancy~(MMD) is a popular multivariate distance metric between distributions that has found utility in two-sample testing. The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution. Hence, to design a level-$α$ test, one usually selects the rejection threshold as the $(1-α)$-quantile of the perm… ▽ More

    Submitted 4 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentation

  46. arXiv:2210.10768  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Anytime-valid off-policy inference for contextual bandits

    Authors: Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro

    Abstract: Contextual bandit algorithms are ubiquitous tools for active sequential experimentation in healthcare and the tech industry. They involve online learning algorithms that adaptively learn policies over time to map observed contexts $X_t$ to actions $A_t$ in an attempt to maximize stochastic rewards $R_t$. This adaptivity raises interesting but hard statistical inference questions, especially counte… ▽ More

    Submitted 15 August, 2024; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 43 pages, 6 figures

  47. arXiv:2210.01948  [pdf, ps, other

    math.ST cs.GT cs.IT stat.ME

    Game-theoretic statistics and safe anytime-valid inference

    Authors: Aaditya Ramdas, Peter Grünwald, Vladimir Vovk, Glenn Shafer

    Abstract: Safe anytime-valid inference (SAVI) provides measures of statistical evidence and certainty -- e-processes for testing and confidence sequences for estimation -- that remain valid at all stopping times, accommodating continuous monitoring and analysis of accumulating data and optional stopping or continuation for any reason. These measures crucially rely on test martingales, which are nonnegative… ▽ More

    Submitted 17 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 25 pages. Under review. ArXiv does not compile/space some references properly

  48. arXiv:2204.12447  [pdf, other

    stat.ME math.ST

    E-values as unnormalized weights in multiple testing

    Authors: Nikolaos Ignatiadis, Ruodu Wang, Aaditya Ramdas

    Abstract: We study how to combine p-values and e-values, and design multiple testing procedures where both p-values and e-values are available for every hypothesis. Our results provide a new perspective on multiple testing with data-driven weights: while standard weighted multiple testing methods require the weights to deterministically add up to the number of hypotheses being tested, we show that this norm… ▽ More

    Submitted 18 July, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

  49. arXiv:2203.12572  [pdf, other

    math.ST stat.ME

    Post-selection inference for e-value based confidence intervals

    Authors: Ziyu Xu, Ruodu Wang, Aaditya Ramdas

    Abstract: Suppose that one can construct a valid $(1-δ)$-confidence interval (CI) for each of $K$ parameters of potential interest. If a data analyst uses an arbitrary data-dependent criterion to select some subset $S$ of parameters, then the aforementioned CIs for the selected parameters are no longer valid due to selection bias. We design a new method to adjust the intervals in order to control the false… ▽ More

    Submitted 27 February, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 46 pages, 6 figures

    Journal ref: Electronic Journal of Statistics 18(1): 2292-2338 (2024)

  50. arXiv:2203.04485  [pdf, ps, other

    math.PR cs.GT math.ST

    A composite generalization of Ville's martingale theorem

    Authors: Johannes Ruf, Martin Larsson, Wouter M. Koolen, Aaditya Ramdas

    Abstract: We provide a composite version of Ville's theorem that an event has zero measure if and only if there exists a nonnegative martingale which explodes to infinity when that event occurs. This is a classic result connecting measure-theoretic probability to the sequence-by-sequence game-theoretic probability, recently developed by Shafer and Vovk. Our extension of Ville's result involves appropriate c… ▽ More

    Submitted 3 May, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 21 pages