Skip to main content

Showing 1–50 of 71 results for author: Rinaldo, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2310.11982  [pdf, other

    math.ST

    On the estimation of persistence intensity functions and linear representations of persistence diagrams

    Authors: Weichen Wu, Jisu Kim, Alessandro Rinaldo

    Abstract: The prevailing statistical approach to analyzing persistence diagrams is concerned with filtering out topological noise. In this paper, we adopt a different viewpoint and aim at estimating the actual distribution of a random persistence diagram, which captures both topological signal and noise. To that effect, Chazel and Divol (2019) proved that, under general conditions, the expected value of a r… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  2. arXiv:2307.00795  [pdf, other

    math.ST

    Inference for Projection Parameters in Linear Regression: beyond $d = o(n^{1/2})$

    Authors: Woonyoung Chang, Arun Kumar Kuchibhotla, Alessandro Rinaldo

    Abstract: We consider the problem of inference for projection parameters in linear regression with increasing dimensions. This problem has been studied under a variety of assumptions in the literature. The classical asymptotic normality result for the least squares estimator of the projection parameter only holds when the dimension $d$ of the covariates is of a smaller order than $n^{1/2}$, where $n$ is the… ▽ More

    Submitted 11 January, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Updated Jan 11, 2024

  3. arXiv:2306.14299  [pdf, ps, other

    math.PR math.ST

    Dual Induction CLT for High-dimensional m-dependent Data

    Authors: Heejong Bong, Arun Kumar Kuchibhotla, Alessandro Rinaldo

    Abstract: We derive novel and sharp high-dimensional Berry--Esseen bounds for the sum of $m$-dependent random vectors over the class of hyper-rectangles exhibiting only a poly-logarithmic dependence in the dimension. Our results hold under minimal assumptions, such as non-degenerate covariances and finite third moments, and yield a sample complexity of order $\sqrt{m/n}$, aside from logarithmic terms, match… ▽ More

    Submitted 16 November, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: 25 pages

    MSC Class: 60B12; 60F05

  4. arXiv:2305.19001  [pdf, other

    stat.ML cs.IT cs.LG math.OC math.ST

    High-probability sample complexities for policy evaluation with linear function approximation

    Authors: Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei

    Abstract: This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of the best linear coefficients for two widely-used policy evaluation algorithms: the temporal difference (TD) learning algorithm and the two-timescale li… ▽ More

    Submitted 2 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: The first two authors contributed equally; paper accepted to IEEE Transactions on Information Theory

  5. arXiv:2301.03542  [pdf, ps, other

    math.ST stat.ME

    A Sequential Test for Log-Concavity

    Authors: Aditya Gangrade, Alessandro Rinaldo, Aaditya Ramdas

    Abstract: On observing a sequence of i.i.d.\ data with distribution $P$ on $\mathbb{R}^d$, we ask the question of how one can test the null hypothesis that $P$ has a log-concave density. This paper proves one interesting negative and positive result: the non-existence of test (super)martingales, and the consistency of universal inference. To elaborate, the set of log-concave distributions $\mathcal{L}$ is a… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  6. arXiv:2212.05355  [pdf, ps, other

    math.PR math.ST

    High-dimensional Berry-Esseen Bound for $m$-Dependent Random Samples

    Authors: Heejong Bong, Arun Kumar Kuchibhotla, Alessandro Rinaldo

    Abstract: In this work, we provide a $(n/m)^{-1/2}$-rate finite sample Berry-Esseen bound for $m$-dependent high-dimensional random vectors over the class of hyper-rectangles. This bound imposes minimal assumptions on the random vectors such as nondegenerate covariances and finite third moments. The proof uses inductive relationships between anti-concentration inequalities and Berry--Esseen bounds, which ar… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

  7. arXiv:2205.12937  [pdf, other

    math.ST cs.LG stat.ML

    Mitigating multiple descents: A model-agnostic framework for risk monotonization

    Authors: Pratik Patil, Arun Kumar Kuchibhotla, Yuting Wei, Alessandro Rinaldo

    Abstract: Recent empirical and theoretical analyses of several commonly used prediction procedures reveal a peculiar risk behavior in high dimensions, referred to as double/multiple descent, in which the asymptotic risk is a non-monotonic function of the limiting aspect ratio of the number of features or parameters to the sample size. To mitigate this undesirable behavior, we develop a general framework for… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 110 pages, 15 figures

  8. arXiv:2205.12431  [pdf, other

    stat.ME math.ST

    Detecting Abrupt Changes in Sequential Pairwise Comparison Data

    Authors: Wanshan Li, Daren Wang, Alessandro Rinaldo

    Abstract: The Bradley-Terry-Luce (BTL) model is a classic and very popular statistical approach for eliciting a global ranking among a collection of items using pairwise comparison data. In applications in which the comparison outcomes are observed as a time series, it is often the case that data are non-stationary, in the sense that the true underlying ranking changes over time. In this paper we are concer… ▽ More

    Submitted 29 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 37 pages, 4 figures, 7 tables

  9. arXiv:2203.03532  [pdf, ps, other

    stat.ME math.ST stat.ML

    E-detectors: a nonparametric framework for sequential change detection

    Authors: Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

    Abstract: Sequential change detection is a classical problem with a variety of applications. However, the majority of prior work has been parametric, for example, focusing on exponential families. We develop a fundamentally new and general framework for sequential change detection when the pre- and post-change distributions are nonparametrically specified (and thus composite). Our procedures come with clean… ▽ More

    Submitted 29 October, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 49 pages, 7 figures

  10. arXiv:2110.14298  [pdf, other

    math.ST stat.ML

    Denoising and change point localisation in piecewise-constant high-dimensional regression coefficients

    Authors: Fan Wang, Oscar Hernan Madrid Padilla, Yi Yu, Alessandro Rinaldo

    Abstract: We study the theoretical properties of the fused lasso procedure originally proposed by \cite{tibshirani2005sparsity} in the context of a linear regression model in which the regression coefficient are totally ordered and assumed to be sparse and piecewise constant. Despite its popularity, to the best of our knowledge, estimation error bounds in high-dimensional settings have only been obtained fo… ▽ More

    Submitted 18 February, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  11. arXiv:2110.11487  [pdf, other

    math.ST

    Generalized Results for the Existence and Consistency of the MLE in the Bradley-Terry-Luce Model

    Authors: Heejong Bong, Alessandro Rinaldo

    Abstract: Ranking problems based on pairwise comparisons, such as those arising in online gaming, often involve a large pool of items to order. In these situations, the gap in performance between any two items can be significant, and the smallest and largest winning probabilities can be very close to zero or one. Furthermore, each item may be compared only to a subset of all the items, so that not all pairw… ▽ More

    Submitted 15 June, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: To appear in ICML2022

    MSC Class: 62F07 ACM Class: G.3

  12. arXiv:2110.10989  [pdf, other

    math.ST

    Optimal partition recovery in general graphs

    Authors: Yi Yu, Oscar Hernan Madrid Padilla, Alessandro Rinaldo

    Abstract: We consider a graph-structured change point problem in which we observe a random vector with piecewise constant but unknown mean and whose independent, sub-Gaussian coordinates correspond to the $n$ nodes of a fixed graph. We are interested in the localisation task of recovering the partition of the nodes associated to the constancy regions of the mean vector. When the partition $\mathcal{S}$ cons… ▽ More

    Submitted 18 February, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  13. arXiv:2110.10825  [pdf, other

    math.ST stat.ML

    $\ell_{\infty}$-Bounds of the MLE in the BTL Model under General Comparison Graphs

    Authors: Wanshan Li, Shamindra Shrotriya, Alessandro Rinaldo

    Abstract: The Bradley-Terry-Luce (BTL) model is a popular statistical approach for estimating the global ranking of a collection of items using pairwise comparisons. To ensure accurate ranking, it is essential to obtain precise estimates of the model parameters in the $\ell_{\infty}$-loss. The difficulty of this task depends crucially on the topology of the pairwise comparison graph over the given items. Ho… ▽ More

    Submitted 22 June, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: Accepted for the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), 43 pages, 7 figures

  14. arXiv:2105.13504  [pdf, other

    math.ST cs.LG stat.ML

    Lattice partition recovery with dyadic CART

    Authors: Oscar Hernan Madrid Padilla, Yi Yu, Alessandro Rinaldo

    Abstract: We study piece-wise constant signals corrupted by additive Gaussian noise over a $d$-dimensional lattice. Data of this form naturally arise in a host of applications, and the tasks of signal detection or testing, de-noising and estimation have been studied extensively in the statistical and signal processing literature. In this paper we consider instead the problem of partition recovery, i.e.~of e… ▽ More

    Submitted 27 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

  15. arXiv:2101.05477  [pdf, other

    math.ST cs.LG

    Optimal network online change point localisation

    Authors: Yi Yu, Oscar Hernan Madrid Padilla, Daren Wang, Alessandro Rinaldo

    Abstract: We study the problem of online network change point detection. In this setting, a collection of independent Bernoulli networks is collected sequentially, and the underlying distributions change when a change point occurs. The goal is to detect the change point as quickly as possible, if it exists, subject to a constraint on the number or probability of false alarms. In this paper, on the detection… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  16. arXiv:2010.08082  [pdf, other

    math.ST stat.ME

    Nonparametric iterated-logarithm extensions of the sequential generalized likelihood ratio test

    Authors: Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

    Abstract: We develop a nonparametric extension of the sequential generalized likelihood ratio (GLR) test and corresponding time-uniform confidence sequences for the mean of a univariate distribution. By utilizing a geometric interpretation of the GLR statistic, we derive a simple analytic upper bound on the probability that it exceeds any prespecified boundary; these are intractable to approximate via simul… ▽ More

    Submitted 13 May, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 53 pages, 8 figures

  17. arXiv:2009.13673  [pdf, ps, other

    math.ST

    High-dimensional CLT for Sums of Non-degenerate Random Vectors: $n^{-1/2}$-rate

    Authors: Arun Kumar Kuchibhotla, Alessandro Rinaldo

    Abstract: In this note, we provide a Berry--Esseen bounds for rectangles in high-dimensions when the random vectors have non-singular covariance matrices. Under this assumption of non-singularity, we prove an $n^{-1/2}$ scaling for the Berry--Esseen bound for sums of mean independent random vectors with a finite third moment. The proof is essentially the method of compositions proof of multivariate Berry--E… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: 21 pages

  18. arXiv:2007.09751  [pdf, ps, other

    math.ST stat.ME

    Berry-Esseen Bounds for Projection Parameters and Partial Correlations with Increasing Dimension

    Authors: Arun Kumar Kuchibhotla, Alessandro Rinaldo, Larry Wasserman

    Abstract: We provide finite sample bounds on the Normal approximation to the law of the least squares estimator of the projection parameters normalized by the sandwich-based standard errors. Our results hold in the increasing dimension setting and under minimal assumptions on the data generating distribution. In particular, we do not assume a linear regression function and only require the existence of fini… ▽ More

    Submitted 22 October, 2021; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: 58 pages, 0 figures

  19. arXiv:2006.03283  [pdf, ps, other

    math.ST

    A Note on Online Change Point Detection

    Authors: Yi Yu, Oscar Hernan Madrid Padilla, Daren Wang, Alessandro Rinaldo

    Abstract: We investigate sequential change point estimation and detection in univariate nonparametric settings, where a stream of independent observations from sub-Gaussian distributions with a common variance factor and piecewise-constant but otherwise unknown means are collected. We develop a simple CUSUM-based methodology that provably control the probability of false alarms or the average run length whi… ▽ More

    Submitted 13 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

  20. arXiv:2003.00083  [pdf, other

    math.ST stat.ML

    Nonparametric Estimation in the Dynamic Bradley-Terry Model

    Authors: Heejong Bong, Wanshan Li, Shamindra Shrotriya, Alessandro Rinaldo

    Abstract: We propose a time-varying generalization of the Bradley-Terry model that allows for nonparametric modeling of dynamic global rankings of distinct teams. We develop a novel estimator that relies on kernel smoothing to pre-process the pairwise comparisons over time and is applicable in sparse settings where the Bradley-Terry may not be fit. We obtain necessary and sufficient conditions for the exist… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: To appear in AISTATS 2020

  21. arXiv:2002.08422  [pdf, other

    math.ST stat.ML

    On conditional versus marginal bias in multi-armed bandits

    Authors: Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

    Abstract: The bias of the sample means of the arms in multi-armed bandits is an important issue in adaptive data analysis that has recently received considerable attention in the literature. Existing results relate in precise ways the sign and magnitude of the bias to various sources of data adaptivity, but do not apply to the conditional inference setting in which the sample means are computed only if some… ▽ More

    Submitted 22 February, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 18 pages

  22. arXiv:1910.13289  [pdf, other

    math.ST stat.ME

    Optimal nonparametric multivariate change point detection and localization

    Authors: Oscar Hernan Madrid Padilla, Yi Yu, Daren Wang, Alessandro Rinaldo

    Abstract: We study the multivariate nonparametric change point detection problem, where the data are a sequence of independent $p$-dimensional random vectors whose distributions are piecewise-constant with Lipschitz densities changing at unknown times, called change points. We quantify the size of the distributional change at any change point with the supremum norm of the difference between the correspondin… ▽ More

    Submitted 25 June, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  23. arXiv:1909.06359  [pdf, other

    math.ST stat.ME

    Localizing Changes in High-Dimensional Vector Autoregressive Processes

    Authors: Daren Wang, Yi Yu, Alessandro Rinaldo, Rebecca Willett

    Abstract: Autoregressive models capture stochastic processes in which past realizations determine the generative distribution of new data; they arise naturally in a variety of industrial, biomedical, and financial settings. A key challenge when working with such data is to determine when the underlying generative model has changed, as this can offer insights into distinct operating regimes of the underlying… ▽ More

    Submitted 29 July, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: 53 pages; 4 figure

  24. arXiv:1907.03813  [pdf, other

    stat.ML cs.LG math.ST

    Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection

    Authors: Xiaoyi Gu, Leman Akoglu, Alessandro Rinaldo

    Abstract: Nearest-neighbor (NN) procedures are well studied and widely used in both supervised and unsupervised learning problems. In this paper we are concerned with investigating the performance of NN-based methods for anomaly detection. We first show through extensive simulations that NN methods compare favorably to some of the other state-of-the-art algorithms for anomaly detection based on a set of ben… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  25. arXiv:1905.11397  [pdf, other

    math.ST stat.ML

    Are sample means in multi-armed bandits positively or negatively biased?

    Authors: Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

    Abstract: It is well known that in stochastic multi-armed bandits (MAB), the sample mean of an arm is typically not an unbiased estimator of its true mean. In this paper, we decouple three different sources of this selection bias: adaptive \emph{sampling} of arms, adaptive \emph{stopping} of the experiment, and adaptively \emph{choosing} which arm to study. Through a new notion called ``optimism'' that capt… ▽ More

    Submitted 26 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 21 pages. Advances in Neural Information Processing Systems 32 (NeurIPS 2019, Spotlight Presentation)

  26. arXiv:1905.10019  [pdf, other

    stat.ME math.ST

    Optimal nonparametric change point detection and localization

    Authors: Oscar Hernan Madrid Padilla, Yi Yu, Daren Wang, Alessandro Rinaldo

    Abstract: We study change point detection and localization for univariate data in fully nonparametric settings in which, at each time point, we acquire an i.i.d. sample from an unknown distribution. We quantify the magnitude of the distributional changes at the change points using the Kolmogorov--Smirnov distance. We allow all the relevant parameters -- the minimal spacing between two consecutive change poi… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    MSC Class: Change point detection; Minimax optimality

  27. arXiv:1903.06955  [pdf, other

    math.AT cs.CG

    Homotopy Reconstruction via the Cech Complex and the Vietoris-Rips Complex

    Authors: Jisu Kim, Jaehyeok Shin, Frédéric Chazal, Alessandro Rinaldo, Larry Wasserman

    Abstract: We derive conditions under which the reconstruction of a target space is topologically correct via the Čech complex or the Vietoris-Rips complex obtained from possibly noisy point cloud data. We provide two novel theoretical results. First, we describe sufficient conditions under which any non-empty intersection of finitely many Euclidean balls intersected with a positive reach set is contractible… ▽ More

    Submitted 12 May, 2020; v1 submitted 16 March, 2019; originally announced March 2019.

    Comments: 60 pages, 7 figures, to appear in the 36th International Symposium on Computational Geometry (SoCG 2020), the code is available at https://github.com/jisuk1/nerveshape

  28. arXiv:1902.00746  [pdf, ps, other

    math.ST cs.LG stat.ML

    On the bias, risk and consistency of sample means in multi-armed bandits

    Authors: Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

    Abstract: The sample mean is among the most well studied estimators in statistics, having many desirable properties such as unbiasedness and consistency. However, when analyzing data collected using a multi-armed bandit (MAB) experiment, the sample mean is biased and much remains to be understood about its properties. For example, when is it consistent, how large is its bias, and can we bound its mean squar… ▽ More

    Submitted 29 April, 2021; v1 submitted 2 February, 2019; originally announced February 2019.

    Comments: 48 pages

  29. arXiv:1810.09498  [pdf, ps, other

    math.ST stat.ME

    Univariate Mean Change Point Detection: Penalization, CUSUM and Optimality

    Authors: Daren Wang, Yi Yu, Alessandro Rinaldo

    Abstract: The problem of univariate mean change point detection and localization based on a sequence of $n$ independent observations with piecewise constant means has been intensively studied for more than half century, and serves as a blueprint for change point problems in more complex settings. We provide a complete characterization of this classical problem in a general framework in which the upper bound… ▽ More

    Submitted 6 June, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

  30. arXiv:1810.05935  [pdf, ps, other

    math.ST

    Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension

    Authors: Jisu Kim, Jaehyeok Shin, Alessandro Rinaldo, Larry Wasserman

    Abstract: We derive concentration inequalities for the supremum norm of the difference between a kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the selection of the bandwidth and under weaker conditions on the kernel and the data generating distribution than previously used in the literature. We first propose a novel concept, called the volume dimension, to measure th… ▽ More

    Submitted 31 December, 2019; v1 submitted 13 October, 2018; originally announced October 2018.

    Comments: 51 pages, to be published in Proceedings of Thirty-sixth International Conference on Machine Learning (ICML 2019), Volume 97, 2019

  31. arXiv:1810.02294  [pdf, ps, other

    math.ST stat.ML stat.OT

    Markov Properties of Discrete Determinantal Point Processes

    Authors: Kayvan Sadeghi, Alessandro Rinaldo

    Abstract: Determinantal point processes (DPPs) are probabilistic models for repulsion. When used to represent the occurrence of random subsets of a finite base set, DPPs allow to model global negative associations in a mathematically elegant and direct way. Discrete DPPs have become popular and computationally tractable models for solving several machine learning tasks that require the selection of diverse… ▽ More

    Submitted 27 January, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: 9 pages, 1 figure

  32. arXiv:1712.09912  [pdf, ps, other

    math.ST

    Optimal Covariance Change Point Localization in High Dimension

    Authors: Daren Wang, Yi Yu, Alessandro Rinaldo

    Abstract: We study the problem of change point detection for covariance matrices in high dimensions. We assume that we observe a sequence {X_i}_{i=1,...,n} of independent and centered p-dimensional sub-Gaussian random vectors whose covariance matrices are piecewise constant. Our task is to recover with high accuracy the number and locations of the change points, which are assumed unknown. Our generic model… ▽ More

    Submitted 21 August, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Comments: 44 pages

    MSC Class: Change point detection; High dimensional covariance estimation; Binary segmentation; Wild binary segmentation; Minimax optimal

  33. arXiv:1709.03885  [pdf, other

    math.ST

    On Exchangeability in Network Models

    Authors: Steffen L. Lauritzen, Alessandro Rinaldo, Kayvan Sadeghi

    Abstract: We derive representation theorems for exchangeable distributions on finite and infinite graphs using elementary arguments based on geometric and graph-theoretic concepts. Our results elucidate some of the key differences, and their implications, between statistical network models that are finitely exchangeable and models that define a consistent sequence of probability distributions on graphs of i… ▽ More

    Submitted 14 September, 2018; v1 submitted 12 September, 2017; originally announced September 2017.

    Comments: Dedicated to the memory of Steve Fienberg

  34. arXiv:1706.03113  [pdf, other

    math.ST

    DBSCAN: Optimal Rates For Density Based Clustering

    Authors: Daren Wang, Xinyang Lu, Alessandro Rinaldo

    Abstract: We study the problem of optimal estimation of the density cluster tree under various assumptions on the underlying density. Building up from the seminal work of Chaudhuri et al. [2014], we formulate a new notion of clustering consistency which is better suited to smooth densities, and derive minimax rates of consistency for cluster tree estimation for Holder smooth densities of arbitrary degree α.… ▽ More

    Submitted 4 December, 2019; v1 submitted 9 June, 2017; originally announced June 2017.

    Comments: 55 pages, 5 figures

  35. arXiv:1705.04565  [pdf, other

    math.ST math.DG

    Estimating the Reach of a Manifold

    Authors: Eddie Aamari, Jisu Kim, Frédéric Chazal, Bertrand Michel, Alessandro Rinaldo, Larry Wasserman

    Abstract: Various problems in manifold estimation make use of a quantity called the reach, denoted by $τ\_M$, which is a measure of the regularity of the manifold. This paper is the first investigation into the problem of how to estimate the reach. First, we study the geometry of the reach through an approximation perspective. We derive new geometric results on the reach for submanifolds without boundary. A… ▽ More

    Submitted 8 April, 2019; v1 submitted 12 May, 2017; originally announced May 2017.

  36. arXiv:1701.08420  [pdf, other

    math.ST

    Random Networks, Graphical Models, and Exchangeability

    Authors: Steffen Lauritzen, Alessandro Rinaldo, Kayvan Sadeghi

    Abstract: We study conditional independence relationships for random networks and their interplay with exchangeability. We show that, for finitely exchangeable network models, the empirical subgraph densities are maximum likelihood estimates of their theoretical counterparts. We then characterize all possible Markov structures for finitely exchangeable random graphs, thereby identifying a new class of Marko… ▽ More

    Submitted 21 November, 2017; v1 submitted 29 January, 2017; originally announced January 2017.

    Comments: To appear in JRSSB

  37. arXiv:1611.05401  [pdf, other

    math.ST

    Bootstrapping and Sample Splitting For High-Dimensional, Assumption-Free Inference

    Authors: Alessandro Rinaldo, Larry Wasserman, Max G'Sell, Jing Lei

    Abstract: Several new methods have been proposed for performing valid inference after model selection. An older method is sampling splitting: use part of the data for model selection and part for inference. In this paper we revisit sample splitting combined with the bootstrap (or the Normal approximation). We show that this leads to a simple, assumption-free approach to inference and we establish results on… ▽ More

    Submitted 2 April, 2018; v1 submitted 16 November, 2016; originally announced November 2016.

    MSC Class: 62G05

  38. arXiv:1606.06746  [pdf, other

    stat.ME math.ST

    Approximate Recovery in Changepoint Problems, from $\ell_2$ Estimation Error Rates

    Authors: Kevin Lin, James Sharpnack, Alessandro Rinaldo, Ryan J. Tibshirani

    Abstract: In the 1-dimensional multiple changepoint detection problem, we prove that any procedure with a fast enough $\ell_2$ error rate, in terms of its estimation of the underlying piecewise constant mean vector, automatically has an (approximate) changepoint screening property---specifically, each true jump in the underlying mean vector has an estimated jump nearby. We also show, again assuming only kno… ▽ More

    Submitted 2 December, 2016; v1 submitted 21 June, 2016; originally announced June 2016.

    Comments: 43 pages, 8 figures

  39. arXiv:1605.06416  [pdf, other

    math.ST stat.ME stat.ML

    Statistical Inference for Cluster Trees

    Authors: Jisu Kim, Yen-Chi Chen, Sivaraman Balakrishnan, Alessandro Rinaldo, Larry Wasserman

    Abstract: A cluster tree provides a highly-interpretable summary of a density function by representing the hierarchy of its high-density clusters. It is estimated using the empirical tree, which is the cluster tree constructed from a density estimator. This paper addresses the basic question of quantifying our uncertainty by assessing the statistical significance of topological features of an empirical clus… ▽ More

    Submitted 12 February, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: 20 pages, 6 figures, accepted in Neural Information Processing Systems (NIPS) 2016

  40. arXiv:1605.04565  [pdf, other

    stat.ME math.ST

    Hierarchical Models for Independence Structures of Networks

    Authors: Kayvan Sadeghi, Alessandro Rinaldo

    Abstract: We introduce a new family of network models, called hierarchical network models, that allow us to represent in an explicit manner the stochastic dependence among the dyads (random ties) of the network. In particular, each member of this family can be associated with a graphical model defining conditional independence clauses among the dyads of the network, called the dependency graph. Every networ… ▽ More

    Submitted 25 November, 2019; v1 submitted 15 May, 2016; originally announced May 2016.

    Comments: 19 pages, 7 figures

  41. Minimax Rates for Estimating the Dimension of a Manifold

    Authors: Jisu Kim, Alessandro Rinaldo, Larry Wasserman

    Abstract: Many algorithms in machine learning and computational geometry require, as input, the intrinsic dimension of the manifold that supports the probability distribution of the data. This parameter is rarely known and therefore has to be estimated. We characterize the statistical difficulty of this problem by deriving upper and lower bounds on the minimax rate for estimating the dimension. First, we co… ▽ More

    Submitted 30 December, 2019; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: 54 pages, 11 figures, to be published in Journal of Computational Geometry, Volume 10, Number 1

  42. arXiv:1604.04173  [pdf, other

    stat.ME math.ST stat.ML

    Distribution-Free Predictive Inference For Regression

    Authors: Jing Lei, Max G'Sell, Alessandro Rinaldo, Ryan J. Tibshirani, Larry Wasserman

    Abstract: We develop a general framework for distribution-free predictive inference in regression, using conformal inference. The proposed methodology allows for the construction of a prediction band for the response variable using any estimator of the regression function. The resulting prediction band preserves the consistency properties of the original estimator under standard assumptions, while guarantee… ▽ More

    Submitted 8 March, 2017; v1 submitted 14 April, 2016; originally announced April 2016.

    Comments: 50 pages, 7 figures, 3 tables

  43. arXiv:1602.00180  [pdf, other

    math.ST cs.DM

    On the Geometry and Extremal Properties of the Edge-Degeneracy Model

    Authors: Nicolas Kim, Dane Wilburne, Sonja Petrović, Alessandro Rinaldo

    Abstract: The edge-degeneracy model is an exponential random graph model that uses the graph degeneracy, a measure of the graph's connection density, and number of edges in a graph as its sufficient statistics. We show this model is relatively well-behaved by studying the statistical degeneracy of this model through the geometry of the associated polytope.

    Submitted 16 September, 2016; v1 submitted 30 January, 2016; originally announced February 2016.

    Comments: 9 pages, 4 figures. This version differs ever so slightly from the published one; several typos have been fixed and clarifying comments by J. Rauh incorporated in the update

  44. arXiv:1506.06266  [pdf, other

    math.ST

    Uniform Asymptotic Inference and the Bootstrap After Model Selection

    Authors: Ryan J. Tibshirani, Alessandro Rinaldo, Robert Tibshirani, Larry Wasserman

    Abstract: Recently, Tibshirani et al. (2016) proposed a method for making inferences about parameters defined by model selection, in a typical regression setting with normally distributed errors. Here, we study the large sample properties of this method, without assuming normality. We prove that the test statistic of Tibshirani et al. (2016) is asymptotically valid, as the number of samples n grows and the… ▽ More

    Submitted 9 August, 2017; v1 submitted 20 June, 2015; originally announced June 2015.

    Comments: 47 pages, 13 figures

  45. arXiv:1412.7197  [pdf, other

    math.ST cs.CG math.AT

    Robust Topological Inference: Distance To a Measure and Kernel Distance

    Authors: Frédéric Chazal, Brittany T. Fasy, Fabrizio Lecci, Bertrand Michel, Alessandro Rinaldo, Larry Wasserman

    Abstract: Let P be a distribution with support S. The salient features of S can be quantified with persistent homology, which summarizes topological features of the sublevel sets of the distance function (the distance of any point x to S). Given a sample from P we can infer the persistent homology using an empirical version of the distance function. However, the empirical distance function is highly non-rob… ▽ More

    Submitted 22 December, 2014; originally announced December 2014.

  46. arXiv:1411.3825  [pdf, other

    math.ST stat.ML

    Statistical Models for Degree Distributions of Networks

    Authors: Kayvan Sadeghi, Alessandro Rinaldo

    Abstract: We define and study the statistical models in exponential family form whose sufficient statistics are the degree distributions and the bi-degree distributions of undirected labelled simple graphs. Graphs that are constrained by the joint degree distributions are called $dK$-graphs in the computer science literature and this paper attempts to provide the first statistically grounded analysis of thi… ▽ More

    Submitted 14 November, 2014; originally announced November 2014.

    Comments: 13 pages. 4 figures, a shorter version to be presented at NIPS workshop 2014

  47. arXiv:1407.1004  [pdf, other

    math.ST cs.SI

    $β$ models for random hypergraphs with a given degree sequence

    Authors: Despina Stasi, Kayvan Sadeghi, Alessandro Rinaldo, Sonja Petrović, Stephen E. Fienberg

    Abstract: We introduce the beta model for random hypergraphs in order to represent the occurrence of multi-way interactions among agents in a social network. This model builds upon and generalizes the well-studied beta model for random graphs, which instead only considers pairwise interactions. We provide two algorithms for fitting the model parameters, IPS (iterative proportional scaling) and fixed point a… ▽ More

    Submitted 3 July, 2014; originally announced July 2014.

    Comments: 9 pages, 2 figures, Proceedings of 21st International Conference on Computational Statistics (2014), to appear

  48. arXiv:1406.1901  [pdf, other

    math.AT cs.CG stat.AP

    Subsampling Methods for Persistent Homology

    Authors: Frédéric Chazal, Brittany Terese Fasy, Fabrizio Lecci, Bertrand Michel, Alessandro Rinaldo, Larry Wasserman

    Abstract: Persistent homology is a multiscale method for analyzing the shape of sets and functions from point cloud data arising from an unknown distribution supported on those sets. When the size of the sample is large, direct computation of the persistent homology is prohibitive due to the combinatorial nature of the existing algorithms. We propose to compute the persistent homology of several subsamples… ▽ More

    Submitted 7 June, 2014; originally announced June 2014.

  49. arXiv:1312.2050  [pdf, ps, other

    math.ST stat.ML

    Consistency of spectral clustering in stochastic block models

    Authors: Jing Lei, Alessandro Rinaldo

    Abstract: We analyze the performance of spectral clustering for community extraction in stochastic block models. We show that, under mild conditions, spectral clustering applied to the adjacency matrix of the network can consistently recover hidden communities even when the order of the maximum expected degree is as small as $\log n$, with $n$ the number of nodes. This result applies to some popular polynom… ▽ More

    Submitted 30 December, 2014; v1 submitted 6 December, 2013; originally announced December 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOS1274 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1274

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 1, 215-237

  50. arXiv:1312.0308  [pdf, other

    math.ST cs.CG math.AT

    Stochastic Convergence of Persistence Landscapes and Silhouettes

    Authors: Frédéric Chazal, Brittany Terese Fasy, Fabrizio Lecci, Alessandro Rinaldo, Larry Wasserman

    Abstract: Persistent homology is a widely used tool in Topological Data Analysis that encodes multiscale topological information as a multi-set of points in the plane called a persistence diagram. It is difficult to apply statistical theory directly to a random sample of diagrams. Instead, we can summarize the persistent homology with the persistence landscape, introduced by Bubenik, which converts a diagra… ▽ More

    Submitted 1 December, 2013; originally announced December 2013.