Skip to main content

Showing 1–45 of 45 results for author: Barber, R F

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.03599  [pdf, ps, other

    stat.ME math.ST

    Mosaic inference on panel data

    Authors: Asher Spector, Rina Foygel Barber, Emmanuel Candès

    Abstract: Analysis of panel data via linear regression is widespread across disciplines. To perform statistical inference, such analyses typically assume that clusters of observations are jointly independent. For example, one might assume that observations in New York are independent of observations in New Jersey. Are such assumptions plausible? Might there be hidden dependencies between nearby clusters? Th… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 38 pages, 7 figures

  2. arXiv:2506.02257  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Assumption-free stability for ranking problems

    Authors: Ruiting Liang, Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: In this work, we consider ranking problems among a finite set of candidates: for instance, selecting the top-$k$ items among a larger list of candidates or obtaining the full ranking of all items in the set. These problems are often unstable, in the sense that estimating a ranking from noisy data can exhibit high sensitivity to small perturbations. Concretely, if we use data to provide a score for… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  3. arXiv:2504.02292  [pdf, ps, other

    math.ST

    Unifying Different Theories of Conformal Prediction

    Authors: Rina Foygel Barber, Ryan J. Tibshirani

    Abstract: This paper presents a unified framework for understanding the methodology and theory behind several different methods in the conformal prediction literature, which includes standard conformal prediction (CP), weighted conformal prediction (WCP), nonexchangeable conformal prediction (NexCP), and randomly-localized conformal prediction (RLCP), among others. At the crux of our framework is the idea t… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  4. arXiv:2502.06765  [pdf, ps, other

    math.ST cs.LG stat.ML

    Are all models wrong? Fundamental limits in distribution-free empirical model falsification

    Authors: Manuel M. Müller, Yuetian Luo, Rina Foygel Barber

    Abstract: In statistics and machine learning, when we train a fitted model on available data, we typically want to ensure that we are searching within a model class that contains at least one accurate model -- that is, we would like to ensure an upper bound on the model class risk (the lowest possible risk that can be attained by any model in the class). However, it is also of interest to establish lower bo… ▽ More

    Submitted 5 June, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 39 pages, 1 figure

  5. arXiv:2501.06133  [pdf, other

    stat.ME math.ST

    Testing conditional independence under isotonicity

    Authors: Rohan Hore, Jake A. Soloff, Rina Foygel Barber, Richard J. Samworth

    Abstract: We propose a test of the conditional independence of random variables $X$ and $Y$ given $Z$ under the additional assumption that $X$ is stochastically increasing in $Z$. The well-documented hardness of testing conditional independence means that some further restriction on the null hypothesis parameter space is required, but in contrast to existing approaches based on parametric models, smoothness… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: 69 pages, 5 figures

  6. arXiv:2411.11824  [pdf, ps, other

    math.ST stat.ME stat.ML

    Theoretical Foundations of Conformal Prediction

    Authors: Anastasios N. Angelopoulos, Rina Foygel Barber, Stephen Bates

    Abstract: This book is about conformal prediction and related inferential techniques that build on permutation tests and exchangeability. These techniques are useful in a diverse array of tasks, including hypothesis testing and providing uncertainty quantification guarantees for machine learning systems. Much of the current interest in conformal prediction is due to its ability to integrate into complex mac… ▽ More

    Submitted 3 June, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: This material will be published by Cambridge University Press as Theoretical Foundations of Conformal Prediction by Anastasios N. Angelopoulos, Rina Foygel Barber, and Stephen Bates. This prepublication version is free to view/download for personal use only. Not for redistribution/resale/use in derivative works. Copyright Anastasios N. Angelopoulos, Rina Foygel Barber, and Stephen Bates, 2025

  7. arXiv:2405.15107  [pdf, ps, other

    stat.ML cs.LG math.ST

    Is Algorithmic Stability Testable? A Unified Framework under Computational Constraints

    Authors: Yuetian Luo, Rina Foygel Barber

    Abstract: Algorithmic stability is a central notion in learning theory that quantifies the sensitivity of an algorithm to small changes in the training data. If a learning algorithm satisfies certain stability properties, this leads to many important downstream implications, such as generalization, robustness, and reliable predictive inference. Verifying that stability holds for a particular algorithm is th… ▽ More

    Submitted 30 March, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.14064  [pdf, other

    stat.ML cs.LG math.ST

    Building a stable classifier with the inflated argmax

    Authors: Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: We propose a new framework for algorithmic stability in the context of multiclass classification. In practice, classification algorithms often operate by first assigning a continuous score (for instance, an estimated probability) to each possible label, then taking the maximizer -- i.e., selecting the class that has the highest score. A drawback of this type of approach is that it is inherently un… ▽ More

    Submitted 25 April, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024

  9. arXiv:2405.09511  [pdf, other

    math.ST

    Stability via resampling: statistical problems beyond the real line

    Authors: Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: Model averaging techniques based on resampling methods (such as bootstrapping or subsampling) have been utilized across many areas of statistics, often with the explicit goal of promoting stability in the resulting output. We provide a general, finite-sample theoretical result guaranteeing the stability of bagging when applied to algorithms that return outputs in a general space, so that the outpu… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  10. arXiv:2404.06457  [pdf, other

    math.ST math.PR

    Hoeffding and Bernstein inequalities for weighted sums of exchangeable random variables

    Authors: Rina Foygel Barber

    Abstract: The aim of this paper is to establish Hoeffding and Bernstein type concentration inequalities for weighted sums of exchangeable random variables. A special case is the i.i.d. setting, where random variables are sampled independently from some distribution (and are therefore exchangeable). In contrast to the existing literature on this problem, our results provide a natural unified view of both the… ▽ More

    Submitted 14 August, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  11. arXiv:2402.07388  [pdf, ps, other

    math.ST cs.LG stat.ML

    The Limits of Assumption-free Tests for Algorithm Performance

    Authors: Yuetian Luo, Rina Foygel Barber

    Abstract: Algorithm evaluation and comparison are fundamental questions in machine learning and statistics -- how well does an algorithm perform at a given modeling task, and which algorithm performs best? Many methods have been developed to assess algorithm performance, often based around cross-validation type strategies, retraining the algorithm of interest on different subsets of the data and assessing i… ▽ More

    Submitted 22 March, 2025; v1 submitted 11 February, 2024; originally announced February 2024.

  12. arXiv:2311.04295  [pdf, ps, other

    math.ST

    Algorithmic stability implies training-conditional coverage for distribution-free prediction methods

    Authors: Ruiting Liang, Rina Foygel Barber

    Abstract: In a supervised learning problem, given a predicted value that is the output of some trained model, how can we quantify our uncertainty around this prediction? Distribution-free predictive inference aims to construct prediction intervals around this output, with valid coverage that does not rely on assumptions on the distribution of the data or the nature of the model training algorithm. Existing… ▽ More

    Submitted 27 June, 2025; v1 submitted 7 November, 2023; originally announced November 2023.

  13. arXiv:2306.06342  [pdf, other

    math.ST stat.ME

    Distribution-free inference with hierarchical data

    Authors: Yonghoon Lee, Rina Foygel Barber, Rebecca Willett

    Abstract: This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold. To address this challenge, a hierarchical form of exchangeability is derived, facilitating extensions of distribution-free methods, including conformal predict… ▽ More

    Submitted 2 March, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

  14. arXiv:2304.03927  [pdf, ps, other

    math.ST math.PR

    De Finetti's theorem and related results for infinite weighted exchangeable sequences

    Authors: Rina Foygel Barber, Emmanuel J. Candes, Aaditya Ramdas, Ryan J. Tibshirani

    Abstract: De Finetti's theorem, also called the de Finetti-Hewitt-Savage theorem, is a foundational result in probability and statistics. Roughly, it says that an infinite sequence of exchangeable random variables can always be written as a mixture of independent and identically distributed (i.i.d.) sequences of random variables. In this paper, we consider a weighted generalization of exchangeability that a… ▽ More

    Submitted 27 November, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  15. arXiv:2303.17042  [pdf, other

    physics.med-ph math.OC

    Simultaneous activity and attenuation estimation in TOF-PET with TV-constrained nonconvex optimization

    Authors: Zhimei Ren, Emil Y. Sidky, Rina Foygel Barber, Chien-Min Kao, Xiaochuan Pan

    Abstract: An alternating direction method of multipliers (ADMM) framework is developed for nonsmooth biconvex optimization for inverse problems in imaging. In particular, the simultaneous estimation of activity and attenuation (SAA) problem in time-of-flight positron emission tomography (TOF-PET) has such a structure when maximum likelihood estimation (MLE) is employed. The ADMM framework is applied to MLE… ▽ More

    Submitted 9 February, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Manuscript accepted at IEEE transactions on medical imaging. This version contains the appendix for the ADMM-TVSAA pseudocode

  16. arXiv:2301.12600  [pdf, other

    stat.ML cs.LG math.ST

    Bagging Provides Assumption-free Stability

    Authors: Jake A. Soloff, Rina Foygel Barber, Rebecca Willett

    Abstract: Bagging is an important technique for stabilizing machine learning models. In this paper, we derive a finite-sample guarantee on the stability of bagging for any model. Our result places no assumptions on the distribution of the data, on the properties of the base algorithm, or on the dimensionality of the covariates. Our guarantee applies to many variants of bagging and is optimal up to a constan… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 January, 2023; originally announced January 2023.

  17. arXiv:2205.03647  [pdf, other

    math.ST

    Training-conditional coverage for distribution-free predictive inference

    Authors: Michael Bian, Rina Foygel Barber

    Abstract: The field of distribution-free predictive inference provides tools for provably valid prediction without any assumptions on the distribution of the data, which can be paired with any regression algorithm to provide accurate and reliable predictive intervals. The guarantees provided by these methods are typically marginal, meaning that predictive accuracy holds on average over both the training dat… ▽ More

    Submitted 17 January, 2023; v1 submitted 7 May, 2022; originally announced May 2022.

  18. arXiv:2201.04457  [pdf, ps, other

    math.ST

    Half-Trek Criterion for Identifiability of Latent Variable Models

    Authors: Rina Foygel Barber, Mathias Drton, Nils Sturma, Luca Weihs

    Abstract: We consider linear structural equation models with latent variables and develop a criterion to certify whether the direct causal effects between the observable variables are identifiable based on the observed covariance matrix. Linear structural equation models assume that both observed and latent variables solve a linear equation system featuring stochastic noise terms. Each model corresponds to… ▽ More

    Submitted 12 August, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: to be published in Annals of Statistics

    MSC Class: 62H05; 62J05; 62R01

  19. arXiv:2111.15546  [pdf, ps, other

    cs.LG math.ST

    Black-box tests for algorithmic stability

    Authors: Byol Kim, Rina Foygel Barber

    Abstract: Algorithmic stability is a concept from learning theory that expresses the degree to which changes to the input data (e.g., removal of a single data point) may affect the outputs of a regression algorithm. Knowing an algorithm's stability properties is often useful for many downstream applications -- for example, stability is known to lead to desirable generalization properties and predictive infe… ▽ More

    Submitted 21 December, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: 37 pages. Minor edits to match the journal-submitted version

  20. arXiv:2106.09136  [pdf, other

    math.ST stat.ML

    Binary classification with corrupted labels

    Authors: Yonghoon Lee, Rina Foygel Barber

    Abstract: In a binary classification problem where the goal is to fit an accurate predictor, the presence of corrupted labels in the training data set may create an additional challenge. However, in settings where likelihood maximization is poorly behaved-for example, if positive and negative labels are perfectly separable-then a small fraction of corrupted labels can improve performance by ensuring robustn… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  21. arXiv:2105.14075  [pdf, ps, other

    math.ST

    Distribution-free inference for regression: discrete, continuous, and in between

    Authors: Yonghoon Lee, Rina Foygel Barber

    Abstract: In data analysis problems where we are not able to rely on distributional assumptions, what types of inference guarantees can still be obtained? Many popular methods, such as holdout methods, cross-validation methods, and conformal prediction, are able to provide distribution-free guarantees for predictive inference, but the problem of providing inference for the underlying regression function (fo… ▽ More

    Submitted 28 May, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

  22. arXiv:2105.07587  [pdf, other

    math.ST stat.ME

    Convergence guarantee for the sparse monotone single index model

    Authors: Ran Dai, Hyebin Song, Rina Foygel Barber, Garvesh Raskutti

    Abstract: We consider a high-dimensional monotone single index model (hdSIM), which is a semiparametric extension of a high-dimensional generalize linear model (hdGLM), where the link function is unknown, but constrained with monotone and non-decreasing shape. We develop a scalable projection-based iterative approach, the "Sparse Orthogonal Descent Single-Index Model" (SOD-SIM), which alternates between spa… ▽ More

    Submitted 16 May, 2021; originally announced May 2021.

    MSC Class: 62G08

  23. arXiv:2007.15346  [pdf, other

    math.ST

    A Power Analysis for Model-X Knockoffs with $\ell_{p}$-Regularized Statistics

    Authors: Asaf Weinstein, Weijie J. Su, Małgorzata Bogdan, Rina F. Barber, Emmanuel J. Candès

    Abstract: Variable selection properties of procedures utilizing penalized-likelihood estimates is a central topic in the study of high dimensional linear regression problems. Existing literature emphasizes the quality of ranking of the variables by such procedures as reflected in the receiver operating characteristic curve or in prediction performance. Specifically, recent works have harnessed modern theory… ▽ More

    Submitted 27 April, 2022; v1 submitted 30 July, 2020; originally announced July 2020.

  24. arXiv:2006.07278  [pdf, other

    math.OC

    Convergence for nonconvex ADMM, with applications to CT imaging

    Authors: Rina Foygel Barber, Emil Y. Sidky

    Abstract: The alternating direction method of multipliers (ADMM) algorithm is a powerful and flexible tool for complex optimization problems of the form $\min\{f(x)+g(y) : Ax+By=c\}$. ADMM exhibits robust empirical performance across a range of challenging settings including nonsmoothness and nonconvexity of the objective functions $f$ and $g$, and provides a simple and natural approach to the inverse probl… ▽ More

    Submitted 6 February, 2024; v1 submitted 12 June, 2020; originally announced June 2020.

  25. arXiv:2004.09477  [pdf, other

    math.ST

    Is distribution-free inference possible for binary regression?

    Authors: Rina Foygel Barber

    Abstract: For a regression problem with a binary label response, we examine the problem of constructing confidence intervals for the label probability conditional on the features. In a setting where we do not have any information about the underlying distribution, we would ideally like to provide confidence intervals that are distribution-free---that is, valid with no assumptions on the distribution of the… ▽ More

    Submitted 7 October, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  26. arXiv:2002.06117  [pdf, ps, other

    math.ST

    Local continuity of log-concave projection, with applications to estimation under model misspecification

    Authors: Rina Foygel Barber, Richard J. Samworth

    Abstract: The log-concave projection is an operator that maps a d-dimensional distribution P to an approximating log-concave density. Prior work by D{ü}mbgen et al. (2011) establishes that, with suitable metrics on the underlying spaces, this projection is continuous, but not uniformly continuous. In this work we prove a local uniform continuity result for log-concave projection -- in particular, establishi… ▽ More

    Submitted 18 December, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

  27. arXiv:1908.04462  [pdf, other

    math.ST

    The bias of isotonic regression

    Authors: Ran Dai, Hyebin Song, Rina Foygel Barber, Garvesh Raskutti

    Abstract: We study the bias of the isotonic regression estimator. While there is extensive work characterizing the mean squared error of the isotonic regression estimator, relatively little is known about the bias. In this paper, we provide a sharp characterization, proving that the bias scales as $O(n^{-β/3})$ up to log factors, where $1 \leq β\leq 2$ is the exponent corresponding to H{ö}lder smoothness of… ▽ More

    Submitted 13 January, 2020; v1 submitted 12 August, 2019; originally announced August 2019.

  28. arXiv:1903.04684  [pdf, ps, other

    math.ST

    The limits of distribution-free conditional predictive inference

    Authors: Rina Foygel Barber, Emmanuel J. Candès, Aaditya Ramdas, Ryan J. Tibshirani

    Abstract: We consider the problem of distribution-free predictive inference, with the goal of producing predictive coverage guarantees that hold conditionally rather than marginally. Existing methods such as conformal prediction offer marginal coverage guarantees, where predictive coverage holds on average over all possible test points, but this is not sufficient for many practical applications where we wou… ▽ More

    Submitted 14 April, 2020; v1 submitted 11 March, 2019; originally announced March 2019.

  29. arXiv:1812.11433  [pdf, ps, other

    stat.ME math.ST

    On the Construction of Knockoffs in Case-Control Studies

    Authors: Rina Foygel Barber, Emmanuel Candes

    Abstract: Consider a case-control study in which we have a random sample, constructed in such a way that the proportion of cases in our sample is different from that in the general population---for instance, the sample is constructed to achieve a fixed ratio of cases to controls. Imagine that we wish to determine which of the potentially many covariates under study truly influence the response by applying t… ▽ More

    Submitted 29 December, 2018; originally announced December 2018.

    Comments: 4 pages

  30. arXiv:1812.00404  [pdf, ps, other

    math.OC

    An equivalence between critical points for rank constraints versus low-rank factorizations

    Authors: Wooseok Ha, Haoyang Liu, Rina Foygel Barber

    Abstract: Two common approaches in low-rank optimization problems are either working directly with a rank constraint on the matrix variable, or optimizing over a low-rank factorization so that the rank constraint is implicitly ensured. In this paper, we study the natural connection between the rank-constrained and factorized approaches. We show that all second-order stationary points of the factorized objec… ▽ More

    Submitted 16 December, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

  31. arXiv:1807.05405  [pdf, other

    stat.ME math.ST

    The conditional permutation test for independence while controlling for confounders

    Authors: Thomas B. Berrett, Yi Wang, Rina Foygel Barber, Richard J. Samworth

    Abstract: We propose a general new method, the conditional permutation test, for testing the conditional independence of variables $X$ and $Y$ given a potentially high-dimensional random vector $Z$ that may contain confounding factors. The proposed test permutes entries of $X$ non-uniformly, so as to respect the existing dependence between $X$ and $Z$ and thus account for the presence of these confounders.… ▽ More

    Submitted 7 May, 2019; v1 submitted 14 July, 2018; originally announced July 2018.

    Comments: 31 pages, 4 figures

  32. arXiv:1804.08841  [pdf, other

    stat.ME math.ST stat.CO stat.ML

    Between hard and soft thresholding: optimal iterative thresholding algorithms

    Authors: Haoyang Liu, Rina Foygel Barber

    Abstract: Iterative thresholding algorithms seek to optimize a differentiable objective function over a sparsity or rank constraint by alternating between gradient steps that reduce the objective, and thresholding steps that enforce the constraint. This work examines the choice of the thresholding operator, and asks whether it is possible to achieve stronger guarantees than what is possible with hard thresh… ▽ More

    Submitted 30 July, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

  33. arXiv:1709.04451  [pdf, other

    math.OC stat.ML

    Alternating minimization and alternating descent over nonconvex sets

    Authors: Wooseok Ha, Rina Foygel Barber

    Abstract: We analyze the performance of alternating minimization for loss functions optimized over two variables, where each variable may be restricted to lie in some potentially nonconvex constraint set. This type of setting arises naturally in high-dimensional statistics and signal processing, where the variables often reflect different structures or components within the signals being considered. Our ana… ▽ More

    Submitted 25 February, 2019; v1 submitted 13 September, 2017; originally announced September 2017.

  34. arXiv:1706.01852  [pdf, other

    math.ST

    Contraction and uniform convergence of isotonic regression

    Authors: Fan Yang, Rina Foygel Barber

    Abstract: We consider the problem of isotonic regression, where the underlying signal $x$ is assumed to satisfy a monotonicity constraint, that is, $x$ lies in the cone $\{ x\in\mathbb{R}^n : x_1 \leq \dots \leq x_n\}$. We study the isotonic projection operator (projection to this cone), and find a necessary and sufficient condition characterizing all norms with respect to which this projection is contracti… ▽ More

    Submitted 31 October, 2018; v1 submitted 6 June, 2017; originally announced June 2017.

  35. arXiv:1703.07755  [pdf, other

    math.OC

    Gradient descent with nonconvex constraints: local concavity determines convergence

    Authors: Rina Foygel Barber, Wooseok Ha

    Abstract: Many problems in high-dimensional statistics and optimization involve minimization over nonconvex constraints-for instance, a rank constraint for a matrix estimation problem-but little is known about the theoretical properties of such optimization problems for a general nonconvex constraint set. In this paper we study the interplay between the geometric properties of the constraint set and the con… ▽ More

    Submitted 18 October, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

  36. arXiv:1703.06222  [pdf

    stat.ME math.ST stat.ML

    A unified treatment of multiple testing with prior knowledge using the p-filter

    Authors: Aaditya Ramdas, Rina Foygel Barber, Martin J. Wainwright, Michael I. Jordan

    Abstract: There is a significant literature on methods for incorporating knowledge into multiple testing procedures so as to improve their power and precision. Some common forms of prior knowledge include (a) beliefs about which hypotheses are null, modeled by non-uniform prior weights; (b) differing importances of hypotheses, modeled by differing penalties for false discoveries; (c) multiple arbitrary part… ▽ More

    Submitted 6 August, 2019; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: 36 pages, 1 figure, accepted for publication at the Annals of Statistics

  37. arXiv:1611.09933  [pdf, ps, other

    math.ST

    Trimmed Conformal Prediction for High-Dimensional Models

    Authors: Wenyu Chen, Zhaokai Wang, Wooseok Ha, Rina Foygel Barber

    Abstract: In regression, conformal prediction is a general methodology to construct prediction intervals in a distribution-free manner. Although conformal prediction guarantees strong statistical property for predictive inference, its inherent computational challenge has attracted the attention of researchers in the community. In this paper, we propose a new framework, called Trimmed Conformal Prediction (T… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: 11 pages, 4 figures, Under review for AISTATS 2017

  38. arXiv:1610.07403  [pdf, other

    math.ST

    The Function-on-Scalar LASSO with Applications to Longitudinal GWAS

    Authors: Rina Foygel Barber, Matthew Reimherr, Thomas Schill

    Abstract: We present a new methodology for simultaneous variable selection and parameter estimation in function-on-scalar regression with an ultra-high dimensional predictor vector. We extend the LASSO to functional data in both the $\textit{dense}$ functional setting and the $\textit{sparse}$ functional setting. We provide theoretical guarantees which allow for an exponential number of predictor variables.… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

  39. arXiv:1602.03574  [pdf, other

    stat.ME math.ST

    A knockoff filter for high-dimensional selective inference

    Authors: Rina Foygel Barber, Emmanuel J. Candes

    Abstract: This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split into two groups, where the first group is used to screen for a set of potentially relevant variables, whereas the second is used for inference over this reduced… ▽ More

    Submitted 3 May, 2018; v1 submitted 10 February, 2016; originally announced February 2016.

  40. arXiv:1510.08842  [pdf, other

    math.OC

    MOCCA: mirrored convex/concave optimization for nonconvex composite functions

    Authors: Rina Foygel Barber, Emil Y. Sidky

    Abstract: Many optimization problems arising in high-dimensional statistics decompose naturally into a sum of several terms, where the individual terms are relatively simple but the composite objective function can only be optimized with iterative algorithms. In this paper, we are interested in optimization problems of the form F(Kx) + G(x), where K is a fixed linear transformation, while F and G are functi… ▽ More

    Submitted 29 June, 2016; v1 submitted 29 October, 2015; originally announced October 2015.

  41. arXiv:1503.08337  [pdf, other

    math.ST

    Laplace Approximation in High-dimensional Bayesian Regression

    Authors: Rina Foygel Barber, Mathias Drton, Kean Ming Tan

    Abstract: We consider Bayesian variable selection in sparse high-dimensional regression, where the number of covariates $p$ may be large relative to the samples size $n$, but at most a moderate number $q$ of covariates are active. Specifically, we treat generalized linear models. For a single fixed sparse model with well-behaved prior distribution, classical theory proves that the Laplace approximation to t… ▽ More

    Submitted 28 March, 2015; originally announced March 2015.

    Comments: 17 pages, 1 figure

  42. arXiv:1502.07641  [pdf, other

    math.ST cs.LG

    ROCKET: Robust Confidence Intervals via Kendall's Tau for Transelliptical Graphical Models

    Authors: Rina Foygel Barber, Mladen Kolar

    Abstract: Undirected graphical models are used extensively in the biological and social sciences to encode a pattern of conditional independences between variables, where the absence of an edge between two nodes $a$ and $b$ indicates that the corresponding two variables $X_a$ and $X_b$ are believed to be conditionally independent, after controlling for all other measured variables. In the Gaussian case, con… ▽ More

    Submitted 1 September, 2017; v1 submitted 26 February, 2015; originally announced February 2015.

  43. arXiv:1412.4451  [pdf, ps, other

    math.ST cs.IT

    Privacy and Statistical Risk: Formalisms and Minimax Bounds

    Authors: Rina Foygel Barber, John C. Duchi

    Abstract: We explore and compare a variety of definitions for privacy and disclosure limitation in statistical estimation and data analysis, including (approximate) differential privacy, testing-based definitions of privacy, and posterior guarantees on disclosure risk. We give equivalence results between the definitions, shedding light on the relationships between different formalisms for privacy. We also t… ▽ More

    Submitted 14 December, 2014; originally announced December 2014.

    Comments: 29 pages

  44. arXiv:1404.5609  [pdf, ps, other

    stat.ME math.ST

    Controlling the false discovery rate via knockoffs

    Authors: Rina Foygel Barber, Emmanuel J. Candès

    Abstract: In many fields of science, we observe a response variable together with a large number of potential explanatory variables, and would like to be able to discover which variables are truly associated with the response. At the same time, we need to know that the false discovery rate (FDR) - the expected fraction of false discoveries among all discoveries - is not too high, in order to assure the scie… ▽ More

    Submitted 14 October, 2015; v1 submitted 22 April, 2014; originally announced April 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1337 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1337

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 5, 2055-2085

  45. arXiv:1403.3374  [pdf, other

    math.ST

    High-dimensional Ising model selection with Bayesian information criteria

    Authors: Rina Foygel Barber, Mathias Drton

    Abstract: We consider the use of Bayesian information criteria for selection of the graph underlying an Ising model. In an Ising model, the full conditional distributions of each variable form logistic regression models, and variable selection techniques for regression allow one to identify the neighborhood of each node and, thus, the entire graph. We prove high-dimensional consistency results for this pseu… ▽ More

    Submitted 5 March, 2015; v1 submitted 13 March, 2014; originally announced March 2014.