Skip to main content

Showing 1–22 of 22 results for author: Donoho, D L

Searching in archive math. Search in all archives.
.
  1. arXiv:2210.04488  [pdf, other

    math.ST

    Optimal Eigenvalue Shrinkage in the Semicircle Limit

    Authors: David L. Donoho, Michael J. Feldman

    Abstract: Modern datasets are trending towards ever higher dimension. In response, recent theoretical studies of covariance estimation often assume the proportional-growth asymptotic framework, where the sample size $n$ and dimension $p$ are comparable, with $n, p \rightarrow \infty $ and $γ_n = p/n \rightarrow γ> 0$. Yet, many datasets -- perhaps most -- have very different numbers of rows and columns. We… ▽ More

    Submitted 30 July, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  2. arXiv:2106.02073  [pdf, other

    cs.LG cs.AI math.DG math.OC stat.ML

    Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

    Authors: X. Y. Han, Vardan Papyan, David L. Donoho

    Abstract: The recently discovered Neural Collapse (NC) phenomenon occurs pervasively in today's deep net training paradigm of driving cross-entropy (CE) loss towards zero. During NC, last-layer features collapse to their class-means, both classifiers and class-means collapse to the same Simplex Equiangular Tight Frame, and classifier behavior collapses to the nearest-class-mean decision rule. Recent works d… ▽ More

    Submitted 9 May, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 Outstanding Paper Prize & Oral. Appendix contains [A] empirical experiments, [B-D] proofs of theoretical results, and [E] survey of related works examining Neural Collapse

  3. arXiv:2103.03218  [pdf, ps, other

    math.ST

    The Impossibility Region for Detecting Sparse Mixtures using the Higher Criticism

    Authors: David L. Donoho, Alon Kipnis

    Abstract: Consider a multiple hypothesis testing setting involving rare/weak effects: relatively few tests, out of possibly many, deviate from their null hypothesis behavior. Summarizing the significance of each test by a P-value, we construct a global test against the null using the Higher Criticism (HC) statistics of these P-values. We calibrate the rare/weak model using parameters controlling the asympto… ▽ More

    Submitted 19 October, 2021; v1 submitted 15 February, 2021; originally announced March 2021.

    MSC Class: 2010; Primary: 62H17; 62H15

  4. arXiv:2009.12297  [pdf, other

    math.ST stat.ME

    ScreeNOT: Exact MSE-Optimal Singular Value Thresholding in Correlated Noise

    Authors: David L. Donoho, Matan Gavish, Elad Romanov

    Abstract: We derive a formula for optimal hard thresholding of the singular value decomposition in the presence of correlated additive noise; although it nominally involves unobservables, we show how to apply it even where the noise covariance structure is not a-priori known or is not independently estimable. The proposed method, which we call ScreeNOT, is a mathematically solid alternative to Cattell's e… ▽ More

    Submitted 26 March, 2023; v1 submitted 25 September, 2020; originally announced September 2020.

    Journal ref: Annals of Statistics, 2023

  5. arXiv:2007.01958  [pdf, other

    math.ST stat.CO

    Higher Criticism to Compare Two Large Frequency Tables, with sensitivity to Possible Rare and Weak Differences

    Authors: David L. Donoho, Alon Kipnis

    Abstract: We adapt Higher Criticism (HC) to the comparison of two frequency tables which may -- or may not -- exhibit moderate differences between the tables in some unknown, relatively small subset out of a large number of categories. Our analysis of the power of the proposed HC test quantifies the rarity and size of assumed differences and applies moderate deviations-analysis to determine the asymptotic p… ▽ More

    Submitted 21 June, 2022; v1 submitted 3 July, 2020; originally announced July 2020.

    MSC Class: 62H17; 62H15; 62G10

    Journal ref: Annals of Statistics 2022, Vol. 50, No. 3, 1447-1472

  6. arXiv:1810.07403  [pdf, ps, other

    math.ST stat.ME

    Optimal Covariance Estimation for Condition Number Loss in the Spiked Model

    Authors: David L. Donoho, Behrooz Ghorbani

    Abstract: We study estimation of the covariance matrix under relative condition number loss $κ(Σ^{-1/2} \hatΣ Σ^{-1/2})$, where $κ(Δ)$ is the condition number of matrix $Δ$, and $\hatΣ$ and $Σ$ are the estimated and theoretical covariance matrices. Optimality in $κ$-loss provides optimal guarantees in two stylized applications: Multi-User Covariance Estimation and Multi-Task Linear Discriminant Analysis. We… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: 85 pages, 4 figures

  7. arXiv:1503.02106  [pdf, other

    math.ST

    Variance Breakdown of Huber (M)-estimators: $n/p \rightarrow m \in (1,\infty)$

    Authors: David L. Donoho, Andrea Montanari

    Abstract: A half century ago, Huber evaluated the minimax asymptotic variance in scalar location estimation, $ \min_ψ\max_{F \in {\cal F}_ε} V(ψ, F) = \frac{1}{I(F_ε^*)} $, where $V(ψ,F)$ denotes the asymptotic variance of the $(M)$-estimator for location with score function $ψ$, and $I(F_ε^*)$ is the minimal Fisher information $ \min_{{\cal F}_ε} I(F)$ over the class of $ε$-Contaminated Normal distribution… ▽ More

    Submitted 6 March, 2015; originally announced March 2015.

    Comments: Based on a lecture delivered at a special colloquium honoring the 50th anniversary of the Seminar für Statistik (SfS) at ETH Zürich, November 25, 2014

    MSC Class: 62C20; 62J05; 62G35

  8. arXiv:1405.7511  [pdf, ps, other

    math.ST

    Optimal Shrinkage of Singular Values

    Authors: Matan Gavish, David L. Donoho

    Abstract: We consider recovery of low-rank matrices from noisy data by shrinkage of singular values, in which a single, univariate nonlinearity is applied to each of the empirical singular values. We adopt an asymptotic framework, in which the matrix size is much larger than the rank of the signal matrix to be recovered, and the signal-to-noise ratio of the low-rank piece stays constant. For a variety of lo… ▽ More

    Submitted 15 May, 2016; v1 submitted 29 May, 2014; originally announced May 2014.

  9. arXiv:1311.0851  [pdf, ps, other

    math.ST

    Optimal Shrinkage of Eigenvalues in the Spiked Covariance Model

    Authors: David L. Donoho, Matan Gavish, Iain M. Johnstone

    Abstract: We show that in a common high-dimensional covariance model, the choice of loss function has a profound effect on optimal estimation. In an asymptotic framework based on the Spiked Covariance model and use of orthogonally invariant estimators, we show that optimal estimation of the population covariance matrix boils down to design of an optimal shrinker $η$ that acts elementwise on the sample eigen… ▽ More

    Submitted 4 June, 2017; v1 submitted 4 November, 2013; originally announced November 2013.

  10. The Phase Transition of Matrix Recovery from Gaussian Measurements Matches the Minimax MSE of Matrix Denoising

    Authors: David L. Donoho, Matan Gavish, Andrea Montanari

    Abstract: Let $X_0$ be an unknown $M$ by $N$ matrix. In matrix recovery, one takes $n < MN$ linear measurements $y_1,..., y_n$ of $X_0$, where $y_i = \Tr(a_i^T X_0)$ and each $a_i$ is a $M$ by $N$ matrix. For measurement matrices with Gaussian i.i.d entries, it known that if $X_0$ is of low rank, it is recoverable from just a few measurements. A popular approach for matrix recovery is Nuclear Norm Minimizat… ▽ More

    Submitted 10 February, 2013; originally announced February 2013.

  11. arXiv:1112.0708  [pdf, other

    cs.IT cond-mat.stat-mech math.ST

    Information-Theoretically Optimal Compressed Sensing via Spatial Coupling and Approximate Message Passing

    Authors: David L. Donoho, Adel Javanmard, Andrea Montanari

    Abstract: We study the compressed sensing reconstruction problem for a broad class of random, band-diagonal sensing matrices. This construction is inspired by the idea of spatial coupling in coding theory. As demonstrated heuristically and numerically by Krzakala et al. \cite{KrzakalaEtAl}, message passing algorithms can effectively solve the reconstruction problem for spatially coupled measurements with un… ▽ More

    Submitted 18 January, 2013; v1 submitted 3 December, 2011; originally announced December 2011.

    Comments: 60 pages, 7 figures, Sections 3,5 and Appendices A,B are added. The stability constant is quantified (cf Theorem 1.7)

  12. arXiv:1004.3006  [pdf, ps, other

    math.FA cs.IT math.NA

    Microlocal Analysis of the Geometric Separation Problem

    Authors: David L. Donoho, Gitta Kutyniok

    Abstract: Image data are often composed of two or more geometrically distinct constituents; in galaxy catalogs, for instance, one sees a mixture of pointlike structures (galaxy superclusters) and curvelike structures (filaments). It would be ideal to process a single image and extract two geometrically `pure' images, each one containing features from only one of the two geometric constituents. This seems t… ▽ More

    Submitted 18 April, 2010; originally announced April 2010.

    Comments: 59 pages, 9 figures

    Report number: Technical Report No. 2010-01, Statistics Department, Stanford University

  13. arXiv:1004.1218  [pdf, other

    math.ST cs.IT

    The Noise-Sensitivity Phase Transition in Compressed Sensing

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: Consider the noisy underdetermined system of linear equations: y=Ax0 + z0, with n x N measurement matrix A, n < N, and Gaussian white noise z0 ~ N(0,σ^2 I). Both y and A are known, both x0 and z0 are unknown, and we seek an approximation to x0. When x0 has few nonzeros, useful approximations are obtained by l1-penalized l2 minimization, in which the reconstruction \hxl solves min || y - Ax||^2/2… ▽ More

    Submitted 7 April, 2010; originally announced April 2010.

    Comments: 40 pages, 13 pdf figures

  14. arXiv:0909.0777  [pdf, other

    math.NA cs.IT cs.MS

    Optimally Tuned Iterative Reconstruction Algorithms for Compressed Sensing

    Authors: Arian Maleki, David L. Donoho

    Abstract: We conducted an extensive computational experiment, lasting multiple CPU-years, to optimally select parameters for two important classes of algorithms for finding sparse solutions of underdetermined systems of linear equations. We make the optimally tuned implementations available at {\tt sparselab.stanford.edu}; they run `out of the box' with no user tuning: it is not necessary to select thresh… ▽ More

    Submitted 3 September, 2009; originally announced September 2009.

    Comments: 12 pages, 14 figures

  15. arXiv:0906.2530  [pdf, other

    math.ST cs.IT physics.data-an stat.CO

    Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

    Authors: David L. Donoho, Jared Tanner

    Abstract: We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers incre… ▽ More

    Submitted 14 June, 2009; originally announced June 2009.

    Comments: 47 pages, 24 figures, 10 tables

  16. arXiv:0807.3590  [pdf, ps, other

    math.MG cs.IT math.OC math.PR

    Counting the Faces of Randomly-Projected Hypercubes and Orthants, with Applications

    Authors: David L. Donoho, Jared Tanner

    Abstract: Let $A$ be an $n$ by $N$ real valued random matrix, and $\h$ denote the $N$-dimensional hypercube. For numerous random matrix ensembles, the expected number of $k$-dimensional faces of the random $n$-dimensional zonotope $A\h$ obeys the formula $E f_k(A\h) /f_k(\h) = 1-P_{N-n,N-k}$, where $P_{N-n,N-k}$ is a fair-coin-tossing probability. The formula applies, for example, where the columns of… ▽ More

    Submitted 22 July, 2008; originally announced July 2008.

    Comments: 21 pages, 3 figures

    MSC Class: 52A22; 52B05; 52B11; 52B12; 62E20; 68P30; 68P25; 68W20; 68W40; 94B20; 94B35; 94B65; 94B70

  17. Does median filtering truly preserve edges better than linear filtering?

    Authors: Ery Arias-Castro, David L. Donoho

    Abstract: Image processing researchers commonly assert that "median filtering is better than linear filtering for removing noise in the presence of edges." Using a straightforward large-$n$ decision-theory framework, this folk-theorem is seen to be false in general. We show that median filtering and linear filtering have similar asymptotic worst-case mean-squared error (MSE) when the signal-to-noise ratio… ▽ More

    Submitted 20 April, 2009; v1 submitted 14 December, 2006; originally announced December 2006.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOS604 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS604 MSC Class: 62G08; 62G20 (Primary) 60G35 (Secondary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 3, 1172-1206

  18. arXiv:math/0607364  [pdf, ps, other

    math.MG math.NA math.PR math.ST

    Counting faces of randomly-projected polytopes when the projection radically lowers dimension

    Authors: David L. Donoho, Jared Tanner

    Abstract: This paper develops asymptotic methods to count faces of random high-dimensional polytopes. Beyond its intrinsic interest, our conclusions have surprising implications - in statistics, probability, information theory, and signal processing - with potential impacts in practical subjects like medical imaging and digital communications. Three such implications concern: convex hulls of Gaussian poin… ▽ More

    Submitted 26 September, 2006; v1 submitted 15 July, 2006; originally announced July 2006.

    Comments: 56 pages

    MSC Class: 52A22; 52B05; 52B11; 52B12; 62E20; 68P30; 68P25; 68W20; 68W40; 94B20 94B35; 94B65; 94B70

  19. Adaptive multiscale detection of filamentary structures in a background of uniform random points

    Authors: Ery Arias-Castro, David L. Donoho, Xiaoming Huo

    Abstract: We are given a set of $n$ points that might be uniformly distributed in the unit square $[0,1]^2$. We wish to test whether the set, although mostly consisting of uniformly scattered points, also contains a small fraction of points sampled from some (a priori unknown) curve with $C^α$-norm bounded by $β$. An asymptotic detection threshold exists in this problem; for a constant $T_-(α,β)>0$, if th… ▽ More

    Submitted 18 May, 2006; originally announced May 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053605000000787 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0097 MSC Class: 62M30 (Primary) 62G10; 62G20 (Secondary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 1, 326-349

  20. arXiv:math/0603673  [pdf, ps, other

    math.PR

    Correction. Connect The Dots: How Many Random Points Can A Regular Curve Pass Through?

    Authors: E. Arias-Castro, D. L. Donoho, X. Huo, C. A. Tovey

    Abstract: Correction for Adv. in Appl. Probab. 37, no. 3 (2005), 571-603

    Submitted 28 March, 2006; originally announced March 2006.

    Comments: 2 pages, 1 figure

    MSC Class: 60D05; 62M40

  21. arXiv:math/0505374  [pdf, ps, other

    math.ST

    Adapting to Unknown Sparsity by controlling the False Discovery Rate

    Authors: Felix Abramovich, Yoav Benjamini, David L. Donoho, Iain M. Johnstone

    Abstract: We attempt to recover an $n$-dimensional vector observed in white noise, where $n$ is large and the vector is known to be sparse, but the degree of sparsity is unknown. We consider three different ways of defining sparsity of a vector: using the fraction of nonzero terms; imposing power-law decay bounds on the ordered entries; and controlling the $\ell_p$ norm for $p$ small. We obtain a procedur… ▽ More

    Submitted 18 May, 2005; originally announced May 2005.

    Comments: This is a complete version of a paper to appear in Annals of Statitistics. The paper in AoS has certain proofs abbreviated that are given here in detail

    MSC Class: 62F10; 62G12

  22. arXiv:math/0212395  [pdf, ps, other

    math.ST

    Emerging applications of geometric multiscale analysis

    Authors: David L. Donoho

    Abstract: Classical multiscale analysis based on wavelets has a number of successful applications, e.g. in data compression, fast algorithms, and noise removal. Wavelets, however, are adapted to point singularities, and many phenomena in several variables exhibit intermediate-dimensional singularities, such as edges, filaments, and sheets. This suggests that in higher dimensions, wavelets ought to be repl… ▽ More

    Submitted 30 November, 2002; originally announced December 2002.

    Report number: ICM-2002 MSC Class: 41A30; 41A58; 41A63; 62G07; 62G08; 94A08; 94A11; 94A12; 94A29

    Journal ref: Proceedings of the ICM, Beijing 2002, vol. 1, 209--233