Skip to main content

Showing 1–29 of 29 results for author: Donoho, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.19882  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CY

    Position: Machine Learning Conferences Should Establish a "Refutations and Critiques" Track

    Authors: Rylan Schaeffer, Joshua Kazdan, Yegor Denisov-Blanch, Brando Miranda, Matthias Gerstgrasser, Susan Zhang, Andreas Haupt, Isha Gupta, Elyas Obbad, Jesse Dodge, Jessica Zosa Forde, Koustuv Sinha, Francesco Orabona, Sanmi Koyejo, David Donoho

    Abstract: Science progresses by iteratively advancing and correcting humanity's understanding of the world. In machine learning (ML) research, rapid advancements have led to an explosion of publications, but have also led to misleading, incorrect, flawed or perhaps even fraudulent studies being accepted and sometimes highlighted at ML conferences due to the fallibility of peer review. While such mistakes ar… ▽ More

    Submitted 30 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

  2. arXiv:2505.00326  [pdf, other

    cs.LG eess.IV eess.SP stat.CO stat.ME

    Optimal Vector Compressed Sensing Using James Stein Shrinkage

    Authors: Apratim Dey, David Donoho

    Abstract: The trend in modern science and technology is to take vector measurements rather than scalars, ruthlessly scaling to ever higher dimensional vectors. For about two decades now, traditional scalar Compressed Sensing has been synonymous with a Convex Optimization based procedure called Basis Pursuit. In the vector recovery case, the natural tendency is to return to a straightforward vector extension… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 69 pages

  3. arXiv:2410.22812  [pdf, other

    cs.LG cs.AI cs.ET math.ST stat.ML

    Universality of the $π^2/6$ Pathway in Avoiding Model Collapse

    Authors: Apratim Dey, David Donoho

    Abstract: Researchers in empirical machine learning recently spotlighted their fears of so-called Model Collapse. They imagined a discard workflow, where an initial generative model is trained with real data, after which the real data are discarded, and subsequently, the model generates synthetic data on which a new model is trained. They came to the conclusion that models degenerate as model-fitting genera… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 30 pages

  4. arXiv:2410.16713  [pdf, other

    cs.LG cs.AI

    Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

    Authors: Joshua Kazdan, Rylan Schaeffer, Apratim Dey, Matthias Gerstgrasser, Rafael Rafailov, David L. Donoho, Sanmi Koyejo

    Abstract: What happens when generative machine learning models are pretrained on web-scale datasets containing data generated by earlier models? Some prior work warns of "model collapse" as the web is overwhelmed by synthetic data; other work suggests the problem can be contained (i.e. collapse can be avoided) by managing how available data are used in pretraining. In this paper, we report experiments on th… ▽ More

    Submitted 17 March, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS 2024 Workshops: Mathematics of Modern Machine Learning (M3L) and Attributing Model Behavior at Scale (ATTRIB)

  5. arXiv:2404.01413  [pdf, other

    cs.LG cs.AI cs.CL cs.ET stat.ML

    Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

    Authors: Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

    Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs? Recent investigations into model-data feedback loops proposed that such loops would lead to a phenomenon termed model collapse, under which performance progressively degrades with each model-data feedback iteration… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  6. arXiv:2308.01839  [pdf, other

    q-bio.QM cs.CV q-bio.GN stat.AP stat.ML

    Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data

    Authors: Rong Ma, Eric D. Sun, David Donoho, James Zou

    Abstract: Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional si… ▽ More

    Submitted 29 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the National Academy of Sciences, 2024, 121(10) e2313719121

  7. arXiv:2106.07053  [pdf, other

    cs.IT cs.AI eess.SY math.ST stat.OT

    Convex Sparse Blind Deconvolution

    Authors: Qingyun Sun, David Donoho

    Abstract: In the blind deconvolution problem, we observe the convolution of an unknown filter and unknown signal and attempt to reconstruct the filter and signal. The problem seems impossible in general, since there are seemingly many more unknowns than knowns . Nevertheless, this problem arises in many application fields; and empirically, some of these fields have had success using heuristic methods -- eve… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  8. arXiv:2106.02073  [pdf, other

    cs.LG cs.AI math.DG math.OC stat.ML

    Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

    Authors: X. Y. Han, Vardan Papyan, David L. Donoho

    Abstract: The recently discovered Neural Collapse (NC) phenomenon occurs pervasively in today's deep net training paradigm of driving cross-entropy (CE) loss towards zero. During NC, last-layer features collapse to their class-means, both classifiers and class-means collapse to the same Simplex Equiangular Tight Frame, and classifier behavior collapses to the nearest-class-mean decision rule. Recent works d… ▽ More

    Submitted 9 May, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 Outstanding Paper Prize & Oral. Appendix contains [A] empirical experiments, [B-D] proofs of theoretical results, and [E] survey of related works examining Neural Collapse

  9. arXiv:2008.08186  [pdf, other

    cs.LG cs.CV stat.ML

    Prevalence of Neural Collapse during the terminal phase of deep learning training

    Authors: Vardan Papyan, X. Y. Han, David L. Donoho

    Abstract: Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is pushed towards zero. Direct measurements of TPT, for three prototypical deepnet architectures and across seven canonical classification datasets, expose a pervasi… ▽ More

    Submitted 21 August, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

  10. arXiv:1906.03742  [pdf, other

    cs.LG stat.ML

    Degrees of Freedom Analysis of Unrolled Neural Networks

    Authors: Morteza Mardani, Qingyun Sun, Vardan Papyan, Shreyas Vasanawala, John Pauly, David Donoho

    Abstract: Unrolled neural networks emerged recently as an effective model for learning inverse maps appearing in image restoration tasks. However, their generalization risk (i.e., test mean-squared-error) and its link to network design and train sample size remains mysterious. Leveraging the Stein's Unbiased Risk Estimator (SURE), this paper analyzes the generalization risk with its bias and variance compon… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

  11. arXiv:1901.08705  [pdf, other

    cs.DC

    Ambitious Data Science Can Be Painless

    Authors: Hatef Monajemi, Riccardo Murri, Eric Jonas, Percy Liang, Victoria Stodden, David L. Donoho

    Abstract: Modern data science research can involve massive computational experimentation; an ambitious PhD in computational fields may do experiments consuming several million CPU hours. Traditional computing practices, in which researchers use laptops or shared campus-resident resources, are inadequate for experiments at the massive scale and varied scope that we now see in data science. On the other hand,… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Submitted to Harvard Data Science Review

  12. arXiv:1806.03963  [pdf, other

    cs.CV cs.LG

    Neural Proximal Gradient Descent for Compressive Imaging

    Authors: Morteza Mardani, Qingyun Sun, Shreyas Vasawanala, Vardan Papyan, Hatef Monajemi, John Pauly, David Donoho

    Abstract: Recovering high-resolution images from limited sensory data typically leads to a serious ill-posed inverse problem, demanding inversion algorithms that effectively capture the prior information. Learning a good inverse mapping from training data faces severe challenges, including: (i) scarcity of training data; (ii) need for plausible reconstructions that are physically feasible; (iii) need for fa… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

    Comments: arXiv admin note: text overlap with arXiv:1711.10046

  13. arXiv:1711.10046  [pdf, other

    cs.AI cs.IR cs.LG

    Recurrent Generative Adversarial Networks for Proximal Learning and Automated Compressive Image Recovery

    Authors: Morteza Mardani, Hatef Monajemi, Vardan Papyan, Shreyas Vasanawala, David Donoho, John Pauly

    Abstract: Recovering images from undersampled linear measurements typically leads to an ill-posed linear inverse problem, that asks for proper statistical priors. Building effective priors is however challenged by the low train and test overhead dictated by real-time tasks; and the need for retrieving visually "plausible" and physically "feasible" images with minimal hallucination. To cope with these challe… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: 11 pages, 11 figures

  14. arXiv:1702.03062  [pdf, other

    cs.IT

    Sparsity/Undersampling Tradeoffs in Anisotropic Undersampling, with Applications in MR Imaging/Spectroscopy

    Authors: Hatef Monajemi, David L. Donoho

    Abstract: We study anisotropic undersampling schemes like those used in multi-dimensional NMR spectroscopy and MR imaging, which sample exhaustively in certain time dimensions and randomly in others. Our analysis shows that anisotropic undersampling schemes are equivalent to certain block-diagonal measurement systems. We develop novel exact formulas for the sparsity/undersampling tradeoffs in such measure… ▽ More

    Submitted 16 March, 2018; v1 submitted 9 February, 2017; originally announced February 2017.

  15. arXiv:1606.00925  [pdf, other

    cs.LG stat.ML

    Convolutional Imputation of Matrix Networks

    Authors: Qingyun Sun, Mengyuan Yan David Donoho, Stephen Boyd

    Abstract: A matrix network is a family of matrices, with relatedness modeled by a weighted graph. We consider the task of completing a partially observed matrix network. We assume a novel sampling scheme where a fraction of matrices might be completely unobserved. How can we recover the entire matrix network from incomplete observations? This mathematical problem arises in many applications including medica… ▽ More

    Submitted 7 June, 2018; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: Accepted by ICML 2018

  16. arXiv:1310.7320  [pdf, other

    math.ST cs.IT

    High Dimensional Robust M-Estimation: Asymptotic Variance via Approximate Message Passing

    Authors: David Donoho, Andrea Montanari

    Abstract: In a recent article (Proc. Natl. Acad. Sci., 110(36), 14557-14562), El Karoui et al. study the distribution of robust regression estimators in the regime in which the number of parameters p is of the same order as the number of samples n. Using numerical simulations and `highly plausible' heuristic arguments, they unveil a striking new phenomenon. Namely, the regression coefficients contain an ext… ▽ More

    Submitted 15 November, 2013; v1 submitted 28 October, 2013; originally announced October 2013.

    Comments: 32 pages, 5 figures (v2 contains numerical simulations)

  17. The Phase Transition of Matrix Recovery from Gaussian Measurements Matches the Minimax MSE of Matrix Denoising

    Authors: David L. Donoho, Matan Gavish, Andrea Montanari

    Abstract: Let $X_0$ be an unknown $M$ by $N$ matrix. In matrix recovery, one takes $n < MN$ linear measurements $y_1,..., y_n$ of $X_0$, where $y_i = \Tr(a_i^T X_0)$ and each $a_i$ is a $M$ by $N$ matrix. For measurement matrices with Gaussian i.i.d entries, it known that if $X_0$ is of low rank, it is recoverable from just a few measurements. A popular approach for matrix recovery is Nuclear Norm Minimizat… ▽ More

    Submitted 10 February, 2013; originally announced February 2013.

  18. arXiv:1112.0708  [pdf, other

    cs.IT cond-mat.stat-mech math.ST

    Information-Theoretically Optimal Compressed Sensing via Spatial Coupling and Approximate Message Passing

    Authors: David L. Donoho, Adel Javanmard, Andrea Montanari

    Abstract: We study the compressed sensing reconstruction problem for a broad class of random, band-diagonal sensing matrices. This construction is inspired by the idea of spatial coupling in coding theory. As demonstrated heuristically and numerically by Krzakala et al. \cite{KrzakalaEtAl}, message passing algorithms can effectively solve the reconstruction problem for spatially coupled measurements with un… ▽ More

    Submitted 18 January, 2013; v1 submitted 3 December, 2011; originally announced December 2011.

    Comments: 60 pages, 7 figures, Sections 3,5 and Appendices A,B are added. The stability constant is quantified (cf Theorem 1.7)

  19. arXiv:1111.1041  [pdf, other

    cs.IT math.ST

    Accurate Prediction of Phase Transitions in Compressed Sensing via a Connection to Minimax Denoising

    Authors: David Donoho, Iain Johnstone, Andrea Montanari

    Abstract: Compressed sensing posits that, within limits, one can undersample a sparse signal and yet reconstruct it accurately. Knowing the precise limits to such undersampling is important both for theory and practice. We present a formula that characterizes the allowed undersampling of generalized sparse objects. The formula applies to Approximate Message Passing (AMP) algorithms for compressed sensing, w… ▽ More

    Submitted 7 January, 2013; v1 submitted 4 November, 2011; originally announced November 2011.

    Comments: 71 pages, 32 pdf figures

  20. arXiv:1103.1943  [pdf, other

    cs.IT math.ST

    Compressed Sensing over $\ell_p$-balls: Minimax Mean Square Error

    Authors: David Donoho, Iain Johnstone, Arian Maleki, Andrea Montanari

    Abstract: We consider the compressed sensing problem, where the object $x_0 \in \bR^N$ is to be recovered from incomplete measurements $y = Ax_0 + z$; here the sensing matrix $A$ is an $n \times N$ random matrix with iid Gaussian entries and $n < N$. A popular method of sparsity-promoting reconstruction is $\ell^1$-penalized least-squares reconstruction (aka LASSO, Basis Pursuit). It is currently popular… ▽ More

    Submitted 23 March, 2011; v1 submitted 10 March, 2011; originally announced March 2011.

    Comments: 41 pages, 11 pdf figures

  21. arXiv:1004.3006  [pdf, ps, other

    math.FA cs.IT math.NA

    Microlocal Analysis of the Geometric Separation Problem

    Authors: David L. Donoho, Gitta Kutyniok

    Abstract: Image data are often composed of two or more geometrically distinct constituents; in galaxy catalogs, for instance, one sees a mixture of pointlike structures (galaxy superclusters) and curvelike structures (filaments). It would be ideal to process a single image and extract two geometrically `pure' images, each one containing features from only one of the two geometric constituents. This seems t… ▽ More

    Submitted 18 April, 2010; originally announced April 2010.

    Comments: 59 pages, 9 figures

    Report number: Technical Report No. 2010-01, Statistics Department, Stanford University

  22. arXiv:1004.1218  [pdf, other

    math.ST cs.IT

    The Noise-Sensitivity Phase Transition in Compressed Sensing

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: Consider the noisy underdetermined system of linear equations: y=Ax0 + z0, with n x N measurement matrix A, n < N, and Gaussian white noise z0 ~ N(0,σ^2 I). Both y and A are known, both x0 and z0 are unknown, and we seek an approximation to x0. When x0 has few nonzeros, useful approximations are obtained by l1-penalized l2 minimization, in which the reconstruction \hxl solves min || y - Ax||^2/2… ▽ More

    Submitted 7 April, 2010; originally announced April 2010.

    Comments: 40 pages, 13 pdf figures

  23. arXiv:0911.4222  [pdf, other

    cs.IT

    Message Passing Algorithms for Compressed Sensing: II. Analysis and Validation

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: In a recent paper, the authors proposed a new class of low-complexity iterative thresholding algorithms for reconstructing sparse signals from a small set of linear measurements \cite{DMM}. The new algorithms are broadly referred to as AMP, for approximate message passing. This is the second of two conference papers describing the derivation of these algorithms, connection with related literatur… ▽ More

    Submitted 21 November, 2009; originally announced November 2009.

    Comments: 5 pages, 3 pdf figures, IEEE Information Theory Workshop, Cairo 2010

  24. arXiv:0911.4219  [pdf, ps, other

    cs.IT

    Message Passing Algorithms for Compressed Sensing: I. Motivation and Construction

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: In a recent paper, the authors proposed a new class of low-complexity iterative thresholding algorithms for reconstructing sparse signals from a small set of linear measurements \cite{DMM}. The new algorithms are broadly referred to as AMP, for approximate message passing. This is the first of two conference papers describing the derivation of these algorithms, connection with the related litera… ▽ More

    Submitted 21 November, 2009; originally announced November 2009.

    Comments: 5 pages, IEEE Information Theory Workshop, Cairo 2010

  25. arXiv:0909.0777  [pdf, other

    math.NA cs.IT cs.MS

    Optimally Tuned Iterative Reconstruction Algorithms for Compressed Sensing

    Authors: Arian Maleki, David L. Donoho

    Abstract: We conducted an extensive computational experiment, lasting multiple CPU-years, to optimally select parameters for two important classes of algorithms for finding sparse solutions of underdetermined systems of linear equations. We make the optimally tuned implementations available at {\tt sparselab.stanford.edu}; they run `out of the box' with no user tuning: it is not necessary to select thresh… ▽ More

    Submitted 3 September, 2009; originally announced September 2009.

    Comments: 12 pages, 14 figures

  26. arXiv:0907.3574  [pdf, ps, other

    cs.IT cond-mat.dis-nn stat.CO

    Message Passing Algorithms for Compressed Sensing

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: Compressed sensing aims to undersample certain high-dimensional signals, yet accurately reconstruct them by exploiting signal characteristics. Accurate reconstruction is possible when the object to be recovered is sufficiently sparse in a known basis. Currently, the best known sparsity-undersampling tradeoff is achieved when reconstructing by convex optimization -- which is expensive in importan… ▽ More

    Submitted 21 July, 2009; originally announced July 2009.

    Comments: 6 pages paper + 9 pages supplementary information, 13 eps figure. Submitted to Proc. Natl. Acad. Sci. USA

  27. arXiv:0906.2530  [pdf, other

    math.ST cs.IT physics.data-an stat.CO

    Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

    Authors: David L. Donoho, Jared Tanner

    Abstract: We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers incre… ▽ More

    Submitted 14 June, 2009; originally announced June 2009.

    Comments: 47 pages, 24 figures, 10 tables

  28. arXiv:0807.3590  [pdf, ps, other

    math.MG cs.IT math.OC math.PR

    Counting the Faces of Randomly-Projected Hypercubes and Orthants, with Applications

    Authors: David L. Donoho, Jared Tanner

    Abstract: Let $A$ be an $n$ by $N$ real valued random matrix, and $\h$ denote the $N$-dimensional hypercube. For numerous random matrix ensembles, the expected number of $k$-dimensional faces of the random $n$-dimensional zonotope $A\h$ obeys the formula $E f_k(A\h) /f_k(\h) = 1-P_{N-n,N-k}$, where $P_{N-n,N-k}$ is a fair-coin-tossing probability. The formula applies, for example, where the columns of… ▽ More

    Submitted 22 July, 2008; originally announced July 2008.

    Comments: 21 pages, 3 figures

    MSC Class: 52A22; 52B05; 52B11; 52B12; 62E20; 68P30; 68P25; 68W20; 68W40; 94B20; 94B35; 94B65; 94B70

  29. The Simplest Solution to an Underdetermined System of Linear Equations

    Authors: David Donoho, Hossein Kakavand, James Mammen

    Abstract: Consider a d*n matrix A, with d<n. The problem of solving for x in y=Ax is underdetermined, and has infinitely many solutions (if there are any). Given y, the minimum Kolmogorov complexity solution (MKCS) of the input x is defined to be an input z (out of many) with minimum Kolmogorov-complexity that satisfies y=Az. One expects that if the actual input is simple enough, then MKCS will recover th… ▽ More

    Submitted 19 February, 2007; originally announced February 2007.

    Comments: Proceedings of the IEEE International Symposium on Information Theory Seattle, Washington, July 9-14, 2006