Skip to main content

Showing 1–14 of 14 results for author: Lu, Y M

Searching in archive math. Search in all archives.
.
  1. arXiv:2504.15558  [pdf, other

    math.ST

    Dynamical mean-field analysis of adaptive Langevin diffusions: Replica-symmetric fixed point and empirical Bayes

    Authors: Zhou Fan, Justin Ko, Bruno Loureiro, Yue M. Lu, Yandi Shen

    Abstract: In many applications of statistical estimation via sampling, one may wish to sample from a high-dimensional target distribution that is adaptively evolving to the samples already seen. We study an example of such dynamics, given by a Langevin diffusion for posterior sampling in a Bayesian linear regression model with i.i.d. regression design, whose prior continuously adapts to the Langevin traject… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  2. arXiv:2504.15556  [pdf, ps, other

    math.ST math.PR

    Dynamical mean-field analysis of adaptive Langevin diffusions: Propagation-of-chaos and convergence of the linear response

    Authors: Zhou Fan, Justin Ko, Bruno Loureiro, Yue M. Lu, Yandi Shen

    Abstract: Motivated by an application to empirical Bayes learning in high-dimensional regression, we study a class of Langevin diffusions in a system with random disorder, where the drift coefficient is driven by a parameter that continuously adapts to the empirical distribution of the realized process up to the current time. The resulting dynamics take the form of a stochastic interacting particle system h… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  3. arXiv:2410.18938  [pdf, other

    stat.ML cs.LG math.ST

    A Random Matrix Theory Perspective on the Spectrum of Learned Features and Asymptotic Generalization Capabilities

    Authors: Yatin Dandi, Luca Pesce, Hugo Cui, Florent Krzakala, Yue M. Lu, Bruno Loureiro

    Abstract: A key property of neural networks is their capacity of adapting to data during training. Yet, our current mathematical understanding of feature learning and its relationship to generalization remain limited. In this work, we provide a random matrix analysis of how fully-connected two-layer neural networks adapt to the target function after a single, but aggressive, gradient descent step. We rigoro… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  4. arXiv:2403.08160  [pdf, other

    stat.ML cs.LG math.ST

    Asymptotics of Random Feature Regression Beyond the Linear Scaling Regime

    Authors: Hong Hu, Yue M. Lu, Theodor Misiakiewicz

    Abstract: Recent advances in machine learning have been achieved by using overparametrized models trained until near interpolation of the training data. It was shown, e.g., through the double descent phenomenon, that the number of parameters is a poor proxy for the model complexity and generalization capabilities. This leaves open the question of understanding the impact of parametrization on the performanc… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 106 pages, 8 figures

  5. arXiv:2310.18280  [pdf, ps, other

    math.PR stat.ML

    Universality for the global spectrum of random inner-product kernel matrices in the polynomial regime

    Authors: Sofiia Dubova, Yue M. Lu, Benjamin McKenna, Horng-Tzer Yau

    Abstract: We consider certain large random matrices, called random inner-product kernel matrices, which are essentially given by a nonlinear function $f$ applied entrywise to a sample-covariance matrix, $f(X^TX)$, where $X \in \mathbb{R}^{d \times N}$ is random and normalized in such a way that $f$ typically has order-one arguments. We work in the polynomial regime, where $N \asymp d^\ell$ for some… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 43 pages, no figures

    MSC Class: 60B20; 15B52

  6. arXiv:2208.02753  [pdf, other

    cs.IT math.PR math.ST

    Spectral Universality of Regularized Linear Regression with Nearly Deterministic Sensing Matrices

    Authors: Rishabh Dudeja, Subhabrata Sen, Yue M. Lu

    Abstract: It has been observed that the performances of many high-dimensional estimation problems are universal with respect to underlying sensing (or design) matrices. Specifically, matrices with markedly different constructions seem to achieve identical performance if they share the same spectral distribution and have ``generic'' singular vectors. We prove this universality phenomenon for the case of conv… ▽ More

    Submitted 20 July, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

  7. arXiv:2205.06308  [pdf, other

    math.PR stat.ML

    An Equivalence Principle for the Spectrum of Random Inner-Product Kernel Matrices with Polynomial Scalings

    Authors: Yue M. Lu, Horng-Tzer Yau

    Abstract: We investigate random matrices whose entries are obtained by applying a nonlinear kernel function to pairwise inner products between $n$ independent data vectors, drawn uniformly from the unit sphere in $\mathbb{R}^d$. This study is motivated by applications in machine learning and statistics, where these kernel random matrices and their spectral properties play significant roles. We establish the… ▽ More

    Submitted 5 May, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

  8. arXiv:2204.04281  [pdf, other

    math.PR

    Universality of Approximate Message Passing with Semi-Random Matrices

    Authors: Rishabh Dudeja, Yue M. Lu, Subhabrata Sen

    Abstract: Approximate Message Passing (AMP) is a class of iterative algorithms that have found applications in many problems in high-dimensional statistics and machine learning. In its general form, AMP can be formulated as an iterative procedure driven by a matrix $\mathbf{M}$. Theoretical analyses of AMP typically assume strong distributional properties on $\mathbf{M}$ such as $\mathbf{M}$ has i.i.d. sub-… ▽ More

    Submitted 1 May, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

  9. arXiv:2006.06560  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

    Authors: Benjamin Aubin, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where $α=n/d$ is kept finite in the limit of a high dimension $d$ and number of samples $n$. Our contribution is three-fold: First, we… ▽ More

    Submitted 7 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 11 pages + 45 pages Supplementary Material / 5 figures, v2 revised and accepted at NeurIPS

    Journal ref: Advances in Neural Information Processing Systems, v33, pages 12199--12210, 2020

  10. arXiv:2002.11544  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    The role of regularization in classification of high-dimensional noisy Gaussian mixture

    Authors: Francesca Mignacco, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider a high-dimensional mixture of two Gaussians in the noisy regime where even an oracle knowing the centers of the clusters misclassifies a small but finite fraction of the points. We provide a rigorous analysis of the generalization error of regularized convex classifiers, including ridge, hinge and logistic regression, in the high-dimensional limit where the number $n$ of samples and th… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: 8 pages + appendix, 6 figures

    Journal ref: International Conference on Machine Learning, ICML 2020

  11. arXiv:1903.11582  [pdf, ps, other

    cs.IT math.ST

    SLOPE for Sparse Linear Regression:Asymptotics and Optimal Regularization

    Authors: Hong Hu, Yue M. Lu

    Abstract: In sparse linear regression, the SLOPE estimator generalizes LASSO by penalizing different coordinates of the estimate according to their magnitudes. In this paper, we present a precise performance characterization of SLOPE in the asymptotic regime where the number of unknown parameters grows in proportion to the number of observations. Our asymptotic characterization enables us to derive the fund… ▽ More

    Submitted 4 June, 2021; v1 submitted 27 March, 2019; originally announced March 2019.

  12. arXiv:1809.09573  [pdf, other

    cs.LG cs.IT eess.SP math.OC math.ST stat.ML

    Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

    Authors: Yuejie Chi, Yue M. Lu, Yuxin Chen

    Abstract: Substantial progress has been made recently on developing provably accurate and efficient algorithms for low-rank matrix factorization via nonconvex optimization. While conventional wisdom often takes a dim view of nonconvex optimization algorithms due to their susceptibility to spurious local minima, simple iterative methods such as gradient descent have been remarkably successful in practice. Th… ▽ More

    Submitted 19 September, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: Invited overview article

    Journal ref: IEEE Transactions on Signal Processing, vol. 67, no. 20, pp. 5239-5269, October 2019

  13. arXiv:1712.04332  [pdf, other

    cs.LG cs.IT math.PR stat.ML

    Scaling Limit: Exact and Tractable Analysis of Online Learning Algorithms with Applications to Regularized Regression and PCA

    Authors: Chuang Wang, Jonathan Mattingly, Yue M. Lu

    Abstract: We present a framework for analyzing the exact dynamics of a class of online learning algorithms in the high-dimensional scaling limit. Our results are applied to two concrete examples: online regularized linear regression and principal component analysis. As the ambient dimension tends to infinity, and with proper time scaling, we show that the time-varying joint empirical measures of the target… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

  14. arXiv:1502.00190  [pdf, ps, other

    math.NA cs.IT math.OC

    Randomized Kaczmarz Algorithm for Inconsistent Linear Systems: An Exact MSE Analysis

    Authors: Chuang Wang, Ameya Agaskar, Yue M. Lu

    Abstract: We provide a complete characterization of the randomized Kaczmarz algorithm (RKA) for inconsistent linear systems. The Kaczmarz algorithm, known in some fields as the algebraic reconstruction technique, is a classical method for solving large-scale overdetermined linear systems through a sequence of projection operators; the randomized Kaczmarz algorithm is a recent proposal by Strohmer and Vershy… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.

    Comments: 5 pages, 1 figure, 1 table