Skip to main content

Showing 1–16 of 16 results for author: Lu, Y M

Searching in archive cond-mat. Search in all archives.
.
  1. arXiv:2501.03937  [pdf, other

    cs.LG cond-mat.dis-nn

    A precise asymptotic analysis of learning diffusion models: theory and insights

    Authors: Hugo Cui, Cengiz Pehlevan, Yue M. Lu

    Abstract: In this manuscript, we consider the problem of learning a flow or diffusion-based generative model parametrized by a two-layer auto-encoder, trained with online stochastic gradient descent, on a high-dimensional target density with an underlying low-dimensional manifold structure. We derive a tight asymptotic characterization of low-dimensional projections of the distribution of samples generated… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  2. arXiv:2405.11751  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Asymptotic theory of in-context learning by linear attention

    Authors: Yue M. Lu, Mary I. Letey, Jacob A. Zavatone-Veth, Anindita Maiti, Cengiz Pehlevan

    Abstract: Transformers have a remarkable ability to learn and execute tasks based on examples provided within the input itself, without explicit prior training. It has been argued that this capability, known as in-context learning (ICL), is a cornerstone of Transformers' success, yet questions about the necessary sample complexity, pretraining task diversity, and context length for successful ICL remain unr… ▽ More

    Submitted 4 February, 2025; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: 17 pages (main doc), 6 figures, and supplementary information (23 pages)

  3. arXiv:2402.04980  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Asymptotics of feature learning in two-layer networks after one gradient-step

    Authors: Hugo Cui, Luca Pesce, Yatin Dandi, Florent Krzakala, Yue M. Lu, Lenka Zdeborová, Bruno Loureiro

    Abstract: In this manuscript, we investigate the problem of how two-layer neural networks learn features from data, and improve over the kernel regime, after being trained with a single gradient descent step. Leveraging the insight from (Ba et al., 2022), we model the trained network by a spiked Random Features (sRF) model. Further building on recent progress on Gaussian universality (Dandi et al., 2023), w… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:9662-9695, 2024

  4. arXiv:2212.05278  [pdf, other

    cond-mat.str-el cond-mat.other

    Ring-Exchange Interaction Effects on Magnons in Dirac Magnet CoTiO$_3$

    Authors: Yufei Li, Thuc T. Mai, M. Karaki, E. V. Jasper, K. F. Garrity, C. Lyon, D. Shaw, T. DeLazzer, A. J. Biacchi, R. L. Dally, D. M. Heligman, J. Gdanski, T. Adel, M. F. Muñoz, A. Giovannone, A. Pawbake, C. Faugeras, J. R. Simpson, K. Ross, N. Trivedi, Y. M. Lu, A. R. Hight Walker, R. Valdés Aguilar

    Abstract: The magnetic interactions that determine magnetic order and magnon energies typically involve only two spins. While rare, multi-spin interactions can also appear in quantum magnets and be the driving force in the ground state selection and in the nature of its excitations. By performing time-domain terahertz and magneto-Raman spectroscopy measurements combined with theoretical modeling, we determi… ▽ More

    Submitted 4 June, 2024; v1 submitted 10 December, 2022; originally announced December 2022.

    Comments: 8 pages, 4 figures in main text, 27 pages and 11 figures in supplement. Published May 21st, 2024 in PRB

    Journal ref: Phys. Rev. B. 109, 184436 (2024)

  5. arXiv:2012.04524  [pdf, other

    cs.IT cond-mat.dis-nn

    Construction of optimal spectral methods in phase retrieval

    Authors: Antoine Maillard, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider the phase retrieval problem, in which the observer wishes to recover a $n$-dimensional real or complex signal $\mathbf{X}^\star$ from the (possibly noisy) observation of $|\mathbfΦ \mathbf{X}^\star|$, in which $\mathbfΦ$ is a matrix of size $m \times n$. We consider a \emph{high-dimensional} setting where $n,m \to \infty$ with $m/n = \mathcal{O}(1)$, and a large class of (possibly corr… ▽ More

    Submitted 14 October, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 14 pages + references and appendix. v2: Version updated to match the one accepted at MSML 2021. v3: Adding a reference to a previous work mentioning marginal stability and its connection to Bayes-optimality

    Journal ref: Proceedings of Machine Learning Research vol 145:1-28, 2021 2nd Annual Conference on Mathematical and Scientific Machine Learning (MSML 21)

  6. arXiv:2006.06560  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

    Authors: Benjamin Aubin, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where $α=n/d$ is kept finite in the limit of a high dimension $d$ and number of samples $n$. Our contribution is three-fold: First, we… ▽ More

    Submitted 7 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 11 pages + 45 pages Supplementary Material / 5 figures, v2 revised and accepted at NeurIPS

    Journal ref: Advances in Neural Information Processing Systems, v33, pages 12199--12210, 2020

  7. arXiv:2002.11544  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    The role of regularization in classification of high-dimensional noisy Gaussian mixture

    Authors: Francesca Mignacco, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider a high-dimensional mixture of two Gaussians in the noisy regime where even an oracle knowing the centers of the clusters misclassifies a small but finite fraction of the points. We provide a rigorous analysis of the generalization error of regularized convex classifiers, including ridge, hinge and logistic regression, in the high-dimensional limit where the number $n$ of samples and th… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: 8 pages + appendix, 6 figures

    Journal ref: International Conference on Machine Learning, ICML 2020

  8. arXiv:1905.05313  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.IT cs.LG

    Generalized Approximate Survey Propagation for High-Dimensional Estimation

    Authors: Luca Saglietti, Yue M. Lu, Carlo Lucibello

    Abstract: In Generalized Linear Estimation (GLE) problems, we seek to estimate a signal that is observed through a linear transform followed by a component-wise, possibly nonlinear and noisy, channel. In the Bayesian optimal setting, Generalized Approximate Message Passing (GAMP) is known to achieve optimal performance for GLE. However, its performance can significantly degrade whenever there is a mismatch… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Journal ref: ICML 2019

  9. arXiv:1805.08349  [pdf, other

    cs.LG cond-mat.dis-nn cs.IT stat.ML

    A Solvable High-Dimensional Model of GAN

    Authors: Chuang Wang, Hong Hu, Yue M. Lu

    Abstract: We present a theoretical analysis of the training process for a single-layer GAN fed by high-dimensional input data. The training dynamics of the proposed model at both microscopic and macroscopic scales can be exactly analyzed in the high-dimensional limit. In particular, we prove that the macroscopic quantities measuring the quality of the training process converge to a deterministic process cha… ▽ More

    Submitted 28 October, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Accepted by 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  10. arXiv:1805.06834  [pdf, other

    cs.LG cond-mat.dis-nn cs.IT stat.ML

    Subspace Estimation from Incomplete Observations: A High-Dimensional Analysis

    Authors: Chuang Wang, Yonina C. Eldar, Yue M. Lu

    Abstract: We present a high-dimensional analysis of three popular algorithms, namely, Oja's method, GROUSE and PETRELS, for subspace estimation from streaming and highly incomplete observations. We show that, with proper time scaling, the time-varying principal angles between the true subspace and its estimates given by the algorithms converge weakly to deterministic processes when the ambient dimension… ▽ More

    Submitted 17 October, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

    Comments: 26 pages, 6 figures

  11. arXiv:1710.05384  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    The Scaling Limit of High-Dimensional Online Independent Component Analysis

    Authors: Chuang Wang, Yue M. Lu

    Abstract: We analyze the dynamics of an online algorithm for independent component analysis in the high-dimensional scaling limit. As the ambient dimension tends to infinity, and with proper time scaling, we show that the time-varying joint empirical measure of the target feature vector and the estimates provided by the algorithm will converge weakly to a deterministic measured-valued process that can be ch… ▽ More

    Submitted 6 November, 2017; v1 submitted 15 October, 2017; originally announced October 2017.

    Comments: 10 pages, 3 figures, 31st Conference on Neural Information Processing Systems (NIPS 2017)

  12. arXiv:1609.02191  [pdf, other

    cs.IT cond-mat.dis-nn

    Online Learning for Sparse PCA in High Dimensions: Exact Dynamics and Phase Transitions

    Authors: Chuang Wang, Yue M. Lu

    Abstract: We study the dynamics of an online algorithm for learning a sparse leading eigenvector from samples generated from a spiked covariance model. This algorithm combines the classical Oja's method for online PCA with an element-wise nonlinearity at each iteration to promote sparsity. In the high-dimensional limit, the joint empirical measure of the underlying sparse eigenvector and its estimate provid… ▽ More

    Submitted 7 September, 2016; originally announced September 2016.

    Comments: 5 pages

  13. arXiv:1305.1361  [pdf, ps, other

    cond-mat.supr-con

    A new collective mode in YBCO observed by time-domain reflectometry

    Authors: J. P. Hinton, J. D. Koralek, Y. M. Lu, A. Vishwanath, J. Orenstein, D. A. Bonn, W. N. Hardy, Ruixing Liang

    Abstract: We report the observation of coherent oscillations associated with charge density wave (CDW) order in the underdoped cuprate superconductor YBa2Cu3O6+x by time-resolved optical reflectivity. Oscillations with frequency 1.87 THz onset at approximately 105 K and 130 K for dopings of x = 0.67 (ortho-VIII) and x = 0.75 (ortho-III), respectively. Upon cooling below the superconducting critical temperat… ▽ More

    Submitted 6 May, 2013; originally announced May 2013.

  14. arXiv:1210.6731  [pdf

    physics.chem-ph cond-mat.mtrl-sci physics.optics

    Phototransistor Behavior Based on Dye-Sensitized Solar Cell

    Authors: X. Q. Wang, C. B. Cai, Y. F. Wang, W. Q. Zhou, Y. M. Lu, Z. Y. Liu

    Abstract: In the present work, a light-controlled device cell is established based on the dye-sensitized solar cell using nanocrystalline TiO2 films. Voltage-current curves are characterized by three types of transport behaviors: linear increase, saturated plateau and breakdown-like increase, which are actually of the typical performances for a photo-gated transistor. Moreover, an asymmetric behavior is obs… ▽ More

    Submitted 24 October, 2012; originally announced October 2012.

    Comments: 5 figures

    Journal ref: Appl. Phys. Lett. 95, 011112 (2009)

  15. arXiv:1210.3103  [pdf

    cond-mat.mtrl-sci

    Unconventional Scaling of the Anomalous Hall Effect Accompanying Electron Localization Correction in the Dirty Regime

    Authors: Y. M. Lu, J. W. Cai, Zaibing Guo, X. X. Zhang

    Abstract: Scaling of the anomalous Hall conductivity to longitudinal conductivity, has been observed in the dirty regime of two-dimensional weak and strong localization regions in ultrathin, polycrystalline, chemically disordered, ferromagnetic FePt films. The relationship between electron transport and temperature reveals a quantitatively insignificant Coulomb interaction in these films while the temperatu… ▽ More

    Submitted 10 October, 2012; originally announced October 2012.

    Journal ref: Phys. Rev. B 87, 094405 (2013)

  16. arXiv:1208.0960  [pdf

    cond-mat.supr-con

    Dynamical Interplay between Coexisting Orders in the Electron-Doped Cuprate Superconductor Nd_{2-x}Ce_xCuO_4

    Authors: J. P. Hinton, J. D. Koralek, G. Yu, E. M. Motoyama, Y. M. Lu, A. Vishwanath, M. Greven, J. Orenstein

    Abstract: We use coherent pump-probe spectroscopy to measure the photoinduced reflectivity \DeltaR, and complex dielectric function, δ\in, of the electron-doped cuprate superconductor Nd_{2-x}Ce_xCuO_{4+δ} at a value of x near optimal doping, as a function of time, temperature, and laser fluence. We observe the onset of a negative \DeltaR at T=85 K, above the superconducting transition temperature, T_c, of… ▽ More

    Submitted 4 August, 2012; originally announced August 2012.