Skip to main content

Showing 1–4 of 4 results for author: Gheissari, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.15655  [pdf, other

    math.ST math.PR stat.ML

    Local geometry of high-dimensional mixture models: Effective spectral theory and dynamical transitions

    Authors: Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang, Aukosh Jagannath

    Abstract: We study the local geometry of empirical risks in high dimensions via the spectral theory of their Hessian and information matrices. We focus on settings where the data, $(Y_\ell)_{\ell =1}^n\in \mathbb R^d$, are i.i.d. draws of a $k$-component Gaussian mixture model, and the loss depends on the projection of the data into a fixed number of vectors, namely $\mathbf{x}^\top Y$, where… ▽ More

    Submitted 15 May, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Figures added. 59 pages, 7 figures

  2. arXiv:2310.03010  [pdf, other

    cs.LG math.PR stat.ML

    Spectral alignment of stochastic gradient descent for high-dimensional classification tasks

    Authors: Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang, Aukosh Jagannath

    Abstract: We rigorously study the relation between the training dynamics via stochastic gradient descent (SGD) and the spectra of empirical Hessian and gradient matrices. We prove that in two canonical classification tasks for multi-class high-dimensional mixtures and either 1 or 2-layer neural networks, both the SGD trajectory and emergent outlier eigenspaces of the Hessian and gradient matrices align with… ▽ More

    Submitted 15 May, 2025; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Final version. 53 pages, 12 figures

  3. arXiv:2206.04030  [pdf, other

    stat.ML cs.LG math.PR math.ST

    High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

    Authors: Gerard Ben Arous, Reza Gheissari, Aukosh Jagannath

    Abstract: We study the scaling limits of stochastic gradient descent (SGD) with constant step-size in the high-dimensional regime. We prove limit theorems for the trajectories of summary statistics (i.e., finite-dimensional functions) of SGD as the dimension goes to infinity. Our approach allows one to choose the summary statistics that are tracked, the initialization, and the step-size. It yields both ball… ▽ More

    Submitted 17 August, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 43 pages, 11 figures

  4. arXiv:2003.10409  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Online stochastic gradient descent on non-convex losses from high-dimensional inference

    Authors: Gerard Ben Arous, Reza Gheissari, Aukosh Jagannath

    Abstract: Stochastic gradient descent (SGD) is a popular algorithm for optimization problems arising in high-dimensional inference tasks. Here one produces an estimator of an unknown parameter from independent samples of data by iteratively optimizing a loss function. This loss function is random and often non-convex. We study the performance of the simplest version of SGD, namely online SGD, from a random… ▽ More

    Submitted 10 May, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: final version to appear at Jour. Mach. Learn. Res$.$

    Journal ref: J. Mach. Learn. Res., Vol 22, No.106,1-51(2021)