Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery

Kovačević, Filip; Zhang, Yihan; Mondelli, Marco

Statistics > Machine Learning

arXiv:2502.01583 (stat)

[Submitted on 3 Feb 2025 (v1), last revised 10 Jun 2025 (this version, v2)]

Title:Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery

Authors:Filip Kovačević, Yihan Zhang, Marco Mondelli

View PDF HTML (experimental)

Abstract:Multi-index models provide a popular framework to investigate the learnability of functions with low-dimensional structure and, also due to their connections with neural networks, they have been object of recent intensive study. In this paper, we focus on recovering the subspace spanned by the signals via spectral estimators -- a family of methods routinely used in practice, often as a warm-start for iterative algorithms. Our main technical contribution is a precise asymptotic characterization of the performance of spectral methods, when sample size and input dimension grow proportionally and the dimension $p$ of the space to recover is fixed. Specifically, we locate the top-$p$ eigenvalues of the spectral matrix and establish the overlaps between the corresponding eigenvectors (which give the spectral estimators) and a basis of the signal subspace. Our analysis unveils a phase transition phenomenon in which, as the sample complexity grows, eigenvalues escape from the bulk of the spectrum and, when that happens, eigenvectors recover directions of the desired subspace. The precise characterization we put forward enables the optimization of the data preprocessing, thus allowing to identify the spectral estimator that requires the minimal sample size for weak recovery.

Comments:	Accepted to COLT 2025
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Probability (math.PR); Statistics Theory (math.ST)
Cite as:	arXiv:2502.01583 [stat.ML]
	(or arXiv:2502.01583v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2502.01583

Submission history

From: Yihan Zhang [view email]
[v1] Mon, 3 Feb 2025 18:08:30 UTC (447 KB)
[v2] Tue, 10 Jun 2025 17:54:02 UTC (419 KB)

Statistics > Machine Learning

Title:Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators