Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime

Bowman, Benjamin; Montufar, Guido

Statistics > Machine Learning

arXiv:2206.02927 (stat)

[Submitted on 6 Jun 2022 (v1), last revised 14 Oct 2022 (this version, v2)]

Title:Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime

Authors:Benjamin Bowman, Guido Montufar

View PDF

Abstract:We provide quantitative bounds measuring the $L^2$ difference in function space between the trajectory of a finite-width network trained on finitely many samples from the idealized kernel dynamics of infinite width and infinite data. An implication of the bounds is that the network is biased to learn the top eigenfunctions of the Neural Tangent Kernel not just on the training set but over the entire input space. This bias depends on the model architecture and input distribution alone and thus does not depend on the target function which does not need to be in the RKHS of the kernel. The result is valid for deep architectures with fully connected, convolutional, and residual layers. Furthermore the width does not need to grow polynomially with the number of samples in order to obtain high probability bounds up to a stopping time. The proof exploits the low-effective-rank property of the Fisher Information Matrix at initialization, which implies a low effective dimension of the model (far smaller than the number of parameters). We conclude that local capacity control from the low effective rank of the Fisher Information Matrix is still underexplored theoretically.

Comments:	38 pages, 1 figure, to be published in NeurIPS 2022
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2206.02927 [stat.ML]
	(or arXiv:2206.02927v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2206.02927

Submission history

From: Benjamin Bowman [view email]
[v1] Mon, 6 Jun 2022 22:09:15 UTC (136 KB)
[v2] Fri, 14 Oct 2022 23:00:42 UTC (140 KB)

Statistics > Machine Learning

Title:Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators