On Feature Learning in Neural Networks with Global Convergence Guarantees

Chen, Zhengdao; Vanden-Eijnden, Eric; Bruna, Joan

Computer Science > Machine Learning

arXiv:2204.10782 (cs)

[Submitted on 22 Apr 2022]

Title:On Feature Learning in Neural Networks with Global Convergence Guarantees

Authors:Zhengdao Chen, Eric Vanden-Eijnden, Joan Bruna

View PDF

Abstract:We study the optimization of wide neural networks (NNs) via gradient flow (GF) in setups that allow feature learning while admitting non-asymptotic global convergence guarantees. First, for wide shallow NNs under the mean-field scaling and with a general class of activation functions, we prove that when the input dimension is no less than the size of the training set, the training loss converges to zero at a linear rate under GF. Building upon this analysis, we study a model of wide multi-layer NNs whose second-to-last layer is trained via GF, for which we also prove a linear-rate convergence of the training loss to zero, but regardless of the input dimension. We also show empirically that, unlike in the Neural Tangent Kernel (NTK) regime, our multi-layer model exhibits feature learning and can achieve better generalization performance than its NTK counterpart.

Comments:	Accepted by the 10th International Conference on Learning Representations (ICLR 2022)
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
Cite as:	arXiv:2204.10782 [cs.LG]
	(or arXiv:2204.10782v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.10782

Submission history

From: Zhengdao Chen [view email]
[v1] Fri, 22 Apr 2022 15:56:43 UTC (1,964 KB)

Computer Science > Machine Learning

Title:On Feature Learning in Neural Networks with Global Convergence Guarantees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Feature Learning in Neural Networks with Global Convergence Guarantees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators