The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Lippl, Samuel; Abbott, L. F.; Chung, SueYeon

Statistics > Machine Learning

arXiv:2202.02649 (stat)

[Submitted on 5 Feb 2022]

Title:The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Authors:Samuel Lippl, L. F. Abbott, SueYeon Chung

View PDF

Abstract:Understanding the asymptotic behavior of gradient-descent training of deep neural networks is essential for revealing inductive biases and improving network performance. We derive the infinite-time training limit of a mathematically tractable class of deep nonlinear neural networks, gated linear networks (GLNs), and generalize these results to gated networks described by general homogeneous polynomials. We study the implications of our results, focusing first on two-layer GLNs. We then apply our theoretical predictions to GLNs trained on MNIST and show how architectural constraints and the implicit bias of gradient descent affect performance. Finally, we show that our theory captures a substantial portion of the inductive bias of ReLU networks. By making the inductive bias explicit, our framework is poised to inform the development of more efficient, biologically plausible, and robust learning algorithms.

Comments:	23 pages, 5 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2202.02649 [stat.ML]
	(or arXiv:2202.02649v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2202.02649

Submission history

From: Samuel Lippl [view email]
[v1] Sat, 5 Feb 2022 22:37:39 UTC (434 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2022-02

Change to browse by:

cs
cs.LG
q-bio
q-bio.NC
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators