Closed-Form Last Layer Optimization

Galashov, Alexandre; Da Costa, Nathaël; Xu, Liyuan; Hennig, Philipp; Gretton, Arthur

Computer Science > Machine Learning

arXiv:2510.04606 (cs)

[Submitted on 6 Oct 2025]

Title:Closed-Form Last Layer Optimization

Authors:Alexandre Galashov, Nathaël Da Costa, Liyuan Xu, Philipp Hennig, Arthur Gretton

View PDF

Abstract:Neural networks are typically optimized with variants of stochastic gradient descent. Under a squared loss, however, the optimal solution to the linear last layer weights is known in closed-form. We propose to leverage this during optimization, treating the last layer as a function of the backbone parameters, and optimizing solely for these parameters. We show this is equivalent to alternating between gradient descent steps on the backbone and closed-form updates on the last layer. We adapt the method for the setting of stochastic gradient descent, by trading off the loss on the current batch against the accumulated information from previous batches. Further, we prove that, in the Neural Tangent Kernel regime, convergence of this method to an optimal solution is guaranteed. Finally, we demonstrate the effectiveness of our approach compared with standard SGD on a squared loss in several supervised tasks -- both regression and classification -- including Fourier Neural Operators and Instrumental Variable Regression.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2510.04606 [cs.LG]
	(or arXiv:2510.04606v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2510.04606

Submission history

From: Alexandre Galashov [view email]
[v1] Mon, 6 Oct 2025 09:14:39 UTC (1,822 KB)

Computer Science > Machine Learning

Title:Closed-Form Last Layer Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Closed-Form Last Layer Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators