Deep learning as optimal control problems: models and numerical methods

Benning, Martin; Celledoni, Elena; Ehrhardt, Matthias J.; Owren, Brynjulf; Schönlieb, Carola-Bibiane

Mathematics > Optimization and Control

arXiv:1904.05657 (math)

[Submitted on 11 Apr 2019 (v1), last revised 30 Sep 2019 (this version, v3)]

Title:Deep learning as optimal control problems: models and numerical methods

Authors:Martin Benning, Elena Celledoni, Matthias J. Ehrhardt, Brynjulf Owren, Carola-Bibiane Schönlieb

View PDF

Abstract:We consider recent work of Haber and Ruthotto 2017 and Chang et al. 2018, where deep learning neural networks have been interpreted as discretisations of an optimal control problem subject to an ordinary differential equation constraint. We review the first order conditions for optimality, and the conditions ensuring optimality after discretisation. This leads to a class of algorithms for solving the discrete optimal control problem which guarantee that the corresponding discrete necessary conditions for optimality are fulfilled. The differential equation setting lends itself to learning additional parameters such as the time discretisation. We explore this extension alongside natural constraints (e.g. time steps lie in a simplex). We compare these deep learning algorithms numerically in terms of induced flow and generalisation ability.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA)
Cite as:	arXiv:1904.05657 [math.OC]
	(or arXiv:1904.05657v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1904.05657

Submission history

From: Brynjulf Owren [view email]
[v1] Thu, 11 Apr 2019 12:15:00 UTC (4,841 KB)
[v2] Sun, 22 Sep 2019 17:28:02 UTC (7,223 KB)
[v3] Mon, 30 Sep 2019 21:33:06 UTC (7,226 KB)

Mathematics > Optimization and Control

Title:Deep learning as optimal control problems: models and numerical methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Deep learning as optimal control problems: models and numerical methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators