Mean-field Langevin System, Optimal Control and Deep Neural Networks

Hu, Kaitong; Kazeykina, Anna; Ren, Zhenjie

Mathematics > Probability

arXiv:1909.07278 (math)

[Submitted on 16 Sep 2019 (v1), last revised 3 Oct 2019 (this version, v2)]

Title:Mean-field Langevin System, Optimal Control and Deep Neural Networks

Authors:Kaitong Hu, Anna Kazeykina, Zhenjie Ren

View PDF

Abstract:In this paper, we study a regularised relaxed optimal control problem and, in particular, we are concerned with the case where the control variable is of large dimension. We introduce a system of mean-field Langevin equations, the invariant measure of which is shown to be the optimal control of the initial problem under mild conditions. Therefore, this system of processes can be viewed as a continuous-time numerical algorithm for computing the optimal control. As an application, this result endorses the solvability of the stochastic gradient descent algorithm for a wide class of deep neural networks.

Comments:	25 pages
Subjects:	Probability (math.PR); Machine Learning (cs.LG); Optimization and Control (math.OC)
MSC classes:	60H30, 37M25
Cite as:	arXiv:1909.07278 [math.PR]
	(or arXiv:1909.07278v2 [math.PR] for this version)
	https://doi.org/10.48550/arXiv.1909.07278

Submission history

From: Zhenjie Ren [view email]
[v1] Mon, 16 Sep 2019 15:31:25 UTC (38 KB)
[v2] Thu, 3 Oct 2019 19:41:20 UTC (58 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.PR

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
cs.LG
math
math.OC

References & Citations

export BibTeX citation

Mathematics > Probability

Title:Mean-field Langevin System, Optimal Control and Deep Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Probability

Title:Mean-field Langevin System, Optimal Control and Deep Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators