Learning to Stop: Deep Learning for Mean Field Optimal Stopping

Magnino, Lorenzo; Zhu, Yuchen; Laurière, Mathieu

Mathematics > Optimization and Control

arXiv:2410.08850 (math)

[Submitted on 11 Oct 2024 (v1), last revised 9 Jun 2025 (this version, v2)]

Title:Learning to Stop: Deep Learning for Mean Field Optimal Stopping

Authors:Lorenzo Magnino, Yuchen Zhu, Mathieu Laurière

View PDF

Abstract:Optimal stopping is a fundamental problem in optimization with applications in risk management, finance, robotics, and machine learning. We extend the standard framework to a multi-agent setting, named multi-agent optimal stopping (MAOS), where agents cooperate to make optimal stopping decisions in a finite-space, discrete-time environment. Since solving MAOS becomes computationally prohibitive as the number of agents is very large, we study the mean-field optimal stopping (MFOS) problem, obtained as the number of agents tends to infinity. We establish that MFOS provides a good approximation to MAOS and prove a dynamic programming principle (DPP) based on mean-field control theory. We then propose two deep learning approaches: one that learns optimal stopping decisions by simulating full trajectories and another that leverages the DPP to compute the value function and to learn the optimal stopping rule using backward induction. Both methods train neural networks to approximate optimal stopping policies. We demonstrate the effectiveness and the scalability of our work through numerical experiments on 6 different problems in spatial dimension up to 300. To the best of our knowledge, this is the first work to formalize and computationally solve MFOS in discrete time and finite space, opening new directions for scalable MAOS methods.

Comments:	Accepted to ICML 2025
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:2410.08850 [math.OC]
	(or arXiv:2410.08850v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2410.08850

Submission history

From: Yuchen Zhu [view email]
[v1] Fri, 11 Oct 2024 14:27:17 UTC (14,144 KB)
[v2] Mon, 9 Jun 2025 16:11:54 UTC (14,508 KB)

Mathematics > Optimization and Control

Title:Learning to Stop: Deep Learning for Mean Field Optimal Stopping

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Learning to Stop: Deep Learning for Mean Field Optimal Stopping

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators