-
Second Order Fully Nonlinear Mean Field Games with Degenerate Diffusions
Authors:
Alain Bensoussan,
Ziyu Huang,
Shanjian Tang,
Sheung Chi Phillip Yam
Abstract:
In this article, we study the global-in-time well-posedness of second order mean field games (MFGs) with both nonlinear drift functions simultaneously depending on the state, distribution and control variables, and the diffusion term depending on both state and distribution. Besides, the diffusion term is allowed to be degenerate, unbounded and even nonlinear in the distribution, but it does not d…
▽ More
In this article, we study the global-in-time well-posedness of second order mean field games (MFGs) with both nonlinear drift functions simultaneously depending on the state, distribution and control variables, and the diffusion term depending on both state and distribution. Besides, the diffusion term is allowed to be degenerate, unbounded and even nonlinear in the distribution, but it does not depend on the control. First, we establish the global well-posedness of the corresponding forward-backward stochastic differential equations (FBSDEs), which arise from the maximum principle under a so-called $β$-monotonicity commonly used in the optimal control theory. The $β$-monotonicity admits more interesting cases, as representative examples including but not limited to the displacement monotonicity, the small mean field effect condition or the Lasry-Lions monotonicity; and ensures the well-posedness result in diverse non-convex examples. In our settings, we pose assumptions directly on the drift and diffusion coefficients and the cost functionals, rather than indirectly on the Hamiltonian, to make the conditions more visible. Our probabilistic method tackles the nonlinear dynamics with a linear but infinite dimensional version, and together with our recently proposed cone property for the adjoint processes, following in an almost straightforward way the conventional approach to the classical stochastic control problem, we derive a sufficiently good regularity of the value functional, and finally show that it is the unique classical solution to the MFG master equation. Our results require fairly few conditions on the functional coefficients for solution of the MFG, and a bit more conditions -- which are least stringent in the contemporary literature -- for classical solution of the MFG master equation.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
On Mean Field Monotonicity Conditions from Control Theoretical Perspective
Authors:
Alain Bensoussan,
Ziyu Huang,
Shanjian Tang,
Sheung Chi Phillip Yam
Abstract:
In this article, from the viewpoint of control theory, we discuss the relationships among the commonly used monotonicity conditions that ensure the well-posedness of the solutions arising from problems of mean field games (MFGs) and mean field type control (MFTC). We first introduce the well-posedness of general forward-backward stochastic differential equations (FBSDEs) defined on some suitably c…
▽ More
In this article, from the viewpoint of control theory, we discuss the relationships among the commonly used monotonicity conditions that ensure the well-posedness of the solutions arising from problems of mean field games (MFGs) and mean field type control (MFTC). We first introduce the well-posedness of general forward-backward stochastic differential equations (FBSDEs) defined on some suitably chosen Hilbert spaces under the $β$-monotonicity. We then propose a monotonicity condition for the MFG, namely partitioning the running cost functional into two parts, so that both parts still depend on the control and the state distribution, yet one satisfies a strong convexity and a small mean field effect condition, while the other has a newly introduced displacement quasi-monotonicity. To the best of our knowledge, the latter quasi type condition has not yet been discussed in the contemporary literature, and it can be considered as a bit more general monotonicity condition than those commonly used. Besides, for the MFG, we show that convexity and small mean field effect condition for the first part of running cost functional and the quasi-monotonicity condition for the second part together imply the $β$-monotonicity and thus the well-posedness for the associated FBSDEs. For the MFTC problem, we show that the $β$-monotonicity for the corresponding FBSDEs is simply the convexity assumption on the cost functional. Finally, we consider a more general setting where the drift functional is allowed to be non-linear for both MFG and MFTC problems.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
A Class of Degenerate Mean Field Games, Associated FBSDEs and Master Equations
Authors:
Alain Bensoussan,
Ziyu Huang,
Shanjian Tang,
Sheung Chi Phillip Yam
Abstract:
In this paper, we study a class of degenerate mean field games (MFGs) with state-distribution dependent and unbounded functional diffusion coefficients. With a probabilistic method, we study the well-posedness of the forward-backward stochastic differential equations (FBSDEs) associated with the MFG and arising from the maximum principle, and estimate the corresponding Jacobian and Hessian flows.…
▽ More
In this paper, we study a class of degenerate mean field games (MFGs) with state-distribution dependent and unbounded functional diffusion coefficients. With a probabilistic method, we study the well-posedness of the forward-backward stochastic differential equations (FBSDEs) associated with the MFG and arising from the maximum principle, and estimate the corresponding Jacobian and Hessian flows. We further establish the classical regularity of the value functional $V$; in particular, we show that when the cost function is $C^3$ in the spatial and control variables and $C^2$ in the distribution argument, then the value functional is $C^1$ in time and $C^2$ in the spatial and distribution variables. As a consequence, the value functional $V$ is the unique classical solution of the degenerate MFG master equation.
△ Less
Submitted 20 March, 2025; v1 submitted 16 October, 2024;
originally announced October 2024.
-
Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior
Authors:
Hao Liu,
Suresh P. Sethi,
Tak Kwong Wong,
Sheung Chi Phillip Yam
Abstract:
We extend the work on optimal investment and consumption of a population considered in [2] to a general stochastic setting over a finite time horizon. We incorporate the Cobb-Douglas production function in the capital dynamics while the consumption utility function and the drift rate in the population dynamics can be general, in contrast with [2, 30, 31]. The dynamic programming formulation yields…
▽ More
We extend the work on optimal investment and consumption of a population considered in [2] to a general stochastic setting over a finite time horizon. We incorporate the Cobb-Douglas production function in the capital dynamics while the consumption utility function and the drift rate in the population dynamics can be general, in contrast with [2, 30, 31]. The dynamic programming formulation yields an unconventional nonlinear Hamilton-Jacobi-Bellman (HJB) equation, in which the Cobb-Douglas production function as the coefficient of the gradient of the value function induces the mismatching of power rates between capital and population. Moreover, the equation has a very singular term, essentially a very negative power of the partial derivative of the value function with respect to the capital, coming from the optimization of control, and their resolution turns out to be a complex problem not amenable to classical analysis. To show that this singular term, which has not been studied in any physical systems yet, does not actually blow up, we establish new pointwise generalized power laws for the partial derivative of the value function. Our contribution lies in providing a theoretical treatment that combines both the probabilistic approach and theory of partial differential equations to derive the pointwise upper and lower bounds as well as energy estimates in weighted Sobolev spaces. By then, we accomplish showing the well-posedness of classical solutions to a non-canonical parabolic equation arising from a long-lasting problem in macroeconomics.
△ Less
Submitted 14 August, 2024; v1 submitted 16 February, 2024;
originally announced February 2024.
-
A Control Theoretical Approach to Mean Field Games and Associated Master Equations
Authors:
Alain Bensoussan,
Ho Man Tai,
Tak Kwong Wong,
Sheung Chi Phillip Yam
Abstract:
We prove the global-in-time well-posedness for a broad class of mean field game problems, which is beyond the special linear-quadratic setting, as long as the mean field sensitivity is not too large. Through the stochastic maximum principle, we adopt the FBSDE approach to investigate the unique existence of the corresponding equilibrium strategies. The corresponding FBSDEs are first solved locally…
▽ More
We prove the global-in-time well-posedness for a broad class of mean field game problems, which is beyond the special linear-quadratic setting, as long as the mean field sensitivity is not too large. Through the stochastic maximum principle, we adopt the FBSDE approach to investigate the unique existence of the corresponding equilibrium strategies. The corresponding FBSDEs are first solved locally in time, then by controlling the sensitivity of the backward solutions with respect to the initial condition via some suitable apriori estimates for the corresponding Jacobian flows, the global-in-time solution is warranted. Further analysis on these Jacobian flows will be discussed to establish the regularities, such as linear functional differentiability, of the respective value functions that leads to the ultimate classical well-posedness of the master equation on $\mathbb{R}^d$. To the best of our knowledge, it is the first article to deal with the mean field game problem, as well as its associated master equation, with general cost functionals having quadratic growth under the small mean field effect. In this current approach, we directly impose the structural conditions on the cost functionals, rather than conditions on the Hamiltonian. The advantages of this are threefold: (i) compared with imposing conditions on Hamiltonian, the structural conditions imposed in this work are easily verified, and less demanding on the regularity requirements of the cost functionals while solving the master equation; (ii) the displacement monotonicity is basically just a direct consequence of small mean field effect in the structural conditions; and (iii) when the mean field effect is not that small, we can still provide an accurate lifespan for the local existence. The method in this work can be readily extended to the case with nonlinear drift and non-separable cost functionals.
△ Less
Submitted 21 January, 2025; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Global Well-Posedness of First-Order Mean Field Games and Master Equations with Nonlinear Dynamics
Authors:
Alain Bensoussan,
Tak Kwong Wong,
Sheung Chi Phillip Yam,
Hongwei Yuan
Abstract:
This article presents the variant of the approach introduced in the recent work of Bensoussan, Wong, Yam and Yuan [13] to the generic first-order mean field game problem. A major contribution here is the provision of new crucial a priori estimates, whose establishment is fundamentally different from the mentioned work since the associated forward-backward ordinary differential equation (FBODE) sys…
▽ More
This article presents the variant of the approach introduced in the recent work of Bensoussan, Wong, Yam and Yuan [13] to the generic first-order mean field game problem. A major contribution here is the provision of new crucial a priori estimates, whose establishment is fundamentally different from the mentioned work since the associated forward-backward ordinary differential equation (FBODE) system is notably different. In addition, we require monotonicity conditions intimately on the coefficient functions but not on the Hamiltonians to handle their non-separable nature and nonlinear dynamics; as tackling Hamiltonians directly, it potentially dissolves much useful information. Compared with the assumptions used in [13], we introduce an additional requirement that the first-order derivative of the drift function in the measure variable cannot be too large relative to the convexity of the running cost function; this requirement only arises when the Hamiltonian is non-separable, and this phenomenon can also be seen in the existing literature. On the other hand, we require less here for the second-order differentiability of the coefficient functions in comparison to that in [13]. Our approach involves first demonstrating the local existence of a solution over small time interval, followed by the provision of new crucial a priori estimates for the sensitivity of the backward equation with respect to the initial condition of forward dynamics; and finally, smoothly gluing the local solutions together to form a global solution. In addition, we establish the local and global existence and uniqueness of classical solutions for the mean field game and its master equation.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Mean Field Analysis of Two-Party Governance: Competition versus Cooperation among Leaders
Authors:
Dantong Chu,
Kenneth Tsz Hin Ng,
Sheung Chi Phillip Yam,
Harry Zheng
Abstract:
This article studies linear-quadratic Stackelberg games between two dominating players (or equivalently, leaders) and a large group of followers, each of whom interacts under a mean field game (MFG) framework. Unlike the conventional major-minor player game, the mean field term herein is endogenously affected by the two leaders simultaneously. These homogeneous followers are non-cooperative, where…
▽ More
This article studies linear-quadratic Stackelberg games between two dominating players (or equivalently, leaders) and a large group of followers, each of whom interacts under a mean field game (MFG) framework. Unlike the conventional major-minor player game, the mean field term herein is endogenously affected by the two leaders simultaneously. These homogeneous followers are non-cooperative, whereas the two leaders can either compete or cooperate with each other, which are respectively formulated as a Nash and a Pareto game. The complete solutions of the leader-follower game can be expressed in terms of the solutions of some non-symmetric Riccati equations. Notably, our analysis suggests that both modes of interactions between leaders has their own merits and neither of them is always more favourable to the community of followers. In our knowledge, a comparative study of the effect of different modes of governance on the society is relatively rare in the existing literature, we here provide its first preliminary quantitative analysis; under a broad class of practically relevant models, we provide sufficient conditions to decide whether cooperation or competition between leaders is more favourable to the followers. Being in common with modern folklore, the relative merits of the two Stackelberg games depend on whether the interests between the two leaders and the followers align among themselves. Representative numerical examples are also supplemented.
△ Less
Submitted 1 July, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
Degenerate Mean Field Type Control with Linear and Unbounded Diffusion, and their Associated Equations
Authors:
Alain Bensoussan,
Ziyu Huang,
Shanjian Tang,
Sheung Chi Phillip Yam
Abstract:
We study the well-posedness of a system of forward-backward stochastic differential equations (FBSDEs) corresponding to a degenerate mean field type control problem, when the diffusion coefficient depends on the state together with its measure and also the control. Degenerate mean field type control problems are rarely studied in the literature. Our method is based on a lifting approach which embe…
▽ More
We study the well-posedness of a system of forward-backward stochastic differential equations (FBSDEs) corresponding to a degenerate mean field type control problem, when the diffusion coefficient depends on the state together with its measure and also the control. Degenerate mean field type control problems are rarely studied in the literature. Our method is based on a lifting approach which embeds the control problem and the associated FBSDEs in Wasserstein spaces into certain Hilbert spaces. We use a continuation method to establish the solvability of the FBSDEs and that of the Gâteaux derivatives of this FBSDEs. We then explore the regularity of the value function in time and in measure argument, and we also show that it is the unique classical solution of the associated Bellman equation. We also study the higher regularity of the linear functional derivative of the value function, by then, we obtain the classical solution of the mean field type master equation.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Linear Quadratic Extended Mean Field Games and Control Problems
Authors:
Alain Bensoussan,
Bohan Li,
Sheung Chi Phillip Yam
Abstract:
We provide a thorough study of a general class of linear-quadratic extended mean field games and control problems in any dimensions where the mean field terms are allowed to be unbounded and there are also presence of cross terms in the objective functionals. Our investigation focuses on the unique existence of equilibrium strategies for the extended mean field problems by employing the stochastic…
▽ More
We provide a thorough study of a general class of linear-quadratic extended mean field games and control problems in any dimensions where the mean field terms are allowed to be unbounded and there are also presence of cross terms in the objective functionals. Our investigation focuses on the unique existence of equilibrium strategies for the extended mean field problems by employing the stochastic maximum principle approach and the appropriate fixed point argument. We provide two distinct proofs, accompanied by two sufficient conditions, that establish the unique existence of the equilibrium strategy over a global time horizon. Both conditions emphasize the importance of sufficiently small coefficients of sensitivity for the cross term, of state and control, and mean field term. To determine the required magnitude of these coefficients, we utilize the singular values of appropriate matrices and Weyl's inequalities. The present proposed theory is consistent with the classical one, namely, our theoretical framework encompasses classical linear-quadratic stochastic control problems as particular cases. Additionally, we establish sufficient conditions for the unique existence of solutions to a particular class of non-symmetric Riccati equations, and we illustrate a counterexample to the existence of equilibrium strategies. Furthermore, we also apply the stochastic maximum principle approach to examine linear-quadratic extended mean field type stochastic control problems. Finally, we conduct a comparative analysis between our method and the alternative master equation approach, specifically addressing the efficacy of the present proposed approach in solving common practical problems, for which the explicit forms of the equilibrium strategies can be obtained directly, even over any global time horizon.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Maximum Principle for Mean Field Type Control Problems with General Volatility Functions
Authors:
Alain Bensoussan,
Ziyu Huang,
Sheung Chi Phillip Yam
Abstract:
In this paper, we study the maximum principle of mean field type control problems when the volatility function depends on the state and its measure and also the control, by using our recently developed method. Our method is to embed the mean field type control problem into a Hilbert space to bypass the evolution in the Wasserstein space. We here give a necessary condition and a sufficient conditio…
▽ More
In this paper, we study the maximum principle of mean field type control problems when the volatility function depends on the state and its measure and also the control, by using our recently developed method. Our method is to embed the mean field type control problem into a Hilbert space to bypass the evolution in the Wasserstein space. We here give a necessary condition and a sufficient condition for these control problems in Hilbert spaces, and we also derive a system of forward-backward stochastic differential equations.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
A Theory of First Order Mean Field Type Control Problems and their Equations
Authors:
Alain Bensoussan,
Tak Kwong Wong,
Sheung Chi Phillip Yam,
Hongwei Yuan
Abstract:
In this article, by using several new crucial {\it a priori} estimates which are still absent in the literature, we provide a comprehensive resolution of the first order generic mean field type control problems and also establish the global-in-time classical solutions of their Bellman and master equations. Rather than developing the analytical approach via tackling the Bellman and master equation…
▽ More
In this article, by using several new crucial {\it a priori} estimates which are still absent in the literature, we provide a comprehensive resolution of the first order generic mean field type control problems and also establish the global-in-time classical solutions of their Bellman and master equations. Rather than developing the analytical approach via tackling the Bellman and master equation directly, we apply the maximum principle approach by considering the induced forward-backward ordinary differential equation (FBODE) system; indeed, we first show the local-in-time unique existence of the solution of the FBODE system for a variety of terminal data by Banach fixed point argument, and then provide crucial a priori estimates of bounding the sensitivity of the terminal data for the backward equation by utilizing a monotonicity condition that can be deduced from the positive definiteness of the Schur complement of the Hessian matrix of the Lagrangian in the lifted version and manipulating first order condition appropriately; this uniform bound over the whole planning horizon $[0,T]$ allows us to partition $[0,T]$ into a number of sub-intervals with a common small length and then glue the consecutive local-in-time solutions together to form the unique global-in-time solution of the FBODE system. The regularity of the global-in-time solution follows from that of the local ones due to the regularity assumptions on the coefficient functions. Moreover, the regularity of the value function will also be shown with the aid of the regularity of the solution couple of the FBODE system and the regularity assumptions on the coefficient functions, with which we can further deduce that this value function and its linear functional derivative satisfy the Bellman and master equations, respectively.
△ Less
Submitted 15 September, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Mean Field Type Control Problems, Some Hilbert-space-valued FBSDEs, and Related Equations
Authors:
Alain Bensoussan,
Ho Man Tai,
Sheung Chi Phillip Yam
Abstract:
In this article, we provide an original systematic global-in-time analysis of mean field type control problems on $\mathbb{R}^n$ with generic cost functionals by the modified approach but not the same, firstly proposed in [7], as the ``lifting'' idea introduced by P. L. Lions. As an alternative to the recent popular analytical method by tackling the master equation, we resolve the control problem…
▽ More
In this article, we provide an original systematic global-in-time analysis of mean field type control problems on $\mathbb{R}^n$ with generic cost functionals by the modified approach but not the same, firstly proposed in [7], as the ``lifting'' idea introduced by P. L. Lions. As an alternative to the recent popular analytical method by tackling the master equation, we resolve the control problem in a certain proper Hilbert subspace of the whole space of $L^2$ random variables, it can be regarded as tangent space attached at the initial probability measure. The present work also fills the gap of the global-in-time solvability and extends the previous works of [7,11] which only dealt with quadratic cost functionals in control; the problem is linked to the global solvability of the Hilbert-space-valued forward-backward stochastic differential equation (FBSDE), which is solved by variational techniques here. We also rely on the Jacobian flow of the solution to this FBSDE to establish the regularities of the value function, including its linearly functional differentiability, which leads to the classical well-posedness of the Bellman equation. Together with the linear functional derivatives and the gradient of the linear functional derivatives of the solution to the FBSDE, we also obtain the classical well-posedness of the master equation.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Likelihood-based Spacings Goodness-of-Fit Statistics for Univariate Shape-constrained Densities
Authors:
Kwun Chuen Gary Chan,
Hok Kan Ling,
Chuan-Fa Tang,
Sheung Chi Phillip Yam
Abstract:
A variety of statistics based on sample spacings has been studied in the literature for testing goodness-of-fit to parametric distributions. To test the goodness-of-fit to a nonparametric class of univariate shape-constrained densities, including widely studied classes such as k-monotone and log-concave densities, a likelihood ratio test with a working alternative density estimate based on the spa…
▽ More
A variety of statistics based on sample spacings has been studied in the literature for testing goodness-of-fit to parametric distributions. To test the goodness-of-fit to a nonparametric class of univariate shape-constrained densities, including widely studied classes such as k-monotone and log-concave densities, a likelihood ratio test with a working alternative density estimate based on the spacings of the observations is considered, and is shown to be asymptotically normal and distribution-free under the null, consistent under fixed alternatives, and admits bootstrap calibration. The distribution-freeness under the null comes from the fact that the asymptotic dominant term depends only on a function of the spacings of transformed outcomes that are uniformly distributed. Applications and extensions of theoretical results in the literature of shape-constrained estimation are required to show that the average log-density ratio converges to zero at a faster rate than the sample spacing term under the null, and diverges under the alternatives. Numerical studies are conducted to demonstrate that the test is applicable to various classes of shape-constrained densities and has a good balance between type-I error control under the null and power under alternative distributions.
△ Less
Submitted 25 October, 2024; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Control in Hilbert Space and First Order Mean Field Type Problem
Authors:
Alain Bensoussan,
Henry Hang Cheung,
Sheung Chi Phillip Yam
Abstract:
We extend the work \cite{bensoussan2019control} by two of the coauthors, which dealt with a deterministic control problem for which the Hilbert space could be generic and investigated a novel form of the `lifting' technique proposed by P. L. Lions. In \cite{bensoussan2019control}, we only showed the local existence and uniqueness of solutions to the FBODEs in the Hilbert space which were associate…
▽ More
We extend the work \cite{bensoussan2019control} by two of the coauthors, which dealt with a deterministic control problem for which the Hilbert space could be generic and investigated a novel form of the `lifting' technique proposed by P. L. Lions. In \cite{bensoussan2019control}, we only showed the local existence and uniqueness of solutions to the FBODEs in the Hilbert space which were associated to the control problems with drift function consisting of the control only. In this article, we establish the global existence and uniqueness of the solutions to the FBODEs in Hilbert space corresponding to control problems with separable drift function which is nonlinear in state and linear in control. We shall also prove the sufficiency of the Pontryagin Maximum Principle and derive the corresponding Bellman equation. Besides, we shall show an analogue in the stationary case. Finally, by using the `lifting' idea as in \cite{stochasticv2,stochasticv1}, we shall apply the result to solve the linear quadratic mean field type control problems, and to show the global existence of the corresponding Bellman equations.
△ Less
Submitted 25 August, 2021;
originally announced August 2021.
-
Value-Gradient based Formulation of Optimal Control Problem and Machine Learning Algorithm
Authors:
Alain Bensoussan,
Jiayue Han,
Sheung Chi Phillip Yam,
Xiang Zhou
Abstract:
Optimal control problem is typically solved by first finding the value function through Hamilton-Jacobi equation (HJE) and then taking the minimizer of the Hamiltonian to obtain the control. In this work, instead of focusing on the value function, we propose a new formulation for the gradient of the value function (value-gradient) as a decoupled system of partial differential equations in the cont…
▽ More
Optimal control problem is typically solved by first finding the value function through Hamilton-Jacobi equation (HJE) and then taking the minimizer of the Hamiltonian to obtain the control. In this work, instead of focusing on the value function, we propose a new formulation for the gradient of the value function (value-gradient) as a decoupled system of partial differential equations in the context of continuous-time deterministic discounted optimal control problem. We develop an efficient iterative scheme for this system of equations in parallel by utilizing the properties that they share the same characteristic curves as the HJE for the value function. For the theoretical part, we prove that this iterative scheme converges linearly in $L_α^2$ sense for some suitable exponent $α$ in a weight function. For the numerical method, we combine characteristic line method with machine learning techniques. Specifically, we generate multiple characteristic curves at each policy iteration from an ensemble of initial states, and compute both the value function and its gradient simultaneously on each curve as the labelled data. Then supervised machine learning is applied to minimize the weighted squared loss for both the value function and its gradients. Experimental results demonstrate that this new method not only significantly increases the accuracy but also improves the efficiency and robustness of the numerical estimates, particularly with less amount of characteristics data or fewer training steps.
△ Less
Submitted 9 September, 2021; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Machine Learning and Control Theory
Authors:
Alain Bensoussan,
Yiqun Li,
Dinh Phan Cao Nguyen,
Minh-Binh Tran,
Sheung Chi Phillip Yam,
Xiang Zhou
Abstract:
We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the…
▽ More
We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the second part, we review the concept of supervised learning and the relation with static optimization. Deep learning which extends supervised learning, can be viewed as a control problem. In the third part, we present the links between stochastic gradient descent and mean-field theory. Conversely, in the fourth and fifth parts, we review machine learning approaches to stochastic control problems, and focus on the deterministic case, to explain, more easily, the numerical algorithms.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
Control on Hilbert Spaces and Application to Some Mean Field Type Control Problems
Authors:
Alain Bensoussan,
P. Jameson Graber,
Sheung Chi Phillip Yam
Abstract:
We propose a new approach to studying classical solutions of the Bellman equation and Master equation for mean field type control problems, using a novel form of the "lifting" idea introduced by P.-L. Lions. Rather than studying the usual system of Hamilton-Jacobi/Fokker-Planck PDEs using analytic techniques, we instead study a stochastic control problem on a specially constructed Hilbert space, w…
▽ More
We propose a new approach to studying classical solutions of the Bellman equation and Master equation for mean field type control problems, using a novel form of the "lifting" idea introduced by P.-L. Lions. Rather than studying the usual system of Hamilton-Jacobi/Fokker-Planck PDEs using analytic techniques, we instead study a stochastic control problem on a specially constructed Hilbert space, which is reminiscent of a tangent space on the Wasserstein space in optimal transport. On this Hilbert space we can use classical control theory techniques, despite the fact that it is infinite dimensional. A consequence of our construction is that the mean field type control problem appears as a special case. Thus we preserve the advantages of the lifiting procedure, while removing some of the difficulties. Our approach extends previous work by two of the coauthors, which dealt with a deterministic control problem for which the Hilbert space could be generic.
△ Less
Submitted 9 May, 2023; v1 submitted 21 May, 2020;
originally announced May 2020.
-
Mean Field approach to stochastic control with partial information
Authors:
Alain Bensoussan,
Sheung Chi Phillip Yam
Abstract:
The classical stochastic control problem under partial information can be formulated as a control problem for Zakai equation, whose solution is the unnormalized conditional probability distribution of the state of the system. Zakai equation is a stochastic Fokker-Planck equation. Therefore, the problem to be solved is similar to that met in Mean Field Control theory. Since Mean Field Control theor…
▽ More
The classical stochastic control problem under partial information can be formulated as a control problem for Zakai equation, whose solution is the unnormalized conditional probability distribution of the state of the system. Zakai equation is a stochastic Fokker-Planck equation. Therefore, the problem to be solved is similar to that met in Mean Field Control theory. Since Mean Field Control theory is much posterior to the development of Stochastic Control with partial information, the tools, techniques, and concepts obtained in the last decade, for Mean Field Games and Mean field type Control theory, have not been used for the control of Zakai equation. Our objective is to connect the two theories. We get the power of new tools, and we get new insights for the problem of stochastic control with partial information. For mean field theory, we get new interesting applications, but also new problems. Indeed, Mean Field Control Theory leads to very complex equations, like the Master equation, which is a nonlinear infinite dimensional P.D.E., for which general theorems are hardly available, although active research in this direction is performed. Direct methods are useful to obtain regularity results. We will develop in detail the LQ regulator problem, but since we cannot just consider the Gaussian case, well-known results, such as the separation principle is not available. An important result is available in the literature, due to A. Makowsky. It describes the solution of Zakai equation for linear systems with general initial condition (non-gaussian). We show that the separation principle can be extended for quadratic pay-off functionals, but the Kalman filter is much more complex than in the gaussian case. Finally we compare our work to the work of E. Bandini et al. and we show that the example E. Bandini et al. provided does not cover ours. Our system remains nonlinear in their setting.
△ Less
Submitted 26 September, 2019; v1 submitted 23 September, 2019;
originally announced September 2019.
-
Stochastic Control on Space of Random Variables
Authors:
Alain Bensoussan,
P. Jameson Graber,
S. C. P. Yam
Abstract:
By extending \cite{bensoussan2015control}, we implement the proposal of Lions \cite{lions14} on studying mean field games and their master equations via certain control problems on the Hilbert space of square integrable random variables. In \cite{bensoussan2015control}, the Hilbert space could be quite general in the face of the "deterministic control problem" due to the absence of additional rand…
▽ More
By extending \cite{bensoussan2015control}, we implement the proposal of Lions \cite{lions14} on studying mean field games and their master equations via certain control problems on the Hilbert space of square integrable random variables. In \cite{bensoussan2015control}, the Hilbert space could be quite general in the face of the "deterministic control problem" due to the absence of additional randomness; while the special case of $L^2$ space of square integrable random variables was brought in at the interpretation stage. The effectiveness of the approach was demonstrated by deriving Bellman equations and the first order master equations through control theory of dynamical systems valued in the Hilbert space. In our present problem for second order master equations, it connects with a stochastic control problem over the space of random variables, and it possesses an additional randomness generated by the Wiener process which cannot be detached from the randomness caused by the elements in the Hilbert space. Nevertheless, we demonstrate how to tackle this difficulty, while preserving most of the efficiency of the approach suggested by Lions \cite{lions14}.
△ Less
Submitted 29 March, 2019;
originally announced March 2019.
-
Higher Order, Polar and Sz.-Nagy's Generalized Derivatives of Random Polynomials with Independent and Identically Distributed Zeros on the Unit Circle
Authors:
Pak-Leong Cheung,
Tuen Wai Ng,
Jonathan Tsai,
S. C. P. Yam
Abstract:
For random polynomials with i.i.d. (independent and identically distribu-ted) zeros following any common probability distribution $μ$ with support contained in the unit circle, the empirical measures of the zeros of their first and higher order derivatives will be proved to converge weakly to $μ$ a.s. (almost sure(ly)). This, in particular, completes a recent work of Subramanian on the first order…
▽ More
For random polynomials with i.i.d. (independent and identically distribu-ted) zeros following any common probability distribution $μ$ with support contained in the unit circle, the empirical measures of the zeros of their first and higher order derivatives will be proved to converge weakly to $μ$ a.s. (almost sure(ly)). This, in particular, completes a recent work of Subramanian on the first order derivative case where $μ$ was assumed to be non-uniform. The same a.s. weak convergence will also be shown for polar and Sz.-Nagy's generalized derivatives, on some mild conditions.
△ Less
Submitted 25 September, 2014;
originally announced September 2014.
-
Conformal invariance of the exploration path in 2-d critical bond percolation in the square lattice
Authors:
Jonathan Tsai,
S. C. P. Yam,
Wang Zhou
Abstract:
In this paper we present the proof of the convergence of the critical bond percolation exploration process on the square lattice to the trace of SLE$_{6}$. This is an important conjecture in mathematical physics and probability. The case of critical site percolation on the hexagonal lattice was established in the seminal work of Smirnov via proving Cardy's formula. Our proof uses a series of trans…
▽ More
In this paper we present the proof of the convergence of the critical bond percolation exploration process on the square lattice to the trace of SLE$_{6}$. This is an important conjecture in mathematical physics and probability. The case of critical site percolation on the hexagonal lattice was established in the seminal work of Smirnov via proving Cardy's formula. Our proof uses a series of transformations and conditioning to construct a pair of paths: the $+\partial$CBP and the $-\partial$CBP. The convergence in the site percolation case on the hexagonal lattice allows us to obtain certain estimates on the scaling limit of the $+\partial$CBP and the $-\partial$CBP. By considering a path which is the concatenation of $+\partial$CBPs and $-\partial$CBPs in an alternating manner, we can prove the convergence in the case of bond percolation on the square lattice.
△ Less
Submitted 10 January, 2013; v1 submitted 8 December, 2011;
originally announced December 2011.