-
The Markov approximation of the periodic multivariate Poisson autoregression
Authors:
Mahmoud Khabou,
Edward A. K. Cohen,
Almut E. D. Veraart
Abstract:
This paper introduces a periodic multivariate Poisson autoregression with potentially infinite memory, with a special focus on the network setting. Using contraction techniques, we study the stability of such a process and provide upper bounds on how fast it reaches the periodically stationary regime. We then propose a computationally efficient Markov approximation using the properties of the expo…
▽ More
This paper introduces a periodic multivariate Poisson autoregression with potentially infinite memory, with a special focus on the network setting. Using contraction techniques, we study the stability of such a process and provide upper bounds on how fast it reaches the periodically stationary regime. We then propose a computationally efficient Markov approximation using the properties of the exponential function and a density result. Furthermore, we prove the strong consistency of the maximum likelihood estimator for the Markov approximation and empirically test its robustness in the case of misspecification. Our model is applied to the prediction of weekly Rotavirus cases in Berlin, demonstrating superior performance compared to the existing PNAR model.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Uniform-in-Time Convergence Rates to a Nonlinear Markov Chain for Mean-Field Interacting Jump Processes
Authors:
Asaf Cohen,
Ethan Huffman
Abstract:
We consider a system of $N$ particles interacting through their empirical distribution on a finite state space in continuous time. In the formal limit as $N\to\infty$, the system takes the form of a nonlinear (McKean--Vlasov) Markov chain. This paper rigorously establishes this limit. Specifically, under the assumption that the mean field system has a unique, exponentially stable stationary distri…
▽ More
We consider a system of $N$ particles interacting through their empirical distribution on a finite state space in continuous time. In the formal limit as $N\to\infty$, the system takes the form of a nonlinear (McKean--Vlasov) Markov chain. This paper rigorously establishes this limit. Specifically, under the assumption that the mean field system has a unique, exponentially stable stationary distribution, we show that the weak error between the empirical measures of the $N$-particle system and the law of the mean field system is of order $1/N$ uniformly in time. Our analysis makes use of a master equation for test functions evaluated along the measure flow of the mean field system, and we demonstrate that the solutions of this master equation are sufficiently regular. We then show that exponential stability of the mean field system is implied by exponential stability for solutions of the linearized Kolmogorov equation with a source term. Finally, we show that our results can be applied to the study of mean field games and give a new condition for the existence of a unique stationary distribution for a nonlinear Markov chain.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Polynomial Expressions for the Dimensions of the Representations of Symmetric Groups and Restricted Standard Young Tableaux
Authors:
Avichai Cohen,
Shaul Zemel
Abstract:
Given a partition $λ$ of a number $k$, it is known that by adding a long line of length $n-k$, the dimension of the associated representation of $S_{n}$ is an integer-valued polynomial of degree $k$ in $n$. We show that its expansion in the binomial basis is bounded by the length of $λ$, and that the resulting coefficient of index $h$, with alternating signs, counts the standard Young tableaux of…
▽ More
Given a partition $λ$ of a number $k$, it is known that by adding a long line of length $n-k$, the dimension of the associated representation of $S_{n}$ is an integer-valued polynomial of degree $k$ in $n$. We show that its expansion in the binomial basis is bounded by the length of $λ$, and that the resulting coefficient of index $h$, with alternating signs, counts the standard Young tableaux of shape $λ$ in which a given collection of consecutive $h$ numbers lie in increasing rows. We also construct bijections in order to demonstare explicitly that this number is indeed independent of the set of consecutive $h$ numbers used.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Lower bounds for incidences
Authors:
Alex Cohen,
Cosmin Pohoata,
Dmitrii Zakharov
Abstract:
Let $p_1,\ldots,p_n$ be a set of points in the unit square and let $T_1,\ldots,T_n$ be a set of $δ$-tubes such that $T_j$ passes through $p_j$. We prove a lower bound for the number of incidences between the points and tubes under a natural regularity condition (similar to Frostman regularity). As a consequence, we show that in any configuration of points $p_1,\ldots, p_n \in [0,1]^2$ along with a…
▽ More
Let $p_1,\ldots,p_n$ be a set of points in the unit square and let $T_1,\ldots,T_n$ be a set of $δ$-tubes such that $T_j$ passes through $p_j$. We prove a lower bound for the number of incidences between the points and tubes under a natural regularity condition (similar to Frostman regularity). As a consequence, we show that in any configuration of points $p_1,\ldots, p_n \in [0,1]^2$ along with a line $\ell_j$ through each point $p_j$, there exist $j\neq k$ for which $d(p_j, \ell_k) \lesssim n^{-2/3+o(1)}$.
It follows from the latter result that any set of $n$ points in the unit square contains three points forming a triangle of area at most $n^{-7/6+o(1)}$. This new upper bound for Heilbronn's triangle problem attains the high-low limit established in our previous work arXiv:2305.18253.
△ Less
Submitted 17 March, 2025; v1 submitted 11 September, 2024;
originally announced September 2024.
-
Clustering in typical unit-distance avoiding sets
Authors:
Alex Cohen,
Nitya Mani
Abstract:
In the 1960s Moser asked how dense a subset of $\mathbb{R}^d$ can be if no pairs of points in the subset are exactly distance 1 apart. There has been a long line of work showing upper bounds on this density. One curious feature of dense unit distance avoiding sets is that they appear to be ``clumpy,'' i.e. forbidding unit distances comes hand in hand with having more than the expected number dista…
▽ More
In the 1960s Moser asked how dense a subset of $\mathbb{R}^d$ can be if no pairs of points in the subset are exactly distance 1 apart. There has been a long line of work showing upper bounds on this density. One curious feature of dense unit distance avoiding sets is that they appear to be ``clumpy,'' i.e. forbidding unit distances comes hand in hand with having more than the expected number distance $\approx 2$ pairs.
In this work we rigorously establish this phenomenon in $\mathbb{R}^2$. We show that dense unit distance avoiding sets have over-represented distance $\approx 2$ pairs, and that this clustering extends to typical unit distance avoiding sets. To do so, we build off of the linear programming approach used previously to prove upper bounds on the density of unit distance avoiding sets.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Convergence of the Deep Galerkin Method for Mean Field Control Problems
Authors:
William Hofgard,
Jingruo Sun,
Asaf Cohen
Abstract:
We establish the convergence of the deep Galerkin method (DGM), a deep learning-based scheme for solving high-dimensional nonlinear PDEs, for Hamilton-Jacobi-Bellman (HJB) equations that arise from the study of mean field control problems (MFCPs). Based on a recent characterization of the value function of the MFCP as the unique viscosity solution of an HJB equation on the simplex, we establish bo…
▽ More
We establish the convergence of the deep Galerkin method (DGM), a deep learning-based scheme for solving high-dimensional nonlinear PDEs, for Hamilton-Jacobi-Bellman (HJB) equations that arise from the study of mean field control problems (MFCPs). Based on a recent characterization of the value function of the MFCP as the unique viscosity solution of an HJB equation on the simplex, we establish both an existence and convergence result for the DGM. First, we show that the loss functional of the DGM can be made arbitrarily small given that the value function of the MFCP possesses sufficient regularity. Then, we show that if the loss functional of the DGM converges to zero, the corresponding neural network approximators must converge uniformly to the true value function on the simplex. We also provide numerical experiments demonstrating the DGM's ability to generalize to high-dimensional HJB equations.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Constructions of bounded solutions of $div\, {\mathbf u}=f$ in critical spaces
Authors:
Albert Cohen,
Ronald DeVore,
Eitan Tadmor
Abstract:
We construct uniformly bounded solutions of the equation $div\, {\mathbf u}=f$ for arbitrary data $f$ in the critical spaces $L^d(Ω)$, where $Ω$ is a domain of ${\mathbb R}^d$. This question was addressed by Bourgain & Brezis, [On the equation ${\rm div}\, Y=f$ and application to control of phases, JAMS 16(2) (2003) 393-426], who proved that although the problem has a uniformly bounded solution, i…
▽ More
We construct uniformly bounded solutions of the equation $div\, {\mathbf u}=f$ for arbitrary data $f$ in the critical spaces $L^d(Ω)$, where $Ω$ is a domain of ${\mathbb R}^d$. This question was addressed by Bourgain & Brezis, [On the equation ${\rm div}\, Y=f$ and application to control of phases, JAMS 16(2) (2003) 393-426], who proved that although the problem has a uniformly bounded solution, it is critical in the sense that there exists no linear solution operator for general $L^d$-data. We first discuss the validity of this existence result under weaker conditions than $f\in L^d(Ω)$, and then focus our work on constructive processes for such uniformly bounded solutions. In the $d=2$ case, we present a direct one-step explicit construction, which generalizes for $d>2$ to a $(d-1)$-step construction based on induction. An explicit construction is proposed for compactly supported data in $L^{2,\infty}(Ω)$ in the $d=2$ case. We also present constructive approaches based on optimization of a certain loss functional adapted to the problem. This approach provides a two-step construction in the $d=2$ case. This optimization is used as the building block of a hierarchical multistep process introduced in [E. Tadmor, Hierarchical construction of bounded solutions in critical regularity spaces, CPAM 69(6) (2016) 1087-1109] that converges to a solution in more general situations.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Asymptotic Nash Equilibria of Finite-State Ergodic Markovian Mean Field Games
Authors:
Asaf Cohen,
Ethan Zell
Abstract:
Mean field games (MFGs) model equilibria in games with a continuum of weakly interacting players as limiting systems of symmetric $n$-player games. We consider the finite-state, infinite-horizon problem with ergodic cost. Assuming Markovian strategies, we first prove that any solution to the MFG system gives rise to a $(C/\sqrt{n})$-Nash equilibrium in the $n$-player game. We follow this result by…
▽ More
Mean field games (MFGs) model equilibria in games with a continuum of weakly interacting players as limiting systems of symmetric $n$-player games. We consider the finite-state, infinite-horizon problem with ergodic cost. Assuming Markovian strategies, we first prove that any solution to the MFG system gives rise to a $(C/\sqrt{n})$-Nash equilibrium in the $n$-player game. We follow this result by proving the same is true for the strategy profile derived from the master equation. We conclude the main theoretical portion of the paper by establishing a large deviation principle for empirical measures associated with the asymptotic Nash equilibria. Then, we contrast the asymptotic Nash equilibria using an example. We solve the MFG system directly and numerically solve the ergodic master equation by adapting the deep Galerkin method of Sirignano and Spiliopoulos. We use these results to derive the strategies of the asymptotic Nash equilibria and compare them. Finally, we derive an explicit form for the rate functions in dimension two.
△ Less
Submitted 21 March, 2025; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Existence of Optimal Stationary Singular Controls and Mean Field Game Equilibria
Authors:
Asaf Cohen,
Chuhao Sun
Abstract:
In this paper, we examine the stationary relaxed singular control problem within a multi-dimensional framework for a single agent, as well as its mean field game equivalent. We demonstrate that optimal relaxed controls exist for two problem classes: one driven by queueing control and the other by harvesting models. These relaxed controls are defined by random measures across the state and control…
▽ More
In this paper, we examine the stationary relaxed singular control problem within a multi-dimensional framework for a single agent, as well as its mean field game equivalent. We demonstrate that optimal relaxed controls exist for two problem classes: one driven by queueing control and the other by harvesting models. These relaxed controls are defined by random measures across the state and control spaces, with the state process described as a solution to the associated martingale problem. By leveraging findings from [Kurtz-Stockbridge 2001], we establish the equivalence between the martingale problem and the stationary forward equation. This allows us to reformulate the relaxed control problem into a linear programming problem within the measure space. We prove the sequential compactness of these measures, thereby confirming the feasibility of achieving an optimal solution. Subsequently, our focus shifts to mean field games. Drawing on insights from the single-agent problem and employing Kakutani--Glicksberg--Fan fixed point theorem, we derive the existence of a mean field game equilibria.
△ Less
Submitted 2 June, 2025; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Deep Backward and Galerkin Methods for the Finite State Master Equation
Authors:
Asaf Cohen,
Mathieu Laurière,
Ethan Zell
Abstract:
This paper proposes and analyzes two neural network methods to solve the master equation for finite-state mean field games (MFGs). Solving MFGs provides approximate Nash equilibria for stochastic, differential games with finite but large populations of agents. The master equation is a partial differential equation (PDE) whose solution characterizes MFG equilibria for any possible initial distribut…
▽ More
This paper proposes and analyzes two neural network methods to solve the master equation for finite-state mean field games (MFGs). Solving MFGs provides approximate Nash equilibria for stochastic, differential games with finite but large populations of agents. The master equation is a partial differential equation (PDE) whose solution characterizes MFG equilibria for any possible initial distribution. The first method we propose relies on backward induction in a time component while the second method directly tackles the PDE without discretizing time. For both approaches, we prove two types of results: there exist neural networks that make the algorithms' loss functions arbitrarily small, and conversely, if the losses are small, then the neural networks are good approximations of the master equation's solution. We conclude the paper with numerical experiments on benchmark problems from the literature up to dimension 15, and a comparison with solutions computed by a classical method for fixed initial distributions.
△ Less
Submitted 23 December, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Advancing Continuous Distribution Generation: An Exponentiated Odds Ratio Generator Approach
Authors:
Xinyu Chen,
Yuanqi Xie,
Achraf Cohen,
Shusen Pu
Abstract:
This paper presents a new methodology for generating continuous statistical distributions, integrating the exponentiated odds ratio within the framework of survival analysis. This new method enhances the flexibility and adaptability of distribution models to effectively address the complexities inherent in contemporary datasets. The core of this advancement is illustrated by introducing a particul…
▽ More
This paper presents a new methodology for generating continuous statistical distributions, integrating the exponentiated odds ratio within the framework of survival analysis. This new method enhances the flexibility and adaptability of distribution models to effectively address the complexities inherent in contemporary datasets. The core of this advancement is illustrated by introducing a particular subfamily, the "Type-2 Gumbel Weibull-G Family of Distributions." We provide a comprehensive analysis of the mathematical properties of these distributions, encompassing statistical properties such as density functions, moments, hazard rate and quantile functions, Rényi entropy, order statistics, and the concept of stochastic ordering. To establish the robustness of our approach, we apply five distinct methods for parameter estimation. The practical applicability of the Type-2 Gumbel Weibull-G distributions is further supported through the analysis of three real-world datasets. These empirical applications illustrate the exceptional statistical precision of our distributions compared to existing models, thereby reinforcing their significant value in both theoretical and practical statistical applications.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
High order recovery of geometric interfaces from cell-average data
Authors:
Albert Cohen,
Olga Mula,
Agustín Somacal
Abstract:
We consider the problem of recovering characteristic functions $u:=χ_Ω$ from cell-average data on a coarse grid, and where $Ω$ is a compact set of $\mathbb{R}^d$. This task arises in very different contexts such as image processing, inverse problems, and the accurate treatment of interfaces in finite volume schemes. While linear recovery methods are known to perform poorly, nonlinear strategies ba…
▽ More
We consider the problem of recovering characteristic functions $u:=χ_Ω$ from cell-average data on a coarse grid, and where $Ω$ is a compact set of $\mathbb{R}^d$. This task arises in very different contexts such as image processing, inverse problems, and the accurate treatment of interfaces in finite volume schemes. While linear recovery methods are known to perform poorly, nonlinear strategies based on local reconstructions of the jump interface $Γ:=\partialΩ$ by geometrically simpler interfaces may offer significant improvements. We study two main families of local reconstruction schemes, the first one based on nonlinear least-squares fitting, the second one based on the explicit computation of a polynomial-shaped curve fitting the data, which yields simpler numerical computations and high order geometric fitting. For each of them, we derive a general theoretical framework which allows us to control the recovery error by the error of best approximation up to a fixed multiplicative constant. Numerical tests in 2d illustrate the expected approximation order of these strategies. Several extensions are discussed, in particular the treatment of piecewise smooth interfaces with corners.
△ Less
Submitted 21 October, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Locally Optimal Descent for Dynamic Stepsize Scheduling
Authors:
Gilad Yehudai,
Alon Cohen,
Amit Daniely,
Yoel Drori,
Tomer Koren,
Mariano Schain
Abstract:
We introduce a novel dynamic learning-rate scheduling scheme grounded in theory with the goal of simplifying the manual and time-consuming tuning of schedules in practice. Our approach is based on estimating the locally-optimal stepsize, guaranteeing maximal descent in the direction of the stochastic gradient of the current step. We first establish theoretical convergence bounds for our method wit…
▽ More
We introduce a novel dynamic learning-rate scheduling scheme grounded in theory with the goal of simplifying the manual and time-consuming tuning of schedules in practice. Our approach is based on estimating the locally-optimal stepsize, guaranteeing maximal descent in the direction of the stochastic gradient of the current step. We first establish theoretical convergence bounds for our method within the context of smooth non-convex stochastic optimization, matching state-of-the-art bounds while only assuming knowledge of the smoothness parameter. We then present a practical implementation of our algorithm and conduct systematic experiments across diverse datasets and optimization algorithms, comparing our scheme with existing state-of-the-art learning-rate schedulers. Our findings indicate that our method needs minimal tuning when compared to existing approaches, removing the need for auxiliary manual schedules and warm-up phases and achieving comparable performance with drastically reduced parameter tuning.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Quantum-inspired nonlinear Galerkin ansatz for high-dimensional HJB equations
Authors:
Chuhao Sun,
Asaf Cohen,
James Stokes,
Shravan Veerapaneni
Abstract:
Neural networks are increasingly recognized as a powerful numerical solution technique for partial differential equations (PDEs) arising in diverse scientific computing domains, including quantum many-body physics. In the context of time-dependent PDEs, the dominant paradigm involves casting the approximate solution in terms of stochastic minimization of an objective function given by the norm of…
▽ More
Neural networks are increasingly recognized as a powerful numerical solution technique for partial differential equations (PDEs) arising in diverse scientific computing domains, including quantum many-body physics. In the context of time-dependent PDEs, the dominant paradigm involves casting the approximate solution in terms of stochastic minimization of an objective function given by the norm of the PDE residual, viewed as a function of the neural network parameters. Recently, advancements have been made in the direction of an alternative approach which shares aspects of nonlinearly parametrized Galerkin methods and variational quantum Monte Carlo, especially for high-dimensional, time-dependent PDEs that extend beyond the usual scope of quantum physics. This paper is inspired by the potential of solving Hamilton-Jacobi-Bellman (HJB) PDEs using Neural Galerkin methods and commences the exploration of nonlinearly parametrized trial functions for which the evolution equations are analytically tractable. As a precursor to the Neural Galerkin scheme, we present trial functions with evolution equations that admit closed-form solutions, focusing on time-dependent HJB equations relevant to finance.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
A new upper bound for the Heilbronn triangle problem
Authors:
Alex Cohen,
Cosmin Pohoata,
Dmitrii Zakharov
Abstract:
For sufficiently large $n$, we show that in every configuration of $n$ points chosen inside the unit square there exists a triangle of area less than $n^{-8/7-1/2000}$. This improves upon a result of Komlós, Pintz and Szemerédi from 1982. Our approach establishes new connections between the Heilbronn triangle problem and various themes in incidence geometry and projection theory which are closely…
▽ More
For sufficiently large $n$, we show that in every configuration of $n$ points chosen inside the unit square there exists a triangle of area less than $n^{-8/7-1/2000}$. This improves upon a result of Komlós, Pintz and Szemerédi from 1982. Our approach establishes new connections between the Heilbronn triangle problem and various themes in incidence geometry and projection theory which are closely related to the discretized sum-product phenomenon.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
Fractal uncertainty in higher dimensions
Authors:
Alex Cohen
Abstract:
We prove that if a fractal set in $\mathbb{R}^d$ avoids lines in a certain quantitative sense, which we call line porosity, then it has a fractal uncertainty principle. The main ingredient is a new higher dimensional Beurling-Malliavin multiplier theorem.
We prove that if a fractal set in $\mathbb{R}^d$ avoids lines in a certain quantitative sense, which we call line porosity, then it has a fractal uncertainty principle. The main ingredient is a new higher dimensional Beurling-Malliavin multiplier theorem.
△ Less
Submitted 4 October, 2024; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Reduced order modeling for elliptic problems with high contrast diffusion coefficients
Authors:
Albert Cohen,
Matthieu Dolbeault,
Agustin Somacal,
Wolfgang Dahmen
Abstract:
We consider the parametric elliptic PDE $-{\rm div} (a(y)\nabla u)=f$ on a spatial domain $Ω$, with $a(y)$ a scalar piecewise constant diffusion coefficient taking any positive values $y=(y_1, \dots, y_d)\in ]0,\infty[^d$ on fixed subdomains $Ω_1,\dots,Ω_d$. This problem is not uniformly elliptic as the contrast $κ(y)=\frac{\max y_j}{\min y_j}$ can be arbitrarily high, contrarily to the Uniform El…
▽ More
We consider the parametric elliptic PDE $-{\rm div} (a(y)\nabla u)=f$ on a spatial domain $Ω$, with $a(y)$ a scalar piecewise constant diffusion coefficient taking any positive values $y=(y_1, \dots, y_d)\in ]0,\infty[^d$ on fixed subdomains $Ω_1,\dots,Ω_d$. This problem is not uniformly elliptic as the contrast $κ(y)=\frac{\max y_j}{\min y_j}$ can be arbitrarily high, contrarily to the Uniform Ellipticity Assumption (UEA) that is commonly made on parametric elliptic PDEs. Based on local polynomial approximations in the $y$ variable, we construct local and global reduced model spaces $V_n$ of moderate dimension $n$ that approximate uniformly well all solutions $u(y)$. Since the solution $u(y)$ blows as $y\to 0$, the solution manifold is not a compact set and does not have finite $n$-width. Therefore, our results for approximation by such spaces are formulated in terms of relative $H^1_0$-projection error, that is, after normalization by $\|u(y)\|_{H^1_0}$. We prove that this relative error decays exponentially with $n$, yet exhibiting the curse of dimensionality as the number $d$ of subdomains grows. We also show similar rates for the Galerkin projection despite the fact that high contrast is well-known to deteriorate the multiplicative constant when applying Cea's lemma. We finally establish uniform estimates in relative error for the state estimation and parameter estimation inverse problems, when $y$ is unknown and a limited number of linear measurements $\ell_i(u)$ are observed. A key ingredient in our construction and analysis is the study of the convergence of $u(y)$ to limit solutions when some of the parameters $y_j$ tend to infinity.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Every real-rooted exponential polynomial is the restriction of a Lee-Yang polynomial
Authors:
Lior Alon,
Alex Cohen,
Cynthia Vinzant
Abstract:
A Lee-Yang polynomial $ p(z_{1},\ldots,z_{n}) $ is a polynomial that has no zeros in the polydisc $ \mathbb{D}^{n} $ and its inverse $ (\mathbb{C}\setminus\overline{\mathbb{D}})^{n} $. We show that any real-rooted exponential polynomial of the form $f(x) = \sum_{j=0}^s c_j e^{λ_j x}$ can be written as the restriction of a Lee-Yang polynomial to a positive line in the torus. Together with previous…
▽ More
A Lee-Yang polynomial $ p(z_{1},\ldots,z_{n}) $ is a polynomial that has no zeros in the polydisc $ \mathbb{D}^{n} $ and its inverse $ (\mathbb{C}\setminus\overline{\mathbb{D}})^{n} $. We show that any real-rooted exponential polynomial of the form $f(x) = \sum_{j=0}^s c_j e^{λ_j x}$ can be written as the restriction of a Lee-Yang polynomial to a positive line in the torus. Together with previous work by Olevskii and Ulanovskii, this implies that the Kurasov-Sarnak construction of $ \mathbb{N} $-valued Fourier quasicrystals from stable polynomials comprises every possible $ \mathbb{N} $-valued Fourier quasicrystal.
△ Less
Submitted 7 October, 2024; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Solving PDEs with Incomplete Information
Authors:
Peter Binev,
Andrea Bonito,
Albert Cohen,
Wolfgang Dahmen,
Ronald DeVore,
Guergana Petrova
Abstract:
We consider the problem of numerically approximating the solutions to a partial differential equation (PDE) when there is insufficient information to determine a unique solution. Our main example is the Poisson boundary value problem, when the boundary data is unknown and instead one observes finitely many linear measurements of the solution. We view this setting as an optimal recovery problem and…
▽ More
We consider the problem of numerically approximating the solutions to a partial differential equation (PDE) when there is insufficient information to determine a unique solution. Our main example is the Poisson boundary value problem, when the boundary data is unknown and instead one observes finitely many linear measurements of the solution. We view this setting as an optimal recovery problem and develop theory and numerical algorithms for its solution. The main vehicle employed is the derivation and approximation of the Riesz representers of these functionals with respect to relevant Hilbert spaces of harmonic functions.
△ Less
Submitted 20 December, 2023; v1 submitted 13 January, 2023;
originally announced January 2023.
-
Learning-based Optimal Admission Control in a Single Server Queuing System
Authors:
Asaf Cohen,
Vijay G. Subramanian,
Yili Zhang
Abstract:
We consider a long-term average profit maximizing admission control problem in an M/M/1 queuing system with unknown service and arrival rates. With a fixed reward collected upon service completion and a cost per unit of time enforced on customers waiting in the queue, a dispatcher decides upon arrivals whether to admit the arriving customer or not based on the full history of observations of the q…
▽ More
We consider a long-term average profit maximizing admission control problem in an M/M/1 queuing system with unknown service and arrival rates. With a fixed reward collected upon service completion and a cost per unit of time enforced on customers waiting in the queue, a dispatcher decides upon arrivals whether to admit the arriving customer or not based on the full history of observations of the queue-length of the system. (Naor 1969, Econometrica) showed that if all the parameters of the model are known, then it is optimal to use a static threshold policy -- admit if the queue-length is less than a predetermined threshold and otherwise not. We propose a learning-based dispatching algorithm and characterize its regret with respect to optimal dispatch policies for the full information model of Naor (1969). We show that the algorithm achieves an $O(1)$ regret when all optimal thresholds with full information are non-zero, and achieves an $O(\ln^{1+ε}(N))$ regret for any specified $ε>0$, in the case that an optimal threshold with full information is $0$ (i.e., an optimal policy is to reject all arrivals), where $N$ is the number of arrivals.
△ Less
Submitted 23 November, 2023; v1 submitted 21 December, 2022;
originally announced December 2022.
-
A neural network approach to high-dimensional optimal switching problems with jumps in energy markets
Authors:
Erhan Bayraktar,
Asaf Cohen,
April Nellis
Abstract:
We develop a backward-in-time machine learning algorithm that uses a sequence of neural networks to solve optimal switching problems in energy production, where electricity and fossil fuel prices are subject to stochastic jumps. We then apply this algorithm to a variety of energy scheduling problems, including novel high-dimensional energy production problems. Our experimental results demonstrate…
▽ More
We develop a backward-in-time machine learning algorithm that uses a sequence of neural networks to solve optimal switching problems in energy production, where electricity and fossil fuel prices are subject to stochastic jumps. We then apply this algorithm to a variety of energy scheduling problems, including novel high-dimensional energy production problems. Our experimental results demonstrate that the algorithm performs with accuracy and experiences linear to sub-linear slowdowns as dimension increases, demonstrating the value of the algorithm for solving high-dimensional switching problems.
△ Less
Submitted 16 September, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Nonlinear approximation spaces for inverse problems
Authors:
Albert Cohen,
Matthieu Dolbeault,
Olga Mula,
Agustin Somacal
Abstract:
This paper is concerned with the ubiquitous inverse problem of recovering an unknown function u from finitely many measurements possibly affected by noise. In recent years, inversion methods based on linear approximation spaces were introduced in [MPPY15, BCDDPW17] with certified recovery bounds. It is however known that linear spaces become ineffective for approximating simple and relevant famili…
▽ More
This paper is concerned with the ubiquitous inverse problem of recovering an unknown function u from finitely many measurements possibly affected by noise. In recent years, inversion methods based on linear approximation spaces were introduced in [MPPY15, BCDDPW17] with certified recovery bounds. It is however known that linear spaces become ineffective for approximating simple and relevant families of functions, such as piecewise smooth functions that typically occur in hyperbolic PDEs (shocks) or images (edges). For such families, nonlinear spaces [Devore98] are known to significantly improve the approximation performance. The first contribution of this paper is to provide with certified recovery bounds for inversion procedures based on nonlinear approximation spaces. The second contribution is the application of this framework to the recovery of general bidimensional shapes from cell-average data. We also discuss how the application of our results to n-term approximation relates to classical results in compressed sensing.
△ Less
Submitted 5 October, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Quantum-inspired variational algorithms for partial differential equations: Application to financial derivative pricing
Authors:
Tianchen Zhao,
Chuhao Sun,
Asaf Cohen,
James Stokes,
Shravan Veerapaneni
Abstract:
Variational quantum Monte Carlo (VMC) combined with neural-network quantum states offers a novel angle of attack on the curse-of-dimensionality encountered in a particular class of partial differential equations (PDEs); namely, the real- and imaginary time-dependent Schrödinger equation. In this paper, we present a simple generalization of VMC applicable to arbitrary time-dependent PDEs, showcasin…
▽ More
Variational quantum Monte Carlo (VMC) combined with neural-network quantum states offers a novel angle of attack on the curse-of-dimensionality encountered in a particular class of partial differential equations (PDEs); namely, the real- and imaginary time-dependent Schrödinger equation. In this paper, we present a simple generalization of VMC applicable to arbitrary time-dependent PDEs, showcasing the technique in the multi-asset Black-Scholes PDE for pricing European options contingent on many correlated underlying assets.
△ Less
Submitted 21 July, 2022;
originally announced July 2022.
-
Fractal uncertainty for discrete 2D Cantor sets
Authors:
Alex Cohen
Abstract:
We prove that a self-similar Cantor set in $\mathbb{Z}_N \times \mathbb{Z}_N$ has a fractal uncertainty principle if and only if it does not contain a pair of orthogonal lines. The key ingredient in our proof is a quantitative form of Lang's conjecture in number theory due to Ruppert and Beukers & Smyth. Our theorem answers a question of Dyatlov and has applications to open quantum maps.
We prove that a self-similar Cantor set in $\mathbb{Z}_N \times \mathbb{Z}_N$ has a fractal uncertainty principle if and only if it does not contain a pair of orthogonal lines. The key ingredient in our proof is a quantitative form of Lang's conjecture in number theory due to Ruppert and Beukers & Smyth. Our theorem answers a question of Dyatlov and has applications to open quantum maps.
△ Less
Submitted 4 October, 2024; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Authors:
Asaf Cassel,
Alon Cohen,
Tomer Koren
Abstract:
We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains an optimal $\smash{\sqrt{T}}$-regret rate compared to the best stabilizing linear controller in hindsight, while avoiding stringent assumptions on the costs su…
▽ More
We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains an optimal $\smash{\sqrt{T}}$-regret rate compared to the best stabilizing linear controller in hindsight, while avoiding stringent assumptions on the costs such as strong convexity. Our approach is based on a careful design of non-convex lower confidence bounds for the online costs, and uses a novel technique for computationally-efficient regret minimization of these bounds that leverages their particular non-convex structure.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Analysis of the Finite-State Ergodic Master Equation
Authors:
Asaf Cohen,
Ethan Zell
Abstract:
Mean field games model equilibria in games with a continuum of players as limiting systems of symmetric $n$-player games with weak interaction between the players. We consider a finite-state, infinite-horizon problem with two cost criteria: discounted and ergodic. Under the Lasry--Lions monotonicity condition we characterize the stationary ergodic mean field game equilibrium by a mean field game s…
▽ More
Mean field games model equilibria in games with a continuum of players as limiting systems of symmetric $n$-player games with weak interaction between the players. We consider a finite-state, infinite-horizon problem with two cost criteria: discounted and ergodic. Under the Lasry--Lions monotonicity condition we characterize the stationary ergodic mean field game equilibrium by a mean field game system of two coupled equations: one for the value and the other for the stationary measure. This system is linked with the ergodic master equation. Several discounted mean field game systems are utilized in order to set up the relevant discounted master equations. We show that the discounted master equations are smooth, uniformly in the discount factor. Taking the discount factor to zero, we achieve the smoothness of the ergodic master equation.
△ Less
Submitted 16 November, 2022; v1 submitted 11 April, 2022;
originally announced April 2022.
-
Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics
Authors:
Asaf Cassel,
Alon Cohen,
Tomer Koren
Abstract:
We consider the problem of controlling an unknown linear dynamical system under a stochastic convex cost and full feedback of both the state and cost function. We present a computationally efficient algorithm that attains an optimal $\sqrt{T}$ regret-rate compared to the best stabilizing linear controller in hindsight. In contrast to previous work, our algorithm is based on the Optimism in the Fac…
▽ More
We consider the problem of controlling an unknown linear dynamical system under a stochastic convex cost and full feedback of both the state and cost function. We present a computationally efficient algorithm that attains an optimal $\sqrt{T}$ regret-rate compared to the best stabilizing linear controller in hindsight. In contrast to previous work, our algorithm is based on the Optimism in the Face of Uncertainty paradigm. This results in a substantially improved computational complexity and a simpler analysis.
△ Less
Submitted 22 June, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Covertly Controlling a Linear System
Authors:
Barak Amihood,
Asaf Cohen
Abstract:
Consider the problem of covertly controlling a linear system. In this problem, Alice desires to control (stabilize or change the parameters of) a linear system, while keeping an observer, Willie, unable to decide if the system is indeed being controlled or not.
We formally define the problem, under two different models: (i) When Willie can only observe the system's output (ii) When Willie can di…
▽ More
Consider the problem of covertly controlling a linear system. In this problem, Alice desires to control (stabilize or change the parameters of) a linear system, while keeping an observer, Willie, unable to decide if the system is indeed being controlled or not.
We formally define the problem, under two different models: (i) When Willie can only observe the system's output (ii) When Willie can directly observe the control signal. Focusing on AR(1) systems, we show that when Willie observes the system's output through a clean channel, an inherently unstable linear system can not be covertly stabilized. However, an inherently stable linear system can be covertly controlled, in the sense of covertly changing its parameter. Moreover, we give direct and converse results for two important controllers: a minimal-information controller, where Alice is allowed to used only $1$ bit per sample, and a maximal-information controller, where Alice is allowed to view the real-valued output. Unlike covert communication, where the trade-off is between rate and covertness, the results reveal an interesting \emph{three--fold} trade--off in covert control: the amount of information used by the controller, control performance and covertness. To the best of our knowledge, this is the first study formally defining covert control.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
Optimal Dividends under Model Uncertainty
Authors:
Prakash Chakraborty,
Asaf Cohen,
Virginia R. Young
Abstract:
We consider a diffusive model for optimally distributing dividends, while allowing for Knightian model ambiguity concerning the drift of the surplus process. We show that the value function is the unique solution of a non-linear Hamilton-Jacobi-Bellman variational inequality. In addition, this value function embodies a unique optimal threshold strategy for the insurer's surplus, thereby making it…
▽ More
We consider a diffusive model for optimally distributing dividends, while allowing for Knightian model ambiguity concerning the drift of the surplus process. We show that the value function is the unique solution of a non-linear Hamilton-Jacobi-Bellman variational inequality. In addition, this value function embodies a unique optimal threshold strategy for the insurer's surplus, thereby making it the smooth pasting of a non-linear and linear part at the location of the threshold. Furthermore, we obtain continuity and monotonicity of the value function and the threshold strategy with respect to the parameter that measures ambiguity of our model.
△ Less
Submitted 19 September, 2021;
originally announced September 2021.
-
Recursive Estimation of a Failure Probability for a Lipschitz Function
Authors:
Lucie Bernard,
Albert Cohen,
Arnaud Guyader,
Florent Malrieu
Abstract:
Let g : $Ω$ = [0, 1] d $\rightarrow$ R denote a Lipschitz function that can be evaluated at each point, but at the price of a heavy computational time. Let X stand for a random variable with values in $Ω$ such that one is able to simulate, at least approximately, according to the restriction of the law of X to any subset of $Ω$. For example, thanks to Markov chain Monte Carlo techniques, this is a…
▽ More
Let g : $Ω$ = [0, 1] d $\rightarrow$ R denote a Lipschitz function that can be evaluated at each point, but at the price of a heavy computational time. Let X stand for a random variable with values in $Ω$ such that one is able to simulate, at least approximately, according to the restriction of the law of X to any subset of $Ω$. For example, thanks to Markov chain Monte Carlo techniques, this is always possible when X admits a density that is known up to a normalizing constant. In this context, given a deterministic threshold T such that the failure probability p := P(g(X) > T) may be very low, our goal is to estimate the latter with a minimal number of calls to g. In this aim, building on Cohen et al. [9], we propose a recursive and optimal algorithm that selects on the fly areas of interest and estimate their respective probabilities.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
Asynchronous Stochastic Optimization Robust to Arbitrary Delays
Authors:
Alon Cohen,
Amit Daniely,
Yoel Drori,
Tomer Koren,
Mariano Schain
Abstract:
We consider stochastic optimization with delayed gradients where, at each time step $t$, the algorithm makes an update using a stale stochastic gradient from step $t - d_t$ for some arbitrary delay $d_t$. This setting abstracts asynchronous distributed optimization where a central server receives gradient updates computed by worker machines. These machines can experience computation and communicat…
▽ More
We consider stochastic optimization with delayed gradients where, at each time step $t$, the algorithm makes an update using a stale stochastic gradient from step $t - d_t$ for some arbitrary delay $d_t$. This setting abstracts asynchronous distributed optimization where a central server receives gradient updates computed by worker machines. These machines can experience computation and communication loads that might vary significantly over time. In the general non-convex smooth optimization setting, we give a simple and efficient algorithm that requires $O( σ^2/ε^4 + τ/ε^2 )$ steps for finding an $ε$-stationary point $x$, where $τ$ is the \emph{average} delay $\smash{\frac{1}{T}\sum_{t=1}^T d_t}$ and $σ^2$ is the variance of the stochastic gradients. This improves over previous work, which showed that stochastic gradient decent achieves the same rate but with respect to the \emph{maximal} delay $\max_{t} d_t$, that can be significantly larger than the average delay especially in heterogeneous distributed systems. Our experiments demonstrate the efficacy and robustness of our algorithm in cases where the delay distribution is skewed or heavy-tailed.
△ Less
Submitted 15 November, 2021; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Scaling Properties of Deep Residual Networks
Authors:
Alain-Sam Cohen,
Rama Cont,
Alain Rossier,
Renyuan Xu
Abstract:
Residual networks (ResNets) have displayed impressive results in pattern recognition and, recently, have garnered considerable theoretical interest due to a perceived link with neural ordinary differential equations (neural ODEs). This link relies on the convergence of network weights to a smooth function as the number of layers increases. We investigate the properties of weights trained by stocha…
▽ More
Residual networks (ResNets) have displayed impressive results in pattern recognition and, recently, have garnered considerable theoretical interest due to a perceived link with neural ordinary differential equations (neural ODEs). This link relies on the convergence of network weights to a smooth function as the number of layers increases. We investigate the properties of weights trained by stochastic gradient descent and their scaling with network depth through detailed numerical experiments. We observe the existence of scaling regimes markedly different from those assumed in neural ODE literature. Depending on certain features of the network architecture, such as the smoothness of the activation function, one may obtain an alternative ODE limit, a stochastic differential equation or neither of these. These findings cast doubts on the validity of the neural ODE model as an adequate asymptotic description of deep ResNets and point to an alternative class of differential equations as a better description of the deep network limit.
△ Less
Submitted 10 June, 2021; v1 submitted 25 May, 2021;
originally announced May 2021.
-
Optimal pointwise sampling for $L^2$ approximation
Authors:
Albert Cohen,
Matthieu Dolbeault
Abstract:
Given a function $u\in L^2=L^2(D,μ)$, where $D\subset \mathbb R^d$ and $μ$ is a measure on $D$, and a linear subspace $V_n\subset L^2$ of dimension $n$, we show that near-best approximation of $u$ in $V_n$ can be computed from a near-optimal budget of $Cn$ pointwise evaluations of $u$, with $C>1$ a universal constant. The sampling points are drawn according to some random distribution, the approxi…
▽ More
Given a function $u\in L^2=L^2(D,μ)$, where $D\subset \mathbb R^d$ and $μ$ is a measure on $D$, and a linear subspace $V_n\subset L^2$ of dimension $n$, we show that near-best approximation of $u$ in $V_n$ can be computed from a near-optimal budget of $Cn$ pointwise evaluations of $u$, with $C>1$ a universal constant. The sampling points are drawn according to some random distribution, the approximation is computed by a weighted least-squares method, and the error is assessed in expected $L^2$ norm. This result improves on the results in [6,8] which require a sampling budget that is sub-optimal by a logarithmic factor, thanks to a sparsification strategy introduced in [17,18]. As a consequence, we obtain for any compact class $\mathcal K\subset L^2$ that the sampling number $ρ_{Cn}^{\rm rand}(\mathcal K)_{L^2}$ in the randomized setting is dominated by the Kolmogorov $n$-width $d_n(\mathcal K)_{L^2}$. While our result shows the existence of a randomized sampling with such near-optimal properties, we discuss remaining issues concerning its generation by a computationally efficient algorithm.
△ Less
Submitted 13 September, 2021; v1 submitted 12 May, 2021;
originally announced May 2021.
-
Optimal ergodic harvesting under ambiguity
Authors:
Asaf Cohen,
Alexandru Hening,
Chuhao Sun
Abstract:
We consider an ergodic harvesting problem with model ambiguity that arises from biology. To account for the ambiguity, the problem is constructed as a stochastic game with two players: the decision-maker (DM) chooses the `best' harvesting policy and an adverse player chooses the `worst' probability measure. The main result is establishing an optimal strategy (also referred to as a control) of the…
▽ More
We consider an ergodic harvesting problem with model ambiguity that arises from biology. To account for the ambiguity, the problem is constructed as a stochastic game with two players: the decision-maker (DM) chooses the `best' harvesting policy and an adverse player chooses the `worst' probability measure. The main result is establishing an optimal strategy (also referred to as a control) of the DM and showing that it is a threshold policy. The optimal threshold and the optimal payoff are obtained by solving a free-boundary problem emerging from the Hamilton--Jacobi--Bellman (HJB) equation. As part of the proof, we fix a gap that appeared in the HJB analysis of [Alvarez and Hening, {\em Stochastic Process. Appl.}, 2019], a paper that analyzed the risk-neutral version of the ergodic harvesting problem. Finally, we study the dependence of the optimal threshold and the optimal payoff on the ambiguity parameter and show that if the ambiguity goes to 0, the problem converges, to the risk-neutral problem.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Near-optimal approximation methods for elliptic PDEs with lognormal coefficients
Authors:
Albert Cohen,
Giovanni Migliorati
Abstract:
This paper studies numerical methods for the approximation of elliptic PDEs with lognormal coefficients of the form $-{\rm div}(a\nabla u)=f$ where $a=\exp(b)$ and $b$ is a Gaussian random field. The approximant of the solution $u$ is an $n$-term polynomial expansion in the scalar Gaussian random variables that parametrize $b$. We present a general convergence analysis of weighted least-squares ap…
▽ More
This paper studies numerical methods for the approximation of elliptic PDEs with lognormal coefficients of the form $-{\rm div}(a\nabla u)=f$ where $a=\exp(b)$ and $b$ is a Gaussian random field. The approximant of the solution $u$ is an $n$-term polynomial expansion in the scalar Gaussian random variables that parametrize $b$. We present a general convergence analysis of weighted least-squares approximants for smooth and arbitrarily rough random field, using a suitable random design, for which we prove optimality in the following sense: their convergence rate matches exactly or closely the rate that has been established in \cite{BCDM} for best $n$-term approximation by Hermite polynomials, under the same minimial assumptions on the Gaussian random field. This is in contrast with the current state of the art results for the stochastic Galerkin method that suffers the lack of coercivity due to the lognormal nature of the diffusion field. Numerical tests with $b$ as the Brownian bridge confirm our theoretical findings.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
A Scaling Limit for Utility Indifference Prices in the Discretized Bachelier Model
Authors:
Asaf Cohen,
Yan Dolinsky
Abstract:
We consider the discretized Bachelier model where hedging is done on an equidistant set of times. Exponential utility indifference prices are studied for path-dependent European options and we compute their non-trivial scaling limit for a large number of trading times $n$ and when risk aversion is scaled like $n\ell$ for some constant $\ell>0$. Our analysis is purely probabilistic. We first use a…
▽ More
We consider the discretized Bachelier model where hedging is done on an equidistant set of times. Exponential utility indifference prices are studied for path-dependent European options and we compute their non-trivial scaling limit for a large number of trading times $n$ and when risk aversion is scaled like $n\ell$ for some constant $\ell>0$. Our analysis is purely probabilistic. We first use a duality argument to transform the problem into an optimal drift control problem with a penalty term. We further use martingale techniques and strong invariance principles and get that the limiting problem takes the form of a volatility control problem.
△ Less
Submitted 1 March, 2022; v1 submitted 23 February, 2021;
originally announced February 2021.
-
Partition and Analytic Rank are Equivalent over Large Fields
Authors:
Alex Cohen,
Guy Moshkovitz
Abstract:
We prove that the partition rank and the analytic rank of tensors are equal up to a constant, over finite fields of any characteristic and any large enough cardinality depending on the analytic rank. Moreover, we show that a plausible improvement of our field cardinality requirement would imply that the ranks are equal up to 1+o(1) in the exponent over every finite field. At the core of the proof…
▽ More
We prove that the partition rank and the analytic rank of tensors are equal up to a constant, over finite fields of any characteristic and any large enough cardinality depending on the analytic rank. Moreover, we show that a plausible improvement of our field cardinality requirement would imply that the ranks are equal up to 1+o(1) in the exponent over every finite field. At the core of the proof is a technique for lifting decompositions of multilinear polynomials in an open subset of an algebraic variety, and a technique for finding a large subvariety that retains all rational points such that at least one of these points satisfies a finite-field analogue of genericity with respect to it. Proving the equivalence between these two ranks, ideally over fixed finite fields, is a central question in additive combinatorics, and was reiterated by multiple authors. As a corollary we prove, allowing the field to depend on the value of the norm, the Polynomial Gowers Inverse Conjecture in the d vs. d-1 case.
△ Less
Submitted 27 November, 2023; v1 submitted 20 February, 2021;
originally announced February 2021.
-
Structure vs. Randomness for Bilinear Maps
Authors:
Alex Cohen,
Guy Moshkovitz
Abstract:
We prove that the slice rank of a 3-tensor (a combinatorial notion introduced by Tao in the context of the cap-set problem), the analytic rank (a Fourier-theoretic notion introduced by Gowers and Wolf), and the geometric rank (an algebro-geometric notion introduced by Kopparty, Moshkovitz, and Zuiddam) are all equal up to an absolute constant. As a corollary, we obtain strong trade-offs on the ari…
▽ More
We prove that the slice rank of a 3-tensor (a combinatorial notion introduced by Tao in the context of the cap-set problem), the analytic rank (a Fourier-theoretic notion introduced by Gowers and Wolf), and the geometric rank (an algebro-geometric notion introduced by Kopparty, Moshkovitz, and Zuiddam) are all equal up to an absolute constant. As a corollary, we obtain strong trade-offs on the arithmetic complexity of a biased bilinear map, and on the separation between computing a bilinear map exactly and on average. Our result settles open questions of Haramaty and Shpilka [STOC 2010], and of Lovett [Discrete Anal. 2019] for 3-tensors.
△ Less
Submitted 3 October, 2022; v1 submitted 9 February, 2021;
originally announced February 2021.
-
Uniqueness of excited states to $-Δu+u-u^3=0$ in three dimensions
Authors:
Alex Cohen,
Zhenhao Li,
Wilhelm Schlag
Abstract:
We prove the uniqueness of several excited states to the ODE $\ddot y(t) + \frac{2}{t} \dot y(t) + f(y(t)) = 0$, $y(0) = b$, and $\dot y(0) = 0$ for the model nonlinearity $f(y) = y^3 - y$. The $n$-th excited state is a solution with exactly $n$ zeros and which tends to $0$ as $t \to \infty$. These represent all smooth radial nonzero solutions to the PDE $Δu + f(u)= 0$ in $H^1$. We interpret the O…
▽ More
We prove the uniqueness of several excited states to the ODE $\ddot y(t) + \frac{2}{t} \dot y(t) + f(y(t)) = 0$, $y(0) = b$, and $\dot y(0) = 0$ for the model nonlinearity $f(y) = y^3 - y$. The $n$-th excited state is a solution with exactly $n$ zeros and which tends to $0$ as $t \to \infty$. These represent all smooth radial nonzero solutions to the PDE $Δu + f(u)= 0$ in $H^1$. We interpret the ODE as a damped oscillator governed by a double-well potential, and the result is proved via rigorous numerical analysis of the energy and variation of the solutions. More specifically, the problem of uniqueness can be formulated entirely in terms of inequalities on the solutions and their variation, and these inequalities can be verified numerically.
△ Less
Submitted 4 October, 2024; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Finite state mean field games with Wright Fisher common noise as limits of $N$-player weighted games
Authors:
Erhan Bayraktar,
Alekos Cecchin,
Asaf Cohen,
François Delarue
Abstract:
Forcing finite state mean field games by a relevant form of common noise is a subtle issue, which has been addressed only recently. Among others, one possible way is to subject the simplex valued dynamics of an equilibrium by a so-called Wright-Fisher noise, very much in the spirit of stochastic models in population genetics. A key feature is that such a random forcing preserves the structure of t…
▽ More
Forcing finite state mean field games by a relevant form of common noise is a subtle issue, which has been addressed only recently. Among others, one possible way is to subject the simplex valued dynamics of an equilibrium by a so-called Wright-Fisher noise, very much in the spirit of stochastic models in population genetics. A key feature is that such a random forcing preserves the structure of the simplex, which is nothing but, in this setting, the probability space over the state space of the game. The purpose of this article is hence to elucidate the finite player version and, accordingly, to prove that $N$-player equilibria indeed converge towards the solution of such a kind of Wright-Fisher mean field game. Whilst part of the analysis is made easier by the fact that the corresponding master equation has already been proved to be uniquely solvable under the presence of the common noise, it becomes however more subtle than in the standard setting because the mean field interaction between the players now occurs through a weighted empirical measure. In other words, each player carries its own weight, which hence may differ from $1/N$ and which, most of all, evolves with the common noise.
△ Less
Submitted 1 November, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Inner ideals in Lie algebras and spherical buildings
Authors:
Arjeh M. Cohen
Abstract:
The correspondence found by Faulkner between inner ideals of the Lie algebra of a simple algebraic group and shadows on long root groups of the building associated with the algebraic group is shown to hold in greater generality (in particular, over perfect fields of characteristic distinct from two).
The correspondence found by Faulkner between inner ideals of the Lie algebra of a simple algebraic group and shadows on long root groups of the building associated with the algebraic group is shown to hold in greater generality (in particular, over perfect fields of characteristic distinct from two).
△ Less
Submitted 29 October, 2020;
originally announced October 2020.
-
Optimal sampling and Christoffel functions on general domains
Authors:
Albert Cohen,
Matthieu Dolbeault
Abstract:
We consider the problem of reconstructing an unknown function $u\in L^2(D,μ)$ from its evaluations at given sampling points $x^1,\dots,x^m\in D$, where $D\subset \mathbb R^d$ is a general domain and $μ$ a probability measure. The approximation is picked from a linear space $V_n$ of interest where $n=\dim(V_n)$. Recent results have revealed that certain weighted least-squares methods achieve near b…
▽ More
We consider the problem of reconstructing an unknown function $u\in L^2(D,μ)$ from its evaluations at given sampling points $x^1,\dots,x^m\in D$, where $D\subset \mathbb R^d$ is a general domain and $μ$ a probability measure. The approximation is picked from a linear space $V_n$ of interest where $n=\dim(V_n)$. Recent results have revealed that certain weighted least-squares methods achieve near best approximation with a sampling budget $m$ that is proportional to $n$, up to a logarithmic factor $\ln(2n/\varepsilon)$, where $\varepsilon>0$ is a probability of failure. The sampling points should be picked at random according to a well-chosen probability measure $σ$ whose density is given by the inverse Christoffel function that depends both on $V_n$ and $μ$. While this approach is greatly facilitated when $D$ and $μ$ have tensor product structure, it becomes problematic for domains $D$ with arbitrary geometry since the optimal measure depends on an orthonormal basis of $V_n$ in $L^2(D,μ)$ which is not explicitly given, even for simple polynomial spaces. Therefore sampling according to this measure is not practically feasible. In this paper, we discuss practical sampling strategies, which amount to using a perturbed measure $\widetilde σ$ that can be computed in an offline stage, not involving the measurement of $u$. We show that near best approximation is attained by the resulting weighted least-squares method at near-optimal sampling budget and we discuss multilevel approaches that preserve optimality of the cumulated sampling budget when the spaces $V_n$ are iteratively enriched. These strategies rely on the knowledge of a-priori upper bounds on the inverse Christoffel function. We establish such bounds for spaces $V_n$ of multivariate algebraic polynomials, and for general domains $D$.
△ Less
Submitted 27 October, 2020; v1 submitted 21 October, 2020;
originally announced October 2020.
-
A Sylvester-Gallai theorem for cubic curves
Authors:
Alex Cohen,
Frank de Zeeuw
Abstract:
We prove a variant of the Sylvester-Gallai theorem for cubics (algebraic curves of degree three): If a finite set of sufficiently many points in $\mathbb{R}^2$ is not contained in a cubic, then there is a cubic that contains exactly nine of the points. This resolves the first unknown case of a conjecture of Wiseman and Wilson from 1988, who proved a variant of Sylvester-Gallai for conics and conje…
▽ More
We prove a variant of the Sylvester-Gallai theorem for cubics (algebraic curves of degree three): If a finite set of sufficiently many points in $\mathbb{R}^2$ is not contained in a cubic, then there is a cubic that contains exactly nine of the points. This resolves the first unknown case of a conjecture of Wiseman and Wilson from 1988, who proved a variant of Sylvester-Gallai for conics and conjectured that similar statements hold for curves of any degree.
△ Less
Submitted 2 January, 2022; v1 submitted 4 October, 2020;
originally announced October 2020.
-
Optimal Stable Nonlinear Approximation
Authors:
Albert Cohen,
Ronald DeVore,
Guergana Petrova,
Przemyslaw Wojtaszczyk
Abstract:
While it is well known that nonlinear methods of approximation can often perform dramatically better than linear methods, there are still questions on how to measure the optimal performance possible for such methods. This paper studies nonlinear methods of approximation that are compatible with numerical implementation in that they are required to be numerically stable. A measure of optimal perfor…
▽ More
While it is well known that nonlinear methods of approximation can often perform dramatically better than linear methods, there are still questions on how to measure the optimal performance possible for such methods. This paper studies nonlinear methods of approximation that are compatible with numerical implementation in that they are required to be numerically stable. A measure of optimal performance, called {\em stable manifold widths}, for approximating a model class $K$ in a Banach space $X$ by stable manifold methods is introduced. Fundamental inequalities between these stable manifold widths and the entropy of $K$ are established. The effects of requiring stability in the settings of deep learning and compressed sensing are discussed.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Nonlinear reduced models for state and parameter estimation
Authors:
Albert Cohen,
Wolfgang Dahmen,
Olga Mula,
James Nichols
Abstract:
State estimation aims at approximately reconstructing the solution $u$ to a parametrized partial differential equation from $m$ linear measurements, when the parameter vector $y$ is unknown. Fast numerical recovery methods have been proposed based on reduced models which are linear spaces of moderate dimension $n$ which are tailored to approximate the solution manifold $\mathcal{M}$ where the solu…
▽ More
State estimation aims at approximately reconstructing the solution $u$ to a parametrized partial differential equation from $m$ linear measurements, when the parameter vector $y$ is unknown. Fast numerical recovery methods have been proposed based on reduced models which are linear spaces of moderate dimension $n$ which are tailored to approximate the solution manifold $\mathcal{M}$ where the solution sits. These methods can be viewed as deterministic counterparts to Bayesian estimation approaches, and are proved to be optimal when the prior is expressed by approximability of the solution with respect to the reduced model. However, they are inherently limited by their linear nature, which bounds from below their best possible performance by the Kolmogorov width $d_m(\mathcal{M})$ of the solution manifold. In this paper we propose to break this barrier by using simple nonlinear reduced models that consist of a finite union of linear spaces $V_k$, each having dimension at most $m$ and leading to different estimators $u_k^*$. A model selection mechanism based on minimizing the PDE residual over the parameter space is used to select from this collection the final estimator $u^*$. Our analysis shows that $u^*$ meets optimal recovery benchmarks that are inherent to the solution manifold and not tied to its Kolmogorov width. The residual minimization procedure is computationally simple in the relevant case of affine parameter dependence in the PDE. In addition, it results in an estimator $y^*$ for the unknown parameter vector. In this setting, we also discuss an alternating minimization (coordinate descent) algorithm for joint state and parameter estimation, that potentially improves the quality of both estimators.
△ Less
Submitted 24 November, 2020; v1 submitted 6 September, 2020;
originally announced September 2020.
-
Optimal Dividend Problem: Asymptotic Analysis
Authors:
Asaf Cohen,
Virginia R. Young
Abstract:
We re-visit the classical problem of optimal payment of dividends and determine the degree to which the diffusion approximation serves as a valid approximation of the classical risk model for this problem. Our results parallel some of those in Bäuerle (2004), but we obtain sharper results because we use a different technique for obtaining them. Specifically, Bäuerle (2004) uses probabilistic techn…
▽ More
We re-visit the classical problem of optimal payment of dividends and determine the degree to which the diffusion approximation serves as a valid approximation of the classical risk model for this problem. Our results parallel some of those in Bäuerle (2004), but we obtain sharper results because we use a different technique for obtaining them. Specifically, Bäuerle (2004) uses probabilistic techniques and relies on convergence in distribution of the underlying processes. By contrast, we use comparison results from the theory of differential equations, and these methods allow us to determine the rate of convergence of the value functions in question.
△ Less
Submitted 22 October, 2020; v1 submitted 21 July, 2020;
originally announced July 2020.
-
A Sylvester-Gallai result for concurrent lines in the complex plane
Authors:
Alex Cohen
Abstract:
We show that if a set of points in $\mathbb{C}^2$ lies on a family of $m$ concurrent lines, and if one of those lines contains more than $m-2$ points, then there is a line passing through exactly two points of the set. The bound $m-2$ in our result is optimal. Our main theorem resolves a conjecture of Frank de Zeeuw, and generalizes a result of Kelly and Nwankpa.
We show that if a set of points in $\mathbb{C}^2$ lies on a family of $m$ concurrent lines, and if one of those lines contains more than $m-2$ points, then there is a line passing through exactly two points of the set. The bound $m-2$ in our result is optimal. Our main theorem resolves a conjecture of Frank de Zeeuw, and generalizes a result of Kelly and Nwankpa.
△ Less
Submitted 28 September, 2020; v1 submitted 7 July, 2020;
originally announced July 2020.
-
A Macroeconomic SIR Model for COVID-19
Authors:
Erhan Bayraktar,
Asaf Cohen,
April Nellis
Abstract:
The current COVID-19 pandemic and subsequent lockdowns have highlighted the close and delicate relationship between a country's public health and economic health. Macroeconomic models that use preexisting epidemic models to calculate the impacts of a disease outbreak are therefore extremely useful for policymakers seeking to evaluate the best course of action in such a crisis. We develop an SIR mo…
▽ More
The current COVID-19 pandemic and subsequent lockdowns have highlighted the close and delicate relationship between a country's public health and economic health. Macroeconomic models that use preexisting epidemic models to calculate the impacts of a disease outbreak are therefore extremely useful for policymakers seeking to evaluate the best course of action in such a crisis. We develop an SIR model of the COVID-19 pandemic that explicitly considers herd immunity, behavior-dependent transmission rates, remote workers, and indirect externalities of lockdown. This model is presented as an exit time control problem where lockdown ends when the population achieves herd immunity, either naturally or via a vaccine. A social planner prescribes separate levels of lockdown for two separate sections of the adult population: low-risk (ages 20-64) and high-risk (ages 65 and over). These levels are determined via optimization of an objective function which assigns a macroeconomic cost to the level of lockdown and the number of deaths. We find that, by ending lockdowns once herd immunity is reached, high-risk individuals are able to leave lockdown significantly before the arrival of a vaccine without causing large increases in mortality. Moreover, if we incorporate a behavior-dependent transmission rate which represents increased personal caution in response to increased infection levels, both output loss and total mortality are lowered. Lockdown efficacy is further increased when there is less interaction between low- and high-risk individuals, and increased remote work decreases output losses. Overall, our model predicts that a lockdown which ends at the arrival of herd immunity, combined with individual actions to slow virus transmission, can reduce total mortality to one-third of the no-lockdown level, while allowing high-risk individuals to leave lockdown well before vaccine arrival.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
Nonlinear Methods for Model Reduction
Authors:
Andrea Bonito,
Albert Cohen,
Ronald DeVore,
Diane Guignard,
Peter Jantsch,
Guergana Petrova
Abstract:
The usual approach to model reduction for parametric partial differential equations (PDEs) is to construct a linear space $V_n$ which approximates well the solution manifold $\mathcal{M}$ consisting of all solutions $u(y)$ with $y$ the vector of parameters. This linear reduced model $V_n$ is then used for various tasks such as building an online forward solver for the PDE or estimating parameters…
▽ More
The usual approach to model reduction for parametric partial differential equations (PDEs) is to construct a linear space $V_n$ which approximates well the solution manifold $\mathcal{M}$ consisting of all solutions $u(y)$ with $y$ the vector of parameters. This linear reduced model $V_n$ is then used for various tasks such as building an online forward solver for the PDE or estimating parameters from data observations. It is well understood in other problems of numerical computation that nonlinear methods such as adaptive approximation, $n$-term approximation, and certain tree-based methods may provide improved numerical efficiency. For model reduction, a nonlinear method would replace the linear space $V_n$ by a nonlinear space $Σ_n$. This idea has already been suggested in recent papers on model reduction where the parameter domain is decomposed into a finite number of cells and a linear space of low dimension is assigned to each cell.
Up to this point, little is known in terms of performance guarantees for such a nonlinear strategy. Moreover, most numerical experiments for nonlinear model reduction use a parameter dimension of only one or two. In this work, a step is made towards a more cohesive theory for nonlinear model reduction. Framing these methods in the general setting of library approximation allows us to give a first comparison of their performance with those of standard linear approximation for any general compact set. We then turn to the study these methods for solution manifolds of parametrized elliptic PDEs. We study a very specific example of library approximation where the parameter domain is split into a finite number $N$ of rectangular cells and where different reduced affine spaces of dimension $m$ are assigned to each cell. The performance of this nonlinear procedure is analyzed from the viewpoint of accuracy of approximation versus $m$ and $N$.
△ Less
Submitted 5 May, 2020;
originally announced May 2020.
-
Asymptotic optimality of the generalized $cμ$ rule under model uncertainty
Authors:
Asaf Cohen,
Subhamay Saha
Abstract:
We consider a critically-loaded multiclass queueing control problem with model uncertainty. The model consists of $I$ types of customers and a single server. At any time instant, a decision-maker (DM) allocates the server's effort to the customers. The DM's goal is to minimize a convex holding cost that accounts for the ambiguity with respect to the model, i.e., the arrival and service rates. For…
▽ More
We consider a critically-loaded multiclass queueing control problem with model uncertainty. The model consists of $I$ types of customers and a single server. At any time instant, a decision-maker (DM) allocates the server's effort to the customers. The DM's goal is to minimize a convex holding cost that accounts for the ambiguity with respect to the model, i.e., the arrival and service rates. For this, we consider an adversary player whose role is to choose the worst-case scenario. Specifically, we assume that the DM has a reference probability model in mind and that the cost function is formulated by the supremum over equivalent admissible probability measures to the reference measure with two components, the first is the expected holding cost, and the second one is a penalty for the adversary player for deviating from the reference model. The penalty term is formulated by a general divergence measure.
We show that although that under the equivalent admissible measures the critically-load condition might be violated, the generalized $cμ$ rule is asymptotically optimal for this problem.
△ Less
Submitted 29 March, 2021; v1 submitted 2 April, 2020;
originally announced April 2020.