-
Propagating Model Uncertainty through Filtering-based Probabilistic Numerical ODE Solvers
Authors:
Dingling Yao,
Filip Tronarp,
Nathanael Bosch
Abstract:
Filtering-based probabilistic numerical solvers for ordinary differential equations (ODEs), also known as ODE filters, have been established as efficient methods for quantifying numerical uncertainty in the solution of ODEs. In practical applications, however, the underlying dynamical system often contains uncertain parameters, requiring the propagation of this model uncertainty to the ODE solutio…
▽ More
Filtering-based probabilistic numerical solvers for ordinary differential equations (ODEs), also known as ODE filters, have been established as efficient methods for quantifying numerical uncertainty in the solution of ODEs. In practical applications, however, the underlying dynamical system often contains uncertain parameters, requiring the propagation of this model uncertainty to the ODE solution. In this paper, we demonstrate that ODE filters, despite their probabilistic nature, do not automatically solve this uncertainty propagation problem. To address this limitation, we present a novel approach that combines ODE filters with numerical quadrature to properly marginalize over uncertain parameters, while accounting for both parameter uncertainty and numerical solver uncertainty. Experiments across multiple dynamical systems demonstrate that the resulting uncertainty estimates closely match reference solutions. Notably, we show how the numerical uncertainty from the ODE solver can help prevent overconfidence in the propagated uncertainty estimates, especially when using larger step sizes. Our results illustrate that probabilistic numerical methods can effectively quantify both numerical and parametric uncertainty in dynamical systems.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Authors:
Hanyang Zhao,
Haoxian Chen,
Ji Zhang,
David D. Yao,
Wenpin Tang
Abstract:
Reinforcement learning from human feedback (RLHF), which aligns a diffusion model with input prompt, has become a crucial step in building reliable generative AI models. Most works in this area use a discrete-time formulation, which is prone to induced errors, and often not applicable to models with higher-order/black-box solvers. The objective of this study is to develop a disciplined approach to…
▽ More
Reinforcement learning from human feedback (RLHF), which aligns a diffusion model with input prompt, has become a crucial step in building reliable generative AI models. Most works in this area use a discrete-time formulation, which is prone to induced errors, and often not applicable to models with higher-order/black-box solvers. The objective of this study is to develop a disciplined approach to fine-tune diffusion models using continuous-time RL, formulated as a stochastic control problem with a reward function that aligns the end result (terminal state) with input prompt. The key idea is to treat score matching as controls or actions, and thereby making connections to policy optimization and regularization in continuous-time RL. To carry out this idea, we lay out a new policy optimization framework for continuous-time RL, and illustrate its potential in enhancing the value networks design space via leveraging the structural property of diffusion models. We validate the advantages of our method by experiments in downstream tasks of fine-tuning large-scale Text2Image models of Stable Diffusion v1.5.
△ Less
Submitted 16 April, 2025; v1 submitted 3 February, 2025;
originally announced February 2025.
-
Critical radii and suprema of random waves over Riemannian manifolds
Authors:
Renjie Feng,
Dong Yao,
Robert J. Adler
Abstract:
We study random waves on smooth, compact, Riemannian manifolds under the spherical ensemble. Our first main result shows that there is a positive universal limit for the critical radius of a specific deterministic embedding, defined via the eigenfunctions of the Laplace-Beltrami operator, of such manifolds into higher dimensional Euclidean spaces. This result enables the application of Weyl's tube…
▽ More
We study random waves on smooth, compact, Riemannian manifolds under the spherical ensemble. Our first main result shows that there is a positive universal limit for the critical radius of a specific deterministic embedding, defined via the eigenfunctions of the Laplace-Beltrami operator, of such manifolds into higher dimensional Euclidean spaces. This result enables the application of Weyl's tube formula to derive the tail probabilities for the suprema of random waves. Consequently, the estimate for the expectation of the Euler characteristic of the excursion set follows directly.
△ Less
Submitted 18 January, 2025;
originally announced January 2025.
-
KAM Theory for almost-periodic equilibria in one dimensional almost-periodic media
Authors:
Yujia An,
Rafael de la Llave,
Xifeng Su,
Donghua Wang,
Dongyu Yao
Abstract:
We consider one dimensional chains of interacting particles subjected to one dimensional almost-periodic media. We formulate and prove two KAM type theorems corresponding to both short-range and long-range interactions respectively. Both theorems presented have an a posteriori format and establish the existence of almost-periodic equilibria. The new part here is that the potential function is give…
▽ More
We consider one dimensional chains of interacting particles subjected to one dimensional almost-periodic media. We formulate and prove two KAM type theorems corresponding to both short-range and long-range interactions respectively. Both theorems presented have an a posteriori format and establish the existence of almost-periodic equilibria. The new part here is that the potential function is given by some almost-periodic function with infinitely many incommensurate frequencies.
In both cases, we do not need to assume that the system is close to integrable. We will show that if there exists an approximate solution for the functional equations, which satisfies some appropriate non-degeneracy conditions, then a true solution nearby is obtained. This procedure may be used to validate efficient numerical computations.
Moreover, to well understand the role of almost-periodic media which can be approximated by quasi-periodic ones, we present a different approach -- the step by step increase of complexity method -- to the study of the above results of the almost-periodic models.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
A Non-convex Optimization Approach of Searching Algebraic Degree Phase-type Representations for General Phase-type Distributions
Authors:
Yujie Liu,
Dacheng Yao,
Hanqin Zhang
Abstract:
For a continuous-time phase-type distribution, starting with its Laplace-Stieltjes transform, we obtain a necessary and sufficient condition for its minimal phase-type representation to have the same order as the algebraic degree of the Laplace-Stieltjes transform. To facilitate finding this minimal representation, we transform this condition equivalently into a quadratic nonconvex optimization pr…
▽ More
For a continuous-time phase-type distribution, starting with its Laplace-Stieltjes transform, we obtain a necessary and sufficient condition for its minimal phase-type representation to have the same order as the algebraic degree of the Laplace-Stieltjes transform. To facilitate finding this minimal representation, we transform this condition equivalently into a quadratic nonconvex optimization problem, which can be effectively addressed using an alternating minimization algorithm. The algorithm convergence is also proved. Moreover, the method we develop for the continuous-time phase-type distributions can be directly used to the discrete-time phase-type distributions after establishing an equivalence between the minimal representation problems for continuous-time and discrete-times phase-type distributions.
△ Less
Submitted 14 February, 2025; v1 submitted 19 September, 2024;
originally announced September 2024.
-
On Substochastic Inverse Eigenvalue Problems with the Corresponding Eigenvector Constraints
Authors:
Yujie Liu,
Dacheng Yao,
Hanqin Zhang
Abstract:
We consider the inverse eigenvalue problem of constructing a substochastic matrix from the given spectrum parameters with the corresponding eigenvector constraints. This substochastic inverse eigenvalue problem (SstIEP) with the specific eigenvector constraints is formulated into a nonconvex optimization problem (NcOP). The solvability for SstIEP with the specific eigenvector constraints is equiva…
▽ More
We consider the inverse eigenvalue problem of constructing a substochastic matrix from the given spectrum parameters with the corresponding eigenvector constraints. This substochastic inverse eigenvalue problem (SstIEP) with the specific eigenvector constraints is formulated into a nonconvex optimization problem (NcOP). The solvability for SstIEP with the specific eigenvector constraints is equivalent to identify the attainability of a zero optimal value for the formulated NcOP. When the optimal objective value is zero, the corresponding optimal solution to the formulated NcOP is just the substochastic matrix desired to be constructed. We develop the alternating minimization algorithm to solve the formulated NcOP, and its convergence is established by developing a novel method to obtain the boundedness of the optimal solution. Some numerical experiments are conducted to demonstrate the efficiency of the proposed method.
△ Less
Submitted 5 August, 2024;
originally announced September 2024.
-
Small gaps of GSE
Authors:
Renjie Feng,
Jiaming Li,
Dong Yao
Abstract:
In this paper, we study the smallest gaps for the Gaussian symplectic ensemble (GSE). We prove that the rescaled smallest gaps and their locations converge to a Poisson point process with an explicit rate. The approach provides an alternative proof for the GOE case and complements the results in \cite{FTW}. By combining the main results from \cite{BB, FTW, FW2}, the study of the smallest gaps for…
▽ More
In this paper, we study the smallest gaps for the Gaussian symplectic ensemble (GSE). We prove that the rescaled smallest gaps and their locations converge to a Poisson point process with an explicit rate. The approach provides an alternative proof for the GOE case and complements the results in \cite{FTW}. By combining the main results from \cite{BB, FTW, FW2}, the study of the smallest gaps for the classical random matrix ensembles C$β$E and G$β$E for $β= 1, 2,$ and $4$ is now complete.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Quasi-stationary distributions for subcritical branching Markov chains
Authors:
Wenming Hong,
Dan Yao
Abstract:
Consider a subcritical branching Markov chain. Let $Z_n$ denote the counting measure of particles of generation $n$. Under some conditions, we give a probabilistic proof for the existence of the Yaglom limit of $(Z_n)_{n\in\mathbb{N}}$ by the moment method, based on the spinal decomposition and the many-to-few formula. As a result, we give explicit integral representations of all quasi-stationary…
▽ More
Consider a subcritical branching Markov chain. Let $Z_n$ denote the counting measure of particles of generation $n$. Under some conditions, we give a probabilistic proof for the existence of the Yaglom limit of $(Z_n)_{n\in\mathbb{N}}$ by the moment method, based on the spinal decomposition and the many-to-few formula. As a result, we give explicit integral representations of all quasi-stationary distributions of $(Z_n)_{n\in\mathbb{N}}$, whose proofs are direct and probabilistic, and don't rely on Martin boundary theory.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Biorthogonal polynomials related to quantum transport theory of disordered wires
Authors:
Dong Wang,
Dong Yao
Abstract:
We consider the Plancherel-Rotach type asymptotics of the biorthogonal polynomials associated to the biorthogonal ensemble with the joint probability density function \begin{equation*}
\frac{1}{C} \prod_{1 \leq i < j \leq n} (λ_j -λ_i)(f(λ_j) - f(λ_i)) \prod^n_{j = 1} W^{(n)}_α(λ_j) dλ_j, \end{equation*} where \begin{align*} f(x) = {}& \sinh^2(\sqrt{x}), & W^{(n)}_α(x) = {}& x^α h(x) e^{-nV(x)}.…
▽ More
We consider the Plancherel-Rotach type asymptotics of the biorthogonal polynomials associated to the biorthogonal ensemble with the joint probability density function \begin{equation*}
\frac{1}{C} \prod_{1 \leq i < j \leq n} (λ_j -λ_i)(f(λ_j) - f(λ_i)) \prod^n_{j = 1} W^{(n)}_α(λ_j) dλ_j, \end{equation*} where \begin{align*} f(x) = {}& \sinh^2(\sqrt{x}), & W^{(n)}_α(x) = {}& x^α h(x) e^{-nV(x)}. \end{align*} In the special case that the potential function $V$ is linear, this biorthogonal ensemble arises in the quantum transport theory of disordered wires. We analyze the asymptotic problem via $2$-component vector-valued Riemann-Hilbert problems, and solve it under the one-cut regular with a hard edge condition. We use the asymptotics of biorthogonal polynomials to establish sine universality for the correlation kernel in the bulk, and provide a central limit theorem with a specific variance for holomorphic linear statistics.
As an application of our theories, we establish the Ohm's law (1.12) and universal conductance fluctuation (1.13) for the disordered wire model, thereby rigorously confirming predictions from experimental physics [Washburn-Webb86].
△ Less
Submitted 31 December, 2024; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Policy Optimization for Continuous Reinforcement Learning
Authors:
Hanyang Zhao,
Wenpin Tang,
David D. Yao
Abstract:
We study reinforcement learning (RL) in the setting of continuous time and space, for an infinite horizon with a discounted objective and the underlying dynamics driven by a stochastic differential equation. Built upon recent advances in the continuous approach to RL, we develop a notion of occupation time (specifically for a discounted objective), and show how it can be effectively used to derive…
▽ More
We study reinforcement learning (RL) in the setting of continuous time and space, for an infinite horizon with a discounted objective and the underlying dynamics driven by a stochastic differential equation. Built upon recent advances in the continuous approach to RL, we develop a notion of occupation time (specifically for a discounted objective), and show how it can be effectively used to derive performance-difference and local-approximation formulas. We further extend these results to illustrate their applications in the PG (policy gradient) and TRPO/PPO (trust region policy optimization/ proximal policy optimization) methods, which have been familiar and powerful tools in the discrete RL setting but under-developed in continuous RL. Through numerical experiments, we demonstrate the effectiveness and advantages of our approach.
△ Less
Submitted 18 October, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Determinantal point processes on spheres: multivariate linear statistics
Authors:
Renjie Feng,
Friedrich Götze,
Dong Yao
Abstract:
In this paper, we will derive the first and 2nd order Wiener chaos decomposition for the multivariate linear statistics of the determinantal point processes associated with the spectral projection kernels on the unit spheres $S^d$. We will first get a graphical representation for the cumulants of multivariate linear statistics for any determinantal point process. The main results then follow from…
▽ More
In this paper, we will derive the first and 2nd order Wiener chaos decomposition for the multivariate linear statistics of the determinantal point processes associated with the spectral projection kernels on the unit spheres $S^d$. We will first get a graphical representation for the cumulants of multivariate linear statistics for any determinantal point process. The main results then follow from the very precise estimates and identities regarding the spectral projection kernels and the symmetry of the spheres.
△ Less
Submitted 22 January, 2023;
originally announced January 2023.
-
SIR Epidemics on Evolving Erdős-Rényi Graphs
Authors:
Wenze Chen,
Yuewen Hou,
Dong Yao
Abstract:
In the standard SIR model, infected vertices infect their neighbors at rate $λ$ independently across each edge. They also recover at rate $γ$. In this work we consider the SIR-$ω$ model where the graph structure itself co-evolves with the SIR dynamics. Specifically, $S-I$ connections are broken at rate $ω$. Then, with probability $α$, $S$ rewires this edge to another uniformly chosen vertex; and w…
▽ More
In the standard SIR model, infected vertices infect their neighbors at rate $λ$ independently across each edge. They also recover at rate $γ$. In this work we consider the SIR-$ω$ model where the graph structure itself co-evolves with the SIR dynamics. Specifically, $S-I$ connections are broken at rate $ω$. Then, with probability $α$, $S$ rewires this edge to another uniformly chosen vertex; and with probability $1-α$, this edge is simply dropped. When $α=1$ the SIR-$ω$ model becomes the evoSIR model. Jiang et al. proved in \cite{DOMath} that the probability of an outbreak in the evoSIR model converges to 0 as $λ$ approaches the critical infection rate $λ_c$. On the other hand, numerical experiments in \cite{DOMath} revealed that, as $λ\to λ_c$, (conditionally on an outbreak) the fraction of infected vertices may not converge to 0, which is referred to as a discontinuous phase transition. In \cite{BB} Ball and Britton give two (non-matching) conditions for continuous and discontinuous phase transitions for the fraction of infected vertices in the SIR-$ω$ model. In this work, we obtain a necessary and sufficient condition for the emergence of a discontinuous phase transition of the final epidemic size of the SIR-$ω$ model on \ER\, graphs, thus closing the gap between these two conditions.
△ Less
Submitted 15 May, 2025; v1 submitted 25 August, 2022;
originally announced August 2022.
-
Trading under the Proof-of-Stake Protocol -- a Continuous-Time Control Approach
Authors:
Wenpin Tang,
David D. Yao
Abstract:
We develop a continuous-time control approach to optimal trading in a Proof-of-Stake (PoS) blockchain, formulated as a consumption-investment problem that aims to strike the optimal balance between a participant's (or agent's) utility from holding/trading stakes and utility from consumption. We present solutions via dynamic programming and the Hamilton-Jacobi-Bellman (HJB) equations. When the util…
▽ More
We develop a continuous-time control approach to optimal trading in a Proof-of-Stake (PoS) blockchain, formulated as a consumption-investment problem that aims to strike the optimal balance between a participant's (or agent's) utility from holding/trading stakes and utility from consumption. We present solutions via dynamic programming and the Hamilton-Jacobi-Bellman (HJB) equations. When the utility functions are linear or convex, we derive close-form solutions and show that the bang-bang strategy is optimal (i.e., always buy or sell at full capacity). Furthermore, we bring out the explicit connection between the rate of return in trading/holding stakes and the participant's risk-adjusted valuation of the stakes. In particular, we show when a participant is risk-neutral or risk-seeking, corresponding to the risk-adjusted valuation being a martingale or a sub-martingale, the optimal strategy must be to either buy all the time, sell all the time, or first buy then sell, and with both buying and selling executed at full capacity. We also propose a risk-control version of the consumption-investment problem; and for a special case, the ''stake-parity'' problem, we show a mean-reverting strategy is optimal.
△ Less
Submitted 11 June, 2023; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Bayesian Sparse Gaussian Mixture Model in High Dimensions
Authors:
Dapeng Yao,
Fangzheng Xie,
Yanxun Xu
Abstract:
We study the sparse high-dimensional Gaussian mixture model when the number of clusters is allowed to grow with the sample size. A minimax lower bound for parameter estimation is established, and we show that a constrained maximum likelihood estimator achieves the minimax lower bound. However, this optimization-based estimator is computationally intractable because the objective function is highly…
▽ More
We study the sparse high-dimensional Gaussian mixture model when the number of clusters is allowed to grow with the sample size. A minimax lower bound for parameter estimation is established, and we show that a constrained maximum likelihood estimator achieves the minimax lower bound. However, this optimization-based estimator is computationally intractable because the objective function is highly nonconvex and the feasible set involves discrete structures. To address the computational challenge, we propose a Bayesian approach to estimate high-dimensional Gaussian mixtures whose cluster centers exhibit sparsity using a continuous spike-and-slab prior. Posterior inference can be efficiently computed using an easy-to-implement Gibbs sampler. We further prove that the posterior contraction rate of the proposed Bayesian method is minimax optimal. The mis-clustering rate is obtained as a by-product using tools from matrix perturbation theory. The proposed Bayesian sparse Gaussian mixture model does not require pre-specifying the number of clusters, which can be adaptively estimated via the Gibbs sampler. The validity and usefulness of the proposed method is demonstrated through simulation studies and the analysis of a real-world single-cell RNA sequencing dataset.
△ Less
Submitted 22 February, 2024; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Polynomial Voting Rules
Authors:
Wenpin Tang,
David D. Yao
Abstract:
We propose and study a new class of polynomial voting rules for a general decentralized decision/consensus system, and more specifically for the PoS (Proof of Stake) protocol. The main idea, inspired by the Penrose square-root law and the more recent quadratic voting rule, is to differentiate a voter's voting power and the voter's share (fraction of the total in the system). We show that while vot…
▽ More
We propose and study a new class of polynomial voting rules for a general decentralized decision/consensus system, and more specifically for the PoS (Proof of Stake) protocol. The main idea, inspired by the Penrose square-root law and the more recent quadratic voting rule, is to differentiate a voter's voting power and the voter's share (fraction of the total in the system). We show that while voter shares form a martingale process that converge to a Dirichlet distribution, their voting powers follow a super-martingale process that decays to zero over time. This prevents any voter from controlling the voting process, and thus enhances security. For both limiting results, we also provide explicit rates of convergence. When the initial total volume of votes (or stakes) is large, we show a phase transition in share stability (or the lack thereof), corresponding to the voter's initial share relative to the total. We also study the scenario in which trading (of votes/stakes) among the voters is allowed, and quantify the level of risk sensitivity (or risk averse) in three categories, corresponding to the voter's utility being a super-martingale, a sub-martingale, and a martingale. For each category, we identify the voter's best strategy in terms of participation and trading.
△ Less
Submitted 8 January, 2024; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Principal minors of Gaussian orthogonal ensemble
Authors:
Renjie Feng,
Gang Tian,
Dongyi Wei,
Dong Yao
Abstract:
In this paper, we study the extremal process of the maxima of all the largest eigenvalues of principal minors of the classical Gaussian orthogonal ensemble (GOE). We prove that the fluctuation of the maxima is given by the Gumbel distribution in the limit. We also derive the limiting joint distribution of the maxima and the corresponding eigenvector, which implies that these two random variables a…
▽ More
In this paper, we study the extremal process of the maxima of all the largest eigenvalues of principal minors of the classical Gaussian orthogonal ensemble (GOE). We prove that the fluctuation of the maxima is given by the Gumbel distribution in the limit. We also derive the limiting joint distribution of the maxima and the corresponding eigenvector, which implies that these two random variables are asymptotically independent.
△ Less
Submitted 12 February, 2024; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Mean Field Behavior during the Big Bang Regime for Coalescing Random Walks
Authors:
Jonathan Hermon,
Shuangping Li,
Dong Yao,
Lingfu Zhang
Abstract:
In this paper we consider coalescing random walks on a general connected graph $G=(V,E)$. We set up a unified framework to study the leading order of the decay rate of $P_t$, the expectation of the fraction of occupied sites at time $t$, particularly for the `Big Bang' regime where $t\ll t_{\text{coal}}:=\mathbb{E}[\inf\{s:\text{There is only one particle at time }s\}]$. Our results show that…
▽ More
In this paper we consider coalescing random walks on a general connected graph $G=(V,E)$. We set up a unified framework to study the leading order of the decay rate of $P_t$, the expectation of the fraction of occupied sites at time $t$, particularly for the `Big Bang' regime where $t\ll t_{\text{coal}}:=\mathbb{E}[\inf\{s:\text{There is only one particle at time }s\}]$. Our results show that $P_t$ satisfies certain mean field behavior, if the graphs satisfy certain transience-like conditions.
We apply this framework to two families of graphs: (1) graphs given by the configuration model with a degree distribution supported in $[3,\bar d]$ for some $\bar d\geq 3$, and (2) finite and infinite vertex-transitive graphs. In the first case, we show that for $1 \ll t \ll |V|$, $P_t$ decays in the order of $t^{-1}$, and $(tP_t)^{-1}$ is approximately the probability that two particles starting from the root of the corresponding unimodular Galton-Watson tree never collide after one of them leaves the root, which is also roughly $|V|/(2t_{\text{meet}})$, where $t_{\text{meet}}$ is the mean meeting time of two walkers. By taking the local weak limit, for the unimodular Galton-Watson tree we prove the convergence of $tP_t$ as $t\to\infty$. For the second family of graphs, if we take a sequence of finite graphs $G_n=(V_n, E_n)$, such that $t_{\text{meet}}=O(|V_n|)$ and the inverse of the spectral gap $t_{\text{rel}}$ is $o(|V_n|)$, then for $t_{\text{rel}}\ll t\ll t_{\text{coal}}$, $(tP_t)^{-1}$ is approximately the probability that two random walks never meet before time $t$, and also $|V|/(2t_{\text{meet}})$. In addition, we define a natural uniform transience condition, and show that it implies the above for all $1\ll t\ll t_{\text{coal}}$. Such estimates of $tP_t$ are also obtained for all infinite transient transitive unimodular graphs, in particular, all transient transitive amenable graphs.
△ Less
Submitted 12 September, 2022; v1 submitted 24 May, 2021;
originally announced May 2021.
-
Ergodic inventory control with diffusion demand and general ordering costs
Authors:
Bo Wei,
Dacheng Yao
Abstract:
In this work, we consider a continuous-time inventory system where the demand process follows an inventory-dependent diffusion process. The ordering cost of each order depends on the order quantity and is given by a general function, which is not even necessarily continuous and monotone. By applying a lower bound approach together with a comparison theorem, we show the global optimality of an…
▽ More
In this work, we consider a continuous-time inventory system where the demand process follows an inventory-dependent diffusion process. The ordering cost of each order depends on the order quantity and is given by a general function, which is not even necessarily continuous and monotone. By applying a lower bound approach together with a comparison theorem, we show the global optimality of an $(s,S)$ policy for this ergodic inventory control problem.
△ Less
Submitted 4 December, 2020;
originally announced December 2020.
-
Susceptible-Infected Epidemics on Evolving Graphs
Authors:
Rick Durrett,
Dong Yao
Abstract:
The evoSIR model is a modification of the usual SIR process on a graph $G$ in which $S-I$ connections are broken at rate $ρ$ and the $S$ connects to a randomly chosen vertex. The evoSI model is the same as evoSIR but recovery is impossible. In \cite{DOMath} the critical value for evoSIR was computed and simulations showed that when $G$ is an Erd\H os-Rényi graph with mean degree 5, the system has…
▽ More
The evoSIR model is a modification of the usual SIR process on a graph $G$ in which $S-I$ connections are broken at rate $ρ$ and the $S$ connects to a randomly chosen vertex. The evoSI model is the same as evoSIR but recovery is impossible. In \cite{DOMath} the critical value for evoSIR was computed and simulations showed that when $G$ is an Erd\H os-Rényi graph with mean degree 5, the system has a discontinuous phase transition, i.e., as the infection rate $λ$ decreases to $λ_c$, the fraction of individuals infected during the epidemic does not converge to 0. In this paper we study evoSI dynamics on graphs generated by the configuration model. We show that there is a quantity $Δ$ determined by the first three moments of the degree distribution, so that the phase transition is discontinuous if $Δ>0$ and continuous if $Δ<0$.
△ Less
Submitted 1 October, 2023; v1 submitted 18 March, 2020;
originally announced March 2020.
-
Impulse Control with Discontinuous Setup Costs: Discounted Cost Criterion
Authors:
Fen Xu,
Dacheng Yao,
Hanqin Zhang
Abstract:
This paper studies a continuous-review backlogged inventory model considered by Helmes et al. (2015) but with discontinuous quantity-dependent setup cost for each order. In particular, the setup cost is characterized by a two-step function and a higher cost would be charged once the order quantity exceeds a threshold $Q$. Unlike the optimality of $(s,S)$-type policy obtained by Helmes et al. (2015…
▽ More
This paper studies a continuous-review backlogged inventory model considered by Helmes et al. (2015) but with discontinuous quantity-dependent setup cost for each order. In particular, the setup cost is characterized by a two-step function and a higher cost would be charged once the order quantity exceeds a threshold $Q$. Unlike the optimality of $(s,S)$-type policy obtained by Helmes et al. (2015) for continuous setup cost with the discounted cost criterion, we find that, in our model, although some $(s,S)$-type policy is indeed optimal in some cases, the $(s,S)$-type policy can not always be optimal. In particular, we show that there exist cases in which an $(s,S)$ policy is optimal for some initial levels but it is strictly worse than a generalized $(s,\{S(x):x\leq s\})$ policy for the other initial levels. Under $(s,\{S(x):x\leq s\})$ policy, it orders nothing for $x>s$ and orders up to level $S(x)$ for $x\leq s$, where $S(x)$ is a non-constant function of $x$. We further prove the optimality of such $(s,\{S(x):x\leq s\})$ policy in a large subset of admissible policies for those initial levels. Moreover, the optimality is obtained through establishing a more general lower bound theorem which will also be applicable in solving some other optimization problems by the common lower bound approach.
△ Less
Submitted 2 September, 2020; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Zeros of repeated derivatives of random polynomials
Authors:
Renjie Feng,
Dong Yao
Abstract:
It has been shown that zeros of Kac polynomials $K_n(z)$ of degree $n$ cluster asymptotically near the unit circle as $n\to\infty$ under some assumptions. This property remains unchanged for the $l$-th derivative of the Kac polynomials $K^{(l)}_n(z)$ for any fixed order $l$. So it's natural to study the situation when the number of the derivatives we take depends on $n$, i.e., $l=N_n$. We will sho…
▽ More
It has been shown that zeros of Kac polynomials $K_n(z)$ of degree $n$ cluster asymptotically near the unit circle as $n\to\infty$ under some assumptions. This property remains unchanged for the $l$-th derivative of the Kac polynomials $K^{(l)}_n(z)$ for any fixed order $l$. So it's natural to study the situation when the number of the derivatives we take depends on $n$, i.e., $l=N_n$. We will show that the limiting global behavior of zeros of $K_n^{(N_n)}(z)$ depends on the limit of the ratio $N_n/n$. In particular, we prove that when the limit of the ratio is strictly positive, the property of the uniform clustering around the unit circle fails; when the ratio is close to 1, the zeros have some rescaling phenomenon. Then we study such problem for random polynomials with more general coefficients. But things, especially the rescaling phenomenon, become very complicated for the general case when $N_n/n\to 1$, where we compute the case of the random elliptic polynomials to illustrate this.
△ Less
Submitted 2 August, 2019;
originally announced August 2019.
-
The Symbiotic Contact Process
Authors:
Rick Durrett,
Dong Yao
Abstract:
We consider a contact process on $Z^d$ with two species that interact in a symbiotic manner. Each site can either be vacant or occupied by individuals of species $A$ and/or $B$. Multiple occupancy by the same species at a single site is prohibited. The name symbiotic comes from the fact that if only one species is present at a site then that particle dies with rate 1 but if both species are presen…
▽ More
We consider a contact process on $Z^d$ with two species that interact in a symbiotic manner. Each site can either be vacant or occupied by individuals of species $A$ and/or $B$. Multiple occupancy by the same species at a single site is prohibited. The name symbiotic comes from the fact that if only one species is present at a site then that particle dies with rate 1 but if both species are present then the death rate is reduced to $μ\le 1$ for each particle at that site. We show the critical birth rate $λ_c(μ)$ for weak survival is of order $\sqrtμ$ as $μ\to 0$. Mean-field calculations predict that when $μ< 1/2$ there is a discontinuous transition as $λ$ is varied. In contrast, we show that, in any dimension, the phase transition is continuous. To be fair to physicists the paper that introduced the model, the authors say that the symbiotic contact process is in the directed percolation universality class and hence has a continuous transition. However, a 2018 paper asserts that the transition is discontinuous above the upper critical dimension, which is 4 for oriented percolation.
△ Less
Submitted 9 December, 2019; v1 submitted 3 April, 2019;
originally announced April 2019.
-
Average nearest neighbor degrees in scale-free networks
Authors:
Dong Yao,
Pim van der Hoorn,
Nelly Litvak
Abstract:
The average nearest neighbor degree (ANND) of a node of degree $k$ is widely used to measure dependencies between degrees of neighbor nodes in a network. We formally analyze ANND in undirected random graphs when the graph size tends to infinity. The limiting behavior of ANND depends on the variance of the degree distribution. When the variance is finite, the ANND has a deterministic limit. When th…
▽ More
The average nearest neighbor degree (ANND) of a node of degree $k$ is widely used to measure dependencies between degrees of neighbor nodes in a network. We formally analyze ANND in undirected random graphs when the graph size tends to infinity. The limiting behavior of ANND depends on the variance of the degree distribution. When the variance is finite, the ANND has a deterministic limit. When the variance is infinite, the ANND scales with the size of the graph, and we prove a corresponding central limit theorem in the configuration model (CM, a network with random connections). As ANND proved uninformative in the infinite variance scenario, we propose an alternative measure, the average nearest neighbor rank (ANNR). We prove that ANNR converges to a deterministic function whenever the degree distribution has finite mean. We then consider the erased configuration model (ECM), where self-loops and multiple edges are removed, and investigate the well-known `structural negative correlations', or `finite-size effects', that arise in simple graphs, such as ECM, because large nodes can only have a limited number of large neighbors. Interestingly, we prove that for any fixed $k$, ANNR in ECM converges to the same limit as in CM. However, numerical experiments show that finite-size effects occur when $k$ scales with $n$.
△ Less
Submitted 29 December, 2017; v1 submitted 19 April, 2017;
originally announced April 2017.
-
Optimal Drift Rate Control and Impulse Control for a Stochastic Inventory/Production System
Authors:
Ping Cao,
Dacheng Yao
Abstract:
In this paper, we consider joint drift rate control and impulse control for a stochastic inventory system under long-run average cost criterion. Assuming the inventory level must be nonnegative, we prove that a $\{(0,q^{\star},Q^{\star},S^{\star}),\{μ^{\star}(x): x\in[0, S^{\star}]\}\}$ policy is an optimal joint control policy, where the impulse control follows the control band policy…
▽ More
In this paper, we consider joint drift rate control and impulse control for a stochastic inventory system under long-run average cost criterion. Assuming the inventory level must be nonnegative, we prove that a $\{(0,q^{\star},Q^{\star},S^{\star}),\{μ^{\star}(x): x\in[0, S^{\star}]\}\}$ policy is an optimal joint control policy, where the impulse control follows the control band policy $(0,q^{\star},Q^{\star},S^{\star})$, that brings the inventory level up to $q^{\star}$ once it drops to $0$ and brings it down to $Q^{\star}$ once it rises to $S^{\star}$, and the drift rate only depends on the current inventory level and is given by function $μ^{\star}(x)$ for the inventory level $x\in[0,S^{\star}]$. The optimality of the $\{(0,q^{\star},Q^{\star},S^{\star}),\{μ^{\star}(x): x\in[0,S^{\star}]\}\}$ policy is proven by using a lower bound approach, in which a critical step is to prove the existence and uniqueness of optimal policy parameters. To prove the existence and uniqueness, we develop a novel analytical method to solve a free boundary problem consisting of an ordinary differential equation (ODE) and several free boundary conditions. Furthermore, we find that the optimal drift rate $μ^{\star}(x)$ is firstly increasing and then decreasing as $x$ increases from $0$ to $S^{\star}$ with a turnover point between $Q^{\star}$ and $S^{\star}$.
△ Less
Submitted 25 September, 2017; v1 submitted 7 November, 2016;
originally announced November 2016.
-
Joint pricing and inventory control for a stochastic inventory system with Brownian motion demand
Authors:
Dacheng Yao
Abstract:
In this paper, we consider an infinite horizon, continuous-review, stochastic inventory system in which cumulative customers' demand is price-dependent and is modeled as a Brownian motion. Excess demand is backlogged. The revenue is earned by selling products and the costs are incurred by holding/shortage and ordering, the latter consists of a fixed cost and a proportional cost. Our objective is t…
▽ More
In this paper, we consider an infinite horizon, continuous-review, stochastic inventory system in which cumulative customers' demand is price-dependent and is modeled as a Brownian motion. Excess demand is backlogged. The revenue is earned by selling products and the costs are incurred by holding/shortage and ordering, the latter consists of a fixed cost and a proportional cost. Our objective is to simultaneously determine a pricing strategy and an inventory control strategy to maximize the expected long-run average profit. Specifically, the pricing strategy provides the price $p_t$ for any time $t\geq0$ and the inventory control strategy characterizes when and how much we need to order. We show that an $(s^*,S^*,p^*)$ policy is optimal and obtain the equations of optimal policy parameters, where $p^*=\{p_t^*:t\geq 0\}$. Furthermore, we find that at each time $t$, the optimal price $p_t^*$ depends on the current inventory level $z$, and it is increasing in $[s^*,z^*]$ and is decreasing in $[z^*,\infty)$, where $z^*$ is a negative level.
△ Less
Submitted 11 July, 2017; v1 submitted 9 August, 2016;
originally announced August 2016.
-
Optimal Control of a Levy Inventory System: The Optimality of Control Band Policy
Authors:
Jinbiao Wu,
Haolin Feng,
Dacheng Yao
Abstract:
We consider an inventory system whose state is modeled by a Lévy process. There are two types of costs--the running costs and the inventory control costs. The running costs (also known as the holding/penalty costs) are incurred continuously at some rate as a function of the inventory state. The inventory control costs, incurred only when interventions of the inventory state are placed, have both a…
▽ More
We consider an inventory system whose state is modeled by a Lévy process. There are two types of costs--the running costs and the inventory control costs. The running costs (also known as the holding/penalty costs) are incurred continuously at some rate as a function of the inventory state. The inventory control costs, incurred only when interventions of the inventory state are placed, have both a fixed and a variable component. The objective is to minimize the expectation of the infinite horizon discounted costs. We formulate this as a stochastic impulse control problem. In our setting, we obtain analytical results that are of significant implications. Specifically, we establish the existence of the optimal control, and we provide the solution in closed-form. More importantly, we prove the optimality of the simple control band policy. Furthermore, we investigate the transient and the steady-state behavior of the controlled process and the stochastic decomposition property.
△ Less
Submitted 31 August, 2016; v1 submitted 28 July, 2016;
originally announced July 2016.
-
Matching Supply and Demand in Production-Inventory Systems: Asymptotics and Optimization
Authors:
Yingdong Lu,
Mark S. Squillante,
David D. Yao
Abstract:
We consider a general class of high-volume, fast-moving production-inventory systems based on both lost-sales and backorder inventory models. Such systems require a fundamental understanding of the asymptotic behavior of key performance measures under various supply strategies, as well as the pre-planning of these strategies. Our analysis relies on a thorough study of the asymptotic behavior of a…
▽ More
We consider a general class of high-volume, fast-moving production-inventory systems based on both lost-sales and backorder inventory models. Such systems require a fundamental understanding of the asymptotic behavior of key performance measures under various supply strategies, as well as the pre-planning of these strategies. Our analysis relies on a thorough study of the asymptotic behavior of a random walk with power drift, which is of independent interest. In addition to providing key insights, our analysis leads to approximations of the corresponding optimization problem that yield simple solutions which are close to optimal. We also establish an equivalence between the lost-sales and backorder models when both have the same penalty cost that becomes large.
△ Less
Submitted 28 January, 2015;
originally announced January 2015.
-
Optimal Ordering Policy for Inventory Systems with Quantity-Dependent Setup Costs
Authors:
Shuangchi He,
Dacheng Yao,
Hanqin Zhang
Abstract:
We consider a continuous-review inventory system in which the setup cost of each order is a general function of the order quantity and the demand process is modeled as a Brownian motion with a positive drift. Assuming the holding and shortage cost to be a convex function of the inventory level, we obtain the optimal ordering policy that minimizes the long-run average cost by a lower bound approach…
▽ More
We consider a continuous-review inventory system in which the setup cost of each order is a general function of the order quantity and the demand process is modeled as a Brownian motion with a positive drift. Assuming the holding and shortage cost to be a convex function of the inventory level, we obtain the optimal ordering policy that minimizes the long-run average cost by a lower bound approach. To tackle some technical issues in the lower bound approach under the quantity-dependent setup cost assumption, we establish a comparison theorem that enables one to prove the global optimality of a policy by examining a tractable subset of admissible policies. Since the smooth pasting technique does not apply to our Brownian inventory model, we also propose a selection procedure for computing the optimal policy parameters when the setup cost is a step function.
△ Less
Submitted 25 March, 2016; v1 submitted 5 January, 2015;
originally announced January 2015.
-
Optimal Control of Brownian Inventory Models with Convex Inventory Cost: Discounted Cost Case
Authors:
Jim Dai,
Dacheng Yao
Abstract:
We consider an inventory system in which inventory level fluctuates as a Brownian motion in the absence of control. The inventory continuously accumulates cost at a rate that is a general convex function of the inventory level, which can be negative when there is a backlog. At any time, the inventory level can be adjusted by a positive or negative amount, which incurs a fixed positive cost and a p…
▽ More
We consider an inventory system in which inventory level fluctuates as a Brownian motion in the absence of control. The inventory continuously accumulates cost at a rate that is a general convex function of the inventory level, which can be negative when there is a backlog. At any time, the inventory level can be adjusted by a positive or negative amount, which incurs a fixed positive cost and a proportional cost. The challenge is to find an adjustment policy that balances the inventory cost and adjustment cost to minimize the expected total discounted cost. We provide a tutorial on using a three-step lower-bound approach to solving the optimal control problem under a discounted cost criterion. In addition, we prove that a four-parameter control band policy is optimal among all feasible policies. A key step is the constructive proof of the existence of a unique solution to the free boundary problem. The proof leads naturally to an algorithm to compute the four parameters of the optimal control band policy.
△ Less
Submitted 29 October, 2011;
originally announced October 2011.
-
Optimal Control of Brownian Inventory Models with Convex Holding Cost: Average Cost Case
Authors:
Jim Dai,
Dacheng Yao
Abstract:
We consider an inventory system in which inventory level fluctuates as a Brownian motion in the absence of control. The inventory continuously accumulates cost at a rate that is a general convex function of the inventory level, which can be negative when there is a backlog. At any time, the inventory level can be adjusted by a positive or negative amount, which incurs a fixed cost and a proportion…
▽ More
We consider an inventory system in which inventory level fluctuates as a Brownian motion in the absence of control. The inventory continuously accumulates cost at a rate that is a general convex function of the inventory level, which can be negative when there is a backlog. At any time, the inventory level can be adjusted by a positive or negative amount, which incurs a fixed cost and a proportional cost. The challenge is to find an adjustment policy that balances the holding cost and adjustment cost to minimize the long-run average cost. When both upward and downward fixed costs are positive, our model is an impulse control problem. When both fixed costs are zero, our model is a singular or instantaneous control problem. For the impulse control problem, we prove that a four-parameter control band policy is optimal among all feasible policies. For the singular control problem, we prove that a two-parameter control band policy is optimal.
We use a lower-bound approach, widely known as "the verification theorem", to prove the optimality of a control band policy for both the impulse and singular control problems. Our major contribution is to prove the existence of a "smooth" solution to the free boundary problem under some mild assumptions on the holding cost function. The existence proof leads naturally to numerical algorithms to compute the optimal control band parameters. We demonstrate that the lower-bound approach also works for Brownian inventory model in which no inventory backlog is allowed. In a companion paper, we will show how the lower-bound approach can be adapted to study a Brownian inventory model under a discounted cost criterion.
△ Less
Submitted 15 October, 2011; v1 submitted 12 October, 2011;
originally announced October 2011.
-
Linear Optimization over a Polymatroid with Side Constraints -- Scheduling Queues and Minimizing Submodular Functions
Authors:
Yingdong Lu,
David Yao
Abstract:
Two seemingly unrelated problems, scheduling a multiclass queueing system and minimizing a submodular function, share a rather deep connection via the polymatroid that is characterized by a submodular set function on the one hand and represents the performance polytope of the queueing system on the other hand. We first develop what we call a {\it grouping} algorithm that solves the queueing sche…
▽ More
Two seemingly unrelated problems, scheduling a multiclass queueing system and minimizing a submodular function, share a rather deep connection via the polymatroid that is characterized by a submodular set function on the one hand and represents the performance polytope of the queueing system on the other hand. We first develop what we call a {\it grouping} algorithm that solves the queueing scheduling problem under side constraints, with a computational effort of $O(n^3LP(n))$, $n$ being the number of job classes, and LP(n) being the computational efforts of solving a linear program with no more than $n$ variables and $n$ constraints. The algorithm organizes the job classes into groups, and identifies the optimal policy to be a priority rule across the groups and a randomized rule within each group (to enforce the side constraints). We then apply the grouping algorithm to the submodular function minimization, mapping the latter to a queueing scheduling problem with side constraints. %Each time the algorithm is applied, it identifies a subset; and We show the minimizing subset can be identified by applying the grouping algorithm $n$ times. Hence, this results in a algorithm that minimizes a submodular function with an effort of $O(n^4LP(n))$.
△ Less
Submitted 9 May, 2008; v1 submitted 9 April, 2008;
originally announced April 2008.
-
Stochastic Knapsack Problem Revisited: Switch-Over Policies and Dynamic Pricing
Authors:
Grace Lin,
Yingdong Lu,
David Yao
Abstract:
The stochastic knapsack has been used as a model in wide ranging applications from dynamic resource allocation to admission control in telecommunication. In recent years, a variation of the model has become a basic tool in studying problems that arise in revenue management and dynamic/flexible pricing; and it is in this context that our study is undertaken. Based on a dynamic programming formula…
▽ More
The stochastic knapsack has been used as a model in wide ranging applications from dynamic resource allocation to admission control in telecommunication. In recent years, a variation of the model has become a basic tool in studying problems that arise in revenue management and dynamic/flexible pricing; and it is in this context that our study is undertaken. Based on a dynamic programming formulation and associated properties of the value function, we study in this paper a class of control that we call switch-over policies -- start from accepting only orders of the highest price, and switch to including lower prices as time goes by, with the switch-over times optimally decided via convex programming. We establish the asymptotic optimality of the switch-over policy, and develop pricing models based on this policy to optimize the price reductions over the decision horizon.
△ Less
Submitted 8 August, 2007;
originally announced August 2007.
-
Heavy-Traffic Optimality of a Stochastic Network under Utility-Maximizing Resource Control
Authors:
Heng-Qing Ye,
David D. Yao
Abstract:
We study a stochastic network that consists of a set of servers processing multiple classes of jobs. Each class of jobs requires a concurrent occupancy of several servers while being processed, and each server is shared among the job classes in a head-of-the-line processor-sharing mechanism. The allocation of the service capacities is a real-time control mechanism: in each network state, the con…
▽ More
We study a stochastic network that consists of a set of servers processing multiple classes of jobs. Each class of jobs requires a concurrent occupancy of several servers while being processed, and each server is shared among the job classes in a head-of-the-line processor-sharing mechanism. The allocation of the service capacities is a real-time control mechanism: in each network state, the control is the solution to an optimization problem that maximizes a general utility function. Whereas this resource control optimizes in a ``greedy'' fashion, with respect to each state, we establish its asymptotic optimality in terms of (a) deriving the fluid and diffusion limits of the network under this control, and (b) identifying a cost function that is minimized in the diffusion limit, along with a characterization of the so-called fixed point state of the network.
△ Less
Submitted 5 January, 2006;
originally announced January 2006.