-
Structure-biased Maker-Breaker Games
Authors:
Wesley Pegden,
Francesca Yu
Abstract:
In classical Maker-Breaker games on graphs, Maker and Breaker take turns claiming edges; Maker's goal is to claim all of some structure (e.g., a spanning tree, Hamilton cycle, etc.), while Breaker aims to stop her. The standard question considered is how powerful a Breaker Maker can defeat; i.e., for the $(1:b)$-biased game where Breaker takes $b$ edges per turn, how large can $b$ be for Maker to…
▽ More
In classical Maker-Breaker games on graphs, Maker and Breaker take turns claiming edges; Maker's goal is to claim all of some structure (e.g., a spanning tree, Hamilton cycle, etc.), while Breaker aims to stop her. The standard question considered is how powerful a Breaker Maker can defeat; i.e., for the $(1:b)$-biased game where Breaker takes $b$ edges per turn, how large can $b$ be for Maker to still have a winning strategy, for various possible goal sets?
We introduce a variant of this question in which Breaker is required to choose their multiple edges as the edges of (a subgraph of) a given structure (e.g., a matching, clique, etc.) on each turn. We establish the order of magnitude of the threshold biases for triangle games, connectivity games, and Hamiltonicity games under clique, matching, and star biases respectively. We conclude that in many cases structure imposes major obstruction to Breaker, opening up a set of games whose strategies deviate from the classical biased Maker-Breaker game strategies, and shedding light on the types of Breaker strategies that may or may not work to prove tighter bounds in the classical setting.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
A Second-order method on graded meshes for fractional Laplacian via Riesz fractional derivative with a singular source term
Authors:
Minghua Chen,
Jianxing Han,
Jiankang Shi,
Fan Yu
Abstract:
The high-order numerical analysis for fractional Laplacian via the Riesz fractional derivative, under the low regularity solution, has presented significant challenges in the past decades. To fill in this gap, we design a grid mapping function on graded meshes to analyse the local truncation errors, which are far less than second-order convergence at the boundary layer. To restore the second-order…
▽ More
The high-order numerical analysis for fractional Laplacian via the Riesz fractional derivative, under the low regularity solution, has presented significant challenges in the past decades. To fill in this gap, we design a grid mapping function on graded meshes to analyse the local truncation errors, which are far less than second-order convergence at the boundary layer. To restore the second-order global errors, we construct an appropriate right-preconditioner for the resulting matrix algebraic equation. We prove that the proposed scheme achieves second-order convergence on graded meshes even if the source term is singular or hypersingular. Numerical experiments illustrate the theoretical results. The proposed approach is applicable for multidimensional fractional diffusion equations, gradient flows and nonlinear equations.
△ Less
Submitted 16 February, 2025;
originally announced February 2025.
-
Data integration using covariate summaries from external sources
Authors:
Facheng Yu,
Yuqian Zhang
Abstract:
In modern data analysis, information is frequently collected from multiple sources, often leading to challenges such as data heterogeneity and imbalanced sample sizes across datasets. Robust and efficient data integration methods are crucial for improving the generalization and transportability of statistical findings. In this work, we address scenarios where, in addition to having full access to…
▽ More
In modern data analysis, information is frequently collected from multiple sources, often leading to challenges such as data heterogeneity and imbalanced sample sizes across datasets. Robust and efficient data integration methods are crucial for improving the generalization and transportability of statistical findings. In this work, we address scenarios where, in addition to having full access to individualized data from a primary source, supplementary covariate information from external sources is also available. While traditional data integration methods typically require individualized covariates from external sources, such requirements can be impractical due to limitations related to accessibility, privacy, storage, and cost. Instead, we propose novel data integration techniques that rely solely on external summary statistics, such as sample means and covariances, to construct robust estimators for the mean outcome under both homogeneous and heterogeneous data settings. Additionally, we extend this framework to causal inference, enabling the estimation of average treatment effects for both generalizability and transportability.
△ Less
Submitted 26 November, 2024; v1 submitted 23 November, 2024;
originally announced November 2024.
-
The weighted and shifted seven-step BDF method for parabolic equations
Authors:
Georgios Akrivis,
Minghua Chen,
Fan Yu
Abstract:
Stability of the BDF methods of order up to five for parabolic equations can be established by the energy technique via Nevanlinna--Odeh multipliers. The nonexistence of Nevanlinna--Odeh multipliers makes the six-step BDF method special; however, the energy technique was recently extended by the authors in [Akrivis et al., SIAM J. Numer. Anal. \textbf{59} (2021) 2449--2472] and covers all six stab…
▽ More
Stability of the BDF methods of order up to five for parabolic equations can be established by the energy technique via Nevanlinna--Odeh multipliers. The nonexistence of Nevanlinna--Odeh multipliers makes the six-step BDF method special; however, the energy technique was recently extended by the authors in [Akrivis et al., SIAM J. Numer. Anal. \textbf{59} (2021) 2449--2472] and covers all six stable BDF methods. The seven-step BDF method is unstable for parabolic equations, since it is not even zero-stable. In this work, we construct and analyze a stable linear combination of two non zero-stable schemes, the seven-step BDF method and its shifted counterpart, referred to as WSBDF7 method. The stability regions of the WSBDF$q, q\leqslant 7$, with a weight $\vartheta\geqslant1$, increase as $\vartheta$ increases, are larger than the stability regions of the classical BDF$q,$ corresponding to $\vartheta=1$. We determine novel and suitable multipliers for the WSBDF7 method and establish stability for parabolic equations by the energy technique. The proposed approach is applicable for mean curvature flow, gradient flows, fractional equations and nonlinear equations.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Theoretical Guarantees for the Subspace-Constrained Tyler's Estimator
Authors:
Gilad Lerman,
Feng Yu,
Teng Zhang
Abstract:
This work analyzes the subspace-constrained Tyler's estimator (STE) designed for recovering a low-dimensional subspace within a dataset that may be highly corrupted with outliers. It assumes a weak inlier-outlier model and allows the fraction of inliers to be smaller than a fraction that leads to computational hardness of the robust subspace recovery problem. It shows that in this setting, if the…
▽ More
This work analyzes the subspace-constrained Tyler's estimator (STE) designed for recovering a low-dimensional subspace within a dataset that may be highly corrupted with outliers. It assumes a weak inlier-outlier model and allows the fraction of inliers to be smaller than a fraction that leads to computational hardness of the robust subspace recovery problem. It shows that in this setting, if the initialization of STE, which is an iterative algorithm, satisfies a certain condition, then STE can effectively recover the underlying subspace. It further shows that under the generalized haystack model, STE initialized by the Tyler's M-estimator (TME), can recover the subspace when the fraction of iniliers is too small for TME to handle.
△ Less
Submitted 12 April, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
Fill Probabilities in a Limit Order Book with State-Dependent Stochastic Order Flows
Authors:
Felix Lokin,
Fenghui Yu
Abstract:
This paper focuses on computing the fill probabilities for limit orders positioned at various price levels within the limit order book, which play a crucial role in optimizing executions. We adopt a generic stochastic model to capture the dynamics of the order book as a series of queueing systems. This generic model is state-dependent and also incorporates stylized factors. We subsequently derive…
▽ More
This paper focuses on computing the fill probabilities for limit orders positioned at various price levels within the limit order book, which play a crucial role in optimizing executions. We adopt a generic stochastic model to capture the dynamics of the order book as a series of queueing systems. This generic model is state-dependent and also incorporates stylized factors. We subsequently derive semi-analytical expressions to compute the relevant probabilities within the context of state-dependent stochastic order flows. These probabilities cover various scenarios, including the probability of a change in the mid-price, the fill probabilities of orders posted at the best quotes, and those posted at a price level deeper than the best quotes in the book, before the opposite best quote moves. These expressions can be further generalized to accommodate orders posted even deeper in the order book, although the associated probabilities are typically very small in such cases. Lastly, we conduct extensive numerical experiments using real order book data from the foreign exchange spot market. Our findings suggest that the model is tractable and possesses the capability to effectively capture the dynamics of the limit order book. Moreover, the derived formulas and numerical methods demonstrate reasonably good accuracy in estimating the fill probabilities.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Multiple blowing-up solutions for a slightly critical Lane-Emden system with non-power nonlinearity
Authors:
Shengbing Deng,
Fang Yu
Abstract:
In this paper, we study the following Lane-Emden system with nearly critical non-power nonlinearity \begin{eqnarray*} \left\{ \arraycolsep=1.5pt
\begin{array}{lll} -Δu =\frac{|v|^{p-1}v}{[\ln(e+|v|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] -Δv =\frac{|u|^{q-1}u}{[\ln(e+|u|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] u= v=0 \ \ & {\rm on}\ \partialΩ, \end{array} \right. \end{eqnarray*} where $Ω$ is a bounded smooth domain…
▽ More
In this paper, we study the following Lane-Emden system with nearly critical non-power nonlinearity \begin{eqnarray*} \left\{ \arraycolsep=1.5pt
\begin{array}{lll} -Δu =\frac{|v|^{p-1}v}{[\ln(e+|v|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] -Δv =\frac{|u|^{q-1}u}{[\ln(e+|u|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] u= v=0 \ \ & {\rm on}\ \partialΩ, \end{array} \right. \end{eqnarray*} where $Ω$ is a bounded smooth domain in $\mathbb{R}^N$, $N\geq 3$, $ε>0$ is a small parameter, $p$ and $q $ lying on the critical Sobolev hyperbola $\frac{1}{p+1}+\frac{1}{q+1}=\frac{N-2}{N}$. We construct multiple blowing-up solutions based on the finite dimensional Lyapunov-Schmidt reduction method as $ε$ goes to zero.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Filtration and splitting of the Hodge bundle on the non-varying strata of quadratic differentials
Authors:
Dawei Chen,
Fei Yu
Abstract:
We describe the Harder--Narasimhan filtration of the Hodge bundle for Teichmüller curves in the non-varying strata of quadratic differentials appearing in [CM2]. Moreover, we show that the Hodge bundle on the non-varying strata away from the irregular components can split as a direct sum of line bundles. As applications, we determine all individual Lyapunov exponents of algebraically primitive Tei…
▽ More
We describe the Harder--Narasimhan filtration of the Hodge bundle for Teichmüller curves in the non-varying strata of quadratic differentials appearing in [CM2]. Moreover, we show that the Hodge bundle on the non-varying strata away from the irregular components can split as a direct sum of line bundles. As applications, we determine all individual Lyapunov exponents of algebraically primitive Teichmüller curves in the non-varying strata and derive new results regarding the asymptotic behavior of Lyapunov exponents.
△ Less
Submitted 11 June, 2023;
originally announced June 2023.
-
Sign changing bubble tower solutions to a slightly subcritical elliptic problem with non-power nonlinearity
Authors:
Shengbing Deng,
Fang Yu
Abstract:
We study the following elliptic problem involving slightly subcritical non-power nonlinearity $$\left\{\begin{array}{lll} -Δu =\frac{|u|^{2^*-2}u}{[\ln(e+|u|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] u= 0 \ \ & {\rm on}\ \partialΩ, \end{array} \right.$$ where $Ω$ is a bounded smooth domain in $\mathbb{R}^n$, $n\geq 3$, $2^*=\frac{2n}{n-2}$ is the critical Sobolev exponent, $ε>0$ is a small parameter. By the f…
▽ More
We study the following elliptic problem involving slightly subcritical non-power nonlinearity $$\left\{\begin{array}{lll} -Δu =\frac{|u|^{2^*-2}u}{[\ln(e+|u|)]^ε}\ \ &{\rm in}\ Ω, \\[2mm] u= 0 \ \ & {\rm on}\ \partialΩ, \end{array} \right.$$ where $Ω$ is a bounded smooth domain in $\mathbb{R}^n$, $n\geq 3$, $2^*=\frac{2n}{n-2}$ is the critical Sobolev exponent, $ε>0$ is a small parameter. By the finite dimensional Lyapunov-Schmidt reduction method, we construct a sign changing bubble tower solution with the shape of a tower of bubbles as $ε$ goes to zero.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Seeding with Differentially Private Network Information
Authors:
M. Amin Rahimian,
Fang-Yi Yu,
Yuxin Liu,
Carlos Hurtado
Abstract:
In public health interventions such as the distribution of preexposure prophylaxis (PrEP) for HIV prevention, decision makers rely on seeding algorithms to identify key individuals who can amplify the impact of their interventions. In such cases, building a complete sexual activity network is often infeasible due to privacy concerns. Instead, contact tracing can provide influence samples, that is,…
▽ More
In public health interventions such as the distribution of preexposure prophylaxis (PrEP) for HIV prevention, decision makers rely on seeding algorithms to identify key individuals who can amplify the impact of their interventions. In such cases, building a complete sexual activity network is often infeasible due to privacy concerns. Instead, contact tracing can provide influence samples, that is, sequences of sexual contacts without requiring complete network information. This presents two challenges: protecting individual privacy in contact data and adapting seeding algorithms to work effectively with incomplete network information. To solve these two problems, we study privacy guarantees for influence maximization algorithms when the social network is unknown and the inputs are samples of prior influence cascades that are collected at random and need privacy protection. Building on recent results that address seeding with costly network information, our privacy-preserving algorithms introduce randomization in the collected data or the algorithm output and can bound the privacy loss of each node (or group of nodes) in deciding to include their data in the algorithm input. We provide theoretical guarantees of seeding performance with a limited sample size subject to differential privacy budgets in both central and local privacy regimes. Simulations on synthetic random graphs and empirically grounded sexual contacts of men who have sex with men reveal the diminishing value of network information with decreasing privacy budget in both regimes and graceful decrease in performance with decreasing privacy budget in the central regime. Achieving good performance with local privacy guarantees requires relatively higher privacy budgets that confirm our theoretical expectations.
△ Less
Submitted 30 October, 2024; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Variable step-size BDF3 method for Allen-Cahn equation
Authors:
Minghua Chen,
Fan Yu,
Qingdong Zhang,
Zhimin Zhang
Abstract:
In this work, we analyze the three-step backward differentiation formula (BDF3) method for solving the Allen-Cahn equation on variable grids. For BDF2 method, the discrete orthogonal convolution (DOC) kernels are positive, the stability and convergence analysis are well established in [Liao and Zhang, \newblock Math. Comp., \textbf{90} (2021) 1207--1226; Chen, Yu, and Zhang, \newblock SIAM J. Nume…
▽ More
In this work, we analyze the three-step backward differentiation formula (BDF3) method for solving the Allen-Cahn equation on variable grids. For BDF2 method, the discrete orthogonal convolution (DOC) kernels are positive, the stability and convergence analysis are well established in [Liao and Zhang, \newblock Math. Comp., \textbf{90} (2021) 1207--1226; Chen, Yu, and Zhang, \newblock SIAM J. Numer. Anal., Major Revised]. However, the numerical analysis for BDF3 method with variable steps seems to be highly nontrivial, since the DOC kernels are not always positive. By developing a novel spectral norm inequality, the unconditional stability and convergence are rigorously proved under the updated step ratio restriction $r_k:=τ_k/τ_{k-1}\leq 1.405$ (compared with $r_k\leq 1.199$ in [Calvo and Grigorieff, \newblock BIT. \textbf{42} (2002) 689--701]) for BDF3 method. Finally, numerical experiments are performed to illustrate the theoretical results. To the best of our knowledge, this is the first theoretical analysis of variable steps BDF3 method for the Allen-Cahn equation.
△ Less
Submitted 6 April, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
Unbiased Estimation of the Hessian for Partially Observed Diffusions
Authors:
Neil K. Chada,
Ajay Jasra,
Fangyuan Yu
Abstract:
In this article we consider the development of unbiased estimators of the Hessian, of the log-likelihood function with respect to parameters, for partially observed diffusion processes. These processes arise in numerous applications, where such diffusions require derivative information, either through the Jacobian or Hessian matrix. As time-discretizations of diffusions induce a bias, we provide a…
▽ More
In this article we consider the development of unbiased estimators of the Hessian, of the log-likelihood function with respect to parameters, for partially observed diffusion processes. These processes arise in numerous applications, where such diffusions require derivative information, either through the Jacobian or Hessian matrix. As time-discretizations of diffusions induce a bias, we provide an unbiased estimator of the Hessian. This is based on using Girsanov's Theorem and randomization schemes developed through Mcleish [2011] and Rhee & Glynn [2015]. We demonstrate our developed estimator of the Hessian is unbiased, and one of finite variance. We numerically test and verify this by comparing the methodology here to that of a newly proposed particle filtering methodology. We test this on a range of diffusion models, which include different Ornstein--Uhlenbeck processes and the Fitzhugh--Nagumo model, arising in neuroscience.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Weighted and shifted BDF2 methods on variable grids
Authors:
Minghua Chen,
Fan Yu,
Qingdong Zhang
Abstract:
Variable steps implicit-explicit multistep methods for PDEs have been presented in [17], where the zero-stability is studied for ODEs; however, the stability analysis still remains an open question for PDEs. Based on the idea of linear multistep methods, we present a simple weighted and shifted BDF2 methods with variable steps for the parabolic problems, which serve as a bridge between BDF2 and Cr…
▽ More
Variable steps implicit-explicit multistep methods for PDEs have been presented in [17], where the zero-stability is studied for ODEs; however, the stability analysis still remains an open question for PDEs. Based on the idea of linear multistep methods, we present a simple weighted and shifted BDF2 methods with variable steps for the parabolic problems, which serve as a bridge between BDF2 and Crank-Nicolson scheme. The contributions of this paper are as follows: we first prove that the optimal adjacent time-step ratios for the weighted and shifted BDF2, which greatly improve the maximum time-step ratios for BDF2 in [11,15]. Moreover, the unconditional stability and optimal convergence are rigorous proved, which make up for the vacancy of the theory for PDEs in [17]. Finally, numerical experiments are given to illustrate theoretical results.
△ Less
Submitted 5 August, 2021;
originally announced August 2021.
-
Optimal Scoring Rule Design under Partial Knowledge
Authors:
Yiling Chen,
Fang-Yi Yu
Abstract:
This paper studies the design of optimal proper scoring rules when the principal has partial knowledge of an agent's signal distribution. Recent work characterizes the proper scoring rules that maximize the increase of an agent's payoff when the agent chooses to access a costly signal to refine a posterior belief from her prior prediction, under the assumption that the agent's signal distribution…
▽ More
This paper studies the design of optimal proper scoring rules when the principal has partial knowledge of an agent's signal distribution. Recent work characterizes the proper scoring rules that maximize the increase of an agent's payoff when the agent chooses to access a costly signal to refine a posterior belief from her prior prediction, under the assumption that the agent's signal distribution is fully known to the principal. In our setting, the principal only knows about a set of distributions where the agent's signal distribution belongs. We formulate the scoring rule design problem as a max-min optimization that maximizes the worst-case increase in payoff across the set of distributions.
We propose an efficient algorithm to compute an optimal scoring rule when the set of distributions is finite, and devise a fully polynomial-time approximation scheme that accommodates various infinite sets of distributions. We further remark that widely used scoring rules, such as the quadratic and log rules, as well as previously identified optimal scoring rules under full knowledge, can be far from optimal in our partial knowledge settings.
△ Less
Submitted 11 October, 2024; v1 submitted 15 July, 2021;
originally announced July 2021.
-
Randomized multilevel Monte Carlo for embarrassingly parallel inference
Authors:
Ajay Jasra,
Kody J. H. Law,
Alexander Tarakanov,
Fangyuan Yu
Abstract:
This position paper summarizes a recently developed research program focused on inference in the context of data centric science and engineering applications, and forecasts its trajectory forward over the next decade. Often one endeavours in this context to learn complex systems in order to make more informed predictions and high stakes decisions under uncertainty. Some key challenges which must b…
▽ More
This position paper summarizes a recently developed research program focused on inference in the context of data centric science and engineering applications, and forecasts its trajectory forward over the next decade. Often one endeavours in this context to learn complex systems in order to make more informed predictions and high stakes decisions under uncertainty. Some key challenges which must be met in this context are robustness, generalizability, and interpretability. The Bayesian framework addresses these three challenges elegantly, while bringing with it a fourth, undesirable feature: it is typically far more expensive than its deterministic counterparts. In the 21st century, and increasingly over the past decade, a growing number of methods have emerged which allow one to leverage cheap low-fidelity models in order to precondition algorithms for performing inference with more expensive models and make Bayesian inference tractable in the context of high-dimensional and expensive models. Notable examples are multilevel Monte Carlo (MLMC), multi-index Monte Carlo (MIMC), and their randomized counterparts (rMLMC), which are able to provably achieve a dimension-independent (including $\infty-$dimension) canonical complexity rate with respect to mean squared error (MSE) of $1/$MSE. Some parallelizability is typically lost in an inference context, but recently this has been largely recovered via novel double randomization approaches. Such an approach delivers i.i.d. samples of quantities of interest which are unbiased with respect to the infinite resolution target distribution. Over the coming decade, this family of algorithms has the potential to transform data centric science and engineering, as well as classical machine learning applications such as deep learning, by scaling up and scaling out fully Bayesian inference.
△ Less
Submitted 3 December, 2021; v1 submitted 5 July, 2021;
originally announced July 2021.
-
BDF$6$ SAV schemes for time-fractional Allen-Cahn dissipative systems
Authors:
Fan Yu,
Minghua Chen
Abstract:
Recently, the error analysis of BDF$k$ $(1\leqslant k\leqslant5)$ SAV (scalar auxiliary variable) schemes are given in \cite{Huangg:20} for the classical Allen-Cahn equation. However, it remains unavailable for BDF$6$ SAV schemes. In this paper, we construct and analyze BDF$6$ SAV schemes for the time-fractional dissipative systems. We carry out a rigorous error analysis for the time-fractional Al…
▽ More
Recently, the error analysis of BDF$k$ $(1\leqslant k\leqslant5)$ SAV (scalar auxiliary variable) schemes are given in \cite{Huangg:20} for the classical Allen-Cahn equation. However, it remains unavailable for BDF$6$ SAV schemes. In this paper, we construct and analyze BDF$6$ SAV schemes for the time-fractional dissipative systems. We carry out a rigorous error analysis for the time-fractional Allen-Cahn equation, which also fills up a gap for the classical case. Finally, numerical experiment is shown to illustrate the effectiveness of the presented methods. As far as we know, this is the first SAV schemes for the time-fractional dissipative systems.
△ Less
Submitted 18 April, 2021;
originally announced April 2021.
-
ALMA: Alternating Minimization Algorithm for Clustering Mixture Multilayer Network
Authors:
Xing Fan,
Marianna Pensky,
Feng Yu,
Teng Zhang
Abstract:
The paper considers a Mixture Multilayer Stochastic Block Model (MMLSBM), where layers can be partitioned into groups of similar networks, and networks in each group are equipped with a distinct Stochastic Block Model. The goal is to partition the multilayer network into clusters of similar layers, and to identify communities in those layers. Jing et al. (2020) introduced the MMLSBM and developed…
▽ More
The paper considers a Mixture Multilayer Stochastic Block Model (MMLSBM), where layers can be partitioned into groups of similar networks, and networks in each group are equipped with a distinct Stochastic Block Model. The goal is to partition the multilayer network into clusters of similar layers, and to identify communities in those layers. Jing et al. (2020) introduced the MMLSBM and developed a clustering methodology, TWIST, based on regularized tensor decomposition. The present paper proposes a different technique, an alternating minimization algorithm (ALMA), that aims at simultaneous recovery of the layer partition, together with estimation of the matrices of connection probabilities of the distinct layers. Compared to TWIST, ALMA achieves higher accuracy both theoretically and numerically.
△ Less
Submitted 12 October, 2021; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Eigenvalue-corrected Natural Gradient Based on a New Approximation
Authors:
Kai-Xin Gao,
Xiao-Lei Liu,
Zheng-Hai Huang,
Min Wang,
Shuangling Wang,
Zidong Wang,
Dachuan Xu,
Fan Yu
Abstract:
Using second-order optimization methods for training deep neural networks (DNNs) has attracted many researchers. A recently proposed method, Eigenvalue-corrected Kronecker Factorization (EKFAC) (George et al., 2018), proposes an interpretation of viewing natural gradient update as a diagonal method, and corrects the inaccurate re-scaling factor in the Kronecker-factored eigenbasis. Gao et al. (202…
▽ More
Using second-order optimization methods for training deep neural networks (DNNs) has attracted many researchers. A recently proposed method, Eigenvalue-corrected Kronecker Factorization (EKFAC) (George et al., 2018), proposes an interpretation of viewing natural gradient update as a diagonal method, and corrects the inaccurate re-scaling factor in the Kronecker-factored eigenbasis. Gao et al. (2020) considers a new approximation to the natural gradient, which approximates the Fisher information matrix (FIM) to a constant multiplied by the Kronecker product of two matrices and keeps the trace equal before and after the approximation. In this work, we combine the ideas of these two methods and propose Trace-restricted Eigenvalue-corrected Kronecker Factorization (TEKFAC). The proposed method not only corrects the inexact re-scaling factor under the Kronecker-factored eigenbasis, but also considers the new approximation method and the effective damping technique proposed in Gao et al. (2020). We also discuss the differences and relationships among the Kronecker-factored approximations. Empirically, our method outperforms SGD with momentum, Adam, EKFAC and TKFAC on several DNNs.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
A Trace-restricted Kronecker-Factored Approximation to Natural Gradient
Authors:
Kai-Xin Gao,
Xiao-Lei Liu,
Zheng-Hai Huang,
Min Wang,
Zidong Wang,
Dachuan Xu,
Fan Yu
Abstract:
Second-order optimization methods have the ability to accelerate convergence by modifying the gradient through the curvature matrix. There have been many attempts to use second-order optimization methods for training deep neural networks. Inspired by diagonal approximations and factored approximations such as Kronecker-Factored Approximate Curvature (KFAC), we propose a new approximation to the Fi…
▽ More
Second-order optimization methods have the ability to accelerate convergence by modifying the gradient through the curvature matrix. There have been many attempts to use second-order optimization methods for training deep neural networks. Inspired by diagonal approximations and factored approximations such as Kronecker-Factored Approximate Curvature (KFAC), we propose a new approximation to the Fisher information matrix (FIM) called Trace-restricted Kronecker-factored Approximate Curvature (TKFAC) in this work, which can hold the certain trace relationship between the exact and the approximate FIM. In TKFAC, we decompose each block of the approximate FIM as a Kronecker product of two smaller matrices and scaled by a coefficient related to trace. We theoretically analyze TKFAC's approximation error and give an upper bound of it. We also propose a new damping technique for TKFAC on convolutional neural networks to maintain the superiority of second-order optimization methods during training. Experiments show that our method has better performance compared with several state-of-the-art algorithms on some deep network architectures.
△ Less
Submitted 21 November, 2020;
originally announced November 2020.
-
Multilevel Ensemble Kalman-Bucy Filters
Authors:
Neil K. Chada,
Ajay Jasra,
Fangyuan Yu
Abstract:
In this article we consider the linear filtering problem in continuous-time. We develop and apply multilevel Monte Carlo (MLMC) strategies for ensemble Kalman-Bucy filters (EnKBFs). These filters can be viewed as approximations of conditional McKean-Vlasov-type diffusion processes. They are also interpreted as the continuous-time analogue of the \textit{ensemble Kalman filter}, which has proven to…
▽ More
In this article we consider the linear filtering problem in continuous-time. We develop and apply multilevel Monte Carlo (MLMC) strategies for ensemble Kalman-Bucy filters (EnKBFs). These filters can be viewed as approximations of conditional McKean-Vlasov-type diffusion processes. They are also interpreted as the continuous-time analogue of the \textit{ensemble Kalman filter}, which has proven to be successful due to its applicability and computational cost. We prove that an ideal version of our multilevel EnKBF can achieve a mean square error (MSE) of $\mathcal{O}(ε^2), \ ε>0$ with a cost of order $\mathcal{O}(ε^{-2}\log(ε)^2)$. In order to prove this result we provide a Monte Carlo convergence and approximation bounds associated to time-discretized EnKBFs. This implies a reduction in cost compared to the (single level) EnKBF which requires a cost of $\mathcal{O}(ε^{-3})$ to achieve an MSE of $\mathcal{O}(ε^2)$. We test our theory on a linear problem, which we motivate through high-dimensional examples of order $\sim \mathcal{O}(10^4)$ and $\mathcal{O}(10^5)$.
△ Less
Submitted 5 April, 2021; v1 submitted 9 November, 2020;
originally announced November 2020.
-
Backward difference formula: The energy technique for subdiffusion equation
Authors:
Minghua Chen,
Fan Yu,
Zhi Zhou
Abstract:
Based on the equivalence of A-stability and G-stability, the energy technique of the six-step BDF method for the heat equation has been discussed in [Akrivis, Chen, Yu, Zhou, Math. Comp., Revised]. Unfortunately, this theory is hard to extend the time-fractional PDEs. In this work, we consider three types of subdiffusion models, namely single-term, multi-term and distributed order fractional diffu…
▽ More
Based on the equivalence of A-stability and G-stability, the energy technique of the six-step BDF method for the heat equation has been discussed in [Akrivis, Chen, Yu, Zhou, Math. Comp., Revised]. Unfortunately, this theory is hard to extend the time-fractional PDEs. In this work, we consider three types of subdiffusion models, namely single-term, multi-term and distributed order fractional diffusion equations. We present a novel and concise stability analysis of time stepping schemes generated by $k$-step backward difference formula (BDF$k$), for approximately solving the subdiffusion equation. The analysis mainly relies on the energy technique by applying Grenander-Szegö theorem. This kind of argument has been widely used to confirm the stability of various $A$-stable schemes (e.g., $k=1,2$). However, it is not an easy task for the higher-order BDF methods, due to the loss the $A$-stability. The core object of this paper is to fill in this gap.
△ Less
Submitted 25 October, 2020;
originally announced October 2020.
-
The energy technique for the six-step BDF method
Authors:
Georgios Akrivis,
Minghua Chen,
Fan Yu,
Zhi Zhou
Abstract:
In combination with the Grenander--Szegö theorem, we observe that a relaxed positivity condition on multipliers, milder than the basic %fundamental requirement of the Nevanlinna--Odeh multipliers that the sum of the absolute values of their components is strictly less than $1$, makes the energy technique applicable to the stability analysis of BDF methods for parabolic equations with selfadjoint e…
▽ More
In combination with the Grenander--Szegö theorem, we observe that a relaxed positivity condition on multipliers, milder than the basic %fundamental requirement of the Nevanlinna--Odeh multipliers that the sum of the absolute values of their components is strictly less than $1$, makes the energy technique applicable to the stability analysis of BDF methods for parabolic equations with selfadjoint elliptic part. This is particularly useful for the six-step BDF method for which no Nevanlinna--Odeh multiplier exists. We introduce multipliers satisfying the positivity property for the six-step BDF method and establish stability of the method for parabolic equations.
△ Less
Submitted 17 July, 2020;
originally announced July 2020.
-
Phase retrieval of complex-valued objects via a randomized Kaczmarz method
Authors:
Teng Zhang,
Feng Yu
Abstract:
This paper investigates the convergence of the randomized Kaczmarz algorithm for the problem of phase retrieval of complex-valued objects. While this algorithm has been studied for the real-valued case}, its generalization to the complex-valued case is nontrivial and has been left as a conjecture. This paper establishes the connection between the convergence of the algorithm and the convexity of a…
▽ More
This paper investigates the convergence of the randomized Kaczmarz algorithm for the problem of phase retrieval of complex-valued objects. While this algorithm has been studied for the real-valued case}, its generalization to the complex-valued case is nontrivial and has been left as a conjecture. This paper establishes the connection between the convergence of the algorithm and the convexity of an objective function. Based on the connection, it demonstrates that when the sensing vectors are sampled uniformly from a unit sphere and the number of sensing vectors $m$ satisfies $m>O(n\log n)$ as $n, m\rightarrow\infty$, then this algorithm with a good initialization achieves linear convergence to the solution with high probability.
△ Less
Submitted 13 October, 2020; v1 submitted 7 May, 2020;
originally announced May 2020.
-
Unbiased Filtering of a Class of Partially Observed Diffusions
Authors:
Ajay Jasra,
Kody Law,
Fangyuan Yu
Abstract:
In this article we consider a Monte Carlo-based method to filter partially observed diffusions observed at regular and discrete times. Given access only to Euler discretizations of the diffusion process, we present a new procedure which can return online estimates of the filtering distribution with no discretization bias and finite variance. Our approach is based upon a novel double application of…
▽ More
In this article we consider a Monte Carlo-based method to filter partially observed diffusions observed at regular and discrete times. Given access only to Euler discretizations of the diffusion process, we present a new procedure which can return online estimates of the filtering distribution with no discretization bias and finite variance. Our approach is based upon a novel double application of the randomization methods of Rhee & Glynn (2015) along with the multilevel particle filter (MLPF) approach of Jasra et al (2017). A numerical comparison of our new approach with the MLPF, on a single processor, shows that similar errors are possible for a mild increase in computational cost. However, the new method scales strongly to arbitrarily many processors.
△ Less
Submitted 11 February, 2020; v1 submitted 10 February, 2020;
originally announced February 2020.
-
An Algorithm for Graph-Fused Lasso Based on Graph Decomposition
Authors:
Feng Yu,
Yi Yang,
Teng Zhang
Abstract:
This work proposes a new algorithm for solving the graph-fused lasso (GFL), a method for parameter estimation that operates under the assumption that the signal tends to be locally constant over a predefined graph structure. The proposed method applies the alternating direction method of multipliers (ADMM) algorithm and is based on the decomposition of the objective function into two components. W…
▽ More
This work proposes a new algorithm for solving the graph-fused lasso (GFL), a method for parameter estimation that operates under the assumption that the signal tends to be locally constant over a predefined graph structure. The proposed method applies the alternating direction method of multipliers (ADMM) algorithm and is based on the decomposition of the objective function into two components. While ADMM has been widely used in this problem, existing works such as network lasso decompose the objective function into the loss function component and the total variation penalty component. In comparison, this work proposes to decompose the objective function into two components, where one component is the loss function plus part of the total variation penalty, and the other component is the remaining total variation penalty. Compared with the network lasso algorithm, this method has a smaller computational cost per iteration and converges faster in most simulations numerically.
△ Less
Submitted 6 August, 2019;
originally announced August 2019.
-
Multilevel Particle Filters for the Non-Linear Filtering Problem in Continuous Time
Authors:
Ajay Jasra,
Fangyuan Yu,
Jeremy Heng
Abstract:
In the following article we consider the numerical approximation of the non-linear filter in continuous-time, where the observations and signal follow diffusion processes. Given access to high-frequency, but discrete-time observations, we resort to a first order time discretization of the non-linear filter, followed by an Euler discretization of the signal dynamics. In order to approximate the ass…
▽ More
In the following article we consider the numerical approximation of the non-linear filter in continuous-time, where the observations and signal follow diffusion processes. Given access to high-frequency, but discrete-time observations, we resort to a first order time discretization of the non-linear filter, followed by an Euler discretization of the signal dynamics. In order to approximate the associated discretized non-linear filter, one can use a particle filter (PF). Under assumptions, this can achieve a mean square error of $\mathcal{O}(ε^2)$, for $ε>0$ arbitrary, such that the associated cost is $\mathcal{O}(ε^{-4})$. We prove, under assumptions, that the multilevel particle filter (MLPF) of Jasra et al (2017) can achieve a mean square error of $\mathcal{O}(ε^2)$, for cost $\mathcal{O}(ε^{-3})$. This is supported by numerical simulations in several examples.
△ Less
Submitted 9 June, 2020; v1 submitted 15 July, 2019;
originally announced July 2019.
-
Finite difference/spectral approximations for the two-dimensional time Caputo-Fabrizio fractional diffusion equation
Authors:
Fan Yu,
Minghua Chen
Abstract:
The main contribution of this work is to construct and analyze stable and high order schemes to efficiently solve the two-dimensional time Caputo-Fabrizio fractional diffusion equation. Based on a third-order finite difference method in time and spectral methods in space, the proposed scheme is unconditionally stable and has the global truncation error $\mathcal{O}(τ^3+N^{-m})$, where $τ$, $N$ and…
▽ More
The main contribution of this work is to construct and analyze stable and high order schemes to efficiently solve the two-dimensional time Caputo-Fabrizio fractional diffusion equation. Based on a third-order finite difference method in time and spectral methods in space, the proposed scheme is unconditionally stable and has the global truncation error $\mathcal{O}(τ^3+N^{-m})$, where $τ$, $N$ and $m$ are the time step size, polynomial degree and regularity in the space variable of the exact solution, respectively. It should be noted that the global truncation error $\mathcal{O}(τ^2+N^{-m})$ is well established in [ Li, Lv and Xu, {\em Numer. Methods Partial Differ. Equ}. (2019)]. Finally, some numerical experiments are carried out to verify the theoretical analysis. To the best of our knowledge, this is the first proof for the stability of the third-order scheme for the Caputo-Fabrizio fractional operator.
△ Less
Submitted 20 August, 2020; v1 submitted 1 June, 2019;
originally announced June 2019.
-
ADMM for Efficient Deep Learning with Global Convergence
Authors:
Junxiang Wang,
Fuxun Yu,
Xiang Chen,
Liang Zhao
Abstract:
Alternating Direction Method of Multipliers (ADMM) has been used successfully in many conventional machine learning applications and is considered to be a useful alternative to Stochastic Gradient Descent (SGD) as a deep learning optimizer. However, as an emerging domain, several challenges remain, including 1) The lack of global convergence guarantees, 2) Slow convergence towards solutions, and 3…
▽ More
Alternating Direction Method of Multipliers (ADMM) has been used successfully in many conventional machine learning applications and is considered to be a useful alternative to Stochastic Gradient Descent (SGD) as a deep learning optimizer. However, as an emerging domain, several challenges remain, including 1) The lack of global convergence guarantees, 2) Slow convergence towards solutions, and 3) Cubic time complexity with regard to feature dimensions. In this paper, we propose a novel optimization framework for deep learning via ADMM (dlADMM) to address these challenges simultaneously. The parameters in each layer are updated backward and then forward so that the parameter information in each layer is exchanged efficiently. The time complexity is reduced from cubic to quadratic in (latent) feature dimensions via a dedicated algorithm design for subproblems that enhances them utilizing iterative quadratic approximations and backtracking. Finally, we provide the first proof of global convergence for an ADMM-based method (dlADMM) in a deep neural network problem under mild conditions. Experiments on benchmark datasets demonstrated that our proposed dlADMM algorithm outperforms most of the comparison methods.
△ Less
Submitted 5 July, 2021; v1 submitted 31 May, 2019;
originally announced May 2019.
-
Higher-order accurate diffuse-domain methods for partial differential equations with Dirichlet boundary conditions in complex, evolving geometries
Authors:
Fei Yu,
Zhenlin Guo,
John Lowengrub
Abstract:
The diffuse-domain, or smoothed boundary, method is an attractive approach for solving partial differential equations in complex geometries because of its simplicity and flexibility. In this method the complex geometry is embedded into a larger, regular domain. The original PDE is reformulated using a smoothed characteristic function of the complex domain and source terms are introduced to approxi…
▽ More
The diffuse-domain, or smoothed boundary, method is an attractive approach for solving partial differential equations in complex geometries because of its simplicity and flexibility. In this method the complex geometry is embedded into a larger, regular domain. The original PDE is reformulated using a smoothed characteristic function of the complex domain and source terms are introduced to approximate the boundary conditions. The reformulated equation, which is independent of the dimension and domain geometry, can be solved by standard numerical methods and the same solver can be used for any domain geometry. A challenge is making the method higher-order accurate. For Dirichlet boundary conditions, which we focus on here, current implementations demonstrate a wide range in their accuracy but generally the methods yield at best first order accuracy in $ε$, the parameter that characterizes the width of the region over which the characteristic function is smoothed. Typically, $ε\propto h$, the grid size. Here, we analyze the diffuse-domain PDEs using matched asymptotic expansions and explain the observed behaviors. Our analysis also identifies simple modifications to the diffuse-domain PDEs that yield higher-order accuracy in $ε$, e.g., $O(ε^2)$ in the $L^2$ norm and $O(ε^p)$ with $1.5\le p\le 2$ in the $L^{\infty}$ norm. Our analytic results are confirmed numerically in stationary and moving domains where the level set method is used to capture the dynamics of the domain boundary and to construct the smoothed characteristic function.
△ Less
Submitted 27 November, 2019; v1 submitted 9 December, 2018;
originally announced December 2018.
-
Central Limit Theorems for Coupled Particle Filters
Authors:
Ajay Jasra,
Fangyuan Yu
Abstract:
In this article we prove a new central limit theorem (CLT) for coupled particle filters (CPFs). CPFs are used for the sequential estimation of the difference of expectations w.r.t. filters which are in some sense close. Examples include the estimation of the filtering distribution associated to different parameters (finite difference estimation) and filters associated to partially observed discret…
▽ More
In this article we prove a new central limit theorem (CLT) for coupled particle filters (CPFs). CPFs are used for the sequential estimation of the difference of expectations w.r.t. filters which are in some sense close. Examples include the estimation of the filtering distribution associated to different parameters (finite difference estimation) and filters associated to partially observed discretized diffusion processes (PODDP) and the implementation of the multilevel Monte Carlo (MLMC) identity. We develop new theory for CPFs and based upon several results, we propose a new CPF which approximates the maximal coupling (MCPF) of a pair of predictor distributions. In the context of ML estimation associated to PODDP with discretization $Δ_l$ we show that the MCPF and the approach in Jasra et al. (2018) have, under assumptions, an asymptotic variance that is upper-bounded by an expression that is (almost) $\mathcal{O}(Δ_l)$, uniformly in time. The $\mathcal{O}(Δ_l)$ rate preserves the so-called forward rate of the diffusion in some scenarios which is not the case for the CPF in Jasra et al (2017).
△ Less
Submitted 22 October, 2018; v1 submitted 11 October, 2018;
originally announced October 2018.
-
Bifurcation Analysis in a Continuous-Time Information Model with Discrete and Distributed Delays
Authors:
Jingli Ren,
Fangzhi Yu
Abstract:
In this paper, we consider a continuous-time model with discrete and dis-tributed delays to describe how two pieces of information interact in online social networks. Sufficient conditions are carried out to illustrate the stability of each e-quilibrium. Taking time delay as a bifurcation parameter, the system undergoes a sequence of Hopf bifurcation when this parameter passes through a critical v…
▽ More
In this paper, we consider a continuous-time model with discrete and dis-tributed delays to describe how two pieces of information interact in online social networks. Sufficient conditions are carried out to illustrate the stability of each e-quilibrium. Taking time delay as a bifurcation parameter, the system undergoes a sequence of Hopf bifurcation when this parameter passes through a critical value. By methods of multiple scales, we prove that the direction of Hopf bifurcation is depending on the condition which is related to delay.
△ Less
Submitted 20 October, 2016;
originally announced October 2016.
-
Global well-posedness of advective Lotka-Volterra competition systems with nonlinear diffusion
Authors:
Qi Wang,
Jingyue Yang,
Feng Yu
Abstract:
This paper investigates the global well-posedness of a class of reaction-advection-diffusion models with nonlinear diffusion and Lotka-Volterra dynamics. We prove the existence and uniform boundedness of the global-in-time solutions to the fully parabolic systems under certain growth conditions on the diffusion and sensitivity functions. Global existence and uniform boundedness of the correspondin…
▽ More
This paper investigates the global well-posedness of a class of reaction-advection-diffusion models with nonlinear diffusion and Lotka-Volterra dynamics. We prove the existence and uniform boundedness of the global-in-time solutions to the fully parabolic systems under certain growth conditions on the diffusion and sensitivity functions. Global existence and uniform boundedness of the corresponding parabolic-elliptic system are also obtained. Our results suggest that attraction (positive taxis) inhibits blowups in Lotka-Volterra competition systems.
△ Less
Submitted 18 May, 2019; v1 submitted 17 May, 2016;
originally announced May 2016.
-
On Okounkov's conjecture connecting Hilbert schemes of points and multiple q-zeta values
Authors:
Zhenbo Qin,
Fei Yu
Abstract:
We compute the generating series for the intersection pairings between the total Chern classes of the tangent bundles of the Hilbert schemes of points on a smooth projective surface and the Chern characters of tautological bundles over these Hilbert schemes. Modulo the lower weight term, we verify Okounkov's conjecture [Oko] connecting these Hilbert schemes and multiple $q$-zeta values. In additio…
▽ More
We compute the generating series for the intersection pairings between the total Chern classes of the tangent bundles of the Hilbert schemes of points on a smooth projective surface and the Chern characters of tautological bundles over these Hilbert schemes. Modulo the lower weight term, we verify Okounkov's conjecture [Oko] connecting these Hilbert schemes and multiple $q$-zeta values. In addition, this conjecture is completely proved when the surface is abelian. We also determine some universal constants in the sense of Boissi\' ere and Nieper-Wisskirchen [Boi, BN] regarding the total Chern classes of the tangent bundles of these Hilbert schemes. The main approach of this paper is to use the set-up of Carlsson and Okounkov outlined in [Car, CO] and the structure of the Chern character operators proved in [LQW2].
△ Less
Submitted 3 October, 2015;
originally announced October 2015.
-
Stationary and time periodic patterns of two-predator and one-prey systems with prey-taxis
Authors:
Ke Wang,
Qi Wang,
Feng Yu
Abstract:
This paper concerns pattern formation in a class of reaction-advection-diffusion systems modeling the population dynamics of two predators and one prey. We consider the biological situation that both predators forage along the population density gradient of the preys which can defend themselves as a group. We prove the global existence and uniform boundedness of positive classical solutions for th…
▽ More
This paper concerns pattern formation in a class of reaction-advection-diffusion systems modeling the population dynamics of two predators and one prey. We consider the biological situation that both predators forage along the population density gradient of the preys which can defend themselves as a group. We prove the global existence and uniform boundedness of positive classical solutions for the fully parabolic system over a bounded domain with space dimension $N=1,2$ and for the parabolic- -parabolic-elliptic system over higher space dimensions. Linearized stability analysis shows that prey-taxis stabilizes the positive constant equilibrium if there is no group defense while it destabilizes the equilibrium otherwise. Then we obtain stationary and time-periodic nontrivial solutions of the system that bifurcate from the positive constant equilibrium. Moreover, the stability of these solutions is also analyzed in detail which provides a wave mode selection mechanism of nontrivial patterns for this strongly coupled system. Finally, we perform numerical simulations to illustrate and support our theoretical results.
△ Less
Submitted 25 October, 2016; v1 submitted 16 August, 2015;
originally announced August 2015.
-
Structurally Stable Singularities for a Nonlinear Wave Equation
Authors:
Alberto Bressan,
Tao Huang,
Fang Yu
Abstract:
For the nonlinear wave equation $u_{tt} - c(u)\big(c(u) u_x\big)_x~=~0$, it is well known that solutions can develop singularities in finite time. For an open dense set of initial data, the present paper provides a detailed asymptotic description of the solution in a neighborhood of each singular point, where $|u_x|\to\infty$. The different structure of conservative and dissipative solutions is an…
▽ More
For the nonlinear wave equation $u_{tt} - c(u)\big(c(u) u_x\big)_x~=~0$, it is well known that solutions can develop singularities in finite time. For an open dense set of initial data, the present paper provides a detailed asymptotic description of the solution in a neighborhood of each singular point, where $|u_x|\to\infty$. The different structure of conservative and dissipative solutions is analyzed.
△ Less
Submitted 30 March, 2015;
originally announced March 2015.
-
Eigenvalues of Curvature, Lyapunov exponents and Harder-Narasimhan filtrations
Authors:
Fei Yu
Abstract:
Inspired by Katz-Mazur theorem on crystalline cohomology and by Eskin-Kontsevich-Zorich's numerical experiments, we conjecture that the polygon of Lyapunov spectrum lies above (or on) the Harder-Narasimhan polygon of the Hodge bundle over any Teichmüller curve. We also discuss the connections between the two polygons and the integral of eigenvalues of the curvature of the Hodge bundle by using Ati…
▽ More
Inspired by Katz-Mazur theorem on crystalline cohomology and by Eskin-Kontsevich-Zorich's numerical experiments, we conjecture that the polygon of Lyapunov spectrum lies above (or on) the Harder-Narasimhan polygon of the Hodge bundle over any Teichmüller curve. We also discuss the connections between the two polygons and the integral of eigenvalues of the curvature of the Hodge bundle by using Atiyah-Bott, Forni and Möller's works. We obtain several applications to Teichmüller dynamics conditional to the conjecture.
△ Less
Submitted 11 October, 2016; v1 submitted 7 August, 2014;
originally announced August 2014.
-
Structural Stability of Supersonic Contact Discontinuities in Three-Dimensional Compressible Steady Flows
Authors:
Ya-Guang Wang,
Fang Yu
Abstract:
In this paper, we study the structurally nonlinear stability of supersonic contact discontinuities in three-dimensional compressible isentropic steady flows. Based on the weakly linear stability result and the $L^2$-estimates obtained by the authors in J. Diff. Equ. 255(2013), for the linearized problems of three-dimensional compressible isentropic steady equations at a supersonic contact disconti…
▽ More
In this paper, we study the structurally nonlinear stability of supersonic contact discontinuities in three-dimensional compressible isentropic steady flows. Based on the weakly linear stability result and the $L^2$-estimates obtained by the authors in J. Diff. Equ. 255(2013), for the linearized problems of three-dimensional compressible isentropic steady equations at a supersonic contact discontinuity satisfying certain stability conditions, we first derive tame estimates of solutions to the linearized problem in higher order norms by exploring the behavior of vorticities. Since the supersonic contact discontinuities are only weakly linearly stable, so the tame estimates of solutions to the linearized problems have loss of regularity with respect to both of background states and initial data, so to use the tame estimates to study the nonlinear problem we adapt the Nash-Moser-Hörmander iteration scheme to conclude that weakly linearly stable supersonic contact discontinuities in three-dimensional compressible steady flows are also structurally nonlinearly stable.
△ Less
Submitted 6 July, 2014;
originally announced July 2014.
-
Rescaling limits of the spatial Lambda-Fleming-Viot process with selection
Authors:
Alison Etheridge,
Amandine Veber,
Feng Yu
Abstract:
We consider the spatial Lambda-Fleming-Viot process model for frequencies of genetic types in a population living in R^d, with two types of individuals (0 and 1) and natural selection favouring individuals of type 1. We first prove that the model is well-defined and provide a measure-valued dual process encoding the locations of the `potential ancestors' of a sample taken from such a population. W…
▽ More
We consider the spatial Lambda-Fleming-Viot process model for frequencies of genetic types in a population living in R^d, with two types of individuals (0 and 1) and natural selection favouring individuals of type 1. We first prove that the model is well-defined and provide a measure-valued dual process encoding the locations of the `potential ancestors' of a sample taken from such a population. We then consider two cases, one in which the dynamics of the process are driven by events of bounded radii and one incorporating large-scale events whose radii have a polynomial tail distribution. In both cases, we consider a sequence of spatial Lambda-Fleming-Viot processes indexed by n, and we assume that the fraction of individuals replaced during a reproduction event and the relative frequency of events during which natural selection acts tend to 0 as n tends to infinity. We choose the decay of these parameters in such a way that when reproduction is only local, the measure-valued process describing the local frequencies of the less favoured type converges in distribution to a (measure-valued) solution to the stochastic Fisher-KPP equation in one dimension, and to a (measure-valued) solution to the deterministic Fisher-KPP equation in more than one dimension. When large-scale extinction-recolonisation events occur, the sequence of processes converges instead to the solution to the analogous equation in which the Laplacian is replaced by a fractional Laplacian. We also consider the process of `potential ancestors' of a sample of individuals taken from these populations, which we see as a system of branching and coalescing symmetric jump processes. We show their convergence in distribution towards a system of Brownian or stable motions which branch at some finite rate. In one dimension, in the limit, pairs of particles also coalesce at a rate proportional to their collision local time.
△ Less
Submitted 9 August, 2020; v1 submitted 23 June, 2014;
originally announced June 2014.
-
Conditioning the logistic branching process on non-extinction
Authors:
Alison Etheridge,
Shidong Wang,
Feng Yu
Abstract:
We consider a birth and death process in which death is due to both `natural death' and to competition between individuals, modelled as a quadratic function of population size. The resulting `logistic branching process' has been proposed as a model for numbers of individuals in populations competing for some resource, or for numbers of species. However, because of the quadratic death rate, even if…
▽ More
We consider a birth and death process in which death is due to both `natural death' and to competition between individuals, modelled as a quadratic function of population size. The resulting `logistic branching process' has been proposed as a model for numbers of individuals in populations competing for some resource, or for numbers of species. However, because of the quadratic death rate, even if the intrinsic growth rate is positive, the population will, with probability one, die out in finite time. There is considerable interest in understanding the process conditioned on non-extinction.
In this paper, we exploit a connection with the ancestral selection graph of population genetics to find expressions for the transition rates in the logistic branching process conditioned on survival until some fixed time $T$, in terms of the distribution of a certain one-dimensional diffusion process at time $T$. We also find the probability generating function of the Yaglom distribution of the process and rather explicit expressions for the transition rates for the so-called Q-process, that is the logistic branching process conditioned to stay alive into the indefinite future. For this process, one can write down the joint generator of the (time-reversed) total population size and what in population genetics would be called the `genealogy' and in phylogenetics would be called the `reconstructed tree' of a sample from the population.
We explore some ramifications of these calculations numerically.
△ Less
Submitted 21 October, 2013;
originally announced October 2013.
-
A new inequality on the Hodge number $h^{1,1}$ of algebraic surfaces
Authors:
Jun Lu,
Sheng-Li Tan,
Fei Yu,
Kang Zuo
Abstract:
We get a new inequality on the Hodge number $h^{1,1}(S)$ of fibred algebraic complex surfaces $S$, which is a generalization of an inequality of Beauville. Our inequality implies the Arakelov type inequalities due to Arakelov, Faltings, Viehweg and Zuo, respectively.
We get a new inequality on the Hodge number $h^{1,1}(S)$ of fibred algebraic complex surfaces $S$, which is a generalization of an inequality of Beauville. Our inequality implies the Arakelov type inequalities due to Arakelov, Faltings, Viehweg and Zuo, respectively.
△ Less
Submitted 11 March, 2013;
originally announced March 2013.
-
Weierstrass filtration on Teichmüller curves and Lyapunov exponents: Upper bounds
Authors:
Fei Yu,
Kang Zuo
Abstract:
We get an upper bound of the slope of each graded quotient for the Harder-Narasimhan filtration of the Hodge bundle of a Teichmüller curve. As an application, we show that the sum of Lyapunov exponents of a Teichmüller curve does not exceed ${(g+1)}/{2}$, with equality reached if and only if the curve lies in the hyperelliptic locus induced from $\mathcal{Q}(2k_1,...,2k_n,-1^{2g+2})$ or it is a sp…
▽ More
We get an upper bound of the slope of each graded quotient for the Harder-Narasimhan filtration of the Hodge bundle of a Teichmüller curve. As an application, we show that the sum of Lyapunov exponents of a Teichmüller curve does not exceed ${(g+1)}/{2}$, with equality reached if and only if the curve lies in the hyperelliptic locus induced from $\mathcal{Q}(2k_1,...,2k_n,-1^{2g+2})$ or it is a special Teichmüller curve in $Ω\mathcal{M}_g(1^{2g-2})$. It also gives an unified interpretation for many known results about the special partial sums of Lyapunov exponents on Teichmüller curves.
△ Less
Submitted 19 April, 2016; v1 submitted 12 September, 2012;
originally announced September 2012.
-
Weierstrass filtration on Teichmuller curves and Lyapunov exponents
Authors:
Fei Yu,
Kang Zuo
Abstract:
We define the Weierstrass filtration for Teichmuller curves and construct the Harder-Narasimhan filtration of the Hodge bundle of a Teichmuller curve in hyperelliptic loci and low-genus nonvarying strata. As a result we obtain the sum of Lyapunov exponents of Teichmuller curves in these strata.
We define the Weierstrass filtration for Teichmuller curves and construct the Harder-Narasimhan filtration of the Hodge bundle of a Teichmuller curve in hyperelliptic loci and low-genus nonvarying strata. As a result we obtain the sum of Lyapunov exponents of Teichmuller curves in these strata.
△ Less
Submitted 3 December, 2014; v1 submitted 27 March, 2012;
originally announced March 2012.
-
Instability of Truncated Symmetric Powers of sheaves
Authors:
Lingguang Li,
Fei Yu
Abstract:
Let $X$ be a smooth projective variety of dimension $n$ over an algebraically closed field $k$ of characteristic $p>0$. Let $F_X:X\rightarrow X$ be the absolute Frobenius morphism, and $\E$ a torsion free sheaf on $X$. We give a upper bound of instability of truncated symmetric powers $\mathrm{T}^l(\E)(0\leq l\leq\rk(\E)(p-1))$ in terms of $L_{\max}(\Omg^1_X)$, $\mathrm{I}(\Omg^1_X)$ and…
▽ More
Let $X$ be a smooth projective variety of dimension $n$ over an algebraically closed field $k$ of characteristic $p>0$. Let $F_X:X\rightarrow X$ be the absolute Frobenius morphism, and $\E$ a torsion free sheaf on $X$. We give a upper bound of instability of truncated symmetric powers $\mathrm{T}^l(\E)(0\leq l\leq\rk(\E)(p-1))$ in terms of $L_{\max}(\Omg^1_X)$, $\mathrm{I}(\Omg^1_X)$ and $\mathrm{I}(\E)$ (Theorem \ref{InstabTl}). As an application, We obtain a upper bound of Frobenius direct image ${F_X}_*(\E)$ and some sufficient conditions of slope semi-stability of ${F_X}_*(\E)$. In addition, we study the slope (semi)-stability of sheaves of locally exact (closed) forms $B^i_X$ ($Z^i_X$).
△ Less
Submitted 1 January, 2012; v1 submitted 20 October, 2010;
originally announced October 2010.
-
Fixation Probability for Competing Selective Sweeps
Authors:
Feng Yu,
Alison Etheridge,
Charles Cuthbertson
Abstract:
We consider a biological population in which a beneficial mutation is undergoing a selective sweep when a second beneficial mutation arises at a linked locus and we investigate the probability that both mutations will eventually fix in the population. Previous work has dealt with the case where the second mutation to arise confers a smaller benefit than the first. In that case population size pl…
▽ More
We consider a biological population in which a beneficial mutation is undergoing a selective sweep when a second beneficial mutation arises at a linked locus and we investigate the probability that both mutations will eventually fix in the population. Previous work has dealt with the case where the second mutation to arise confers a smaller benefit than the first. In that case population size plays almost no role. Here we consider the opposite case and observe that, by contrast, the probability of both mutations fixing can be heavily dependent on population size. Indeed the key parameter is $ρN$, the product of the population size and the recombination rate between the two selected loci. If $ρN$ is small, the probability that both mutations fix can be reduced through interference to almost zero while for large $ρN$ the mutations barely influence one another. The main rigorous result is a method for calculating the fixation probability of a double mutant in the large population limit.
△ Less
Submitted 29 November, 2008;
originally announced December 2008.
-
Stationary distribution for dioecious branching particle systems with rapid stirring
Authors:
Feng Yu
Abstract:
We study dioecious (i.e., two-sex) branching particle system models, where there are two types of particles, modeling the male and female populations, and where birth of new particles requires the presence of both male and female particles. We show that stationary distributions of various dioecious branching particle models are nontrivial under certain conditions, for example, when there is suff…
▽ More
We study dioecious (i.e., two-sex) branching particle system models, where there are two types of particles, modeling the male and female populations, and where birth of new particles requires the presence of both male and female particles. We show that stationary distributions of various dioecious branching particle models are nontrivial under certain conditions, for example, when there is sufficiently fast stirring.
△ Less
Submitted 29 October, 2007;
originally announced October 2007.
-
Asymptotic behavior of the rate of adaptation
Authors:
Feng Yu,
Alison Etheridge,
Charles Cuthbertson
Abstract:
We consider the accumulation of beneficial and deleterious mutations in large asexual populations. The rate of adaptation is affected by the total mutation rate, proportion of beneficial mutations and population size $N$. We show that regardless of mutation rates, as long as the proportion of beneficial mutations is strictly positive, the adaptation rate is at least $\mathcal{O}(\log^{1-δ}N)$ wher…
▽ More
We consider the accumulation of beneficial and deleterious mutations in large asexual populations. The rate of adaptation is affected by the total mutation rate, proportion of beneficial mutations and population size $N$. We show that regardless of mutation rates, as long as the proportion of beneficial mutations is strictly positive, the adaptation rate is at least $\mathcal{O}(\log^{1-δ}N)$ where $δ$ can be any small positive number, if the population size is sufficiently large. This shows that if the genome is modeled as continuous, there is no limit to natural selection, that is, the rate of adaptation grows in $N$ without bound.
△ Less
Submitted 15 October, 2010; v1 submitted 25 August, 2007;
originally announced August 2007.
-
Stationary distributions of a model of sympatric speciation
Authors:
Feng Yu
Abstract:
This paper deals with a model of sympatric speciation, that is, speciation in the absence of geographical separation, originally proposed by U. Dieckmann and M. Doebeli in 1999. We modify their original model to obtain a Fleming--Viot type model and study its stationary distribution. We show that speciation may occur, that is, the stationary distribution puts most of the mass on a configuration…
▽ More
This paper deals with a model of sympatric speciation, that is, speciation in the absence of geographical separation, originally proposed by U. Dieckmann and M. Doebeli in 1999. We modify their original model to obtain a Fleming--Viot type model and study its stationary distribution. We show that speciation may occur, that is, the stationary distribution puts most of the mass on a configuration that does not concentrate on the phenotype with maximum carrying capacity, if competition between phenotypes is intense enough. Conversely, if competition between phenotypes is not intense, then speciation will not occur and most of the population will have the phenotype with the highest carrying capacity. The length of time it takes speciation to occur also has a delicate dependence on the mutation parameter, and the exact shape of the carrying capacity function and the competition kernel.
△ Less
Submitted 31 July, 2007;
originally announced July 2007.
-
Equilibrium States of Two Stochastic Models in Mathematical Ecology
Authors:
Feng Yu
Abstract:
This work deals with two problems arising in mathematical ecology. The first problem is concerned with diploid branching particle models and its behavior when rapid stirring is added to the interaction. The particle models involve two types of particles, male and female, and branching can only occur when both types of particles are present. We show that if the branching rate is sufficiently larg…
▽ More
This work deals with two problems arising in mathematical ecology. The first problem is concerned with diploid branching particle models and its behavior when rapid stirring is added to the interaction. The particle models involve two types of particles, male and female, and branching can only occur when both types of particles are present. We show that if the branching rate is sufficiently large, this particle model has a nontrivial stationary distribution, i.e. one that does not concentrate all weight on the all-0 state, using a comparison argument due to R. Durrett. We also show extinction for small branching rates, thereby establishing the existence of a phase transition. We then add two different rapid stirring mechanisms to the interactions and show that for the particle models with rapid stirring, there also exist nontrivial stationary distribution(s); for this, we analyze the limiting PDE and establish a condition on the PDE that guarantees existence of nontrivial stationary distributions for sufficient fast stirring.
The second problem deals with a model of sympatric speciation, i.e. speciation in the absence of geographical separation, originally proposed by U. Dieckmann and M. Doebeli in 1999. We modify their original model to obtain several constant-population particle models. We concentrate on a continuous-time model that converges to a deterministic dynamical system as the number of particles becomes large. We establish various results regarding whether speciation occurs by studying the existence of bimodal stationary distributions for the limiting dynamical system.
△ Less
Submitted 5 March, 2007;
originally announced March 2007.