-
The order of appearance of the product of the first and second Lucas numbers
Authors:
Huiming Xiao,
Hongjian Li,
Pingzhi Yuan
Abstract:
Let $\left(U_n\right)_{n\geq0}$ and $\left(V_n\right)_{n\geq0}$ be the first and second Lucas sequences, respectively. Let $m$ be a positive integer. Then the order of appearance of $m$ in the first Lucas sequence is defined as the smallest positive integer $k$ such that $m$ divides $U_k$ and denoted by $τ(m)$. In this paper, we give explicit formulae for the terms $τ(U_m V_n)$, $τ(U_m U_n)$,…
▽ More
Let $\left(U_n\right)_{n\geq0}$ and $\left(V_n\right)_{n\geq0}$ be the first and second Lucas sequences, respectively. Let $m$ be a positive integer. Then the order of appearance of $m$ in the first Lucas sequence is defined as the smallest positive integer $k$ such that $m$ divides $U_k$ and denoted by $τ(m)$. In this paper, we give explicit formulae for the terms $τ(U_m V_n)$, $τ(U_m U_n)$, $τ(V_m V_n)$ and $τ(U_nU_{n+p}U_{n+2p})$, where $p\geq3$ is a prime number.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
A LP-rounding based algorithm for soft capacitated facility location problem with submodular penalties
Authors:
Hanyin Xiao,
Jiaming Zhang,
Zhikang Zhang,
Weidong Li
Abstract:
The soft capacitated facility location problem (SCFLP) is a classic combinatorial optimization problem, with its variants widely applied in the fields of operations research and computer science. In the SCFLP, given a set $\mathcal{F}$ of facilities and a set $\mathcal{D}$ of clients, each facility has a capacity and an open cost, allowing to open multiple times, and each client has a demand.
Th…
▽ More
The soft capacitated facility location problem (SCFLP) is a classic combinatorial optimization problem, with its variants widely applied in the fields of operations research and computer science. In the SCFLP, given a set $\mathcal{F}$ of facilities and a set $\mathcal{D}$ of clients, each facility has a capacity and an open cost, allowing to open multiple times, and each client has a demand.
This problem is to find a subset of facilities in $\mathcal{F}$ and connect each client to the facilities opened, such that the total cost including open cost and connection cost is minimied. SCFLP is a NP-hard problem, which has led to a focus on approximation algorithms. Based on this, we consider a variant, that is, soft capacitated facility location problem with submodular penalties (SCFLPSP), which allows some clients not to be served by accepting the penalty cost. And we consider the integer splittable case of demand, that is, the demand of each client is served by multiple facilities with the integer service amount by each facility. Based on LP-rounding, we propose a $(λR+4)$-approximation algorithm, where $R=\frac{\max_{i \in \mathcal{F} }f_i}{\min_{i \in \mathcal{F} }f_i},λ=\frac{R+\sqrt{R^2+8R}}{2R}$. In particular, when the open cost is uniform, the approximation ratio is 6.
△ Less
Submitted 16 February, 2025; v1 submitted 13 February, 2025;
originally announced February 2025.
-
Gaussian heat kernel asymptotics for conditioned random walks
Authors:
Ion Grama,
Hui Xiao
Abstract:
Consider a random walk $S_n=\sum_{i=1}^n X_i$ with independent and identically distributed real-valued increments with zero mean, finite variance and moment of order $2 + δ$ for some $δ>0$. For any starting point $x\in \mathbb R$, let $τ_x = \inf \left\{ k\geq 1: x+S_{k} < 0 \right\}$ denote the first time when the random walk $x+S_n$ exits the half-line $[0,\infty)$. We investigate the uniform as…
▽ More
Consider a random walk $S_n=\sum_{i=1}^n X_i$ with independent and identically distributed real-valued increments with zero mean, finite variance and moment of order $2 + δ$ for some $δ>0$. For any starting point $x\in \mathbb R$, let $τ_x = \inf \left\{ k\geq 1: x+S_{k} < 0 \right\}$ denote the first time when the random walk $x+S_n$ exits the half-line $[0,\infty)$. We investigate the uniform asymptotic behavior over $x\in \mathbb R$ of the persistence probability $\mathbb P (τ_x >n)$ and the joint distribution $\mathbb{P} \left( x + S_n \leq u, τ_x > n \right)$, for $u\geq 0$, as $n \to \infty$. New limit theorems for these probabilities are established based on the heat kernel approximations. Additionally, we evaluate the rate of convergence by proving Berry-Esseen type bounds.
△ Less
Submitted 11 December, 2024;
originally announced December 2024.
-
Conditioned random walks on linear groups II: local limit theorems
Authors:
Ion Grama,
Jean-François Quint,
Hui Xiao
Abstract:
We investigate random walks on the general linear group constrained within a specific domain, with a focus on their asymptotic behavior. In a previous work [38], we constructed the associated harmonic measure, a key element in formulating the local limit theorem for conditioned random walks on groups. The primary aim of this paper is to prove this theorem. The main challenge arises from studying t…
▽ More
We investigate random walks on the general linear group constrained within a specific domain, with a focus on their asymptotic behavior. In a previous work [38], we constructed the associated harmonic measure, a key element in formulating the local limit theorem for conditioned random walks on groups. The primary aim of this paper is to prove this theorem. The main challenge arises from studying the conditioned reverse walk, whose increments, in the context of random walks on groups, depend on the entire future. To achieve our goal, we combine a Caravenna-type conditioned local limit theorem with the conditioned version of the central limit theorem for the reversed walk. The resulting local limit theorem is then applied to derive the local behavior of the exit time.
△ Less
Submitted 9 December, 2024; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Conditioned random walks on linear groups I: construction of the target harmonic measure
Authors:
Ion Grama,
Jean-François Quint,
Hui Xiao
Abstract:
Our objective is to explore random walks on the general linear group, constrained to a specific domain, with a primary focus on establishing the conditioned local limit theorem. This paper marks the initial stride toward achieving this goal, specifically entailing the construction of a novel entity -- the target harmonic measure. This measure, together with the harmonic function, serves as a pivot…
▽ More
Our objective is to explore random walks on the general linear group, constrained to a specific domain, with a primary focus on establishing the conditioned local limit theorem. This paper marks the initial stride toward achieving this goal, specifically entailing the construction of a novel entity -- the target harmonic measure. This measure, together with the harmonic function, serves as a pivotal component in shaping the conditioned local limit theorem. Using a reversal identity, we introduce a reversed sequence characterized as a dual random walk with a perturbation depending on future observations. The investigation of such walks, which rely on future information, lies at the heart of this paper. To carry out this study, we develop an approach grounded in the finite-size approximation of perturbations, enabling us to simplify the investigation to an array of Markov chains with increasing dimensions.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
On deformation quantizations of symplectic supervarieties
Authors:
Husileng Xiao
Abstract:
We classify deformation quantizations of the symplectic supervarieties that are smooth and admissible. This generalizes the corresponding result of Bezrukavnikov and Kaledin to the super case. We relate the equivalence classes of quantizations of supervarieties with that of their even reduced symplectic varieties. Finally, we prove that certain nilpotent orbits of basic Lie superalgebras are admis…
▽ More
We classify deformation quantizations of the symplectic supervarieties that are smooth and admissible. This generalizes the corresponding result of Bezrukavnikov and Kaledin to the super case. We relate the equivalence classes of quantizations of supervarieties with that of their even reduced symplectic varieties. Finally, we prove that certain nilpotent orbits of basic Lie superalgebras are admissible and splitting, and classify their deformation quantizations.
△ Less
Submitted 20 January, 2025; v1 submitted 28 July, 2024;
originally announced July 2024.
-
Twisting of Lie triple systems, $L_\infty$-algebras, and (generalized) matched pairs
Authors:
Jia Zhao,
Haobo Xia
Abstract:
In this paper, we introduce notions of (proto-, quasi-)twilled Lie triple systems and give their equivalent descriptions using the controlling algebra and bidegree convention. Then we construct an $L_\infty$-algebra via a twilled Lie triple system. Besides, we establish the twisting theory of Lie triple systems and then characterize the twisting as a Maurer-Cartan element in the constructed…
▽ More
In this paper, we introduce notions of (proto-, quasi-)twilled Lie triple systems and give their equivalent descriptions using the controlling algebra and bidegree convention. Then we construct an $L_\infty$-algebra via a twilled Lie triple system. Besides, we establish the twisting theory of Lie triple systems and then characterize the twisting as a Maurer-Cartan element in the constructed $L_\infty$-algebra. Finally, we clarify the relationship between twilled Lie triple systems and matched pairs and clarify the relationship between twilled Lie triple systems and relative Rota-Baxter operators respectively so that we obtain the relationship between matched pairs of Lie triple systems and relative Rota-Baxter operators.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Cohomology and Homotopy of Lie triple systems
Authors:
Haobo Xia,
Yunhe Sheng,
Rong Tang
Abstract:
In this paper, first we give the controlling algebra of Lie triple systems. In particular, the cohomology of Lie triple systems can be characterized by the controlling algebra. Then using controlling algebras, we introduce the notions of homotopy Nambu algebras and homotopy Lie triple systems. We show that $2$-term homotopy Lie triple systems is equivalent to Lie triple $2$-systems, and the latter…
▽ More
In this paper, first we give the controlling algebra of Lie triple systems. In particular, the cohomology of Lie triple systems can be characterized by the controlling algebra. Then using controlling algebras, we introduce the notions of homotopy Nambu algebras and homotopy Lie triple systems. We show that $2$-term homotopy Lie triple systems is equivalent to Lie triple $2$-systems, and the latter is the categorification of a Lie triple system. Finally we study skeletal and strict Lie triple $2$-systems. We show that skeletal Lie triple $2$-systems can be classified the third cohomology group, and strict Lie triple $2$-systems are equivalent to crossed modules of Lie triple systems.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Online Robust Mean Estimation
Authors:
Daniel M. Kane,
Ilias Diakonikolas,
Hanshen Xiao,
Sihan Liu
Abstract:
We study the problem of high-dimensional robust mean estimation in an online setting. Specifically, we consider a scenario where $n$ sensors are measuring some common, ongoing phenomenon. At each time step $t=1,2,\ldots,T$, the $i^{th}$ sensor reports its readings $x^{(i)}_t$ for that time step. The algorithm must then commit to its estimate $μ_t$ for the true mean value of the process at time…
▽ More
We study the problem of high-dimensional robust mean estimation in an online setting. Specifically, we consider a scenario where $n$ sensors are measuring some common, ongoing phenomenon. At each time step $t=1,2,\ldots,T$, the $i^{th}$ sensor reports its readings $x^{(i)}_t$ for that time step. The algorithm must then commit to its estimate $μ_t$ for the true mean value of the process at time $t$. We assume that most of the sensors observe independent samples from some common distribution $X$, but an $ε$-fraction of them may instead behave maliciously. The algorithm wishes to compute a good approximation $μ$ to the true mean $μ^\ast := \mathbf{E}[X]$. We note that if the algorithm is allowed to wait until time $T$ to report its estimate, this reduces to the well-studied problem of robust mean estimation. However, the requirement that our algorithm produces partial estimates as the data is coming in substantially complicates the situation.
We prove two main results about online robust mean estimation in this model. First, if the uncorrupted samples satisfy the standard condition of $(ε,δ)$-stability, we give an efficient online algorithm that outputs estimates $μ_t$, $t \in [T],$ such that with high probability it holds that $\|μ-μ^\ast\|_2 = O(δ\log(T))$, where $μ= (μ_t)_{t \in [T]}$. We note that this error bound is nearly competitive with the best offline algorithms, which would achieve $\ell_2$-error of $O(δ)$. Our second main result shows that with additional assumptions on the input (most notably that $X$ is a product distribution) there are inefficient algorithms whose error does not depend on $T$ at all.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Conditioned local limit theorems for products of positive random matrices
Authors:
Ion Grama,
Hui Xiao
Abstract:
Consider the random matrix products $G_n: = g_n \ldots g_1$, where $(g_{n})_{n\geq 1}$ is a sequence of independent and identically distributed positive random $d\times d$ matrices for any integer $d \geq 2$. For any starting point $x \in \mathbb R_+^d$ with $|x| = 1$ and $y \geq 0$, consider the exit time $τ_{x, y} = \inf \{ k \geq 1: y + \log |G_k x| < 0 \}$. In this paper, we study the conditio…
▽ More
Consider the random matrix products $G_n: = g_n \ldots g_1$, where $(g_{n})_{n\geq 1}$ is a sequence of independent and identically distributed positive random $d\times d$ matrices for any integer $d \geq 2$. For any starting point $x \in \mathbb R_+^d$ with $|x| = 1$ and $y \geq 0$, consider the exit time $τ_{x, y} = \inf \{ k \geq 1: y + \log |G_k x| < 0 \}$. In this paper, we study the conditioned local probability $\mathbb P ( y + \log |G_n x| \in [0, Δ] + z, \, τ_{x, y} > n )$ under various assumptions on $y$ and $z$. For $y = o(\sqrt{n})$, we prove precise upper and lower bounds when $z$ is in a compact interval and give exact asymptotics when $z \to \infty$. We also study the case when $y \asymp \sqrt{n}$ and establish the corresponding asymptotics in function of the behaviour of $z$.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
Limit theorems for first passage times of multivariate perpetuity sequences
Authors:
Sebastian Mentemeier,
Hui Xiao
Abstract:
We study the first passage time $τ_u = \inf \{ n \geq 1: |V_n| > u \}$ for the multivariate perpetuity sequence $V_n = Q_1 + M_1 Q_2 + \cdots + (M_1 \ldots M_{n-1}) Q_n$, where $(M_n, Q_n)$ is a sequence of independent and identically distributed random variables with $M_1$ a $d \times d$ ($d \geq 1$) random matrix with nonnegative entries, and $Q_1$ a nonnegative random vector in $\mathbb R^d$. H…
▽ More
We study the first passage time $τ_u = \inf \{ n \geq 1: |V_n| > u \}$ for the multivariate perpetuity sequence $V_n = Q_1 + M_1 Q_2 + \cdots + (M_1 \ldots M_{n-1}) Q_n$, where $(M_n, Q_n)$ is a sequence of independent and identically distributed random variables with $M_1$ a $d \times d$ ($d \geq 1$) random matrix with nonnegative entries, and $Q_1$ a nonnegative random vector in $\mathbb R^d$. Here $|\cdot|$ denotes the vector norm. The exact asymptotic for the probability $\mathbb P (τ_u < \infty)$ as $u \to \infty$ has been found by Kesten (Acta Math. 1973). In this paper we prove a conditioned weak law of large numbers for $τ_u$: conditioned on the event $\{ τ_u < \infty \}$, $\frac{τ_u}{\log u}$ converges in probability to a certain constant $ρ> 0$ as $u \to \infty$. A conditioned central limit theorem for $τ_u$ is also obtained. We further establish precise large deviation asymptotics for the lower probability $\mathbb P (τ_u \leq (β- l) \log u)$ as $u \to \infty$, where $β\in (0, ρ)$ and $l \geq 0$ is a vanishing perturbation satisfying $l \to 0$ as $u \to \infty$. Our results extend those of Buraczewski et al. (Ann. Probab. 2016) from the univariate case ($d=1$) to the multivariate case ($d>1$). As consequences, we deduce exact asymptotics for the pointwise probability $\mathbb P (τ_u = [(β- l) \log u] )$ and the local probability $\mathbb P (τ_u - (β- l) \log u \in (a, a + m ] )$, where $a<0$ and $m \in \mathbb Z_+$. We also establish analogous results for the first passage time $τ_u^y = \inf \{ n \geq 1: \langle y, V_n \rangle > u \}$, where $y$ is a nonnegative vector in $\mathbb R^d$ with $|y| = 1$.
△ Less
Submitted 9 December, 2024; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Efficient Dynamic Allocation Policy for Robust Ranking and Selection under Stochastic Control Framework
Authors:
Hui Xiao,
Zhihong Wei
Abstract:
This research considers the ranking and selection with input uncertainty. The objective is to maximize the posterior probability of correctly selecting the best alternative under a fixed simulation budget, where each alternative is measured by its worst-case performance. We formulate the dynamic simulation budget allocation decision problem as a stochastic control problem under a Bayesian framewor…
▽ More
This research considers the ranking and selection with input uncertainty. The objective is to maximize the posterior probability of correctly selecting the best alternative under a fixed simulation budget, where each alternative is measured by its worst-case performance. We formulate the dynamic simulation budget allocation decision problem as a stochastic control problem under a Bayesian framework. Following the approximate dynamic programming theory, we derive a one-step-ahead dynamic optimal budget allocation policy and prove that this policy achieves consistency and asymptotic optimality. Numerical experiments demonstrate that the proposed procedure can significantly improve performance.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
The regularization continuation method for optimization problems with nonlinear equality constraints
Authors:
Xin-long Luo,
Hang Xiao,
Sen Zhang
Abstract:
This paper considers the regularization continuation method and the trust-region updating strategy for the nonlinearly equality-constrained optimization problem. Namely, it uses the inverse of the regularization quasi-Newton matrix as the pre-conditioner to improve its computational efficiency in the well-posed phase, and it adopts the inverse of the regularization two-sided projection of the Hess…
▽ More
This paper considers the regularization continuation method and the trust-region updating strategy for the nonlinearly equality-constrained optimization problem. Namely, it uses the inverse of the regularization quasi-Newton matrix as the pre-conditioner to improve its computational efficiency in the well-posed phase, and it adopts the inverse of the regularization two-sided projection of the Hessian as the pre-conditioner to improve its robustness in the ill-conditioned phase. Since it only solves a linear system of equations at every iteration and the sequential quadratic programming (SQP) needs to solve a quadratic programming subproblem at every iteration, it is faster than SQP. Numerical results also show that it is more robust and faster than SQP (the built-in subroutine fmincon.m of the MATLAB2020a environment and the subroutine SNOPT executed in GAMS v28.2 (2019) environment). The computational time of the new method is about one third of that of fmincon.m for the large-scale problem. Finally, the global convergence analysis of the new method is also given.
△ Less
Submitted 4 August, 2023; v1 submitted 26 March, 2023;
originally announced March 2023.
-
Moderate deviations and local limit theorems for the coefficients of random walks on the general linear group
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$, where $(g_n)_{n\geq 1}$ is a sequence of independent and identically distributed random elements with law $μ$ on the general linear group ${\rm GL}(V)$ with $V=\mathbb R^d$. Under suitable conditions on $μ$, we establish Cramér type moderate deviation expansions and local limit theorems with moderate deviations for the coefficients…
▽ More
Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$, where $(g_n)_{n\geq 1}$ is a sequence of independent and identically distributed random elements with law $μ$ on the general linear group ${\rm GL}(V)$ with $V=\mathbb R^d$. Under suitable conditions on $μ$, we establish Cramér type moderate deviation expansions and local limit theorems with moderate deviations for the coefficients $\langle f, G_n v \rangle$, where $v \in V$ and $f \in V^*$. Our approach is based on the Hölder regularity of the invariant measure of the Markov chain $G_n \!\cdot \! x = \mathbb R G_n v$ on the projective space of $V$ with the starting point $x = \mathbb R v$, under the changed measure.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
Edgeworth expansion for the coefficients of random walks on the general linear group
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed random elements with law $μ$ on the general linear group $\textup{GL}(V)$, where $V=\mathbb R^d$. Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$. Under suitable conditions on $μ$, we establish the first-order Edgeworth expansion for the coefficients $\langle f, G_n v \rangle$ with $v \in V$ and…
▽ More
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed random elements with law $μ$ on the general linear group $\textup{GL}(V)$, where $V=\mathbb R^d$. Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$. Under suitable conditions on $μ$, we establish the first-order Edgeworth expansion for the coefficients $\langle f, G_n v \rangle$ with $v \in V$ and $f \in V^*$, in which a new additional term appears compared to the case of vector norm $\|G_n v\|$.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Edgeworth expansion and large deviations for the coefficients of products of positive random matrices
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Consider the matrix products $G_n: = g_n \ldots g_1$, where $(g_{n})_{n\geq 1}$ is a sequence of independent and identically distributed positive random $d\times d$ matrices. Under the optimal third moment condition, we first establish a Berry-Esseen theorem and an Edgeworth expansion for the $(i,j)$-th entry $G_n^{i,j}$ of the matrix $G_n$, where $1 \leq i, j \leq d$. Using the Edgeworth expansio…
▽ More
Consider the matrix products $G_n: = g_n \ldots g_1$, where $(g_{n})_{n\geq 1}$ is a sequence of independent and identically distributed positive random $d\times d$ matrices. Under the optimal third moment condition, we first establish a Berry-Esseen theorem and an Edgeworth expansion for the $(i,j)$-th entry $G_n^{i,j}$ of the matrix $G_n$, where $1 \leq i, j \leq d$. Using the Edgeworth expansion for $G_n^{i,j}$ under the changed probability measure, we then prove precise upper and lower large deviation asymptotics for the entries $G_n^{i,j}$ subject to an exponential moment assumption. As applications, we deduce local limit theorems with large deviations for $G_n^{i,j}$ and upper and lower large deviations bounds for the spectral radius $ρ(G_n)$ of $G_n$. A byproduct of our approach is the local limit theorem for $G_n^{i,j}$ under the optimal second moment condition. In the proofs we develop a spectral gap theory for the norm cocycle and for the coefficients, which is of independent interest.
△ Less
Submitted 19 February, 2025; v1 submitted 7 September, 2022;
originally announced September 2022.
-
The extremal position of a branching random walk on the general linear group
Authors:
Ion Grama,
Sebastian Mentemeier,
Hui Xiao
Abstract:
Consider a branching random walk $(G_u)_{u\in \mathbb T}$ on the general linear group $\textrm{GL}(V)$ of a finite dimensional space $V$, where $\mathbb T$ is the associated genealogical tree with nodes $u$. For any starting point $v \in V \setminus\{0\}$ with $\|v\|=1$ and $x = \mathbb R v \in \mathbb P(V)$, let $M^x_n=\max_{|u| = n} \log \| G_u v \|$ denote the maximal position of the walk…
▽ More
Consider a branching random walk $(G_u)_{u\in \mathbb T}$ on the general linear group $\textrm{GL}(V)$ of a finite dimensional space $V$, where $\mathbb T$ is the associated genealogical tree with nodes $u$. For any starting point $v \in V \setminus\{0\}$ with $\|v\|=1$ and $x = \mathbb R v \in \mathbb P(V)$, let $M^x_n=\max_{|u| = n} \log \| G_u v \|$ denote the maximal position of the walk $\log \| G_u v \|$ in the generation $n$. We first show that under suitable conditions, $\lim_{n \to \infty} \frac{M_n^x }{n} = γ$ almost surely, where $γ\in \mathbb R$ is a constant. Then, in the case when $γ= 0$, under appropriate {\textit boundary conditions}, we refine the last statement by determining the rate of convergence at which $M_n^x$ converges to $-\infty$. We prove in particular that $\lim_{n \to \infty} \frac{M_n^x}{\log n} = -\frac{3}{2α}$ in probability, where $α>0$ is a constant determined by the boundary conditions. Analogous properties are established for the minimal position. As a consequence we derive the asymptotic speed of the maximal and minimal positions for the coefficients, the operator norm and the spectral radius of $G_u$.
△ Less
Submitted 10 December, 2024; v1 submitted 10 June, 2022;
originally announced June 2022.
-
Residual regularization path-following methods for linear complementarity problems
Authors:
Xin-long Luo,
Sen Zhang,
Hang Xiao
Abstract:
In this article, we consider the residual regularization path-following method with the trust-region updating strategy for the linear complementarity problem. This time-stepping selection based on the trust-region updating strategy overcomes the shortcoming of the line search method, which consumes the unnecessary trial steps in the transient-state phase. In order to improve the robustness of the…
▽ More
In this article, we consider the residual regularization path-following method with the trust-region updating strategy for the linear complementarity problem. This time-stepping selection based on the trust-region updating strategy overcomes the shortcoming of the line search method, which consumes the unnecessary trial steps in the transient-state phase. In order to improve the robustness of the path-following method, we use the residual regularization parameter to replace the traditional complementarity regularization parameter. Moreover, we prove the global convergence of the new method under the standard assumptions without the traditional assumption condition of the priority to feasibility over complementarity. Numerical results show that the new method is robust and efficient for the linear complementarity problem, especially for the dense cases. And it is more robust and faster than some state-of-the-art solvers such as the built-in subroutines PATH and MILES of the GAMS v28.2 (2019) environment. The computational time of the new method is about 1/3 to 1/10 of that of PATH for the dense linear complementarity problem.
△ Less
Submitted 28 January, 2023; v1 submitted 21 May, 2022;
originally announced May 2022.
-
Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Authors:
Justin Baker,
Hedi Xia,
Yiwei Wang,
Elena Cherkaev,
Akil Narayan,
Long Chen,
Jack Xin,
Andrea L. Bertozzi,
Stanley J. Osher,
Bao Wang
Abstract:
Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers. These solvers are computationally expensive, requiring the use of tiny step sizes for numerical stability and accuracy guarantees. This paper considers learning neural ODEs using implicit ODE solvers of different orders leveraging proximal operators. The proximal implicit so…
▽ More
Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers. These solvers are computationally expensive, requiring the use of tiny step sizes for numerical stability and accuracy guarantees. This paper considers learning neural ODEs using implicit ODE solvers of different orders leveraging proximal operators. The proximal implicit solver consists of inner-outer iterations: the inner iterations approximate each implicit update step using a fast optimization algorithm, and the outer iterations solve the ODE system over time. The proximal implicit ODE solver guarantees superiority over explicit solvers in numerical stability and computational efficiency. We validate the advantages of proximal implicit solvers over existing popular neural ODE solvers on various challenging benchmark tasks, including learning continuous-depth graph neural networks and continuous normalizing flows.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Higher regularity of homeomorphisms in the Hartman-Grobman theorem for semilinear evolution equations
Authors:
Weijie Lu,
Manuel Pinto,
Y. H Xia
Abstract:
Hein and Prüss [J. Differential Equations, 261(2016)4709-4727] presented a version of Hartman-Grobman type $C^{0}$ linearization result for semilinear hyperbolic evolution equations. They showed that the linearising map (homomorphism) and its inverse are Hölder continuous. An important question: is it possible to improve the regularity of the homomorphisms? In the present paper, we prove that if t…
▽ More
Hein and Prüss [J. Differential Equations, 261(2016)4709-4727] presented a version of Hartman-Grobman type $C^{0}$ linearization result for semilinear hyperbolic evolution equations. They showed that the linearising map (homomorphism) and its inverse are Hölder continuous. An important question: is it possible to improve the regularity of the homomorphisms? In the present paper, we prove that if the mild solutions of semilinear system are bounded, then the regularity of the homomorphisms is Lipchitzian, but the inverse is merely Hölder continuous. We also give a generalized local linearization result in this paper. Finally, some applications end the paper. As pointed out by Backes [J. Differential Equations, 297 (2021) 536-574], even if the diffeomorphism $F$ is $C^{\infty}$, the homomorphism can fail to be locally Lipschitz. The homomorphisms are in general only locally Hölder continuous. However, by establishing two effective dichotomy integral inequalities, we prove that the conjugacy is Lipchitzian, but the inverse is Hölder continuous. Our result is the first one to observe the higher regularity of homomorphisms in the Hartman-Grobman theorem.
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
A Hartman-Grobman theorem for algebraic dichotomies
Authors:
Chaofan Pan,
Manuel Pinto,
Y. H. Xia
Abstract:
Algebraic dichotomy is a generalization of an exponential dichotomy (Lin, JDE2009). This paper gives a version of Hartman-Grobman linearization theorem assuming that linear system admits an algebraic dichotomy, which generalizes the Palmer's linearization theorem. Besides, we prove that the homeomorphism in the linearization theorem (and has a Hölder continuous inverse). Comparing with exponential…
▽ More
Algebraic dichotomy is a generalization of an exponential dichotomy (Lin, JDE2009). This paper gives a version of Hartman-Grobman linearization theorem assuming that linear system admits an algebraic dichotomy, which generalizes the Palmer's linearization theorem. Besides, we prove that the homeomorphism in the linearization theorem (and has a Hölder continuous inverse). Comparing with exponential dichotomy, algebraic dichotomy is more complicate. The exponential dichotomy leads to the estimates $\int_{-\infty}^{t}e^{-α(t-s)}ds$ and $\int_{t}^{+\infty}e^{-α(s-t)}ds$ which are convergent. However, the algebraic dichotomy will leads us to $\int_{-\infty}^{t}\left(\frac{μ(t)}{μ(s)}\right)^{-α}ds$ or $\int_{t}^{+\infty}\left(\frac{μ(s)}{μ(t)}\right)^{-α}ds$, whose the convergence is unknown in the sense of Riemann.
△ Less
Submitted 29 April, 2022; v1 submitted 28 January, 2022;
originally announced January 2022.
-
Limit theorems for the coefficients of random walks on the general linear group
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed random elements with law $μ$ on the general linear group $\textrm{GL}(V)$, where $V=\mathbb R^d$. Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$, and the coefficients $\langle f, G_n v \rangle$, where $v \in V$ and $f \in V^*$. Under suitable moment assumptions on $μ$, we prove the strong and weak laws of…
▽ More
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed random elements with law $μ$ on the general linear group $\textrm{GL}(V)$, where $V=\mathbb R^d$. Consider the random walk $G_n : = g_n \ldots g_1$, $n \geq 1$, and the coefficients $\langle f, G_n v \rangle$, where $v \in V$ and $f \in V^*$. Under suitable moment assumptions on $μ$, we prove the strong and weak laws of large numbers and the central limit theorem for $\langle f, G_n v \rangle$, which improve the previous results established under the exponential moment condition on $μ$. We further demonstrate the Berry-Esseen bound, the Edgeworth expansion, the Cramér type moderate deviation expansion and the local limit theorem with moderate deviations for $\langle f, G_n v \rangle$ under the exponential moment condition. Under a subexponential moment condition on $μ$, we also show a Berry-Esseen type bound and the moderate deviation principle for $\langle f, G_n v \rangle$. Our approach is based on various versions of the Hölder regularity of the invariant measure of the Markov chain $G_n \!\cdot \! x = \mathbb R G_n v$ on the projective space of $V$ with the starting point $x = \mathbb R v$.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
Two classes of minimal generic fundamental invariants for tensors
Authors:
Xin Li,
Liping Zhang,
Hanchen Xia
Abstract:
Motivated by the problems raised by Bürgisser and Ikenmeyer, we discuss two classes of minimal generic fundamental invariants for tensors of order 3. The first one is defined on $\otimes^3 \mathbb{C}^m$, where $m=n^2-1$. We study its construction by obstruction design introduced by Bürgisser and Ikenmeyer, which partially answers one problem raised by them. The second one is defined on…
▽ More
Motivated by the problems raised by Bürgisser and Ikenmeyer, we discuss two classes of minimal generic fundamental invariants for tensors of order 3. The first one is defined on $\otimes^3 \mathbb{C}^m$, where $m=n^2-1$. We study its construction by obstruction design introduced by Bürgisser and Ikenmeyer, which partially answers one problem raised by them. The second one is defined on $\mathbb{C}^{\ell m}\otimes \mathbb{C}^{mn}\otimes \mathbb{C}^{n\ell}$. We study its evaluation on the matrix multiplication tensor $\langle\ell,m,n\rangle$ and unit tensor $\langle n^2 \rangle$ when $\ell=m=n$. The evaluation on the unit tensor leads to the definition of Latin cube and 3-dimensional Alon-Tarsi problem. We generalize some results on Latin square to Latin cube, which enrich the understanding of 3-dimensional Alon-Tarsi problem. It is also natural to generalize the constructions to tensors of other orders. We illustrate the distinction between even and odd dimensional generalizations by concrete examples. Finally, some open problems in related fields are raised.
△ Less
Submitted 8 May, 2025; v1 submitted 14 November, 2021;
originally announced November 2021.
-
Characterizations of complex Finsler Metrics
Authors:
Hongjun Li,
Hongchuan Xia
Abstract:
Munteanu defined the canonical connection associated to a strongly pseudoconvex complex Finsler manifold $(M,F)$. We first prove that the holomorphic sectional curvature tensors of the canonical connection coincide with those of the Chern-Finsler connection associated to $F$ if and only if $F$ is a Kähler-Finsler metric. We also investigate the relationship of the Ricci curvatures (resp. scalar cu…
▽ More
Munteanu defined the canonical connection associated to a strongly pseudoconvex complex Finsler manifold $(M,F)$. We first prove that the holomorphic sectional curvature tensors of the canonical connection coincide with those of the Chern-Finsler connection associated to $F$ if and only if $F$ is a Kähler-Finsler metric. We also investigate the relationship of the Ricci curvatures (resp. scalar curvatures) of these two connections when $M$ is compact. As an application, two characterizations of balanced complex Finsler metrics are given. Next, we obtain a sufficient and necessary condition for a balanced complex Finsler metric to be Kähler-Finsler. Finally, we investigate conformal transformations of a balanced complex Finsler metric.
△ Less
Submitted 6 February, 2023; v1 submitted 29 October, 2021;
originally announced November 2021.
-
Conditioned limit theorems for hyperbolic dynamical systems
Authors:
Ion Grama,
Jean-François Quint,
Hui Xiao
Abstract:
Let $(\mathbb X, T)$ be a subshift of finite type equipped with the Gibbs measure $ν$ and let $f$ be a real-valued Hölder continuous function on $\mathbb X$ such that $ν(f) = 0$. Consider the Birkhoff sums $S_n f = \sum_{k=0}^{n-1} f \circ T^{k}$, $n\geq 1$. For any $t \in \mathbb R$, denote by $τ_t^f$ the first time when the sum $t+ S_n f$ leaves the positive half-line for some $n\geq 1$. By anal…
▽ More
Let $(\mathbb X, T)$ be a subshift of finite type equipped with the Gibbs measure $ν$ and let $f$ be a real-valued Hölder continuous function on $\mathbb X$ such that $ν(f) = 0$. Consider the Birkhoff sums $S_n f = \sum_{k=0}^{n-1} f \circ T^{k}$, $n\geq 1$. For any $t \in \mathbb R$, denote by $τ_t^f$ the first time when the sum $t+ S_n f$ leaves the positive half-line for some $n\geq 1$. By analogy with the case of random walks with independent identically distributed increments, we study the asymptotic as $n\to\infty$ of the probabilities $ ν(x\in \mathbb X: τ_t^f(x)>n) $ and $ ν(x\in \mathbb X: τ_t^f(x)=n) $. We also establish integral and local type limit theorems for the sum $t+ S_n f(x)$ conditioned on the set $\{ x \in \mathbb X: τ_t^f(x)>n \}$.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Authors:
Bao Wang,
Hedi Xia,
Tan Nguyen,
Stanley Osher
Abstract:
We present and review an algorithmic and theoretical framework for improving neural network architecture design via momentum. As case studies, we consider how momentum can improve the architecture design for recurrent neural networks (RNNs), neural ordinary differential equations (ODEs), and transformers. We show that integrating momentum into neural network architectures has several remarkable th…
▽ More
We present and review an algorithmic and theoretical framework for improving neural network architecture design via momentum. As case studies, we consider how momentum can improve the architecture design for recurrent neural networks (RNNs), neural ordinary differential equations (ODEs), and transformers. We show that integrating momentum into neural network architectures has several remarkable theoretical and empirical benefits, including 1) integrating momentum into RNNs and neural ODEs can overcome the vanishing gradient issues in training RNNs and neural ODEs, resulting in effective learning long-term dependencies. 2) momentum in neural ODEs can reduce the stiffness of the ODE dynamics, which significantly enhances the computational efficiency in training and testing. 3) momentum can improve the efficiency and accuracy of transformers.
△ Less
Submitted 18 October, 2021; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Conditioned local limit theorems for random walks on the real line
Authors:
Ion Grama,
Hui Xiao
Abstract:
Consider a random walk $S_n=\sum_{i=1}^n X_i$ with independent and identically distributed real-valued increments $X_i$ of zero mean and finite variance. Assume that $X_i$ is non-lattice and has a moment of order $2+δ$. For any $x\geq 0$, let $τ_x = \inf \left\{ k\geq 1: x+S_{k} < 0 \right\}$ be the first time when the random walk $x+S_n$ leaves the half-line $[0,\infty)$. We study the asymptotic…
▽ More
Consider a random walk $S_n=\sum_{i=1}^n X_i$ with independent and identically distributed real-valued increments $X_i$ of zero mean and finite variance. Assume that $X_i$ is non-lattice and has a moment of order $2+δ$. For any $x\geq 0$, let $τ_x = \inf \left\{ k\geq 1: x+S_{k} < 0 \right\}$ be the first time when the random walk $x+S_n$ leaves the half-line $[0,\infty)$. We study the asymptotic behavior of the probability $\bb P (τ_x >n)$ and that of the expectation $\mathbb{E} \left( f(x + S_n ), τ_x > n \right)$ for a large class of target function $f$ and various values of $x$, $y$ possibly depending on $n$. This general setting implies limit theorems for the joint distribution $\mathbb{P} \left( x + S_n \in y+ [0, Δ], τ_x > n \right)$ where $Δ>0$ may also depend on $n$. In particular, the case of moderate deviations $y=σ\sqrt{q n\log n}$ is considered. We also deduce some new asymptotics for random walks with drift and give explicit constants in the asymptotic of the probability $\bb P (τ_x =n)$. For the proofs we establish new conditioned integral limit theorems with precise error terms.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Heavy Ball Neural Ordinary Differential Equations
Authors:
Hedi Xia,
Vai Suliafu,
Hangjie Ji,
Tan M. Nguyen,
Andrea L. Bertozzi,
Stanley J. Osher,
Bao Wang
Abstract:
We propose heavy ball neural ordinary differential equations (HBNODEs), leveraging the continuous limit of the classical momentum accelerated gradient descent, to improve neural ODEs (NODEs) training and inference. HBNODEs have two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers,…
▽ More
We propose heavy ball neural ordinary differential equations (HBNODEs), leveraging the continuous limit of the classical momentum accelerated gradient descent, to improve neural ODEs (NODEs) training and inference. HBNODEs have two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers, thus significantly reducing the number of function evaluations (NFEs) and improving the utility of the trained models. (ii) The spectrum of HBNODEs is well structured, enabling effective learning of long-term dependencies from complex sequential data. We verify the advantages of HBNODEs over NODEs on benchmark tasks, including image classification, learning complex dynamics, and sequential modeling. Our method requires remarkably fewer forward and backward NFEs, is more accurate, and learns long-term dependencies more effectively than the other ODE-based neural network models. Code is available at \url{https://github.com/hedixia/HeavyBallNODE}.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
On the Foundation of Sparse Sensing (Part II): Diophantine Sampling and Array Configuration
Authors:
Hanshen Xiao,
Beining Zhou,
Guoqiang Xiao
Abstract:
In the second part of the series papers, we set out to study the algorithmic efficiency of sparse sensing. Stemmed from co-prime sensing, we propose a generalized framework, termed Diophantine sensing, which utilizes generic Diophantine equation theory and higher-order sparse ruler to strengthen the sampling time, the degree of freedom (DoF), and the sampling sparsity, simultaneously. Resorting to…
▽ More
In the second part of the series papers, we set out to study the algorithmic efficiency of sparse sensing. Stemmed from co-prime sensing, we propose a generalized framework, termed Diophantine sensing, which utilizes generic Diophantine equation theory and higher-order sparse ruler to strengthen the sampling time, the degree of freedom (DoF), and the sampling sparsity, simultaneously. Resorting to higher-moment statistics, the proposed Diophantine framework presents two fundamental improvements. First, on frequency estimation, we prove that given arbitrarily large down-sampling rates, there exist sampling schemes where the number of samples needed is only proportional to the sum of DoF and the number of snapshots required, which implies a linear sampling time. Second, on Direction-of-arrival (DoA) estimation, we propose two generic array constructions such that given N sensors, the minimal distance between sensors can be as large as a polynomial of N, O(N^q), which indicates that an arbitrarily sparse array (with arbitrarily small mutual coupling) exists given sufficiently many sensors. In addition, asymptotically, the proposed array configurations produce the best known DoF bound compared to existing sparse array designs.
△ Less
Submitted 23 August, 2021;
originally announced August 2021.
-
Equivariant Variance Estimation for Multiple Change-point Model
Authors:
Ning Hao,
Yue Selena Niu,
Han Xiao
Abstract:
The variance of noise plays an important role in many change-point detection procedures and the associated inferences. Most commonly used variance estimators require strong assumptions on the true mean structure or normality of the error distribution, which may not hold in applications. More importantly, the qualities of these estimators have not been discussed systematically in the literature. In…
▽ More
The variance of noise plays an important role in many change-point detection procedures and the associated inferences. Most commonly used variance estimators require strong assumptions on the true mean structure or normality of the error distribution, which may not hold in applications. More importantly, the qualities of these estimators have not been discussed systematically in the literature. In this paper, we introduce a framework of equivariant variance estimation for multiple change-point models. In particular, we characterize the set of all equivariant unbiased quadratic variance estimators for a family of change-point model classes, and develop a minimax theory for such estimators.
△ Less
Submitted 15 November, 2023; v1 submitted 21 August, 2021;
originally announced August 2021.
-
Continuation Newton methods with deflation techniques for global optimization problems
Authors:
Xin-long Luo,
Hang Xiao,
Sen Zhang
Abstract:
The global minimum point of an optimization problem is of interest in engineering fields and it is difficult to be found, especially for a nonconvex large-scale optimization problem. In this article, we consider a new memetic algorithm for this problem. That is to say, we use the continuation Newton method with the deflation technique to find multiple stationary points of the objective function an…
▽ More
The global minimum point of an optimization problem is of interest in engineering fields and it is difficult to be found, especially for a nonconvex large-scale optimization problem. In this article, we consider a new memetic algorithm for this problem. That is to say, we use the continuation Newton method with the deflation technique to find multiple stationary points of the objective function and use those found stationary points as the initial seeds of the evolutionary algorithm, other than the random initial seeds of the known evolutionary algorithms. Meanwhile, in order to retain the usability of the derivative-free method and the fast convergence of the gradient-based method, we use the automatic differentiation technique to compute the gradient and replace the Hessian matrix with its finite difference approximation. According to our numerical experiments, this new algorithm works well for unconstrained optimization problems and finds their global minima efficiently, in comparison to the other representative global optimization methods such as the multi-start methods (the built-in subroutine GlobalSearch.m of MATLAB R2021b, GLODS and VRBBO), the branch-and-bound method (Couenne, a state-of-the-art open-source solver for mixed integer nonlinear programming problems), and the derivative-free algorithms (CMA-ES and MCS).
△ Less
Submitted 13 December, 2023; v1 submitted 29 July, 2021;
originally announced July 2021.
-
The regularization continuation method with an adaptive time step control for linearly constrained optimization problems
Authors:
Xin-long Luo,
Hang Xiao
Abstract:
This paper considers the regularization continuation method and the trust-region updating strategy for the optimization problem with linear equality constraints.The proposed method utilizes the linear conservation law of the regularization continuation method such that it does not need to compute the correction step for preserving the feasibility other than the previous continuation methods and th…
▽ More
This paper considers the regularization continuation method and the trust-region updating strategy for the optimization problem with linear equality constraints.The proposed method utilizes the linear conservation law of the regularization continuation method such that it does not need to compute the correction step for preserving the feasibility other than the previous continuation methods and the quasi-Newton updating formulas for the linearly constrained optimization problem. Moreover, the new method uses the special limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) formula as the preconditioning technique to improve its computational efficiency in the well-posed phase, and it uses the inverse of the regularized two-sided projection of the Lagrangian Hessian as the pre-conditioner to improve its robustness. Numerical results also show that the new method is more robust and faster than the traditional optimization method such as the alternating direction method of multipliers (ADMM),the sequential quadratic programming (SQP) method (the built-in subroutine fmincon.m of the MATLAB2020a environment), and the recent continuation method (Ptctr). The computational time of the new method is about 1/3 of that of SQP (fmincon.m). Finally, the global convergence analysis of the new method is also given.
△ Less
Submitted 8 April, 2022; v1 submitted 2 June, 2021;
originally announced June 2021.
-
Generalized continuation Newton methods and the trust-region updating strategy for the underdetermined system
Authors:
Xin-long Luo,
Hang Xiao
Abstract:
This paper considers the generalized continuation Newton method and thetrust-region updating strategy for the underdetermined system of nonlinear equations. Moreover, in order to improve its computational efficiency, the new method will not update the Jacobian matrix when the current Jacobian matrix performs well. The numerical results show that the new method is more robust and faster than the tr…
▽ More
This paper considers the generalized continuation Newton method and thetrust-region updating strategy for the underdetermined system of nonlinear equations. Moreover, in order to improve its computational efficiency, the new method will not update the Jacobian matrix when the current Jacobian matrix performs well. The numerical results show that the new method is more robust and faster than the traditional optimization method such as the Levenberg-Marquardt method (a variant of trust-region methods, the built-in subroutine fsolve.m of the MATLAB R2020a environment). The computational time of the new method is about 1/8 to 1/50 of that of fsolve. Furthermore, it also proves the global convergence and the local superlinear convergence of the new method under some standard assumptions.
△ Less
Submitted 25 May, 2021; v1 submitted 9 March, 2021;
originally announced March 2021.
-
On finite dimensional representations of finite W-superalgebras
Authors:
Husileng Xiao
Abstract:
Let $\mathfrak{g}=\mathfrak{g}_{\bar{0}}+\mathfrak{g}_{\bar{1}}$ be a basic Lie superalgebra, $\mathcal{W}_0$ (resp.$\mathcal{W}$) be the finite W-(resp.super-) algebras constructed from a fixed nilpotent element in $\mathfrak{g}_{\bar{0}}$. Based on a relation between finite W-algebra $\mathcal{W}_0$ and W-superalgebra $\mathcal{W}$ found recently by the author and Shu, we study the finite dimens…
▽ More
Let $\mathfrak{g}=\mathfrak{g}_{\bar{0}}+\mathfrak{g}_{\bar{1}}$ be a basic Lie superalgebra, $\mathcal{W}_0$ (resp.$\mathcal{W}$) be the finite W-(resp.super-) algebras constructed from a fixed nilpotent element in $\mathfrak{g}_{\bar{0}}$. Based on a relation between finite W-algebra $\mathcal{W}_0$ and W-superalgebra $\mathcal{W}$ found recently by the author and Shu, we study the finite dimensional representations of finite W-superalgebras in this paper. We first formulate and prove a version of Premet's conjecture for the finite W-superalgebras from basic simple Lie superalgebras. As in the W-algebra case, the Premet's conjecture is very close to give a classification to the finite dimensional simple $\mathcal{W}$-modules. In the case of $\ggg$ is Lie superalgebras of basic type \Rmnum{1}, we prove the set of simple $\mathcal{W}$-supermodules is bijective with that of simple $\mathcal{W}_0$-modules; presenting a triangular decomposition to the tensor product of $\mathcal{W}$ with a Clifford algebra, we also give an algorithm to compute the character of the finite dimensional simple $\mathcal{W}$-supermodules with integral central character.
△ Less
Submitted 17 October, 2022; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Explicit continuation methods with L-BFGS updating formulas for linearly constrained optimization problems
Authors:
Xin-long Luo,
Jia-hui Lv,
Hang Xiao
Abstract:
This paper considers an explicit continuation method with the trusty time-stepping scheme and the limited-memory BFGS (L-BFGS) updating formula (Eptctr) for the linearly constrained optimization problem. At every iteration, Eptctr only involves three pairs of the inner product of vector and one matrix-vector product, other than the traditional and representative optimization method such as the seq…
▽ More
This paper considers an explicit continuation method with the trusty time-stepping scheme and the limited-memory BFGS (L-BFGS) updating formula (Eptctr) for the linearly constrained optimization problem. At every iteration, Eptctr only involves three pairs of the inner product of vector and one matrix-vector product, other than the traditional and representative optimization method such as the sequential quadratic programming (SQP) or the latest continuation method such as Ptctr \cite{LLS2020}, which needs to solve a quadratic programming subproblem (SQP) or a linear system of equations (Ptctr). Thus, Eptctr can save much more computational time than SQP or Ptctr. Numerical results also show that the consumed time of EPtctr is about one tenth of that of Ptctr or one fifteenth to 0.4 percent of that of SQP. Furthermore, Eptctr can save the storage space of an $(n+m) \times (n+m)$ large-scale matrix, in comparison to SQP. The required memory of Eptctr is about one fifth of that of SQP. Finally, we also give the global convergence analysis of the new method under the standard assumptions.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Explicit pseudo-transient continuation and the trust-region updating strategy for unconstrained optimization
Authors:
Xin-long Luo,
Hang Xiao,
Jia-hui Lv,
Sen Zhang
Abstract:
This paper considers an explicit continuation method and the trust-region updating strategy for the unconstrained optimization problem. Moreover, in order to improve its computational efficiency and robustness, the new method uses the switching preconditioning technique. In the well-conditioned phase, the new method uses the L-BFGS method as the preconditioning technique in order to improve its co…
▽ More
This paper considers an explicit continuation method and the trust-region updating strategy for the unconstrained optimization problem. Moreover, in order to improve its computational efficiency and robustness, the new method uses the switching preconditioning technique. In the well-conditioned phase, the new method uses the L-BFGS method as the preconditioning technique in order to improve its computational efficiency. Otherwise, the new method uses the inverse of the Hessian matrix as the pre-conditioner in order to improve its robustness. Numerical results aslo show that the new method is more robust and faster than the traditional optimization method such as the trust-region method and the line search method. The computational time of the new method is about one percent of that of the trust-region method (the subroutine fminunc.m of the MATLAB2019a environment, it is set by the trust-region method) or one fifth of that the line search method (fminunc.m is set by the quasi-Newton method) for the large-scale problem. Finally, the global convergence analysis of the new method is also given.
△ Less
Submitted 13 February, 2021; v1 submitted 29 December, 2020;
originally announced December 2020.
-
Strong averaging principle for a class of slow-fast singular SPDEs driven by $α$-stable process
Authors:
Xiaobin Sun,
Huilian Xia,
Yingchao Xie,
Xingcheng Zhou
Abstract:
In this paper, the strong averaging principle is researched for a class of Hölder continuous drift slow-fast SPDEs with $α$-stable process by the Zvonkin's transformation and the classical Khasminkii's time discretization method. As applications, an example is also provided to explain our result.
In this paper, the strong averaging principle is researched for a class of Hölder continuous drift slow-fast SPDEs with $α$-stable process by the Zvonkin's transformation and the classical Khasminkii's time discretization method. As applications, an example is also provided to explain our result.
△ Less
Submitted 8 May, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
Berry-Esseen bounds and moderate deviations for the norm, entries and spectral radius of products of positive random matrices
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Let $(g_{n})_{n\geq 1}$ be a sequence of independent and identically distributed positive random $d\times d$ matrices and consider the matrix product $G_n: = g_n \ldots g_1$. Under suitable conditions, we establish the Berry-Esseen bounds on the rate of convergence in the central limit theorem and moderate deviation expansions of Cramér type, for the matrix norm $\| G_n \|$ of $G_n$, for its…
▽ More
Let $(g_{n})_{n\geq 1}$ be a sequence of independent and identically distributed positive random $d\times d$ matrices and consider the matrix product $G_n: = g_n \ldots g_1$. Under suitable conditions, we establish the Berry-Esseen bounds on the rate of convergence in the central limit theorem and moderate deviation expansions of Cramér type, for the matrix norm $\| G_n \|$ of $G_n$, for its $(i,j)$-th entry $G_n^{i,j}$, and the and for its spectral radius $ρ(G_n)$.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Large deviation expansions for the coefficients of random walks on the general linear group
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed elements of the general linear group $GL(d, \mathbb R)$. Consider the random walk $G_n: = g_n \ldots g_1$. Under suitable conditions, we establish Bahadur-Rao-Petrov type large deviation expansion for the coefficients $\langle f, G_n v \rangle$, where $f \in (\mathbb R^d)^*$ and $v \in \mathbb R^d$. In particular, our r…
▽ More
Let $(g_n)_{n\geq 1}$ be a sequence of independent and identically distributed elements of the general linear group $GL(d, \mathbb R)$. Consider the random walk $G_n: = g_n \ldots g_1$. Under suitable conditions, we establish Bahadur-Rao-Petrov type large deviation expansion for the coefficients $\langle f, G_n v \rangle$, where $f \in (\mathbb R^d)^*$ and $v \in \mathbb R^d$. In particular, our result implies the large deviation principle with an explicit rate function, thus improving significantly the large deviation bounds established earlier. Moreover, we establish Bahadur-Rao-Petrov type large deviation expansion for the coefficients $\langle f, G_n v \rangle$ under the changed measure. Toward this end we prove the Hölder regularity of the stationary measure corresponding to the Markov chain $G_n v /|G_n v|$ under the changed measure, which is of independent interest. In addition, we also prove local limit theorems with large deviations for the coefficients of $G_n$.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
A zero-one law for invariant measures and a local limit theorem for coefficients of random walks on the general linear group
Authors:
Ion Grama,
Jean-François Quint,
Hui Xiao
Abstract:
We prove a zero-one law for the stationary measure for algebraic sets generalizing the results of Furstenberg [13] and Guivarc'h and Le Page [20]. As an application, we establish a local limit theorem for the coefficients of random walks on the general linear group.
We prove a zero-one law for the stationary measure for algebraic sets generalizing the results of Furstenberg [13] and Guivarc'h and Le Page [20]. As an application, we establish a local limit theorem for the coefficients of random walks on the general linear group.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Recurrence relations of Li coefficients
Authors:
Huan Xiao
Abstract:
One of equivalents of the Riemann hypothesis is Li's criterion that all Li coefficients are positive. We study recurrence relations of Li coefficients in this note.
One of equivalents of the Riemann hypothesis is Li's criterion that all Li coefficients are positive. We study recurrence relations of Li coefficients in this note.
△ Less
Submitted 23 June, 2020;
originally announced June 2020.
-
Continuation Newton methods with the residual trust-region time-stepping scheme for nonlinear equations
Authors:
Xin-long Luo,
Hang Xiao,
Jia-hui Lv
Abstract:
For nonlinear equations, the homotopy methods (continuation methods) are popular in engineering fields since their convergence regions are large and they are quite reliable to find a solution. The disadvantage of the classical homotopy methods is that their computational time is heavy since they need to solve many auxiliary nonlinear systems during the intermediate continuation processes. In order…
▽ More
For nonlinear equations, the homotopy methods (continuation methods) are popular in engineering fields since their convergence regions are large and they are quite reliable to find a solution. The disadvantage of the classical homotopy methods is that their computational time is heavy since they need to solve many auxiliary nonlinear systems during the intermediate continuation processes. In order to overcome this shortcoming, we consider the special explicit continuation Newton method with the residual trust-region time-stepping scheme for this problem. According to our numerical experiments, the new method is more robust and faster to find the required solution of the real-world problem than the traditional optimization method (the built-in subroutine fsolve.m of the MATLAB environment) and the homotopy continuation methods(HOMPACK90 and NAClab). Furthermore, we analyze the global convergence and the local superlinear convergence of the new method.
△ Less
Submitted 26 March, 2021; v1 submitted 3 June, 2020;
originally announced June 2020.
-
On $U(n)$-invariant strongly convex complex Finsler metrics
Authors:
Kun Wang,
Hongchuan Xia,
Chunping Zhong
Abstract:
In this paper, we obtain a necessary and sufficient condition for a $U(n)$-invariant complex Finsler metric $F$ on domains in $\mathbb{C}^n$ to be strongly convex, which also makes it possible to investigate relationship between real and complex Finsler geometry via concrete and computable examples. We prove a rigid theorem which states that a $U(n)$-invariant strongly convex complex Finsler metri…
▽ More
In this paper, we obtain a necessary and sufficient condition for a $U(n)$-invariant complex Finsler metric $F$ on domains in $\mathbb{C}^n$ to be strongly convex, which also makes it possible to investigate relationship between real and complex Finsler geometry via concrete and computable examples. We prove a rigid theorem which states that a $U(n)$-invariant strongly convex complex Finsler metric $F$ is a real Berwald metric if and only if $F$ comes from a $U(n)$-invariant Hermitian metric. We give a characterization of $U(n)$-invariant weakly complex Berwald metrics with vanishing holomorphic sectional curvature and obtain an explicit formula for holomorphic curvature of $U(n)$-invariant strongly pseudoconvex complex Finsler metric. Finally, we prove that the real geodesics of some $U(n)$-invariant complex Finsler metric restricted on the unit sphere $\pmb{S}^{2n-1}\subset\mathbb{C}^n$ share a specific property as that of the complex Wrona metric on $\mathbb{C}^n$.cc
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
Campana points on biequivariant compactifications of the Heisenberg group
Authors:
Huan Xiao
Abstract:
We study Campana points on biequivariant compactifications of the Heisenberg group and confirm the log Manin conjecture introduced by Pieropan, Smeets, Tanimoto and Várilly-Alvarado.
We study Campana points on biequivariant compactifications of the Heisenberg group and confirm the log Manin conjecture introduced by Pieropan, Smeets, Tanimoto and Várilly-Alvarado.
△ Less
Submitted 14 January, 2021; v1 submitted 27 April, 2020;
originally announced April 2020.
-
On the Premet conjecture for finite W-superalgebras
Authors:
Husileng Xiao
Abstract:
Let $\bullet^†$ be the map in sense of the Losev, which sends the set of two sided ideals of a finite W-algebras to that of the universal enveloping algebra of corresponding Lie algebras. The Premet conjecture which was proved in \cite{Lo11}, says that, restricted to the set of primitive ideals with finite codimension, any fiber of the map $\bullet^†$ is a single orbit under an action of a finite…
▽ More
Let $\bullet^†$ be the map in sense of the Losev, which sends the set of two sided ideals of a finite W-algebras to that of the universal enveloping algebra of corresponding Lie algebras. The Premet conjecture which was proved in \cite{Lo11}, says that, restricted to the set of primitive ideals with finite codimension, any fiber of the map $\bullet^†$ is a single orbit under an action of a finite group. In this article we formulate and prove a similar fact in the super case. This will give a classification to the set of finite dimensional irreducible representations of W-superalgebras provided $C_{e}$ is a trivial group and the set of primitive ideals of the corresponding universal enveloping algebra of Lie superalgebra is known.
△ Less
Submitted 5 July, 2020; v1 submitted 24 February, 2020;
originally announced February 2020.
-
KoPA: Automated Kronecker Product Approximation
Authors:
Chencheng Cai,
Rong Chen,
Han Xiao
Abstract:
We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank…
▽ More
We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank matrix approximation, and includes it as a special case. Comparing with the latter, KoPA also offers a greater flexibility, since it allows the user to choose the configuration, which are the dimensions of the two smaller matrices forming the Kronecker product. On the other hand, the configuration to be used is usually unknown, and needs to be determined from the data in order to achieve the optimal balance between accuracy and parsimony. We propose to use extended information criteria to select the configuration. Under the paradigm of high dimensional analysis, we show that the proposed procedure is able to select the true configuration with probability tending to one, under suitable conditions on the signal-to-noise ratio. We demonstrate the superiority of KoPA over the low rank approximations through numerical studies, and several benchmark image examples.
△ Less
Submitted 26 August, 2020; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Equivariant K-theory approach to $\imath$-quantum groups
Authors:
Zhaobing Fan,
Haitao Ma,
Husileng Xiao
Abstract:
Various constructions for quantum groups have been generalized to $\imath$-quantum groups. Such generalization is called $\imath$-program. In this paper, we fill one of parts in the $\imath$-program. Namely, we provide an equivariant K-theory approach to $\imath$-quantum groups associated to the Satake diagram in \eqref{eq1}, which is the Langlands dual picture of that constructed in \cite{BKLW14}…
▽ More
Various constructions for quantum groups have been generalized to $\imath$-quantum groups. Such generalization is called $\imath$-program. In this paper, we fill one of parts in the $\imath$-program. Namely, we provide an equivariant K-theory approach to $\imath$-quantum groups associated to the Satake diagram in \eqref{eq1}, which is the Langlands dual picture of that constructed in \cite{BKLW14}, where a geometric realization of the $\imath$-quantum group is provided by using perverse sheaves. As an application of the main results, we prove Li's conjecture \cite{L18} for the special cases with the satake diagram in \eqref{eq1}.
△ Less
Submitted 3 November, 2019;
originally announced November 2019.
-
A hybrid stochastic differential reinsurance and investment game with bounded memory
Authors:
Yanfei Bai,
Zhongbao Zhou,
Helu Xiao,
Rui Gao,
Feimin Zhong
Abstract:
This paper investigates a hybrid stochastic differential reinsurance and investment game between one reinsurer and two insurers, including a stochastic Stackelberg differential subgame and a non-zero-sum stochastic differential subgame. The reinsurer, as the leader of the Stackelberg game, can price reinsurance premium and invest its wealth in a financial market that contains a risk-free asset and…
▽ More
This paper investigates a hybrid stochastic differential reinsurance and investment game between one reinsurer and two insurers, including a stochastic Stackelberg differential subgame and a non-zero-sum stochastic differential subgame. The reinsurer, as the leader of the Stackelberg game, can price reinsurance premium and invest its wealth in a financial market that contains a risk-free asset and a risky asset. The two insurers, as the followers of the Stackelberg game, can purchase proportional reinsurance from the reinsurer and invest in the same financial market. The competitive relationship between two insurers is modeled by the non-zero-sum game, and their decision making will consider the relative performance measured by the difference in their terminal wealth. We consider wealth processes with delay to characterize the bounded memory feature. This paper aims to find the equilibrium strategy for the reinsurer and insurers by maximizing the expected utility of the reinsurer's terminal wealth with delay and maximizing the expected utility of the combination of insurers' terminal wealth and the relative performance with delay. By using the idea of backward induction and the dynamic programming approach, we derive the equilibrium strategy and value functions explicitly. Then, we provide the corresponding verification theorem. Finally, some numerical examples and sensitivity analysis are presented to demonstrate the effects of model parameters on the equilibrium strategy. We find the delay factor discourages or stimulates investment depending on the length of delay. Moreover, competitive factors between two insurers make their optimal reinsurance-investment strategy interact, and reduce reinsurance demand and reinsurance premium price.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
Statistical Robust Chinese Remainder Theorem for Multiple Numbers
Authors:
Hanshen Xiao,
Nan Du,
Zhikang T. Wang,
Guoqiang Xiao
Abstract:
Generalized Chinese Remainder Theorem (CRT) is a well-known approach to solve ambiguity resolution related problems. In this paper, we study the robust CRT reconstruction for multiple numbers from a view of statistics. To the best of our knowledge, it is the first rigorous analysis on the underlying statistical model of CRT-based multiple parameter estimation. To address the problem, two novel app…
▽ More
Generalized Chinese Remainder Theorem (CRT) is a well-known approach to solve ambiguity resolution related problems. In this paper, we study the robust CRT reconstruction for multiple numbers from a view of statistics. To the best of our knowledge, it is the first rigorous analysis on the underlying statistical model of CRT-based multiple parameter estimation. To address the problem, two novel approaches are established. One is to directly calculate a conditional maximum a posteriori probability (MAP) estimation of the residue clustering, and the other is based on a generalized wrapped Gaussian mixture model to iteratively search for MAP of both estimands and clustering. Residue error correcting codes are introduced to improve the robustness further. Experimental results show that the statistical schemes achieve much stronger robustness compared to state-of-the-art deterministic schemes, especially in heavy-noise scenarios.
△ Less
Submitted 31 August, 2019;
originally announced September 2019.
-
Precise large deviation asymptotics for products of random matrices
Authors:
Hui Xiao,
Ion Grama,
Quansheng Liu
Abstract:
Let $(g_{n})_{n\geq 1}$ be a sequence of independent identically distributed $d\times d$ real random matrices with Lyapunov exponent $γ$. For any starting point $x$ on the unit sphere in $\mathbb R^d$, we deal with the norm $ | G_n x | $, where $G_{n}:=g_{n} \ldots g_{1}$. The goal of this paper is to establish precise asymptotics for large deviation probabilities…
▽ More
Let $(g_{n})_{n\geq 1}$ be a sequence of independent identically distributed $d\times d$ real random matrices with Lyapunov exponent $γ$. For any starting point $x$ on the unit sphere in $\mathbb R^d$, we deal with the norm $ | G_n x | $, where $G_{n}:=g_{n} \ldots g_{1}$. The goal of this paper is to establish precise asymptotics for large deviation probabilities $\mathbb P(\log | G_n x | \geq n(q+l))$, where $q>γ$ is fixed and $l$ is vanishing as $n\to \infty$. We study both invertible matrices and positive matrices and give analogous results for the couple $(X_n^x,\log | G_n x |)$ with target functions, where $X_n^x= G_n x /| G_n x |$. As applications we improve previous results on the large deviation principle for the matrix norm $\|G_n\|$ and obtain a precise local limit theorem with large deviations.
△ Less
Submitted 4 July, 2019;
originally announced July 2019.