-
Three quantities arising from Bézout's identity and resultants of integer polynomials
Authors:
Zhiqian Liu,
Xiaoting Li,
Wenheng Liu,
Min Sha
Abstract:
In this paper, we study three quantities arising naturally from Bézout's identity, the resultant and the reduced resultant of two non-zero coprime integer polynomials. We establish several new divisibility relations among them. We also pose two conjectures by making computations.
In this paper, we study three quantities arising naturally from Bézout's identity, the resultant and the reduced resultant of two non-zero coprime integer polynomials. We establish several new divisibility relations among them. We also pose two conjectures by making computations.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Stability of 2-soliton solutions for the modified Camassa-Holm equation with cubic nonlinearity
Authors:
Xijun Deng,
Stéphane Lafortune,
Zhisu Liu
Abstract:
In this paper, we are concerned with the stability of 2-soliton solutions for the modified Camassa-Holm equation with cubic nonlinearity. By employing conserved quantities in terms of the momentum variable $m$, we show that the 2-soliton, when regarded as a solution to the initial-value problem for the modified Camassa-Holm equation, is nonlinearly stable to perturbations with respect to the momen…
▽ More
In this paper, we are concerned with the stability of 2-soliton solutions for the modified Camassa-Holm equation with cubic nonlinearity. By employing conserved quantities in terms of the momentum variable $m$, we show that the 2-soliton, when regarded as a solution to the initial-value problem for the modified Camassa-Holm equation, is nonlinearly stable to perturbations with respect to the momentum variable in the Sobolev space $H^2$.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Invariant measures for stochastic Burgers equation on unbounded domains
Authors:
Zhenxin Liu,
Zhiyuan Shi
Abstract:
In this paper, we investigate the stochastic damped Burgers equation with multiplicative noise defined on the entire real line. We demonstrate the existence and uniqueness of a mild solution to the stochastic damped Burgers equation and establish that the solution is uniformly bounded in time. Furthermore, by employing the uniform estimates on the tails of the solution, we obtain the tightness of…
▽ More
In this paper, we investigate the stochastic damped Burgers equation with multiplicative noise defined on the entire real line. We demonstrate the existence and uniqueness of a mild solution to the stochastic damped Burgers equation and establish that the solution is uniformly bounded in time. Furthermore, by employing the uniform estimates on the tails of the solution, we obtain the tightness of a family of probability distributions of the solution. Subsequently, by applying the Krylov-Bogolioubov theorem, we establish the existence of invariant measures.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization
Authors:
Zijian Liu,
Zhengyuan Zhou
Abstract:
We study the convergence of the shuffling gradient method, a popular algorithm employed to minimize the finite-sum function with regularization, in which functions are passed to apply (Proximal) Gradient Descent (GD) one by one whose order is determined by a permutation on the indices of functions. In contrast to its easy implementation and effective performance in practice, the theoretical unders…
▽ More
We study the convergence of the shuffling gradient method, a popular algorithm employed to minimize the finite-sum function with regularization, in which functions are passed to apply (Proximal) Gradient Descent (GD) one by one whose order is determined by a permutation on the indices of functions. In contrast to its easy implementation and effective performance in practice, the theoretical understanding remains limited. A recent advance by (Liu & Zhou, 2024b) establishes the first last-iterate convergence results under various settings, especially proving the optimal rates for smooth (strongly) convex optimization. However, their bounds for nonsmooth (strongly) convex functions are only as fast as Proximal GD. In this work, we provide the first improved last-iterate analysis for the nonsmooth case demonstrating that the widely used Random Reshuffle ($\textsf{RR}$) and Single Shuffle ($\textsf{SS}$) strategies are both provably faster than Proximal GD, reflecting the benefit of randomness. As an important implication, we give the first (nearly) optimal convergence result for the suffix average under the $\textsf{RR}$ sampling scheme in the general convex case, matching the lower bound shown by (Koren et al., 2022).
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
A hybrid PDE-ABM model for angiogenesis and tumour microenvironment with application to resistance in cancer treatment
Authors:
Louis Shuo Wang,
Jiguang Yu,
Zonghao Liu
Abstract:
The main obstacle to effective cancer treatment is the development of drug resistance, which can be divided into two categories: spontaneous and acquired drug resistance. Non-small cell lung cancer (NSCLC) is the main cause of cancer-related deaths worldwide. A subset of lung cancer, adenocarcinomas, is characterised by mutations in the epidermal growth factor receptor (EGFR) gene. Treatment of EG…
▽ More
The main obstacle to effective cancer treatment is the development of drug resistance, which can be divided into two categories: spontaneous and acquired drug resistance. Non-small cell lung cancer (NSCLC) is the main cause of cancer-related deaths worldwide. A subset of lung cancer, adenocarcinomas, is characterised by mutations in the epidermal growth factor receptor (EGFR) gene. Treatment of EGFR-mutated lung adenocarcinomas has become less effective over time due to drug resistance development, which is associated with a second mutation in the EGFR gene. An important factor in the development of cancer is angiogenesis, which is the formation of blood vessels from the existing vasculature. These newly formed blood vessels provide oxygen and nutrients to tumour cells to maintain tumour growth and proliferation. We applied a hybrid discrete-continuous (HDC) model to capture the dynamic vasculature in the tumour microenvironment (TME). In the case of pre-existing resistance, the formation of angiogenic networks creates a microenvironment that supports tumour survival and enhances drug resistance. In the case of spontaneous mutation-induced resistance, earlier and more frequent mutations confer a greater survival advantage to the tumour population. There is also a mutually reinforcing relationship between a high proliferation rate and high resistance characteristics. These findings explain two conflicting experimental results about the second mutation in NSCLC.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
The strong hull property for affine irreducible Coxeter groups of rank 3
Authors:
Ziming Liu
Abstract:
The conjecture proposed by Gaetz and Gao asserts that the Cayley graph of any Coxeter group possesses the strong hull property. This conjecture has been proved for symmetric groups, hyperoctahedral groups, all right-angled Coxeter groups, and computationally verified for finite Coxeter groups of types $D_4$, $F_4$, $G_2$, and $H_3$. This paper investigates all affine irreducible Coxeter groups of…
▽ More
The conjecture proposed by Gaetz and Gao asserts that the Cayley graph of any Coxeter group possesses the strong hull property. This conjecture has been proved for symmetric groups, hyperoctahedral groups, all right-angled Coxeter groups, and computationally verified for finite Coxeter groups of types $D_4$, $F_4$, $G_2$, and $H_3$. This paper investigates all affine irreducible Coxeter groups of rank 3, specifically those of affine types $\widetilde{A}_2$, $\widetilde{C}_2$, and $\widetilde{G}_2$. By employing key concepts from building theory, we develop novel techniques: first reducing and classifying the convex hull in their Cayley graphs into finitely many cases, then proving the strong hull conjecture for these cases through combinatorial computations. Notably, for the case of affine type $\widetilde{G}_2$, we streamline the proof strategy by reducing it to a corollary of results established for affine type $\widetilde{A}_2$. The reduction techniques developed in this study demonstrate potential for generalization. Their possible algebraic reformulation may not only provide new perspectives for further investigation of this conjecture but also offer methodological insights for algebraic combinatorics and geometric group theory.
△ Less
Submitted 23 May, 2025; v1 submitted 21 May, 2025;
originally announced May 2025.
-
Explicit quadratic large sieve inequality
Authors:
Zihao Liu
Abstract:
In this article, we obtain an explicit version of Heath-Brown's large sieve inequality for quadratic characters and discuss its applications to $L$-functions and quadratic fields.
In this article, we obtain an explicit version of Heath-Brown's large sieve inequality for quadratic characters and discuss its applications to $L$-functions and quadratic fields.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Quasi-3D beam theory based on equilibrium stress definition and mixed element model for accurate analysis of functionally graded beams
Authors:
Wenxiong Li,
Zhiwei Liu,
Suiyin Chen,
Gengying Li,
Jianhua Wen
Abstract:
This paper presents a novel quasi-3D theory and the corresponding mixed beam element model to achieve accurate solutions for functionally graded beams. The key innovations include the development of equilibrium-based stress expressions, the modified cross-sectional stiffness matrix, and the mixed beam element model based on semi-analytical definition of internal force fields. In contrast to the co…
▽ More
This paper presents a novel quasi-3D theory and the corresponding mixed beam element model to achieve accurate solutions for functionally graded beams. The key innovations include the development of equilibrium-based stress expressions, the modified cross-sectional stiffness matrix, and the mixed beam element model based on semi-analytical definition of internal force fields. In contrast to the conventional quasi-3D theory where stress expressions are derived from constitutive equations and geometric relations, the stress expressions in this study are derived from the differential equilibrium equations among stresses, ensuring strict adherence of stress solutions to equilibrium conditions. To incorporate the influence of equilibrium-derived stress distributions, the modified cross-sectional stiffness matrix is derived, enhancing the theoretical and practical feasibility of the beam model. For beam element construction, the mixed variational principle of two-field variables is employed, with generalized internal forces and generalized displacements regarded as two independent fields. Especially, semi-analytical internal force fields, which partially satisfy the differential equilibrium equations, are introduced to improve the element performance. Numerical examples are conducted to verify the accuracy and effectiveness of the proposed theory and beam element.
△ Less
Submitted 25 May, 2025; v1 submitted 14 May, 2025;
originally announced May 2025.
-
Pattern formation using an intrinsic optimal control approach
Authors:
Tianhao Li,
Yibei Li,
Zhixin Liu,
Xiaoming Hu
Abstract:
This paper investigates a pattern formation control problem for a multi-agent system modeled with given interaction topology, in which $m$ of the $n$ agents are chosen as leaders and consequently a control signal is added to each of the leaders. These agents interact with each other by Laplacian dynamics on a graph. The pattern formation control problem is formulated as an intrinsic infinite time-…
▽ More
This paper investigates a pattern formation control problem for a multi-agent system modeled with given interaction topology, in which $m$ of the $n$ agents are chosen as leaders and consequently a control signal is added to each of the leaders. These agents interact with each other by Laplacian dynamics on a graph. The pattern formation control problem is formulated as an intrinsic infinite time-horizon linear quadratic optimal control problem, namely, no error information is incorporated in the objective function. Under mild conditions, we show the existence of the optimal control strategy and the convergence to the desired pattern formation. Based on the optimal control strategy, we propose a distributed control strategy to achieve the given pattern. Finally, numerical simulation is given to illustrate theoretical results.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
A complement of the Erdős-Hajnal problem on paths with equal-degree endpoints
Authors:
Zhen Liu,
Qinghou Zeng
Abstract:
Answering a question of Erdős and Hajnal, Chen and Ma proved that for all $n\geq600$ every graph with $2n + 1$ vertices and at least $n^2 + n+1$ edges contains two vertices of equal degree connected by a path of length three, and the complete bipartite graph $K_{n,n+1}$ shows that this edge bound is sharp. In this paper, we obtain the above result for all $n\ge2$, and thus resolve the question of…
▽ More
Answering a question of Erdős and Hajnal, Chen and Ma proved that for all $n\geq600$ every graph with $2n + 1$ vertices and at least $n^2 + n+1$ edges contains two vertices of equal degree connected by a path of length three, and the complete bipartite graph $K_{n,n+1}$ shows that this edge bound is sharp. In this paper, we obtain the above result for all $n\ge2$, and thus resolve the question of Erdős and Hajnal completely.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
Equating three degrees of graphs
Authors:
Zhen Liu,
Qinghou Zeng
Abstract:
In this paper, we prove that, for every graph with at least 5 vertices, one can delete at most 3 vertices such that the subgraph obtained has at least three vertices with the same degree. This solves an open problem of Caro, Shapira and Yuster [Electron. J. Combin. 21 (2014) P1.24].
In this paper, we prove that, for every graph with at least 5 vertices, one can delete at most 3 vertices such that the subgraph obtained has at least three vertices with the same degree. This solves an open problem of Caro, Shapira and Yuster [Electron. J. Combin. 21 (2014) P1.24].
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
Sharp bounds for the $\boldsymbol{p}$-adic $\boldsymbol{n}$-dimensional fractional Hardy operator and a class of integral operators on $\boldsymbol{p}$-adic function spaces
Authors:
Tianyang He,
Zhiwen Liu,
Ting Yu
Abstract:
In this paper, we first study the sharp weak estimate for the $p$-adic $n$-dimensional fractional Hardy operator from $L^p$ to $L^{q,\infty}$. Secondly, we study the sharp bounds for the $m$-linear $n$-dimensional $p$-adic integral operator with a kernel on $p$-adic weighted spaces $H_α^{\infty}( \mathbb{Q} _{p}^{n} )$. As an application, the sharp bounds for $p$-adic Hardy and Hilbert operators o…
▽ More
In this paper, we first study the sharp weak estimate for the $p$-adic $n$-dimensional fractional Hardy operator from $L^p$ to $L^{q,\infty}$. Secondly, we study the sharp bounds for the $m$-linear $n$-dimensional $p$-adic integral operator with a kernel on $p$-adic weighted spaces $H_α^{\infty}( \mathbb{Q} _{p}^{n} )$. As an application, the sharp bounds for $p$-adic Hardy and Hilbert operators on $p$-adic weighted spaces are obtained. Finally, we also find the sharp bound for the Hausdorff operator on $p$-adic weighted spaces, which generalizes the previous results.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
New Primal-Dual Algorithm for Convex Problems
Authors:
Shuning Liu,
Zexian Liu
Abstract:
Primal-dual algorithm (PDA) is a classic and popular scheme for convex-concave saddle point problems. It is universally acknowledged that the proximal terms in the subproblems about the primal and dual variables are crucial to the convergence theory and numerical performance of primal-dual algorithms. By taking advantage of the information from the current and previous iterative points, we exploit…
▽ More
Primal-dual algorithm (PDA) is a classic and popular scheme for convex-concave saddle point problems. It is universally acknowledged that the proximal terms in the subproblems about the primal and dual variables are crucial to the convergence theory and numerical performance of primal-dual algorithms. By taking advantage of the information from the current and previous iterative points, we exploit two new proximal terms for the subproblems about the primal and dual variables. Based on two new proximal terms, we present a new primal-dual algorithm for convex-concave saddle point problems with bilinear coupling terms and establish its global convergence and O(1/N ) ergodic convergence rate. When either the primal function or the dual function is strongly convex, we accelerate the above proposed algorithm and show that the corresponding algorithm can achieve O(1/N^2) convergence rate. Since the conditions for the stepsizes of the proposed algorithm are related directly to the spectral norm of the linear transform, which is difficult to obtain in some applications, we also introduce a linesearch strategy for the above proposed primal-dual algorithm and establish its global convergence and O(1/N ) ergodic convergence rate . Some numerical experiments are conducted on matrix game and LASSO problems by comparing with other state-of-the-art algorithms, which demonstrate the effectiveness of the proposed three primal-dual algorithms.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
Extreme Points of Base Polytope of Submodular Set Functions and Limit for Quotient Convergent Graph Sequence
Authors:
Yaobin Chen,
Zhicheng Liu,
Yihang Xiao,
Junchi Zhang
Abstract:
Submodular set functions are of great importance in mathematics and theoretical computer science, serving as fundamental tools in optimization, combinatorics, and economics due to their natural properties and wide-ranging applications. In 2023, Lovász systematically extended the theory of submodular set functions from finite sets to general set algebras and proposed several open problems about the…
▽ More
Submodular set functions are of great importance in mathematics and theoretical computer science, serving as fundamental tools in optimization, combinatorics, and economics due to their natural properties and wide-ranging applications. In 2023, Lovász systematically extended the theory of submodular set functions from finite sets to general set algebras and proposed several open problems about the behavior of submodular functions in infinite settings, including the characterization of extreme points of the base polytope of submodular set functions.
We characterize conditions under which the extreme points of the base polytope of a submodular function are restricting measures with respect to its majorizing measure. Applying this result, we characterize the core of increasing subadditive non-atomic games and provide a positive answer to a question of Kristóf Bérzi, Márton Borbényi, László Lovász and László Márton Tóth regarding the rank function for graphing's cycle matroid.
Furthermore, building on the limit theory for set functions, we prove that the limit of convergent sequence of bounded-degree graphs' cycle matroids can be represented as the cycle matroid of a graphing, analogous to the completeness result for local-global convergence.
△ Less
Submitted 20 April, 2025;
originally announced April 2025.
-
Logarithmic Crystalline Representations
Authors:
Zhenmou Liu,
Jinbang Yang,
Kang Zuo
Abstract:
In 1989, Faltings proved the comparison theorem between étale cohomology and crystalline cohomology by studying Fontaine-Faltings modules and crystalline representations. In his paper, he mentioned these modules and representations can be extended to the logarithmic context, but without detail. This note aims to explicitly present the construction of logarithmic Fontaine-Faltings modules and logar…
▽ More
In 1989, Faltings proved the comparison theorem between étale cohomology and crystalline cohomology by studying Fontaine-Faltings modules and crystalline representations. In his paper, he mentioned these modules and representations can be extended to the logarithmic context, but without detail. This note aims to explicitly present the construction of logarithmic Fontaine-Faltings modules and logarithmic crystalline representations.
△ Less
Submitted 19 April, 2025;
originally announced April 2025.
-
Colorings of symmetric unions and partial knots
Authors:
Ben Clingenpeel,
Zongzheng Dai,
Gabriel Diraviam,
Kareem Jaber,
Krishnendu Kar,
Ziyun Liu,
Teo Miklethun,
Haritha Nagampoozhy,
Michael Perry,
Moses Samuelson-Lynn,
Eli Seamans,
Ana Wright,
Nicole Xie,
Ruiqi Zou,
Alexander Zupan
Abstract:
Motivated by work of Kinoshita and Teraska, Lamm introduced the notion of a symmetric union, which can be constructed from a partial knot $J$ by introducing additional crossings to a diagram of $J \# -\!J$ along its axis of symmetry. If both $J$ and $J'$ are partial knots for different symmetric union presentations of the same ribbon knot $K$, the knots $J$ and $J'$ are said to be symmetrically re…
▽ More
Motivated by work of Kinoshita and Teraska, Lamm introduced the notion of a symmetric union, which can be constructed from a partial knot $J$ by introducing additional crossings to a diagram of $J \# -\!J$ along its axis of symmetry. If both $J$ and $J'$ are partial knots for different symmetric union presentations of the same ribbon knot $K$, the knots $J$ and $J'$ are said to be symmetrically related. Lamm proved that if $J$ and $J'$ are symmetrically related, then $\det J = \det J'$, asking whether the converse is true. In this article, we give a negative answer to Lamm's question, constructing for any natural number $m$ a family of $2^m$ knots with the same determinant but such that no two knots in the family are symmetrically related. This result is a corollary to our main theorem, that if $J$ is the partial knot in a symmetric union presentation for $K$, then $\text{col}_p(J) \leq \text{col}_p(K) \leq \frac{(\text{col}_p(J))^2}{2}$, where $\text{col}_p(\cdot )$ denotes the number of $p$-colorings of a knot.
△ Less
Submitted 11 April, 2025;
originally announced April 2025.
-
Data-driven robust UAV position estimation in GPS signal-challenged environment
Authors:
Shenglun Yi,
Xuebo Jin,
Zhengjie Wang,
Zhijun Liu,
Mattia Zorzi
Abstract:
In this paper, we consider a position estimation problem for an unmanned aerial vehicle (UAV) equipped with both proprioceptive sensors, i.e. IMU, and exteroceptive sensors, i.e. GPS and a barometer. We propose a data-driven position estimation approach based on a robust estimator which takes into account that the UAV model is affected by uncertainties and thus it belongs to an ambiguity set. We p…
▽ More
In this paper, we consider a position estimation problem for an unmanned aerial vehicle (UAV) equipped with both proprioceptive sensors, i.e. IMU, and exteroceptive sensors, i.e. GPS and a barometer. We propose a data-driven position estimation approach based on a robust estimator which takes into account that the UAV model is affected by uncertainties and thus it belongs to an ambiguity set. We propose an approach to learn this ambiguity set from the data.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
The finite basis problem for additively idempotent semirings of order four, III
Authors:
Miaomiao Ren,
Zexi Liu,
Mengya Yue,
Yizhi Chen
Abstract:
We study the finite basis problem for $4$-element additively idempotent semirings whose additive reducts have two minimal elements and one coatom. Up to isomorphism, there are $112$ such algebras. We show that $106$ of them are finitely based and the remaining ones are nonfinitely based.
We study the finite basis problem for $4$-element additively idempotent semirings whose additive reducts have two minimal elements and one coatom. Up to isomorphism, there are $112$ such algebras. We show that $106$ of them are finitely based and the remaining ones are nonfinitely based.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
A stacky approach to prismatic crystals via $q$-prism charts
Authors:
Zeyu Liu
Abstract:
Let $Y$ be a locally complete intersection over $\mathcal{O}_K$ containing a $p$-power root of unity $ζ_p$. We classify the derived category of prismatic crystals on the absolute prismatic site of $Y$ by studying quasi-coherent complexes on the prismatization of $Y$ via $q$-prism charts. We also develop a Galois descent mechanism to remove the assumption on $\mathcal{O}_K$. As an application, we c…
▽ More
Let $Y$ be a locally complete intersection over $\mathcal{O}_K$ containing a $p$-power root of unity $ζ_p$. We classify the derived category of prismatic crystals on the absolute prismatic site of $Y$ by studying quasi-coherent complexes on the prismatization of $Y$ via $q$-prism charts. We also develop a Galois descent mechanism to remove the assumption on $\mathcal{O}_K$. As an application, we classify quasi-coherent complexes on the Cartier-Witt stack and give a purely algebraic calculation of the cohomology of the structure sheaf on the absolute prismatic site of $\mathbb{Z}_p$. Along the way, for $Y$ a locally complete intersection over $\overline{A}$ with $A$ lying over a $q$-prism, we classify quasi-coherent complexes on the relative prismatization of $Y$.
△ Less
Submitted 13 April, 2025; v1 submitted 9 April, 2025;
originally announced April 2025.
-
Homogeneous linear recurrence relations of the determinants of distance matrices of trees
Authors:
Zhiqi Liu,
Hui Zhou
Abstract:
In 1971, by induction on $n$ and using a two-term linear recurrence relation, Graham and Pollak got a beautiful formula $$\det(D_n)=-(n-1)(-2)^{n-2}$$ on the determinant of distance matrix $D_n$ of a tree $T_n$ on $n$ vertices. The recurrence relations are very crucial when proving this formula by inductive method: in 2006, Yan and Yeh used two-term and three-term recurrence relations; in 2020, Du…
▽ More
In 1971, by induction on $n$ and using a two-term linear recurrence relation, Graham and Pollak got a beautiful formula $$\det(D_n)=-(n-1)(-2)^{n-2}$$ on the determinant of distance matrix $D_n$ of a tree $T_n$ on $n$ vertices. The recurrence relations are very crucial when proving this formula by inductive method: in 2006, Yan and Yeh used two-term and three-term recurrence relations; in 2020, Du and Yeh used a homogeneous linear three-term recurrence relation. In this paper, we analyze the subtree structure of the tree and find four-term, five-term, six-term and seven-term homogeneous linear recurrence relations on $\det(D_n)$, as a corollary new proofs of Graham and Pollak's formula can be given.
△ Less
Submitted 5 April, 2025;
originally announced April 2025.
-
Trajectory Optimization of Stochastic Systems under Chance Constraints via Set Erosion
Authors:
Zishun Liu,
Liqian Ma,
Yongxin Chen
Abstract:
We study the trajectory optimization problem under chance constraints for continuous-time stochastic systems. To address chance constraints imposed on the entire stochastic trajectory, we propose a framework based on the set erosion strategy, which converts the chance constraints into safety constraints on an eroded subset of the safe set along the corresponding deterministic trajectory. The depth…
▽ More
We study the trajectory optimization problem under chance constraints for continuous-time stochastic systems. To address chance constraints imposed on the entire stochastic trajectory, we propose a framework based on the set erosion strategy, which converts the chance constraints into safety constraints on an eroded subset of the safe set along the corresponding deterministic trajectory. The depth of erosion is captured by the probabilistic bound on the distance between the stochastic trajectory and its deterministic counterpart, for which we utilize a novel and sharp probabilistic bound developed recently. By adopting this framework, a deterministic control input sequence can be obtained, whose feasibility and performance are demonstrated through theoretical analysis. Our framework is compatible with various deterministic optimal control techniques, offering great flexibility and computational efficiency in a wide range of scenarios. To the best of our knowledge, our method provides the first scalable trajectory optimization scheme for high-dimensional stochastic systems under trajectory level chance constraints. We validate the proposed method through two numerical experiments.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
On the averaging theorems for stochastic perturbation of conservative linear systems
Authors:
Jing Guo,
Sergei Kuksin,
Zhenxin Liu
Abstract:
For stochastic perturbations of linear systems with non-zero pure imaginary spectrum we discuss the averaging theorems in terms of the slow-fast action-angle variables and in the sense of Krylov-Bogoliubov. Then we show that if the diffusion matrix of the perturbation is uniformly elliptic, then in all cases the averaged dynamics does not depend on a hamiltonian part of the perturbation.
For stochastic perturbations of linear systems with non-zero pure imaginary spectrum we discuss the averaging theorems in terms of the slow-fast action-angle variables and in the sense of Krylov-Bogoliubov. Then we show that if the diffusion matrix of the perturbation is uniformly elliptic, then in all cases the averaged dynamics does not depend on a hamiltonian part of the perturbation.
△ Less
Submitted 12 May, 2025; v1 submitted 6 April, 2025;
originally announced April 2025.
-
PARQ: Piecewise-Affine Regularized Quantization
Authors:
Lisa Jin,
Jianhao Ma,
Zechun Liu,
Andrey Gromov,
Aaron Defazio,
Lin Xiao
Abstract:
We develop a principled method for quantization-aware training (QAT) of large-scale machine learning models. Specifically, we show that convex, piecewise-affine regularization (PAR) can effectively induce the model parameters to cluster towards discrete values. We minimize PAR-regularized loss functions using an aggregate proximal stochastic gradient method (AProx) and prove that it has last-itera…
▽ More
We develop a principled method for quantization-aware training (QAT) of large-scale machine learning models. Specifically, we show that convex, piecewise-affine regularization (PAR) can effectively induce the model parameters to cluster towards discrete values. We minimize PAR-regularized loss functions using an aggregate proximal stochastic gradient method (AProx) and prove that it has last-iterate convergence. Our approach provides an interpretation of the straight-through estimator (STE), a widely used heuristic for QAT, as the asymptotic form of PARQ. We conduct experiments to demonstrate that PARQ obtains competitive performance on convolution- and transformer-based vision tasks.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
A New Proof of Sub-Gaussian Norm Concentration Inequality
Authors:
Zishun Liu,
Sam Power,
Yongxin Chen
Abstract:
We present a new method for proving the norm concentration inequality of sub-Gaussian variables. Our proof is based on an averaged version of the moment generating function, termed the averaged moment generating function. Our method applies to both vector cases to bound the vector norm and matrix cases to bound the operator norm. Compared with the widely adopted $\varepsilon$-net technique-based p…
▽ More
We present a new method for proving the norm concentration inequality of sub-Gaussian variables. Our proof is based on an averaged version of the moment generating function, termed the averaged moment generating function. Our method applies to both vector cases to bound the vector norm and matrix cases to bound the operator norm. Compared with the widely adopted $\varepsilon$-net technique-based proof of the sub-Gaussian norm concentration inequality, our method does not rely on the union bound and promises a tighter concentration bound.
△ Less
Submitted 9 May, 2025; v1 submitted 18 March, 2025;
originally announced March 2025.
-
Spectrally-Corrected and Regularized QDA Classifier for Spiked Covariance Model
Authors:
Wenya Luo,
Hua Li,
Zhidong Bai,
Zhijun Liu
Abstract:
Quadratic discriminant analysis (QDA) is a widely used method for classification problems, particularly preferable over Linear Discriminant Analysis (LDA) for heterogeneous data. However, QDA loses its effectiveness in high-dimensional settings, where the data dimension and sample size tend to infinity. To address this issue, we propose a novel QDA method utilizing spectral correction and regulari…
▽ More
Quadratic discriminant analysis (QDA) is a widely used method for classification problems, particularly preferable over Linear Discriminant Analysis (LDA) for heterogeneous data. However, QDA loses its effectiveness in high-dimensional settings, where the data dimension and sample size tend to infinity. To address this issue, we propose a novel QDA method utilizing spectral correction and regularization techniques, termed SR-QDA. The regularization parameters in our method are selected by maximizing the Fisher-discriminant ratio. We compare SR-QDA with QDA, regularized quadratic discriminant analysis (R-QDA), and several other competitors. The results indicate that SR-QDA performs exceptionally well, especially in moderate and high-dimensional situations. Empirical experiments across diverse datasets further support this conclusion.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
The Kolmogorov-Smirnov Statistic Revisited
Authors:
Elvis Han Cui,
Yihao Li,
Zhuang Liu
Abstract:
The Kolmogorov-Smirnov (KS) statistic is a classical nonparametric test widely used for comparing an empirical distribution function with a reference distribution or for comparing two empirical distributions. Despite its broad applicability in statistical hypothesis testing and model validation, certain aspects of the KS statistic remain under-explored among the young generation, particularly unde…
▽ More
The Kolmogorov-Smirnov (KS) statistic is a classical nonparametric test widely used for comparing an empirical distribution function with a reference distribution or for comparing two empirical distributions. Despite its broad applicability in statistical hypothesis testing and model validation, certain aspects of the KS statistic remain under-explored among the young generation, particularly under finite sample conditions. This paper revisits the KS statistic in both one-sample and two-sample scenarios, considering one-sided and two-sided variants. We derive exact probabilities for the supremum of the empirical process and present a unified treatment of the KS statistic under diverse settings. Additionally, we explore the discrete nature of the hitting times of the normalized empirical process, providing practical insights into the computation of KS test p-values. The study also discusses the Dvoretzky-Kiefer-Wolfowitz-Massart (DKWM) inequality, highlighting its role in constructing confidence bands for distribution functions. Using empirical process theory, we establish the limit distribution of the KS statistic when the true distribution includes unknown parameters. Our findings extend existing results, offering improved methodologies for statistical analysis and hypothesis testing using the KS statistic, particularly in finite sample scenarios.
△ Less
Submitted 27 February, 2025;
originally announced March 2025.
-
Linear, decoupled and positivity-preserving staggered mesh schemes for general dissipative systems with arbitrary energy distributions
Authors:
Zhengguang Liu,
Nan Zheng,
Xiaoli Li
Abstract:
In this paper, we develop a novel staggered mesh (SM) approach for general nonlinear dissipative systems with arbitrary energy distributions (including cases with known or unknown energy lower bounds). Based on this framework, we propose several second-order semi-discrete schemes that maintain linearity, computational decoupling, and unconditional energy stability. Firstly, for dissipative systems…
▽ More
In this paper, we develop a novel staggered mesh (SM) approach for general nonlinear dissipative systems with arbitrary energy distributions (including cases with known or unknown energy lower bounds). Based on this framework, we propose several second-order semi-discrete schemes that maintain linearity, computational decoupling, and unconditional energy stability. Firstly, for dissipative systems with known energy lower bounds, we introduce a positive auxiliary variable $V(t)$ to substitute the total energy functional, subsequently discretizing it on staggered temporal meshes to ensure that the energy remains non-increasing regardless of the size of time step. The newly developed schemes achieve full computational decoupling, maintaining essentially the same computational expense as conventional implicit-explicit methods while demonstrating significantly improved accuracy. Furthermore, we rigorously establish the positivity preservation of the discrete variable $V^{n+1/2}$ which is a crucial property ensuring numerical stability and accuracy. Theoretical analysis confirms second-order temporal convergence for the proposed SM schemes. Secondly, for dissipative systems lacking well-defined energy lower bounds, we devise an alternative auxiliary variable formulation and extend the SM framework to maintain unconditional energy stability while preserving numerical effectiveness and accuracy. Finally, comprehensive numerical experiments, including benchmark problem simulations, validate the proposed schemes' efficacy and demonstrate their superior performance characteristics.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
A Unified Dual Consensus Approach to Distributed Optimization with Globally-Coupled Constraints
Authors:
Zixuan Liu,
Xuyang Wu,
Dandan Wang,
Jie Lu
Abstract:
This article explores distributed convex optimization with globally-coupled constraints, where the objective function is a general nonsmooth convex function, the constraints include nonlinear inequalities and affine equalities, and the feasible region is possibly unbounded. To address such problems, a unified DUal Consensus Algorithm (DUCA) and its proximal variant (Pro-DUCA) are proposed, which a…
▽ More
This article explores distributed convex optimization with globally-coupled constraints, where the objective function is a general nonsmooth convex function, the constraints include nonlinear inequalities and affine equalities, and the feasible region is possibly unbounded. To address such problems, a unified DUal Consensus Algorithm (DUCA) and its proximal variant (Pro-DUCA) are proposed, which are unified frameworks that approximate the method of multipliers applied to the corresponding dual problem in no need of a closed-form dual objective. With varied parameter settings, DUCA and Pro-DUCA not only extend a collection of existing consensus optimization methods to solve the dual problem that they used to be inapplicable to, but also aid in offering new efficient algorithms to the literature. The proposed unified algorithms are shown to achieve $O(1/k)$ convergence rates in terms of optimality and feasibility, providing new or enhanced convergence results for a number of existing methods. Simulations demonstrate that these algorithms outperform several state-of-the-art alternatives in terms of objective and feasibility errors.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Safety Control of Impulsive Systems with Control Barrier Functions and Adaptive Gains
Authors:
Zihan Liu,
Yuan-Hua Ni
Abstract:
This paper addresses the safety challenges in impulsive systems, where abrupt state jumps introduce significant complexities into system dynamics. A unified framework is proposed by integrating Quadratic Programming (QP), Control Barrier Functions (CBFs), and adaptive gain mechanisms to ensure system safety during impulsive events. The CBFs are constructed to enforce safety constraints by capturin…
▽ More
This paper addresses the safety challenges in impulsive systems, where abrupt state jumps introduce significant complexities into system dynamics. A unified framework is proposed by integrating Quadratic Programming (QP), Control Barrier Functions (CBFs), and adaptive gain mechanisms to ensure system safety during impulsive events. The CBFs are constructed to enforce safety constraints by capturing the system's continuous dynamics and the effects of impulsive state transitions. An adaptive gain mechanism dynamically adjusts control inputs based on the magnitudes of the impulses and the system's proximity to safety boundaries, maintaining safety during instantaneous state jumps. A tailored QP formulation incorporates CBFs constraints and adaptive gain adjustments, optimizing control inputs while ensuring compliance with safety-critical requirements. Theoretical analysis establishes the boundedness, continuity, and feasibility of the adaptive gain and the overall framework. The effectiveness of the method is demonstrated through simulations on a robotic manipulator, showcasing its practical applicability to impulsive systems with state jumps.
△ Less
Submitted 9 April, 2025; v1 submitted 13 March, 2025;
originally announced March 2025.
-
Optimizing AUV speed dynamics with a data-driven Koopman operator approach
Authors:
Zhiliang Liu,
Xin Zhao,
Peng Cai,
Bing Cong
Abstract:
Autonomous Underwater Vehicles (AUVs) play an essential role in modern ocean exploration, and their speed control systems are fundamental
to their efficient operation. Like many other robotic systems, AUVs exhibit multivariable nonlinear dynamics and face various constraints,
including state limitations, input constraints, and constraints on the increment input, making controller design challe…
▽ More
Autonomous Underwater Vehicles (AUVs) play an essential role in modern ocean exploration, and their speed control systems are fundamental
to their efficient operation. Like many other robotic systems, AUVs exhibit multivariable nonlinear dynamics and face various constraints,
including state limitations, input constraints, and constraints on the increment input, making controller design challenging
and requiring significant effort and time. This paper addresses these challenges by employing a data-driven Koopman operator theory combined
with Model Predictive Control (MPC), which takes into account the aforementioned constraints. The proposed approach not only ensures
the performance of the AUV under state and input limitations but also considers the variation in incremental input to prevent
rapid and potentially damaging changes to the vehicle's operation. Additionally, we develop a platform based on ROS2 and Gazebo
to validate the effectiveness of the proposed algorithms, providing new control strategies for underwater vehicles against the complex and dynamic nature of underwater environments.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Backward Stochastic Differential Equations-guided Generative Model for Structural-to-functional Neuroimage Translator
Authors:
Zengjing Chen,
Lu Wang,
Yongkang Lin,
Jie Peng,
Zhiping Liu,
Jie Luo,
Bao Wang,
Yingchao Liu,
Nazim Haouchine,
Xu Qiao
Abstract:
A Method for structural-to-functional neuroimage translator
A Method for structural-to-functional neuroimage translator
△ Less
Submitted 23 February, 2025;
originally announced March 2025.
-
Notes on eventual continuity and ergodicity for SPDEs
Authors:
Ziyu Liu
Abstract:
These notes present an alternative approach to the asymptotic stability of stochastic partial differential equations driven by multiplicative noise, applicable to a wide range of dissipative systems. The method builds on general criteria established in \cite{GLLL2024b,L2023}, utilizing the eventual continuity and generalized coupling techniques.
These notes present an alternative approach to the asymptotic stability of stochastic partial differential equations driven by multiplicative noise, applicable to a wide range of dissipative systems. The method builds on general criteria established in \cite{GLLL2024b,L2023}, utilizing the eventual continuity and generalized coupling techniques.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Enhanced Koopman Operator Approximation for Nonlinear Systems Using Broading Learning System
Authors:
Yangjun Sun,
Zhiliang Liu
Abstract:
Traditional control methods often show limitations in dealing with complex nonlinear systems, especially when it is difficult to accurately obtain the exact system model, and the control accuracy and stability are difficult to guarantee. To solve this problem, the Koopman operator theory provides an effective method to linearise nonlinear systems, which simplifies the analysis and control of the s…
▽ More
Traditional control methods often show limitations in dealing with complex nonlinear systems, especially when it is difficult to accurately obtain the exact system model, and the control accuracy and stability are difficult to guarantee. To solve this problem, the Koopman operator theory provides an effective method to linearise nonlinear systems, which simplifies the analysis and control of the system by mapping the nonlinear dynamics into a high-dimensional space. However, the existing extended dynamical mode decomposition (EDMD) methods suffer from randomness in the selection of basis functions, which leads to bias in the finite-dimensional approximation to the Koopman operator, thus affecting the accuracy of model prediction. To solve this problem, this paper proposes a BLS-EDMD method based on the Broad learning system (BLS) network. The method achieves a high-precision approximation to the Koopman operator by learning more accurate basis functions, which significantly improves the prediction ability of the model. Building on this, we further develop a model predictive controller (MPC) called BE-MPC. This controller directly utilises the high-dimensional and high-precision predictors generated by BLS-EDMD to predict the system state more accurately, thus achieving precise control of the underwater unmanned vehicle (UUV), and its effectiveness is verified by simulation.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Safety Verification of Nonlinear Stochastic Systems via Probabilistic Tube
Authors:
Zishun Liu,
Saber Jafarpour,
Yongxin Chen
Abstract:
We address the problem of safety verification for nonlinear stochastic systems, specifically the task of certifying that system trajectories remain within a safe set with high probability. To tackle this challenge, we adopt a set-erosion strategy, which decouples the effects of stochastic disturbances from deterministic dynamics. This approach converts the stochastic safety verification problem on…
▽ More
We address the problem of safety verification for nonlinear stochastic systems, specifically the task of certifying that system trajectories remain within a safe set with high probability. To tackle this challenge, we adopt a set-erosion strategy, which decouples the effects of stochastic disturbances from deterministic dynamics. This approach converts the stochastic safety verification problem on a safe set into a deterministic safety verification problem on an eroded subset of the safe set. The success of this strategy hinges on the depth of erosion, which is determined by a probabilistic tube that bounds the deviation of stochastic trajectories from their corresponding deterministic trajectories. Our main contribution is the establishment of a tight bound for the probabilistic tube of nonlinear stochastic systems. To obtain a probabilistic bound for stochastic trajectories, we adopt a martingale-based approach. The core innovation lies in the design of a novel energy function associated with the averaged moment generating function, which forms an affine martingale, a generalization of the traditional c-martingale. Using this energy function, we derive a precise bound for the probabilistic tube. Furthermore, we enhance this bound by incorporating the union-bound inequality for strictly contractive dynamics. By integrating the derived probabilistic tubes into the set-erosion strategy, we demonstrate that the safety verification problem for nonlinear stochastic systems can be reduced to a deterministic safety verification problem. Our theoretical results are validated through applications in reachability-based safety verification and safe controller synthesis, accompanied by several numerical examples that illustrate their effectiveness.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Khintchine inequalities and $Z_2$ sets
Authors:
Chian Yeong Chuah,
Zhen-Chuan Liu,
Tao Mei
Abstract:
In this paper we show that the subset of integers that satisfies the Khintchine inequality for $p=1$ with the optimal constant ${\sqrt{2}}$ has to be a $Z_2$ set. We further prove a similar result for a large class of discrete groups. Our arguments rely on previous works by Haagerup/Musat \cite{Haagerup2007}, and Haagerup/Itoh \cite{Haagerup1995}.
In this paper we show that the subset of integers that satisfies the Khintchine inequality for $p=1$ with the optimal constant ${\sqrt{2}}$ has to be a $Z_2$ set. We further prove a similar result for a large class of discrete groups. Our arguments rely on previous works by Haagerup/Musat \cite{Haagerup2007}, and Haagerup/Itoh \cite{Haagerup1995}.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Adaptive monotonicity testing in sublinear time
Authors:
Housen Li,
Zhi Liu,
Axel Munk
Abstract:
Modern large-scale data analysis increasingly faces the challenge of achieving computational efficiency as well as statistical accuracy, as classical statistically efficient methods often fall short in the first regard. In the context of testing monotonicity of a regression function, we propose FOMT (Fast and Optimal Monotonicity Test), a novel methodology tailored to meet these dual demands. FOMT…
▽ More
Modern large-scale data analysis increasingly faces the challenge of achieving computational efficiency as well as statistical accuracy, as classical statistically efficient methods often fall short in the first regard. In the context of testing monotonicity of a regression function, we propose FOMT (Fast and Optimal Monotonicity Test), a novel methodology tailored to meet these dual demands. FOMT employs a sparse collection of local tests, strategically generated at random, to detect violations of monotonicity scattered throughout the domain of the regression function. This sparsity enables significant computational efficiency, achieving sublinear runtime in most cases, and quasilinear runtime (i.e., linear up to a log factor) in the worst case. In contrast, existing statistically optimal tests typically require at least quadratic runtime. FOMT's statistical accuracy is achieved through the precise calibration of these local tests and their effective combination, ensuring both sensitivity to violations and control over false positives. More precisely, we show that FOMT separates the null and alternative hypotheses at minimax optimal rates over Hölder function classes of smoothness order in $(0,2]$. Further, when the smoothness is unknown, we introduce an adaptive version of FOMT, based on a modified Lepskii principle, which attains statistical optimality and meanwhile maintains the same computational complexity as if the intrinsic smoothness were known. Extensive simulations confirm the competitiveness and effectiveness of both FOMT and its adaptive variant.
△ Less
Submitted 30 March, 2025; v1 submitted 4 March, 2025;
originally announced March 2025.
-
On the Realized Joint Laplace Transform of Volatilities with Application to Test the Volatility Dependence
Authors:
XinWei Feng,
Yu Jiang,
Zhi Liu,
Zhe Meng
Abstract:
In this paper, we first investigate the estimation of the empirical joint Laplace transform of volatilities of two semi-martingales within a fixed time interval [0, T] by using overlapped increments of high-frequency data. The proposed estimator is robust to the presence of finite variation jumps in price processes. The related functional central limit theorem for the proposed estimator has been e…
▽ More
In this paper, we first investigate the estimation of the empirical joint Laplace transform of volatilities of two semi-martingales within a fixed time interval [0, T] by using overlapped increments of high-frequency data. The proposed estimator is robust to the presence of finite variation jumps in price processes. The related functional central limit theorem for the proposed estimator has been established. Compared with the estimator with non-overlapped increments, the estimator with overlapped increments improves the asymptotic estimation efficiency. Moreover, we study the asymptotic theory of estimator under a long-span setting and employ it to create a feasible test for the dependence between volatilities. Finally, simulation and empirical studies demonstrate the performance of proposed estimators.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
Infinite-dimensional Extension of the Linear Combination of Hamiltonian Simulation: Theorems and Applications
Authors:
Rundi Lu,
Hao-En Li,
Zhengwei Liu,
Jin-Peng Liu
Abstract:
We generalize the Linear Combination of Hamiltonian Simulation (LCHS) formula [An, Liu, Lin, Phys. Rev. Lett. 2023] to simulate time-evolution operators in infinite-dimensional spaces, including scenarios involving unbounded operators. This extension, named Inf-LCHS for short, bridges the gap between finite-dimensional quantum simulations and the broader class of infinite-dimensional quantum dynam…
▽ More
We generalize the Linear Combination of Hamiltonian Simulation (LCHS) formula [An, Liu, Lin, Phys. Rev. Lett. 2023] to simulate time-evolution operators in infinite-dimensional spaces, including scenarios involving unbounded operators. This extension, named Inf-LCHS for short, bridges the gap between finite-dimensional quantum simulations and the broader class of infinite-dimensional quantum dynamics governed by partial differential equations (PDEs). Furthermore, we propose two sampling methods by integrating the infinite-dimensional LCHS with Gaussian quadrature schemes (Inf-LCHS-Gaussian) or Monte Carlo integration schemes (Inf-LCHS-MC). We demonstrate the applicability of the Inf-LCHS theorem to a wide range of non-Hermitian dynamics, including linear parabolic PDEs, queueing models (birth-or-death processes), Schrödinger equations with complex potentials, Lindblad equations, and black hole thermal field equations. Our analysis provides insights into simulating general linear dynamics using a finite number of quantum dynamics and includes cost estimates for the corresponding quantum algorithms.
△ Less
Submitted 10 March, 2025; v1 submitted 26 February, 2025;
originally announced February 2025.
-
Kissing polytopes in dimension 3
Authors:
Antoine Deza,
Zhongyuan Liu,
Lionel Pournin
Abstract:
It is shown that the smallest possible distance between two disjoint lattice polytopes contained in the cube $[0,k]^3$ is exactly $$ \frac{1}{\sqrt{2(2k^2-4k+5)(2k^2-2k+1)}} $$ for every integer $k$ at least $4$. The proof relies on modeling this as a minimization problem over a subset of the lattice points in the hypercube $[-k,k]^9$. A precise characterization of this subset allows to reduce the…
▽ More
It is shown that the smallest possible distance between two disjoint lattice polytopes contained in the cube $[0,k]^3$ is exactly $$ \frac{1}{\sqrt{2(2k^2-4k+5)(2k^2-2k+1)}} $$ for every integer $k$ at least $4$. The proof relies on modeling this as a minimization problem over a subset of the lattice points in the hypercube $[-k,k]^9$. A precise characterization of this subset allows to reduce the problem to computing the roots of a finite number of degree at most $4$ polynomials, which is done using symbolic computation.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Geometric Ergodicity and Optimal Error Estimates for a Class of Novel Tamed Schemes to Super-linear Stochastic PDEs
Authors:
Zhihui Liu,
Jie Shen
Abstract:
We construct a class of novel tamed schemes that can preserve the original Lyapunov functional for super-linear stochastic PDEs (SPDEs), including the stochastic Allen--Cahn equation, driven by multiplicative or additive noise, and provide a rigorous analysis of their long-time unconditional stability. We also show that the corresponding Galerkin-based fully discrete tamed schemes inherit the geom…
▽ More
We construct a class of novel tamed schemes that can preserve the original Lyapunov functional for super-linear stochastic PDEs (SPDEs), including the stochastic Allen--Cahn equation, driven by multiplicative or additive noise, and provide a rigorous analysis of their long-time unconditional stability. We also show that the corresponding Galerkin-based fully discrete tamed schemes inherit the geometric ergodicity of the SPDEs and establish their convergence towards the SPDEs with optimal strong rates in both the multiplicative and additive noise cases.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Planar graphs without 4-, 7-, 9-cycles and 5-cycles normally adjacent to 3-cycles
Authors:
Zhengjiao Liu,
Tao Wang,
Xiaojing Yang
Abstract:
A graph is \emph{$(\mathcal{I}, \mathcal{F})$-partitionable} if its vertex set can be partitioned into two parts such that one part $\mathcal{I}$ is an independent set, and the other $\mathcal{F}$ induces a forest. A graph is \emph{$k$-degenerate} if every subgraph $H$ contains a vertex of degree at most $k$ in $H$. Bernshteyn and Lee defined a generalization of $k$-degenerate graphs, which is cal…
▽ More
A graph is \emph{$(\mathcal{I}, \mathcal{F})$-partitionable} if its vertex set can be partitioned into two parts such that one part $\mathcal{I}$ is an independent set, and the other $\mathcal{F}$ induces a forest. A graph is \emph{$k$-degenerate} if every subgraph $H$ contains a vertex of degree at most $k$ in $H$. Bernshteyn and Lee defined a generalization of $k$-degenerate graphs, which is called \emph{weakly $k$-degenerate}. In this paper, we show that planar graphs without $4$-, $7$-, $9$-cycles, and $5$-cycles normally adjacent to $3$-cycles are both $(\mathcal{I}, \mathcal{F})$-partitionable and weakly $2$-degenerate.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
Distributed Nash Equilibrium Seeking for Constrained Aggregative Games over Jointly Connected and Weight-Balanced Switching Networks
Authors:
Zhaocong Liu,
Jie Huang
Abstract:
The property of the communication network and the constraints on the strategic space are two factors that determine the complexity of the distributed Nash equilibrium (DNE) seeking problem. The DNE seeking problem of aggregative games has been studied for unconstrained case over all types of communication networks and for various types of constrained games over static and connected communication n…
▽ More
The property of the communication network and the constraints on the strategic space are two factors that determine the complexity of the distributed Nash equilibrium (DNE) seeking problem. The DNE seeking problem of aggregative games has been studied for unconstrained case over all types of communication networks and for various types of constrained games over static and connected communication networks. In this paper, we investigate the DNE seeking problem for constrained aggregative games over jointly connected and weight-balanced switching networks, which can be directed and disconnected at every time instant. By integrating the projected gradient technique and the dynamic average consensus algorithm, we convert our problem to the stability problem of a well-defined time-varying nonlinear system. By constructing a time-varying Lyapunov's function candidate for this time-varying nonlinear system, we conduct a rigorous Lyapunov's analysis to conclude the exponential stability of this system and hence solve our problem.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
Polygraphic resolutions for operated algebras
Authors:
Zuan Liu,
Philippe Malbos
Abstract:
This paper introduces the structure of operated polygraphs as a categorical model for rewriting in operated algebras, generalizing Gröbner-Shirshov bases with non-monomial termination orders. We provide a combinatorial description of critical branchings of operated polygraphs using the structure of polyautomata that we introduce in this paper. Polyautomata extend linear polygraphs equipped with an…
▽ More
This paper introduces the structure of operated polygraphs as a categorical model for rewriting in operated algebras, generalizing Gröbner-Shirshov bases with non-monomial termination orders. We provide a combinatorial description of critical branchings of operated polygraphs using the structure of polyautomata that we introduce in this paper. Polyautomata extend linear polygraphs equipped with an operator structure formalized by a pushdown automaton. We show how to construct polygraphic resolutions of free operated algebras from their confluent and terminating presentations. Finally, we apply our constructions to several families of operated algebras, including Rota-Baxter algebras, differential algebras, and differential Rota-Baxter algebras.
△ Less
Submitted 16 April, 2025; v1 submitted 22 February, 2025;
originally announced February 2025.
-
Efficient Over-parameterized Matrix Sensing from Noisy Measurements via Alternating Preconditioned Gradient Descent
Authors:
Zhiyu Liu,
Zhi Han,
Yandong Tang,
Shaojie Tang,
Yao Wang
Abstract:
We consider the noisy matrix sensing problem in the over-parameterization setting, where the estimated rank $r$ is larger than the true rank $r_\star$ of the target matrix $X_\star$. Specifically, our main objective is to recover a matrix $ X_\star \in \mathbb{R}^{n_1 \times n_2} $ with rank $ r_\star $ from noisy measurements using an over-parameterized factorization $ LR^\top $, where…
▽ More
We consider the noisy matrix sensing problem in the over-parameterization setting, where the estimated rank $r$ is larger than the true rank $r_\star$ of the target matrix $X_\star$. Specifically, our main objective is to recover a matrix $ X_\star \in \mathbb{R}^{n_1 \times n_2} $ with rank $ r_\star $ from noisy measurements using an over-parameterized factorization $ LR^\top $, where $ L \in \mathbb{R}^{n_1 \times r}, \, R \in \mathbb{R}^{n_2 \times r} $ and $ \min\{n_1, n_2\} \ge r > r_\star $, with $ r_\star $ being unknown. Recently, preconditioning methods have been proposed to accelerate the convergence of matrix sensing problem compared to vanilla gradient descent, incorporating preconditioning terms $ (L^\top L + λI)^{-1} $ and $ (R^\top R + λI)^{-1} $ into the original gradient. However, these methods require careful tuning of the damping parameter $λ$ and are sensitive to step size. To address these limitations, we propose the alternating preconditioned gradient descent (APGD) algorithm, which alternately updates the two factor matrices, eliminating the need for the damping parameter $λ$ and enabling faster convergence with larger step sizes. We theoretically prove that APGD convergences to a near-optimal error at a linear rate. We further show that APGD can be extended to deal with other low-rank matrix estimation tasks, also with a theoretical guarantee of linear convergence. To validate the effectiveness and scalability of the proposed APGD, we conduct simulated and real-world experiments on a wide range of low-rank estimation problems, including noisy matrix sensing, weighted PCA, 1-bit matrix completion, and matrix completion. The extensive results demonstrate that APGD consistently achieves the fastest convergence and the lowest computation time compared to the existing alternatives.
△ Less
Submitted 31 May, 2025; v1 submitted 1 February, 2025;
originally announced February 2025.
-
Double EPW cubes from twisted cubics on Gushel-Mukai fourfolds
Authors:
Soheyla Feyzbakhsh,
Hanfei Guo,
Zhiyu Liu,
Shizhuo Zhang
Abstract:
In this paper, we conduct the first systematic investigation of twisted cubics on Gushel-Mukai (GM) fourfolds. We then study the double EPW cube, a 6-dimensional hyperkähler manifold associated with a general GM fourfold $X$, through the Bridgeland moduli space, and show that it is the maximal rationally connected (MRC) quotient of the Hilbert scheme of twisted cubics on $X$. We also prove that a…
▽ More
In this paper, we conduct the first systematic investigation of twisted cubics on Gushel-Mukai (GM) fourfolds. We then study the double EPW cube, a 6-dimensional hyperkähler manifold associated with a general GM fourfold $X$, through the Bridgeland moduli space, and show that it is the maximal rationally connected (MRC) quotient of the Hilbert scheme of twisted cubics on $X$. We also prove that a general double EPW cube admits a covering by Lagrangian subvarieties constructed from the Hilbert schemes of twisted cubics on GM threefolds, which provides a new example for a conjecture of O'Grady.
△ Less
Submitted 22 January, 2025;
originally announced January 2025.
-
FOCUS: First Order Concentrated Updating Scheme
Authors:
Yizhou Liu,
Ziming Liu,
Jeff Gore
Abstract:
Large language models (LLMs) demonstrate remarkable performance, and improving their pre-training process appears to be key to enhancing their capabilities further. Based on the documented success of Adam, learning rate decay, and weight decay, we hypothesize that the pre-training loss landscape features a narrowing valley structure. Through experiments with synthetic loss functions, we discover t…
▽ More
Large language models (LLMs) demonstrate remarkable performance, and improving their pre-training process appears to be key to enhancing their capabilities further. Based on the documented success of Adam, learning rate decay, and weight decay, we hypothesize that the pre-training loss landscape features a narrowing valley structure. Through experiments with synthetic loss functions, we discover that when gradient query noise is high relative to the valley's sharpness, Adam's performance falls behind that of Signum because Adam reduces the effective step size too drastically. This observation led us to develop FOCUS, an optimizer that enhances Signum by incorporating attraction toward moving averaged parameters, allowing it to handle noise better while maintaining larger step sizes. In training GPT-2, FOCUS proves to be more stable than Signum and faster than Adam. These results suggest that gradient noise may be an underappreciated limiting factor in LLM training, and FOCUS offers promising solutions.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
A trajectorial approach to the gradient flow of McKean-Vlasov SDEs with mobility
Authors:
Zhenxin Liu,
Xuewei Wang
Abstract:
We establish the gradient flow representation of diffusion with mobility $b$ with respect to the modified Wasserstein quasi-metric $W_h$, where $h(r)=rb(r)$. The appropriate selection of the free energy functional depends on the specific form of the generalized entropy. Different from the JKO scheme, we derive the trajectorial version of the relative entropy dissipation identity for the McKean-Vla…
▽ More
We establish the gradient flow representation of diffusion with mobility $b$ with respect to the modified Wasserstein quasi-metric $W_h$, where $h(r)=rb(r)$. The appropriate selection of the free energy functional depends on the specific form of the generalized entropy. Different from the JKO scheme, we derive the trajectorial version of the relative entropy dissipation identity for the McKean-Vlasov stochastic differential equation (SDE) with Nemytskii-type coefficients, utilizing techniques from stochastic analysis. Based on this, we demonstrate that the trajectorial average of the solution process to the McKean-Vlasov SDE, with respect to the underlying measure, corresponds to the rate of dissipation of the free energy. As an application, we present the energy dissipation of the Fermi-Dirac-Fokker-Planck equation, a model widely used in physics and biology to describe saturation effects. Inspired by numerical simulations, we propose two questions on condensation phenomena and non-exponential convergence rate.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Derived-natural automorphisms on Hilbert schemes of points on generic K3 surfaces
Authors:
Ziqi Liu
Abstract:
The article revisits birational and biregular automorphisms of the Hilbert scheme of points on a K3 surface from the perspective of derived categories. Under the assumption that the K3 surface is generic, the birational and biregular involutions induced by autoequivalences on the derived category of the underlying K3 surface are characterized.
The article revisits birational and biregular automorphisms of the Hilbert scheme of points on a K3 surface from the perspective of derived categories. Under the assumption that the K3 surface is generic, the birational and biregular involutions induced by autoequivalences on the derived category of the underlying K3 surface are characterized.
△ Less
Submitted 12 May, 2025; v1 submitted 15 January, 2025;
originally announced January 2025.
-
The Neumann problem for a class of Hessian quotient type equations
Authors:
Jiabao Gong,
Zixuan Liu,
Qiang Tu
Abstract:
In this paper, we consider the Neumann problem for a class of Hessian quotient equations involving a gradient term on the right-hand side in Euclidean space. More precisely, we derive the interior gradient estimates for the $(Λ, k)$-convex solution of Hessian quotient equation $\frac{σ_k(Λ(D^2 u))}{σ_l(Λ(D^2 u))}=ψ(x,u,D u)$ with $0\leq l<k\leq C^{p-1}_{n-1}$ under the assumption of the growth con…
▽ More
In this paper, we consider the Neumann problem for a class of Hessian quotient equations involving a gradient term on the right-hand side in Euclidean space. More precisely, we derive the interior gradient estimates for the $(Λ, k)$-convex solution of Hessian quotient equation $\frac{σ_k(Λ(D^2 u))}{σ_l(Λ(D^2 u))}=ψ(x,u,D u)$ with $0\leq l<k\leq C^{p-1}_{n-1}$ under the assumption of the growth condition. As an application, we obtain the global a priori estimates and the existence theorem for the Neumann problem of this Hessian quotient type equation.
△ Less
Submitted 9 January, 2025;
originally announced January 2025.
-
Multi-step Inertial Accelerated Doubly Stochastic Gradient Methods for Block Term Tensor Decomposition
Authors:
Zehui Liu,
Qingsong Wang,
Chunfeng Cui
Abstract:
In this paper, we explore a specific optimization problem that combines a differentiable nonconvex function with a nondifferentiable function for multi-block variables, which is particularly relevant to tackle the multilinear rank-($L_r$,$L_r$,1) block-term tensor decomposition model with a regularization term. While existing algorithms often suffer from high per-iteration complexity and slow conv…
▽ More
In this paper, we explore a specific optimization problem that combines a differentiable nonconvex function with a nondifferentiable function for multi-block variables, which is particularly relevant to tackle the multilinear rank-($L_r$,$L_r$,1) block-term tensor decomposition model with a regularization term. While existing algorithms often suffer from high per-iteration complexity and slow convergence, this paper employs a unified multi-step inertial accelerated doubly stochastic gradient descent method tailored for structured rank-$\left(L_r, L_r, 1\right)$ tensor decomposition, referred to as Midas-LL1. We also introduce an extended multi-step variance-reduced stochastic estimator framework. Our analysis under this new framework demonstrates the subsequential and sequential convergence of the proposed algorithm under certain conditions and illustrates the sublinear convergence rate of the subsequence, showing that the Midas-LL1 algorithm requires at most $\mathcal{O}(\varepsilon^{-2})$ iterations in expectation to reach an $\varepsilon$-stationary point. The proposed algorithm is evaluated on several datasets, and the results indicate that Midas-LL1 outperforms existing state-of-the-art algorithms in terms of both computational speed and solution quality.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.