-
Finite de Finetti theorems for free easy quantum groups
Authors:
Jianquan Wang
Abstract:
We prove various finite de Finetti theorems for non-commutative distributions which are invariant under the free easy quantum group actions. This complements the free de Finetti theorems by Banica, Curran and Speicher, which mostly focus on infinite sequences. We also discuss some refined results for the infinite setting.
We prove various finite de Finetti theorems for non-commutative distributions which are invariant under the free easy quantum group actions. This complements the free de Finetti theorems by Banica, Curran and Speicher, which mostly focus on infinite sequences. We also discuss some refined results for the infinite setting.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
A Lasry-Lions envelope approach for mathematical programs with complementarity constraints
Authors:
Jia Wang,
Andreas Themelis,
Ivan Markovsky,
Panagiotis Patrinos
Abstract:
We propose a homotopy method for solving mathematical programs with complementarity constraints (CCs). The indicator function of the CCs is relaxed by a Lasry-Lions double envelope, an extension of the Moreau envelope that enjoys an additional smoothness property that makes it amenable to fast optimization algorithms. The proposed algorithm mimics the behavior of homotopy methods for systems of no…
▽ More
We propose a homotopy method for solving mathematical programs with complementarity constraints (CCs). The indicator function of the CCs is relaxed by a Lasry-Lions double envelope, an extension of the Moreau envelope that enjoys an additional smoothness property that makes it amenable to fast optimization algorithms. The proposed algorithm mimics the behavior of homotopy methods for systems of nonlinear equations or penalty methods for constrained optimization: it solves a sequence of smooth subproblems that progressively approximate the original problem, using the solution of each subproblem as the starting point for the next one. In the limiting setting, we establish the convergence to Mordukhovich and Clarke stationary points. We also provide a worst-case complexity analysis for computing an approximate stationary point. Preliminary numerical results on a suite of benchmark problems demonstrate the effectiveness of the proposed approach.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Hypergraph Turán problem of the generalized triangle with bounded matching number
Authors:
Jian Wang,
Wenbin Wang,
Weihua Yang
Abstract:
Let $\mathcal{H}$ be a 3-graph on $n$ vertices. The matching number $ν(\mathcal{H})$ is defined as the maximum number of disjoint edges in $\mathcal{H}$. The generalized triangle $F_5$ is a 3-graph on the vertex set $\{a,b,c,d,e\}$ with the edge set $\{abc, abd,cde\}$. In this paper, we showed that an $F_5$-free 3-graph $\mathcal{H}$ with matching number at most $s$ has at most…
▽ More
Let $\mathcal{H}$ be a 3-graph on $n$ vertices. The matching number $ν(\mathcal{H})$ is defined as the maximum number of disjoint edges in $\mathcal{H}$. The generalized triangle $F_5$ is a 3-graph on the vertex set $\{a,b,c,d,e\}$ with the edge set $\{abc, abd,cde\}$. In this paper, we showed that an $F_5$-free 3-graph $\mathcal{H}$ with matching number at most $s$ has at most $s\lfloor (n-s)^2/4\rfloor$ edges for $n\geq 30(s+1)$ and $s\geq 3$. For the proof, we establish a 2-colored version of Mantel's theorem, which may be of independent interests.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Stable deformed $\mathfrak{gl}_N$ homology of torus knots
Authors:
William Ballinger,
Eugene Gorsky,
Matthew Hogancamp,
Joshua Wang
Abstract:
We compute the $E_2$ page in the Rasmussen spectral sequence from triply graded to $\mathfrak{gl}_N$ Khovanov--Rozansky stable homology of torus knots. This confirms a weak form of the conjecture of the second author, Oblomkov, and Rasmussen. The main tool is the link-splitting deformation, or $y$-ification, of link homology; in the $y$-ified context, the relevant Rasmussen spectral sequence colla…
▽ More
We compute the $E_2$ page in the Rasmussen spectral sequence from triply graded to $\mathfrak{gl}_N$ Khovanov--Rozansky stable homology of torus knots. This confirms a weak form of the conjecture of the second author, Oblomkov, and Rasmussen. The main tool is the link-splitting deformation, or $y$-ification, of link homology; in the $y$-ified context, the relevant Rasmussen spectral sequence collapses and we explicitly compute the $y$-ified $\mathfrak{gl}_N$ stable Khovanov--Rozansky homology of torus knots for all $N$.
△ Less
Submitted 30 June, 2025;
originally announced July 2025.
-
A scalar-mean curvature comparison theorem for manifolds with iterated conical singularities
Authors:
Milan Jovanovic,
Jinmin Wang
Abstract:
We use the Dirac operator method to prove a scalar-mean curvature comparison theorem for spin manifolds which carry iterated conical singularities. Our approach is to study the index theory of a twisted Dirac operator on such singular manifolds. A dichotomy argument is used to prove the comparison theorem without knowing precisely the index of the twisted Dirac operator. This framework also enable…
▽ More
We use the Dirac operator method to prove a scalar-mean curvature comparison theorem for spin manifolds which carry iterated conical singularities. Our approach is to study the index theory of a twisted Dirac operator on such singular manifolds. A dichotomy argument is used to prove the comparison theorem without knowing precisely the index of the twisted Dirac operator. This framework also enables us to prove a rigidity theorem of Euclidean domains and a spin positive mass theorem for asymptotically flat manifolds with iterated conical singularities.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
Totally acyclic complexes and homological invariants
Authors:
Jian Wang,
Yunxia Li,
Jiangsheng Hu,
Haiyan zhu
Abstract:
In this paper, we study equivalent characterizations of the condition that every acyclic complex of projective (resp., injective and flat) modules is totally acyclic over a general ring R. This line of inquiry was initiated by Iyengar and Krause in 2006 for commutative Noetherian rings with dualizing complexes. We demonstrate that certain equivalent conditions are closely related to the invariants…
▽ More
In this paper, we study equivalent characterizations of the condition that every acyclic complex of projective (resp., injective and flat) modules is totally acyclic over a general ring R. This line of inquiry was initiated by Iyengar and Krause in 2006 for commutative Noetherian rings with dualizing complexes. We demonstrate that certain equivalent conditions are closely related to the invariants silp(R) and spli(R) defined by Gedrich and Gruenberg, as well as to the invariant sfli(R) defined by Ding and Chen. We also examine some sufficient conditions for the equality spli(R) = silp(R), that leads to a generalization of a result by Ballas and Chatzistavridis that was originally proved in the case that R is a left (and right) coherent ring which is isomorphic with its opposite ring. Finally, we provide examples to illustrate relations among these conditions.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
Compact Kähler manifolds with nef anti-canonical bundle
Authors:
Shin-ichi Matsumura,
Juanyong Wang,
Xiaojun Wu,
Qimin Zhang
Abstract:
In this paper, we prove that a compact Kähler manifold $X$ with the nef anti-canonical bundle $-K_{X}$ admits a locally trivial fibration $φ\colon X \to Y$, where the fiber $F$ is a rationally connected manifold and the base $Y$ is a Calabi--Yau manifold. We introduce a suitable approach that extends the strategy of Cao--Höring, originally developed for smooth projective varieties, to more general…
▽ More
In this paper, we prove that a compact Kähler manifold $X$ with the nef anti-canonical bundle $-K_{X}$ admits a locally trivial fibration $φ\colon X \to Y$, where the fiber $F$ is a rationally connected manifold and the base $Y$ is a Calabi--Yau manifold. We introduce a suitable approach that extends the strategy of Cao--Höring, originally developed for smooth projective varieties, to more general singular Kähler spaces. A key technical ingredient is a flatness criterion for pseudo-effective sheaves with vanishing first Chern class.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
Average quantile regression: a new non-mean regression model and coherent risk measure
Authors:
Rong Jiang,
M. C. Jones,
Keming Yu,
Jiangfeng Wang
Abstract:
Regression models that go beyond the mean, alongside coherent risk measures, have been important tools in modern data analysis. This paper introduces the innovative concept of Average Quantile Regression (AQR), which is smooth at the quantile-like level, comonotonically additive, and explicitly accounts for the severity of tail losses relative to quantile regression. AQR serves as a versatile regr…
▽ More
Regression models that go beyond the mean, alongside coherent risk measures, have been important tools in modern data analysis. This paper introduces the innovative concept of Average Quantile Regression (AQR), which is smooth at the quantile-like level, comonotonically additive, and explicitly accounts for the severity of tail losses relative to quantile regression. AQR serves as a versatile regression model capable of describing distributional information across all positions, akin to quantile regression, yet offering enhanced interpretability compared to expectiles. Numerous traditional regression models and coherent risk measures can be regarded as special cases of AQR. As a flexible non-parametric regression model, AQR demonstrates outstanding performance in analyzing high-dimensional and large datasets, particularly those generated by distributed systems, and provides a convenient framework for their statistical analysis. The corresponding estimators are rigorously derived, and their asymptotic properties are thoroughly developed. In a risk management context, the case study confirms AQR's effectiveness in risk assessment and portfolio optimization.
△ Less
Submitted 28 June, 2025;
originally announced June 2025.
-
Strategic A/B testing via Maximum Probability-driven Two-armed Bandit
Authors:
Yu Zhang,
Shanshan Zhao,
Bokui Wan,
Jinjuan Wang,
Xiaodong Yan
Abstract:
Detecting a minor average treatment effect is a major challenge in large-scale applications, where even minimal improvements can have a significant economic impact. Traditional methods, reliant on normal distribution-based or expanded statistics, often fail to identify such minor effects because of their inability to handle small discrepancies with sufficient sensitivity. This work leverages a cou…
▽ More
Detecting a minor average treatment effect is a major challenge in large-scale applications, where even minimal improvements can have a significant economic impact. Traditional methods, reliant on normal distribution-based or expanded statistics, often fail to identify such minor effects because of their inability to handle small discrepancies with sufficient sensitivity. This work leverages a counterfactual outcome framework and proposes a maximum probability-driven two-armed bandit (TAB) process by weighting the mean volatility statistic, which controls Type I error. The implementation of permutation methods further enhances the robustness and efficacy. The established strategic central limit theorem (SCLT) demonstrates that our approach yields a more concentrated distribution under the null hypothesis and a less concentrated one under the alternative hypothesis, greatly improving statistical power. The experimental results indicate a significant improvement in the A/B testing, highlighting the potential to reduce experimental costs while maintaining high statistical power.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
A Zeroth-Order Extra-Gradient Method For Black-Box Constrained Optimization
Authors:
Yuke Zhou,
Ruiyang Jin,
Siyang Gao,
Jianxiao Wang,
Jie Song
Abstract:
Non-analytical objectives and constraints often arise in control systems, particularly in problems with complex dynamics, which are challenging yet lack efficient solution methods. In this work, we consider general constrained optimization problems involving black-box objectives and constraints. To solve it, we reformulate it as a min-max problem and propose a zeroth-order extra gradient (ZOEG) al…
▽ More
Non-analytical objectives and constraints often arise in control systems, particularly in problems with complex dynamics, which are challenging yet lack efficient solution methods. In this work, we consider general constrained optimization problems involving black-box objectives and constraints. To solve it, we reformulate it as a min-max problem and propose a zeroth-order extra gradient (ZOEG) algorithm that combines the extra gradient method with a feedback-based stochastic zeroth-order gradient estimator. Then, we apply another coordinate gradient estimator to design the zeroth-order coordinate extra gradient algorithm (ZOCEG) to further improve efficiency. The theoretical analysis shows that ZOEG can achieve the best-known oracle complexity of $\mathcal{O}(dε^{-2})$ to get an $ε$-optimal solution ($d$ is the dimension of decision space), and ZOCEG can improve it to $\mathcal{O}(dε^{-1})$. Furthermore, we develop a variant of ZOCEG, which applies block coordinate updates to enhance the efficiency of single-step gradient estimation. Finally, numerical experiments on a load tracking problem validate our theoretical results and the effectiveness of the proposed algorithms.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
A generalization of Deodhar's defect statistic for Iwahori--Hecke algebras of type $BC$
Authors:
Gavin Hobbs,
Tommy Parisi,
Mark Skandera,
Jiayuan Wang
Abstract:
Let $H$ be the Iwahori--Hecke algebra corresponding to any Coxeter group. Deodhar's defect statistic [Geom. Dedicata 36, (1990) pp.95--119] allows one to expand products of simple Kazhdan--Lusztig basis elements of $H$ in the natural basis of $H$. Clearwater and the third author gave a type-$A$ extension [Ann. Comb. 25, no. 3 (2021) pp.757--787] of this formula which combinatorially describes the…
▽ More
Let $H$ be the Iwahori--Hecke algebra corresponding to any Coxeter group. Deodhar's defect statistic [Geom. Dedicata 36, (1990) pp.95--119] allows one to expand products of simple Kazhdan--Lusztig basis elements of $H$ in the natural basis of $H$. Clearwater and the third author gave a type-$A$ extension [Ann. Comb. 25, no. 3 (2021) pp.757--787] of this formula which combinatorially describes the natural expansion of products of Kazhdan--Lusztig basis elements indexed by smooth elements of the symmetric group. We similarly give a type-$BC$ extension of Deodhar's result which combinatorially describes the natural expansion of Kazhdan--Lusztig basis elements indexed by hyperoctahedral group elements which are simultaneously smooth in types $B$ and $C$.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Averaging principles for time-inhomogeneous multi-scale SDEs with partially dissipative coefficients
Authors:
Xiaobin Sun,
Jian Wang,
Yingchao Xie
Abstract:
In this paper, we study averaging principles for a class of time-inhomogeneous stochastic differential equations (SDEs) with slow and fast time-scales, where the drift term in the fast component is time-dependent and only partially dissipative. Under asymptotic assumptions on the coefficients, we prove that the slow component $(X^{\varepsilon}_t)_{t\geq 0}$ converges strongly to the unique solutio…
▽ More
In this paper, we study averaging principles for a class of time-inhomogeneous stochastic differential equations (SDEs) with slow and fast time-scales, where the drift term in the fast component is time-dependent and only partially dissipative. Under asymptotic assumptions on the coefficients, we prove that the slow component $(X^{\varepsilon}_t)_{t\geq 0}$ converges strongly to the unique solution $(\bar{X}_t)_{t\geq 0}$ to an averaged SDE, when the diffusion coefficient in the slow component is independent of the fast component; on the other hand, we establish the weak convergence of $(X_t^{\varepsilon})_{t\ge0}$ in the space $C([0,T];\mathbb{R}^n)$ and identify the limiting process by the martingale problem approach, when the diffusion coefficient of the slow component depends on the fast component. The proofs of strong and weak averaging principles are partly based on the study of the existence and uniqueness of an evolution system of measures for time-inhomogeneous SDEs with partially dissipative drift.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Towards Robust Learning to Optimize with Theoretical Guarantees
Authors:
Qingyu Song,
Wei Lin,
Juncheng Wang,
Hong Xu
Abstract:
Learning to optimize (L2O) is an emerging technique to solve mathematical optimization problems with learning-based methods. Although with great success in many real-world scenarios such as wireless communications, computer networks, and electronic design, existing L2O works lack theoretical demonstration of their performance and robustness in out-of-distribution (OOD) scenarios. We address this g…
▽ More
Learning to optimize (L2O) is an emerging technique to solve mathematical optimization problems with learning-based methods. Although with great success in many real-world scenarios such as wireless communications, computer networks, and electronic design, existing L2O works lack theoretical demonstration of their performance and robustness in out-of-distribution (OOD) scenarios. We address this gap by providing comprehensive proofs. First, we prove a sufficient condition for a robust L2O model with homogeneous convergence rates over all In-Distribution (InD) instances. We assume an L2O model achieves robustness for an InD scenario. Based on our proposed methodology of aligning OOD problems to InD problems, we also demonstrate that the L2O model's convergence rate in OOD scenarios will deteriorate by an equation of the L2O model's input features. Moreover, we propose an L2O model with a concise gradient-only feature construction and a novel gradient-based history modeling method. Numerical simulation demonstrates that our proposed model outperforms the state-of-the-art baseline in both InD and OOD scenarios and achieves up to 10 $\times$ convergence speedup. The code of our method can be found from https://github.com/NetX-lab/GoMathL2O-Official.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Optimal ${L^2}$ error estimates for 2D/3D incompressible Cahn--Hilliard--magnetohydrodynamic equations
Authors:
Haiyan Su,
Jilu Wang,
Zeyu Xia,
Ke Zhang
Abstract:
This paper focuses on an optimal error analysis of a fully discrete finite element scheme for the Cahn--Hilliard--magnetohydrodynamic (CH-MHD) system. The method use the standard inf-sup stable Taylor--Hood/MINI elements to solve the Navier--Stokes equations, Lagrange elements to solve the phase field, and particularly, the Nédélec elements for solving the magnetic induction field. Suffering from…
▽ More
This paper focuses on an optimal error analysis of a fully discrete finite element scheme for the Cahn--Hilliard--magnetohydrodynamic (CH-MHD) system. The method use the standard inf-sup stable Taylor--Hood/MINI elements to solve the Navier--Stokes equations, Lagrange elements to solve the phase field, and particularly, the Nédélec elements for solving the magnetic induction field. Suffering from the strong coupling and high nonlinearity, the previous works just provide suboptimal error estimates for phase field and velocity field in $L^{2}/Ł^2$-norm under the same order elements, and the suboptimal error estimates for magnetic induction field in $\H(\rm curl)$-norm. To this end, we utilize the Ritz, Stokes, and Maxwell quasi-projections to eliminate the low-order pollution of the phase field and magnetic induction field. In addition to the optimal $Ł^2$-norm error estimates, we present the optimal convergence rates for magnetic induction field in $\H(\rm curl)$-norm and for velocity field in $\H^1$-norm. Moreover, the unconditional energy stability and mass conservation of the proposed scheme are preserved. Numerical examples are illustrated to validate the theoretical analysis and show the performance of the proposed scheme.
△ Less
Submitted 15 June, 2025;
originally announced June 2025.
-
Lagrange multiplier expressions for matrix polynomial optimization and tight relaxations
Authors:
Lei Huang,
Jiawang Nie,
Jiajia Wang,
Lingling Xie
Abstract:
This paper studies matrix constrained polynomial optimization. We investigate how to get explicit expressions for Lagrange multiplier matrices from the first order optimality conditions. The existence of these expressions can be shown under the nondegeneracy condition. Using Lagrange multiplier matrix expressions, we propose a strengthened Moment-SOS hierarchy for solving matrix polynomial optimiz…
▽ More
This paper studies matrix constrained polynomial optimization. We investigate how to get explicit expressions for Lagrange multiplier matrices from the first order optimality conditions. The existence of these expressions can be shown under the nondegeneracy condition. Using Lagrange multiplier matrix expressions, we propose a strengthened Moment-SOS hierarchy for solving matrix polynomial optimization. Under some general assumptions, we show that this strengthened hierarchy is tight, or equivalently, it has finite convergence. We also study how to detect tightness and how to extract optimizers. Numerical experiments are provided to show the efficiency of the strengthened hierarchy.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Shortest filling geodesics on hyperbolic surfaces
Authors:
Yue Gao,
Jiajun Wang,
Zhongzi Wang
Abstract:
In this paper, we obtain the minimal length of a filling (multi-)geodesic on a genus $g$ hyperbolic surface in the moduli space of hyperbolic surfaces and show that it is realized by the geodesic whose complement is a right-angled regular $(8g-4)$-gon. A single geodesic realizing this minimum is provided.
In this paper, we obtain the minimal length of a filling (multi-)geodesic on a genus $g$ hyperbolic surface in the moduli space of hyperbolic surfaces and show that it is realized by the geodesic whose complement is a right-angled regular $(8g-4)$-gon. A single geodesic realizing this minimum is provided.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Clustering of large deviations events in heavy-tailed moving average processes: the catastrophe principle in the short-memory case
Authors:
Jiaqi Wang,
Gennady Samorodnitsky
Abstract:
How do large deviation events in a stationary process cluster? The answer depends not only on the type of large deviations, but also on the length of memory in the process. Somewhat unexpectedly, it may also depend on the tails of the process. In this paper we work in the context of large deviations for partial sums in moving average processes with short memory and regularly varying tails. We show…
▽ More
How do large deviation events in a stationary process cluster? The answer depends not only on the type of large deviations, but also on the length of memory in the process. Somewhat unexpectedly, it may also depend on the tails of the process. In this paper we work in the context of large deviations for partial sums in moving average processes with short memory and regularly varying tails. We show that the structure of the large deviation cluster in this case markedly differs from the corresponding structure in the case of exponentially light tails, considered in Chakrabarty and Samorodnitsky (2024). This is due to the difference between the ``conspiracy'' vs. the ``catastrophe'' principles underlying the large deviation events in the light tailed case and the heavy tailed case, correspondingly.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Authors:
Xiangning Yu,
Zhuohan Wang,
Linyi Yang,
Haoxuan Li,
Anjie Liu,
Xiao Xue,
Jun Wang,
Mengyue Yang
Abstract:
Chain-of-Thought (CoT) prompting plays an indispensable role in endowing large language models (LLMs) with complex reasoning capabilities. However, CoT currently faces two fundamental challenges: (1) Sufficiency, which ensures that the generated intermediate inference steps comprehensively cover and substantiate the final conclusion; and (2) Necessity, which identifies the inference steps that are…
▽ More
Chain-of-Thought (CoT) prompting plays an indispensable role in endowing large language models (LLMs) with complex reasoning capabilities. However, CoT currently faces two fundamental challenges: (1) Sufficiency, which ensures that the generated intermediate inference steps comprehensively cover and substantiate the final conclusion; and (2) Necessity, which identifies the inference steps that are truly indispensable for the soundness of the resulting answer. We propose a causal framework that characterizes CoT reasoning through the dual lenses of sufficiency and necessity. Incorporating causal Probability of Sufficiency and Necessity allows us not only to determine which steps are logically sufficient or necessary to the prediction outcome, but also to quantify their actual influence on the final reasoning outcome under different intervention scenarios, thereby enabling the automated addition of missing steps and the pruning of redundant ones. Extensive experimental results on various mathematical and commonsense reasoning benchmarks confirm substantial improvements in reasoning efficiency and reduced token usage without sacrificing accuracy. Our work provides a promising direction for improving LLM reasoning performance and cost-effectiveness.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Enumerating several statistics of r-Colored Dyck paths with no dd-steps having the same colors
Authors:
Yidong Sun,
Jinyi Wang,
Xinyu Wang
Abstract:
An $r$-colored Dyck path is a Dyck path with all $\mathbf{d}$-steps having one of $r$ colors in $[r]=\{1, 2, \dots, r\}$. In this paper, we consider several statistics on the set $\mathcal{A}_{n,0}^{(r)}$ of $r$-colored Dyck paths of length $2n$ with no two consecutive $\mathbf{d}$-steps having the same colors. Precisely, the paper studies the statistics ``number of points" at level $\ell$, ``numb…
▽ More
An $r$-colored Dyck path is a Dyck path with all $\mathbf{d}$-steps having one of $r$ colors in $[r]=\{1, 2, \dots, r\}$. In this paper, we consider several statistics on the set $\mathcal{A}_{n,0}^{(r)}$ of $r$-colored Dyck paths of length $2n$ with no two consecutive $\mathbf{d}$-steps having the same colors. Precisely, the paper studies the statistics ``number of points" at level $\ell$, ``number of $\mathbf{u}$-steps" at level $\ell+1$, ``number of peaks" at level $\ell+1$ and ``number of $\mathbf{udu}$-steps" on the set $\mathcal{A}_{n,0}^{(r)}$. The counting formulas of the first three statistics are established by Riordan arrays related to $S(a,b; x)$, the weighted generating function of $(a,b)$-Schröder paths. By a useful and surprising relations satisfied by $S(a,b; x)$, several identities related to these counting formulas are also described.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Multiple mixing and fractional cohomological equation
Authors:
Zhenqi Jenny Wang
Abstract:
We introduce the notion of the (multiple) fractional cohomological equation and, by studying its solutions, develop a novel framework to obtain the decay of matrix coefficients for partially hyperbolic algebraic actions. In particular, we show that mere partial Hölder regularity of \(L^2\) vectors is sufficient for exponential decay of matrix coefficients.
As an application, under the assumption…
▽ More
We introduce the notion of the (multiple) fractional cohomological equation and, by studying its solutions, develop a novel framework to obtain the decay of matrix coefficients for partially hyperbolic algebraic actions. In particular, we show that mere partial Hölder regularity of \(L^2\) vectors is sufficient for exponential decay of matrix coefficients.
As an application, under the assumption of ergodicity, we obtain explicit and sharp exponential mixing rates of all orders for a large class of partially hyperbolic algebraic actions. Furthermore, we introduce the concept of irrational automorphisms on nilmanifolds and prove that these automorphisms exhibit super-exponential mixing of all orders, marking the first such example in the literature.
△ Less
Submitted 10 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
Uncovering the topology of an infinite-server queueing network from population data
Authors:
Hritika Gupta,
Michel Mandjes,
Liron Ravner,
Jiesen Wang
Abstract:
This paper studies statistical inference in a network of infinite-server queues, with the aim of estimating the underlying parameters (routing matrix, arrival rates, parameters pertaining to the service times) using observations of the network population vector at Poisson time points. We propose a method-of-moments estimator and establish its consistency. The method relies on deriving the covarian…
▽ More
This paper studies statistical inference in a network of infinite-server queues, with the aim of estimating the underlying parameters (routing matrix, arrival rates, parameters pertaining to the service times) using observations of the network population vector at Poisson time points. We propose a method-of-moments estimator and establish its consistency. The method relies on deriving the covariance structure of different nodes at different sampling epochs. Numerical experiments demonstrate that the method yields accurate estimates, even in settings with a large number of parameters. Two model variants are considered: one that assumes a known parametric form for the service-time distributions, and a model-free version that does not require such assumptions.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
The overflow in the Katona Theorem
Authors:
Peter Frankl,
Jian Wang
Abstract:
Let $n>2r>0$ be integers. We consider families $\mathcal{F}$ of subsets of an $n$-element set, in which the union of any two members has size at most $2r$. One of our results states that for $n\geq 6r$ the number of members of size exceeding $r$ in $\mathcal{F}$ is at most $\binom{n-2}{r-1}$. Another result shows that for $n>3.5r$ the number of sets of size at least $r$ is at most $\binom{n}{r}$.…
▽ More
Let $n>2r>0$ be integers. We consider families $\mathcal{F}$ of subsets of an $n$-element set, in which the union of any two members has size at most $2r$. One of our results states that for $n\geq 6r$ the number of members of size exceeding $r$ in $\mathcal{F}$ is at most $\binom{n-2}{r-1}$. Another result shows that for $n>3.5r$ the number of sets of size at least $r$ is at most $\binom{n}{r}$. Both bounds are best possible and the latter sharpens the classical Katona Theorem. Similar results are proved for the odd case of the Katona Theorem as well.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Lions and Muons: Optimization via Stochastic Frank-Wolfe
Authors:
Maria-Eleni Sfyraki,
Jun-Kun Wang
Abstract:
Stochastic Frank-Wolfe is a classical optimization method for solving constrained optimization problems. On the other hand, recent optimizers such as Lion and Muon have gained quite significant popularity in deep learning. In this work, we provide a unifying perspective by interpreting these seemingly disparate methods through the lens of Stochastic Frank-Wolfe. Specifically, we show that Lion and…
▽ More
Stochastic Frank-Wolfe is a classical optimization method for solving constrained optimization problems. On the other hand, recent optimizers such as Lion and Muon have gained quite significant popularity in deep learning. In this work, we provide a unifying perspective by interpreting these seemingly disparate methods through the lens of Stochastic Frank-Wolfe. Specifically, we show that Lion and Muon with weight decay can be viewed as special instances of a Stochastic Frank-Wolfe, and we establish their convergence guarantees in terms of the Frank-Wolfe gap, a standard stationarity measure in non-convex optimization for Frank-Wolfe methods. We further find that convergence to this gap implies convergence to a KKT point of the original problem under a norm constraint for Lion and Muon. Moreover, motivated by recent empirical findings that stochastic gradients in modern machine learning tasks often exhibit heavy-tailed distributions, we extend Stochastic Frank-Wolfe to settings with heavy-tailed noise by developing two robust variants with strong theoretical guarantees, which in turn yields new variants of Lion and Muon.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Large Deviations for Sequential Tests of Statistical Sequence Matching
Authors:
Lin Zhou,
Qianyun Wang,
Yun Wei,
Jingjing Wang
Abstract:
We revisit the problem of statistical sequence matching initiated by Unnikrishnan (TIT 2015) and derive theoretical performance guarantees for sequential tests that have bounded expected stopping times. Specifically, in this problem, one is given two databases of sequences and the task is to identify all matched pairs of sequences. In each database, each sequence is generated i.i.d. from a distinc…
▽ More
We revisit the problem of statistical sequence matching initiated by Unnikrishnan (TIT 2015) and derive theoretical performance guarantees for sequential tests that have bounded expected stopping times. Specifically, in this problem, one is given two databases of sequences and the task is to identify all matched pairs of sequences. In each database, each sequence is generated i.i.d. from a distinct distribution and a pair of sequences is said matched if they are generated from the same distribution. The generating distribution of each sequence is \emph{unknown}. We first consider the case where the number of matches is known and derive the exact exponential decay rate of the mismatch (error) probability, a.k.a. the mismatch exponent, under each hypothesis for optimal sequential tests. Our results reveal the benefit of sequentiality by showing that optimal sequential tests have larger mismatch exponent than fixed-length tests by Zhou \emph{et al.} (TIT 2024). Subsequently, we generalize our achievability result to the case of unknown number of matches. In this case, two additional error probabilities arise: false alarm and false reject probabilities. We propose a corresponding sequential test, show that the test has bounded expected stopping time under certain conditions, and characterize the tradeoff among the exponential decay rates of three error probabilities. Furthermore, we reveal the benefit of sequentiality over the two-step fixed-length test by Zhou \emph{et al.} (TIT 2024) and propose an one-step fixed-length test that has no worse performance than the fixed-length test by Zhou \emph{et al.} (TIT 2024). When specialized to the case where either database contains a single sequence, our results specialize to large deviations of sequential tests for statistical classification, the binary case of which was recently studied by Hsu, Li and Wang (ITW 2022).
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Hyperbolicity and GCD for n+1 divisors with non-empty intersection
Authors:
Julie Tzu-Yueh Wang,
Zheng Xiao
Abstract:
We study hyperbolicity for quasi-projective varieties where the boundary divisor consists of n+1 numerically parallel effective divisors on a complex projective variety of dimension n, allowing non-empty intersection. Under explicit local conditions on beta constants or intersection multiplicities, we prove that all entire curves are algebraically degenerate. Our approach extends the method of Lev…
▽ More
We study hyperbolicity for quasi-projective varieties where the boundary divisor consists of n+1 numerically parallel effective divisors on a complex projective variety of dimension n, allowing non-empty intersection. Under explicit local conditions on beta constants or intersection multiplicities, we prove that all entire curves are algebraically degenerate. Our approach extends the method of Levin-Huang-Xiao to higher dimensions, establishing a second main theorem for regular sequences of closed subschemes. This also yields a GCD-type estimate in the same geometric setting.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Campana's orbifold conjecture for numerically equivalent divisors
Authors:
Min Ru,
Julie Tzu-Yueh Wang
Abstract:
We prove the following version of the Campana's orbifold conjecture: Let $X$ be a complex non-singular projective variety of dimension $n$. Let $D_1,\ldots,D_{n+1}$ be $\mathbb Z$-linearly independent effective divisors in ${\rm Div}(X)$ and $D:=D_1+\cdots+D_{n+1}$ be a normal crossing divisor of $X$. Assume furthermore that they are numerically parallel. Let $Δ=\sum_{i=1}^{n+1} (1-m_i^{-1}) D_i$…
▽ More
We prove the following version of the Campana's orbifold conjecture: Let $X$ be a complex non-singular projective variety of dimension $n$. Let $D_1,\ldots,D_{n+1}$ be $\mathbb Z$-linearly independent effective divisors in ${\rm Div}(X)$ and $D:=D_1+\cdots+D_{n+1}$ be a normal crossing divisor of $X$. Assume furthermore that they are numerically parallel. Let $Δ=\sum_{i=1}^{n+1} (1-m_i^{-1}) D_i$ and let $f:\mathbb C\to (X,Δ) $ be an orbifold entire curve. Then, there exists a positive integer $\ell$ such that, the orbifold $ (X,Δ_{\ell}) $ is of general type, where $Δ_{\ell}=\sum_{i=1}^{n+1} (1-\frac1{\ell})D_i$, and if $f$ has multiplicity at least $\ell$ along $D_i$, $1\le i\le n+1$, then $f$ must be algebraically degenerate.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Fine-tuning for Data-enabled Predictive Control of Noisy Systems by Reinforcement Learning
Authors:
Jinbao Wang,
Shiliang Zhang,
Jun Liu,
Xuehui Ma,
Haolin Liu
Abstract:
Data-enabled predictive control (DeePC) leverages system measurements in characterizing system dynamics for optimal control. The performance of DeePC relies on optimizing its hyperparameters, especially in noisy systems where the optimal hyperparameters adapt over time. Existing hyperparameter tuning approaches for DeePC are more than often computationally inefficient or overly conservative. This…
▽ More
Data-enabled predictive control (DeePC) leverages system measurements in characterizing system dynamics for optimal control. The performance of DeePC relies on optimizing its hyperparameters, especially in noisy systems where the optimal hyperparameters adapt over time. Existing hyperparameter tuning approaches for DeePC are more than often computationally inefficient or overly conservative. This paper proposes an adaptive DeePC where we guide its hyperparameters adaption through reinforcement learning. We start with establishing the relationship between the system I/O behavior and DeePC hyperparameters. Then we formulate the hyperparameter tuning as a sequential decision-making problem, and we address the decision-making through reinforcement learning. We implement offline training to gain a reinforcement learning model, and we integrate the trained model with DeePC to adjust its hyperparameters adaptively in real time. We conduct numerical simulations with diverse noisy conditions, and the results demonstrate the identification of near-optimal hyperparameters and the robustness of the proposed approach against noises in the control.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
GradPower: Powering Gradients for Faster Language Model Pre-Training
Authors:
Mingze Wang,
Jinbo Wang,
Jiaqi Zhang,
Wei Wang,
Peng Pei,
Xunliang Cai,
Weinan E,
Lei Wu
Abstract:
We propose GradPower, a lightweight gradient-transformation technique for accelerating language model pre-training. Given a gradient vector $g=(g_i)_i$, GradPower first applies the elementwise sign-power transformation: $\varphi_p(g)=({\rm sign}(g_i)|g_i|^p)_{i}$ for a fixed $p>0$, and then feeds the transformed gradient into a base optimizer. Notably, GradPower requires only a single-line code ch…
▽ More
We propose GradPower, a lightweight gradient-transformation technique for accelerating language model pre-training. Given a gradient vector $g=(g_i)_i$, GradPower first applies the elementwise sign-power transformation: $\varphi_p(g)=({\rm sign}(g_i)|g_i|^p)_{i}$ for a fixed $p>0$, and then feeds the transformed gradient into a base optimizer. Notably, GradPower requires only a single-line code change and no modifications to the base optimizer's internal logic, including the hyperparameters. When applied to Adam (termed AdamPower), GradPower consistently achieves lower terminal loss across diverse architectures (LLaMA, Qwen2MoE), parameter scales (66M to 2B), datasets (C4, OpenWebText), and learning-rate schedules (cosine, warmup-stable-decay). The most pronounced gains are observed when training modern mixture-of-experts models with warmup-stable-decay schedules. GradPower also integrates seamlessly with other state-of-the-art optimizers, such as Muon, yielding further improvements. Finally, we provide theoretical analyses that reveal the underlying mechanism of GradPower and highlights the influence of gradient noise.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Mean Field Control with Poissonian Common Noise: A Pathwise Compactification Approach
Authors:
Lijun Bo,
Jingfei Wang,
Xiaoli Wei,
Xiang Yu
Abstract:
This paper contributes to the compactification approach to tackle mean-field control (MFC) problems with Poissonian common noise. To overcome the lack of compactness and continuity issues due to common noise, we exploit the point process representation of the Poisson random measure with finite intensity and propose a pathwise formulation by freezing a sample path of the common noise. We first stud…
▽ More
This paper contributes to the compactification approach to tackle mean-field control (MFC) problems with Poissonian common noise. To overcome the lack of compactness and continuity issues due to common noise, we exploit the point process representation of the Poisson random measure with finite intensity and propose a pathwise formulation by freezing a sample path of the common noise. We first study a pathwise relaxed control problem in an auxiliary setup without common noise but with finite deterministic jumping times over the finite horizon. By employing the compactification argument for the pathwise relaxed control problem with Skorokhod topology, we establish the existence of optimal controls in the pathwise formulation. To address the original problem, the main challenge is to close the gap between the problem in the original model with common noise and the pathwise formulation. With the help of concatenation techniques over the sequence of deterministic jumping times, we develop a new tool, also interpreted as the superposition principle in the pathwise formulation, to draw a relationship between the pathwise relaxed control problem and the pathwise measure-valued control problem associated to Fokker-Planck equation. As a result, we can bridge the desired equivalence among different problem formulations. We also extend the methodology to solve mean-field games with Poissonian common noise, confirming the existence of a strong mean field equilibrium.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Cauchy problem and dependency analysis for logarithmic Schrödinger equation on waveguide manifold
Authors:
Hichem Hajaiej,
Jun Wang,
Zhaoyang Yin
Abstract:
In this paper, we develop a novel idea to study $y$-dependence for the logarithmic Schrödinger equation on $\mathbb{R}^d \times \mathbb{T}^n$. Unlike \cite{STNT2014}(Analysis \& PDE, 2014) and \cite{HHYL2024}(SIAM J. Math. Anal., 2024), the heart of the matter is that the scaling argument is invalid. Moreover, we also consider the Cauchy problem, which transforms the variational analysis into dyna…
▽ More
In this paper, we develop a novel idea to study $y$-dependence for the logarithmic Schrödinger equation on $\mathbb{R}^d \times \mathbb{T}^n$. Unlike \cite{STNT2014}(Analysis \& PDE, 2014) and \cite{HHYL2024}(SIAM J. Math. Anal., 2024), the heart of the matter is that the scaling argument is invalid. Moreover, we also consider the Cauchy problem, which transforms the variational analysis into dynamical stability results.
△ Less
Submitted 28 June, 2025; v1 submitted 29 May, 2025;
originally announced May 2025.
-
The spectral torsion for the one form rescaled Dirac operator
Authors:
Jian Wang,
Yong Wang
Abstract:
The spectral torsion is defined by three vector fields and Dirac operators and the noncommutative residue.
Motivated by the spectral torsion and the one form rescaled Dirac operator, we give some new spectral torsion which is the extension of spectral torsion for Dirac operators, and compute the spectral torsion for the one form rescaled Dirac operator on even-dimensional spin manifolds without…
▽ More
The spectral torsion is defined by three vector fields and Dirac operators and the noncommutative residue.
Motivated by the spectral torsion and the one form rescaled Dirac operator, we give some new spectral torsion which is the extension of spectral torsion for Dirac operators, and compute the spectral torsion for the one form rescaled Dirac operator on even-dimensional spin manifolds without boundary.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Second Order Properties of Thinned Counts in Finite Birth--Death Processes
Authors:
Daryl. J. Daley,
Yoni Nazarathy,
Jiesen Wang
Abstract:
The paper studies the counting process arising as a subset of births and deaths in a birth--death process on a finite state space. Whenever a birth or death occurs, the process is incremented or not depending on the outcome of an independent Bernoulli experiment whose probability is a state-dependent function of the birth and death and also depends on whether it is a birth or death that has occurr…
▽ More
The paper studies the counting process arising as a subset of births and deaths in a birth--death process on a finite state space. Whenever a birth or death occurs, the process is incremented or not depending on the outcome of an independent Bernoulli experiment whose probability is a state-dependent function of the birth and death and also depends on whether it is a birth or death that has occurred. We establish a formula for the asymptotic variance rate of this process, also presented as the ratio of the asymptotic variance and the asymptotic mean. Several examples including queueing models illustrate the scope of applicability of the results. An analogous formula for the countably infinite state space is conjectured and tested.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
AMG with Filtering: An Efficient Preconditioner for Interior Point Methods in Large-Scale Contact Mechanics Optimization
Authors:
Socratis Petrides,
Tucker Hartland,
Tzanio Kolev,
Chak Shing Lee,
Michael Puso,
Jerome Solberg,
Eric B. Chin,
Jingyi Wang,
Cosmin Petra
Abstract:
Large-scale contact mechanics simulations are crucial in many engineering fields such as structural design and manufacturing. In the frictionless case, contact can be modeled by minimizing an energy functional; however, these problems are often nonlinear, non-convex, and increasingly difficult to solve as mesh resolution increases. In this work, we employ a Newton-based interior-point (IP) filter…
▽ More
Large-scale contact mechanics simulations are crucial in many engineering fields such as structural design and manufacturing. In the frictionless case, contact can be modeled by minimizing an energy functional; however, these problems are often nonlinear, non-convex, and increasingly difficult to solve as mesh resolution increases. In this work, we employ a Newton-based interior-point (IP) filter line-search method; an effective approach for large-scale constrained optimization. While this method converges rapidly, each iteration requires solving a large saddle-point linear system that becomes ill-conditioned as the optimization process converges, largely due to IP treatment of the contact constraints. Such ill-conditioning can hinder solver scalability and increase iteration counts with mesh refinement. To address this, we introduce a novel preconditioner, AMG with Filtering (AMGF), tailored to the Schur complement of the saddle-point system. Building on the classical algebraic multigrid (AMG) solver, commonly used for elasticity, we augment it with a specialized subspace correction that filters near null space components introduced by contact interface constraints. Through theoretical analysis and numerical experiments on a range of linear and nonlinear contact problems, we demonstrate that the proposed solver achieves mesh independent convergence and maintains robustness against the ill-conditioning that notoriously plagues IP methods. These results indicate that AMGF makes contact mechanics simulations more tractable and broadens the applicability of Newton-based IP methods in challenging engineering scenarios. More broadly, AMGF is well suited for problems, optimization or otherwise, where solver performance is limited by a problematic low-dimensional subspace. This makes the method widely applicable beyond contact mechanics and constrained optimization.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
Local projection stabilization methods for $\boldsymbol{H}({\rm curl})$ and $\boldsymbol{H}({\rm div})$ advection problems
Authors:
Yangfan Luo,
Jindong Wang,
Shuonan Wu
Abstract:
We devise local projection stabilization (LPS) methods for advection problems in the $\boldsymbol{H}$(curl) and $\boldsymbol{H}$(div) spaces, employing conforming finite element spaces of arbitrary order within a unified framework. The key ingredient is a local inf-sup condition, enabled by enriching the approximation space with appropriate $\boldsymbol{H}$(d) bubble functions (with d = curl or di…
▽ More
We devise local projection stabilization (LPS) methods for advection problems in the $\boldsymbol{H}$(curl) and $\boldsymbol{H}$(div) spaces, employing conforming finite element spaces of arbitrary order within a unified framework. The key ingredient is a local inf-sup condition, enabled by enriching the approximation space with appropriate $\boldsymbol{H}$(d) bubble functions (with d = curl or div). This enrichment allows for the construction of modified interpolation operators, which are crucial for establishing optimal a priori error estimates in the energy norm. Numerical examples are presented to verify both the theoretical results and the stabilization properties of the proposed method.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Existence theory for elliptic equations of general exponential nonlinearity on finite graphs
Authors:
Bobo Hua,
Linlin Sun,
Jiaxuan Wang
Abstract:
We study semilinear elliptic equations on finite graphs with fully general exponential nonlinearities, thereby extending classical equations such as the Kazdan-Warner and Chern-Simons equations. A key contribution of this work is the development of new techniques for deriving a priori estimates in this generalized setting, which reduce the original finite graph to a graph with only two vertices. T…
▽ More
We study semilinear elliptic equations on finite graphs with fully general exponential nonlinearities, thereby extending classical equations such as the Kazdan-Warner and Chern-Simons equations. A key contribution of this work is the development of new techniques for deriving a priori estimates in this generalized setting, which reduce the original finite graph to a graph with only two vertices. This reduction enables us to explicitly compute the Brouwer degree and to establish the existence of solutions when the degree is nonzero. Furthermore, using the method of sub- and supersolutions, we also prove the existence of solutions in cases where the Brouwer degree vanishes.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
The asymptotic uniform distribution of subset sums
Authors:
Jing Wang
Abstract:
Let $G$ be a finite abelian group of order $n$, and for each $a\in G$ and integer $1\le h\le n$ let $\mathcal{F}_a(h)$ denote the family of all $h$-element subsets of $G$ whose sum is $a$. A problem posed by Katona and Makar-Limanov is to determine whether the minimum and maximum sizes of the families $\mathcal{F}_a(h)$ (as $a$ ranges over $G$) become asymptotically equal as $n\rightarrow \infty$…
▽ More
Let $G$ be a finite abelian group of order $n$, and for each $a\in G$ and integer $1\le h\le n$ let $\mathcal{F}_a(h)$ denote the family of all $h$-element subsets of $G$ whose sum is $a$. A problem posed by Katona and Makar-Limanov is to determine whether the minimum and maximum sizes of the families $\mathcal{F}_a(h)$ (as $a$ ranges over $G$) become asymptotically equal as $n\rightarrow \infty$ when $h=\left\lfloor\frac{n}{2}\right\rfloor$. We affirmatively answer this question and in fact show that the same asymptotic equality holds for every $4\leq h\leq \left\lfloor\frac{n}{2}\right\rfloor+1$.
△ Less
Submitted 18 May, 2025;
originally announced May 2025.
-
Sharp integral bound of scalar curvature on $3$-manifolds
Authors:
Ovidiu Munteanu,
Jiaping Wang
Abstract:
It is shown that the integral of the scalar curvature on a geodesic ball of radius $R$ in a three-dimensional complete manifold with nonnegative Ricci curvature is bounded above by $8πR$ asymptotically for large $R$ provided that the scalar curvature is bounded between two positive constants.
It is shown that the integral of the scalar curvature on a geodesic ball of radius $R$ in a three-dimensional complete manifold with nonnegative Ricci curvature is bounded above by $8πR$ asymptotically for large $R$ provided that the scalar curvature is bounded between two positive constants.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Distributed Stochastic Optimization for Non-Smooth and Weakly Convex Problems under Heavy-Tailed Noise
Authors:
Jun Hu,
Chao Sun,
Bo Chen,
Jianzheng Wang,
Zheming Wang
Abstract:
In existing distributed stochastic optimization studies, it is usually assumed that the gradient noise has a bounded variance. However, recent research shows that the heavy-tailed noise, which allows an unbounded variance, is closer to practical scenarios in many tasks. Under heavy-tailed noise, traditional optimization methods, such as stochastic gradient descent, may have poor performance and ev…
▽ More
In existing distributed stochastic optimization studies, it is usually assumed that the gradient noise has a bounded variance. However, recent research shows that the heavy-tailed noise, which allows an unbounded variance, is closer to practical scenarios in many tasks. Under heavy-tailed noise, traditional optimization methods, such as stochastic gradient descent, may have poor performance and even diverge. Thus, it is of great importance to study distributed stochastic optimization algorithms applicable to the heavy-tailed noise scenario. However, most of the existing distributed algorithms under heavy-tailed noise are developed for convex and smooth problems, which limits their applications. This paper proposes a clipping-based distributed stochastic algorithm under heavy-tailed noise that is suitable for non-smooth and weakly convex problems. The convergence of the proposed algorithm is proven, and the conditions on the parameters are given. A numerical experiment is conducted to demonstrate the effectiveness of the proposed algorithm.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Largest $3$-uniform set systems with VC-dimension $2$
Authors:
Jian Wang,
Zixiang Xu,
Shengtong Zhang
Abstract:
We determine the largest size of $3$-uniform set systems on $[n]$ with VC-dimension $2$ for all $n$.
We determine the largest size of $3$-uniform set systems on $[n]$ with VC-dimension $2$ for all $n$.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
On lattice coverings by locally anti-blocking bodies and polytopes with few vertices
Authors:
Matthias Schymura,
Jun Wang,
Fei Xue
Abstract:
In 2021, Ordentlich, Regev and Weiss made a breakthrough that the lattice covering density of any $n$-dimensional convex body is upper bounded by $cn^{2}$, improving on the best previous bound established by Rogers in 1959. However, for the Euclidean ball, Rogers obtained the better upper bound $n(\log_{e}n)^{c}$, and this result was extended to certain symmetric convex bodies by Gritzmann. The co…
▽ More
In 2021, Ordentlich, Regev and Weiss made a breakthrough that the lattice covering density of any $n$-dimensional convex body is upper bounded by $cn^{2}$, improving on the best previous bound established by Rogers in 1959. However, for the Euclidean ball, Rogers obtained the better upper bound $n(\log_{e}n)^{c}$, and this result was extended to certain symmetric convex bodies by Gritzmann. The constant $c$ above is independent on $n$. In this paper, we show that such a bound can be achieved for more general classes of convex bodies without symmetry, including anti-blocking bodies, locally anti-blocking bodies and $n$-dimensional polytopes with $n+2$ vertices.
△ Less
Submitted 2 June, 2025; v1 submitted 12 May, 2025;
originally announced May 2025.
-
Neural Operators for Adaptive Control of Traffic Flow Models
Authors:
Kaijing Lyu,
Junmin Wang,
Yihuai Zhang,
Huan Yu
Abstract:
The uncertainty in human driving behaviors leads to stop-and-go instabilities in freeway traffic. The traffic dynamics are typically modeled by the Aw-Rascle-Zhang (ARZ) Partial Differential Equation (PDE) models, in which the relaxation time parameter is usually unknown or hard to calibrate. This paper proposes an adaptive boundary control design based on neural operators (NO) for the ARZ PDE sys…
▽ More
The uncertainty in human driving behaviors leads to stop-and-go instabilities in freeway traffic. The traffic dynamics are typically modeled by the Aw-Rascle-Zhang (ARZ) Partial Differential Equation (PDE) models, in which the relaxation time parameter is usually unknown or hard to calibrate. This paper proposes an adaptive boundary control design based on neural operators (NO) for the ARZ PDE systems. In adaptive control, solving the backstepping kernel PDEs online requires significant computational resources at each timestep to update estimates of the unknown system parameters. To address this, we employ DeepONet to efficiently map model parameters to kernel functions. Simulations show that DeepONet generates kernel solutions nearly two orders of magnitude faster than traditional solvers while maintaining a loss on the order of \(10^{-2}\). Lyapunov analysis further validates the stability of the system when using DeepONet-approximated kernels in the adaptive controller. This result suggests that neural operators can significantly accelerate the acquisition of adaptive controllers for traffic control.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Projection-free approximation of flows of harmonic maps with quadratic constraint accuracy and variable step sizes
Authors:
Georgios Akrivis,
Sören Bartels,
Michele Ruggeri,
Jilu Wang
Abstract:
We construct and analyze a projection-free linearly implicit method for the approximation of flows of harmonic maps into spheres. The proposed method is unconditionally energy stable and, under a sharp discrete regularity condition, achieves second order accuracy with respect to the constraint violation. Furthermore, the method accommodates variable step sizes to speed up the convergence to statio…
▽ More
We construct and analyze a projection-free linearly implicit method for the approximation of flows of harmonic maps into spheres. The proposed method is unconditionally energy stable and, under a sharp discrete regularity condition, achieves second order accuracy with respect to the constraint violation. Furthermore, the method accommodates variable step sizes to speed up the convergence to stationary points and to improve the accuracy of the numerical solutions near singularities, without affecting the unconditional energy stability and the constraint violation property. We illustrate the accuracy in approximating the unit-length constraint and the performance of the method through a series of numerical experiments, and compare it with the linearly implicit Euler and two-step BDF methods.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Model Structures Arising from Extendable Cotorsion Pairs
Authors:
Qingyu Shao,
Junpeng Wang,
Xiaoxiang Zhang
Abstract:
The aim of this paper is to construct exact model structures from so called extendable cotorsion pairs. Given a hereditary Hovey triple $(\mathcal{C}, \mathcal{W}, \mathcal{F})$ in a weakly idempotent complete exact category. If one of the cotorsion pairs, $(\mathcal{C}\cap\mathcal{W}, \mathcal{F})$ and $(\mathcal{C}, \mathcal{W}\cap\mathcal{F})$, is extendable, then there is a chain of hereditary…
▽ More
The aim of this paper is to construct exact model structures from so called extendable cotorsion pairs. Given a hereditary Hovey triple $(\mathcal{C}, \mathcal{W}, \mathcal{F})$ in a weakly idempotent complete exact category. If one of the cotorsion pairs, $(\mathcal{C}\cap\mathcal{W}, \mathcal{F})$ and $(\mathcal{C}, \mathcal{W}\cap\mathcal{F})$, is extendable, then there is a chain of hereditary Hovey triples whose corresponding homotopy categories coincide. As applications, we obtain a new description of the unbounded derived category $\mathbf{D}(R)$ over a ring $R$. Moreover, we can interpret the Krause's recollement in terms of ``$n$-dimensional'' homotopy categories. Finally, we have two approaches to get ``$n$-dimensional'' hereditary Hovey triples, which are proved to coincide, in the category Rep$(Q,\mathcal{A})$ of all representations of a rooted quiver $Q$ with values in an abelian category $\mathcal{A}$.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Lie conformal superalgebras of rank (2 + 1)
Authors:
Jinrong Wang,
Xiaoqing Yue
Abstract:
In this paper, Lie conformal superalgebras of rank (2 + 1) are completely classified (up to isomorphism) and their automorphism groups are determined. Furthermore, we give the classification of the finite irreducible conformal modules over them and the actions are explicitly described.
In this paper, Lie conformal superalgebras of rank (2 + 1) are completely classified (up to isomorphism) and their automorphism groups are determined. Furthermore, we give the classification of the finite irreducible conformal modules over them and the actions are explicitly described.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
On the local topology of non-collapsed Ricci bounded limit spaces
Authors:
Song Sun,
Jikang Wang,
Junsheng Zhang
Abstract:
We show that for a pointed Gromov-Hausdorff limit of non-collapsed Riemannian manifolds with bounded Ricci curvature, the local $b_1$
of the regular loci vanishes. We also discuss applications and some open questions.
We show that for a pointed Gromov-Hausdorff limit of non-collapsed Riemannian manifolds with bounded Ricci curvature, the local $b_1$
of the regular loci vanishes. We also discuss applications and some open questions.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Recurrence of the VRJP and Exponential Decay in the \(H^{2|2}\)-Model on the Hierarchical Lattice for \(d\le 2\)
Authors:
Jinglin Wang,
Xiaolin Zeng
Abstract:
We show that the vertex-reinforced jump processes on a \(d\)-dimensional hierarchical lattice are recurrent for \(d < 2\) and transient for \(d > 2\). We also explore certain regimes when \(d = 2\). The proof of recurrence relies on an exponential decay estimate of the fractional moment of the Green's function, which, unlike the classical approach used for \(\mathbb{Z}^d\), requires additional ent…
▽ More
We show that the vertex-reinforced jump processes on a \(d\)-dimensional hierarchical lattice are recurrent for \(d < 2\) and transient for \(d > 2\). We also explore certain regimes when \(d = 2\). The proof of recurrence relies on an exponential decay estimate of the fractional moment of the Green's function, which, unlike the classical approach used for \(\mathbb{Z}^d\), requires additional entropy estimates via stability of the model distribution under coarse grain operation, which leverages its linear reinforcement.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Monotone Peridynamic Neural Operator for Nonlinear Material Modeling with Conditionally Unique Solutions
Authors:
Jihong Wang,
Xiaochuan Tian,
Zhongqiang Zhang,
Stewart Silling,
Siavash Jafarzadeh,
Yue Yu
Abstract:
Data-driven methods have emerged as powerful tools for modeling the responses of complex nonlinear materials directly from experimental measurements. Among these methods, the data-driven constitutive models present advantages in physical interpretability and generalizability across different boundary conditions/domain settings. However, the well-posedness of these learned models is generally not g…
▽ More
Data-driven methods have emerged as powerful tools for modeling the responses of complex nonlinear materials directly from experimental measurements. Among these methods, the data-driven constitutive models present advantages in physical interpretability and generalizability across different boundary conditions/domain settings. However, the well-posedness of these learned models is generally not guaranteed a priori, which makes the models prone to non-physical solutions in downstream simulation tasks. In this study, we introduce monotone peridynamic neural operator (MPNO), a novel data-driven nonlocal constitutive model learning approach based on neural operators. Our approach learns a nonlocal kernel together with a nonlinear constitutive relation, while ensuring solution uniqueness through a monotone gradient network. This architectural constraint on gradient induces convexity of the learnt energy density function, thereby guaranteeing solution uniqueness of MPNO in small deformation regimes. To validate our approach, we evaluate MPNO's performance on both synthetic and real-world datasets. On synthetic datasets with manufactured kernel and constitutive relation, we show that the learnt model converges to the ground-truth as the measurement grid size decreases both theoretically and numerically. Additionally, our MPNO exhibits superior generalization capabilities than the conventional neural networks: it yields smaller displacement solution errors in down-stream tasks with new and unseen loadings. Finally, we showcase the practical utility of our approach through applications in learning a homogenized model from molecular dynamics data, highlighting its expressivity and robustness in real-world scenarios.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Eisenstein cocycles for imaginary quadratic fields
Authors:
Emmanuel Lecouturier,
Romyar Sharifi,
Sheng-Chi Shih,
Jun Wang
Abstract:
We construct Eisenstein cocycles for arithmetic subgroups of GL_2 of imaginary quadratic fields valued in second K-groups of products of two CM elliptic curves. We use these to construct maps from the first homology groups of Bianchi spaces to corresponding second K-groups of ray class fields and to verify the Eisenstein property of these maps for prime-to-level Hecke operators.
We construct Eisenstein cocycles for arithmetic subgroups of GL_2 of imaginary quadratic fields valued in second K-groups of products of two CM elliptic curves. We use these to construct maps from the first homology groups of Bianchi spaces to corresponding second K-groups of ray class fields and to verify the Eisenstein property of these maps for prime-to-level Hecke operators.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
Stationary distributions of McKean-Vlasov SDEs with jumps: existence, uniqueness, and multiplicity
Authors:
Jianhai Bao,
Jian Wang
Abstract:
In this paper, we are interested in the issues on existence, uniqueness, and multiplicity of stationary distributions for McKean-Vlasov SDEs with jumps. In detail, with regarding to McKean-Vlasov SDEs driven by pure jump Lévy processes, we principally (i) explore the existence of stationary distributions via Schauder's fixed point theorem under an appropriate Lyapunov condition; (ii) tackle the un…
▽ More
In this paper, we are interested in the issues on existence, uniqueness, and multiplicity of stationary distributions for McKean-Vlasov SDEs with jumps. In detail, with regarding to McKean-Vlasov SDEs driven by pure jump Lévy processes, we principally (i) explore the existence of stationary distributions via Schauder's fixed point theorem under an appropriate Lyapunov condition; (ii) tackle the uniqueness of stationary distributions and the convergence to the equilibria as long as the underlying drifts are continuous with respect to the measure variables under the weighted total variation distance and the $L^1$-Wasserstein distance, respectively; (iii) demonstrate the multiplicity of stationary distributions under a locally dissipative condition. In addition, some illustrative examples are provided to show that the associated McKean-Vlasov SDEs possess a unique, two and three stationary distributions, respectively.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Mathematical Analysis of the PDE Model for the Consensus-based Optimization
Authors:
Jinhuan Wang,
Keyu Li,
Hui Huang
Abstract:
In this paper, we develop an analytical framework for the partial differential equation underlying the consensus-based optimization model. The main challenge arises from the nonlinear, nonlocal nature of the consensus point, coupled with a diffusion term that is both singular and degenerate. By employing a regularization procedure in combination with a compactness argument, we establish the global…
▽ More
In this paper, we develop an analytical framework for the partial differential equation underlying the consensus-based optimization model. The main challenge arises from the nonlinear, nonlocal nature of the consensus point, coupled with a diffusion term that is both singular and degenerate. By employing a regularization procedure in combination with a compactness argument, we establish the global existence and uniqueness of weak solutions in $L^\infty(0,T;L^1\cap L^\infty(\mathbb{R}^d))$. Furthermore, we show that the weak solutions exhibit improved $H^2$-regularity when the initial data is regular.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.