-
Generalized Rellich's lemmas, uniqueness theorem and inside-out duality for scattering poles
Authors:
Xiaodong Liu,
Jiguang Sun,
Lei Zhang
Abstract:
Scattering poles correspond to non-trivial scattered fields in the absence of incident waves and play a crucial role in the study of wave phenomena. These poles are complex wavenumbers with negative imaginary parts. In this paper, we prove two generalized Rellich's lemmas for scattered fields associated with complex wavenumbers. These lemmas are then used to establish uniqueness results for invers…
▽ More
Scattering poles correspond to non-trivial scattered fields in the absence of incident waves and play a crucial role in the study of wave phenomena. These poles are complex wavenumbers with negative imaginary parts. In this paper, we prove two generalized Rellich's lemmas for scattered fields associated with complex wavenumbers. These lemmas are then used to establish uniqueness results for inverse scattering problems. We further explore the inside-out duality, which characterizes scattering poles through the linear sampling method applied to interior scattering problems. Notably, we demonstrate that exterior Dirichlet/Neumann poles can be identified without prior knowledge of the actual sound-soft or sound-hard obstacles. Numerical examples are provided to validate the theoretical results.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Split-Merge Revisited: A Scalable Approach to Generalized Eigenvalue Problems
Authors:
Xiaozhi Liu,
Yong Xia
Abstract:
The generalized eigenvalue problem (GEP) serves as a cornerstone in a wide range of applications in numerical linear algebra and scientific computing. However, traditional approaches that aim to maximize the classical Rayleigh quotient often suffer from numerical instability and limited computational efficiency, especially in large-scale settings. In this work, we explore an alternative difference…
▽ More
The generalized eigenvalue problem (GEP) serves as a cornerstone in a wide range of applications in numerical linear algebra and scientific computing. However, traditional approaches that aim to maximize the classical Rayleigh quotient often suffer from numerical instability and limited computational efficiency, especially in large-scale settings. In this work, we explore an alternative difference-based formulation of GEP by minimizing a structured quadratic polynomial objective, which enables the application of efficient first-order optimization methods. We establish global convergence guarantees for these methods without requiring line search, and further introduce a transform-domain perspective that reveals the intrinsic connection and performance gap between classical first-order algorithms and the power method. Based on this insight, we develop an accelerated preconditioned mirror descent algorithm, which allows for flexible preconditioner design and improved convergence behavior. Lastly, we extend the recently proposed Split-Merge algorithm to the general GEP setting, incorporating richer second-order information to further accelerate convergence. Empirical results on both synthetic and real-world datasets demonstrate that our proposed methods achieve significant improvements over existing baselines in terms of both computational efficiency and numerical stability.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Existence and concentration of nontrivial solutions for quasilinear Schrödinger equation with indefinite potential
Authors:
Lifeng Yin,
Xiaoqi Liu,
Yongyong Li
Abstract:
This paper is concerned with the quasilinear Schrödinger equation \begin{align*} -Δu+V(x)u+\frac{k}{2}Δ(u^2)u=f(u)\quad \text{in}~~\mathbb{R}^N\text{,} \end{align*} where $N\geq 3$, $k>0$, $V\in C(\R)$ is an indefinite potential. Under structural conditions on the potential $V$ and the nonlinearity $f$, we establish the existence of a nontrivial solution through a combination of a local linking ar…
▽ More
This paper is concerned with the quasilinear Schrödinger equation \begin{align*} -Δu+V(x)u+\frac{k}{2}Δ(u^2)u=f(u)\quad \text{in}~~\mathbb{R}^N\text{,} \end{align*} where $N\geq 3$, $k>0$, $V\in C(\R)$ is an indefinite potential. Under structural conditions on the potential $V$ and the nonlinearity $f$, we establish the existence of a nontrivial solution through a combination of a local linking argument, Morse theory, and the Moser iteration. Moreover, if $f$ is odd, we obtain an unbounded sequence of nontrivial solutions via the symmetric Mountain Pass Theorem. Additionally, as $k\rightarrow0$, we analyze the concentration behavior of nontrivial solutions.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Turán density of tight cycles minus one edge in the $\ell_2$-norm
Authors:
Levente Bodnár,
Jinghua Deng,
Jianfeng Hou,
Xizhi Liu,
Hongbin Zhao
Abstract:
The $3$-uniform tight $\ell$-cycle minus one edge $C_{\ell}^{3-}$ is the $3$-graph on $\ell$ vertices consisting of $\ell-1$ consecutive triples in the cyclic order. We show that for every integer $\ell \ge 5$ satisfying $\ell\not\equiv 0\pmod3$, every $C_{\ell}^{3-}$-free $3$-graph whose $\ell_2$-norm, that is, the sum of codegree squares, is close to the maximum must be structurally close to the…
▽ More
The $3$-uniform tight $\ell$-cycle minus one edge $C_{\ell}^{3-}$ is the $3$-graph on $\ell$ vertices consisting of $\ell-1$ consecutive triples in the cyclic order. We show that for every integer $\ell \ge 5$ satisfying $\ell\not\equiv 0\pmod3$, every $C_{\ell}^{3-}$-free $3$-graph whose $\ell_2$-norm, that is, the sum of codegree squares, is close to the maximum must be structurally close to the iterative blowup of a single triple. This confirms a conjecture of Balogh--Clemen--Lidický~[Surveys in combinatorics 2022, 21-63] in a stronger form.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Sabotage the Mantel Theorem
Authors:
Natalie Behague,
Debsoumya Chakraborti,
Xizhi Liu
Abstract:
One of the earliest results in extremal graph theory, Mantel's theorem, states that the maximum number of edges in a triangle-free graph $G$ on $n$ vertices is $\lfloor n^2/4 \rfloor$. We investigate how this extremal bound is affected when $G$ is additionally required to contain a prescribed graph $\mathbb{P}$ as a subgraph. We establish general upper and lower bounds for this problem, which are…
▽ More
One of the earliest results in extremal graph theory, Mantel's theorem, states that the maximum number of edges in a triangle-free graph $G$ on $n$ vertices is $\lfloor n^2/4 \rfloor$. We investigate how this extremal bound is affected when $G$ is additionally required to contain a prescribed graph $\mathbb{P}$ as a subgraph. We establish general upper and lower bounds for this problem, which are tight in the exponent for random triangle-free graphs and graphs generated by the triangle-free process, when the size of $\mathbb{P}$ lies within certain ranges.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
Spectral Approximation to Fractional Integral Operators
Authors:
Xiaolin Liu,
Kuan Xu
Abstract:
We propose a fast and stable method for constructing matrix approximations to fractional integral operators applied to series in the Chebyshev fractional polynomials. This method utilizes a recurrence relation satisfied by the fractional integrals of mapped Chebyshev polynomials and significantly outperforms existing methods. Through numerical examples, we highlight the broad applicability of thes…
▽ More
We propose a fast and stable method for constructing matrix approximations to fractional integral operators applied to series in the Chebyshev fractional polynomials. This method utilizes a recurrence relation satisfied by the fractional integrals of mapped Chebyshev polynomials and significantly outperforms existing methods. Through numerical examples, we highlight the broad applicability of these matrix approximations, including the solution of boundary value problems for fractional integral and differential equations. Additional applications include fractional differential equation initial value problems and fractional eigenvalue problems.
△ Less
Submitted 6 July, 2025; v1 submitted 24 June, 2025;
originally announced June 2025.
-
When and How Unlabeled Data Provably Improve In-Context Learning
Authors:
Yingcong Li,
Xiangyu Chang,
Muti Kara,
Xiaofeng Liu,
Amit Roy-Chowdhury,
Samet Oymak
Abstract:
Recent research shows that in-context learning (ICL) can be effective even when demonstrations have missing or incorrect labels. To shed light on this capability, we examine a canonical setting where the demonstrations are drawn according to a binary Gaussian mixture model (GMM) and a certain fraction of the demonstrations have missing labels. We provide a comprehensive theoretical study to show t…
▽ More
Recent research shows that in-context learning (ICL) can be effective even when demonstrations have missing or incorrect labels. To shed light on this capability, we examine a canonical setting where the demonstrations are drawn according to a binary Gaussian mixture model (GMM) and a certain fraction of the demonstrations have missing labels. We provide a comprehensive theoretical study to show that: (1) The loss landscape of one-layer linear attention models recover the optimal fully-supervised estimator but completely fail to exploit unlabeled data; (2) In contrast, multilayer or looped transformers can effectively leverage unlabeled data by implicitly constructing estimators of the form $\sum_{i\ge 0} a_i (X^\top X)^iX^\top y$ with $X$ and $y$ denoting features and partially-observed labels (with missing entries set to zero). We characterize the class of polynomials that can be expressed as a function of depth and draw connections to Expectation Maximization, an iterative pseudo-labeling algorithm commonly used in semi-supervised learning. Importantly, the leading polynomial power is exponential in depth, so mild amount of depth/looping suffices. As an application of theory, we propose looping off-the-shelf tabular foundation models to enhance their semi-supervision capabilities. Extensive evaluations on real-world datasets show that our method significantly improves the semisupervised tabular learning performance over the standard single pass inference.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Stable Computation of Laplacian Eigenfunctions Corresponding to Clustered Eigenvalues
Authors:
Ryoki Endo,
Xuefeng Liu
Abstract:
The accurate computation of eigenfunctions corresponding to tightly clustered Laplacian eigenvalues remains an extremely difficult problem. In this paper, using the shape difference quotient of eigenvalues, we propose a stable computation method for the eigenfunctions of clustered eigenvalues caused by domain perturbation.
The accurate computation of eigenfunctions corresponding to tightly clustered Laplacian eigenvalues remains an extremely difficult problem. In this paper, using the shape difference quotient of eigenvalues, we propose a stable computation method for the eigenfunctions of clustered eigenvalues caused by domain perturbation.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Optimal Fluctuations for Nonlinear Chemical Reaction Systems with General Rate Law
Authors:
Feng Zhao,
Jinjie Zhu,
Yang Li,
Xianbin Liu,
Dongping Jin
Abstract:
This paper investigates optimal fluctuations for chemical reaction systems with N species, M reactions, and general rate law. In the limit of large volume, large fluctuations for such models occur with overwhelming probability in the vicinity of the so-called optimal path, which is a basic consequence of the Freidlin-Wentzell theory, and is vital in biochemistry as it unveils the almost determinis…
▽ More
This paper investigates optimal fluctuations for chemical reaction systems with N species, M reactions, and general rate law. In the limit of large volume, large fluctuations for such models occur with overwhelming probability in the vicinity of the so-called optimal path, which is a basic consequence of the Freidlin-Wentzell theory, and is vital in biochemistry as it unveils the almost deterministic mechanism concealed behind rare noisy phenomena such as escapes from the attractive domain of a stable state and transitions between different metastable states. In this study, an alternative description for optimal fluctuations is proposed in both non-stationary and stationary settings by means of a quantity called prehistory probability in the same setting, respectively. The evolution law of each of them is derived, showing their relationship with the time reversal of a specified family of probability distributions respectively. The law of large numbers and the central limit theorem for the reversed processes are then proved. In doing so, the prehistorical approach to optimal fluctuations for Langevin dynamics is naturally generalized to the present case, thereby suggesting a strong connection between optimal fluctuations and the time reversal of the chemical reaction model.
△ Less
Submitted 7 June, 2025;
originally announced June 2025.
-
A Newton Augmented Lagrangian Method for Symmetric Cone Programming with Complexity Analysis
Authors:
Rui-Jin Zhang,
Ruoyu Diao,
Xin-Wei Liu,
Yu-Hong Dai
Abstract:
Symmetric cone programming incorporates a broad class of convex optimization problems, including linear programming, second-order cone programming, and semidefinite programming. Although the augmented Lagrangian method (ALM) is well-suited for large-scale scenarios, its subproblems are often not second-order continuously differentiable, preventing direct use of classical Newton methods. To address…
▽ More
Symmetric cone programming incorporates a broad class of convex optimization problems, including linear programming, second-order cone programming, and semidefinite programming. Although the augmented Lagrangian method (ALM) is well-suited for large-scale scenarios, its subproblems are often not second-order continuously differentiable, preventing direct use of classical Newton methods. To address this issue, we observe that barrier functions from interior-point methods (IPMs) naturally serve as effective smoothing terms to alleviate such nonsmoothness. By combining the strengths of ALM and IPMs, we construct a novel augmented Lagrangian function and subsequently develop a Newton augmented Lagrangian (NAL) method. By leveraging the self-concordance property of the barrier function, the proposed method is shown to achieve an $\mathcal{O}(ε^{-1})$ complexity bound. Furthermore, we demonstrate that the condition numbers of the Schur complement matrices in the NAL method are considerably better than those of classical IPMs, as visually evidenced by a heatmap of condition numbers. Numerical experiments conducted on standard benchmarks confirm that the NAL method exhibits significant performance improvements compared to several existing methods.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Convergence of spectra of digraph limits
Authors:
Jan Grebík,
Daniel Král',
Xizhi Liu,
Oleg Pikhurko,
Julia Slipantschuk
Abstract:
The relation between densities of cycles and the spectrum of a graphon, which implies that the spectra of convergent graphons converge, fundamentally relies on the self-adjointness of the linear operator associated with a graphon. In this short paper, we consider the setting of digraphons, which are limits of directed graphs, and prove that the spectra of convergent digraphons converge. Using this…
▽ More
The relation between densities of cycles and the spectrum of a graphon, which implies that the spectra of convergent graphons converge, fundamentally relies on the self-adjointness of the linear operator associated with a graphon. In this short paper, we consider the setting of digraphons, which are limits of directed graphs, and prove that the spectra of convergent digraphons converge. Using this result, we establish the relation between densities of directed cycles and the spectrum of a digraphon.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
The Turán density of short tight cycles
Authors:
Levente Bodnár,
Jared León,
Xizhi Liu,
Oleg Pikhurko
Abstract:
The $3$-uniform tight $\ell$-cycle $C_\ell^{3}$ is the $3$-graph on $\{1,\dots,\ell\}$ consisting of all $\ell$ consecutive triples in the cyclic order. Let $\mathcal{C}$ be either the pair $\{C_{4}^{3}, C_{5}^{3}\}$ or the single tight $\ell$-cycle $C_{\ell}^{3}$ for some $\ell\ge 7$ not divisible by $3$.
We show that the Turán density of $\mathcal{C}$, that is, the asymptotically maximal edge…
▽ More
The $3$-uniform tight $\ell$-cycle $C_\ell^{3}$ is the $3$-graph on $\{1,\dots,\ell\}$ consisting of all $\ell$ consecutive triples in the cyclic order. Let $\mathcal{C}$ be either the pair $\{C_{4}^{3}, C_{5}^{3}\}$ or the single tight $\ell$-cycle $C_{\ell}^{3}$ for some $\ell\ge 7$ not divisible by $3$.
We show that the Turán density of $\mathcal{C}$, that is, the asymptotically maximal edge density of a large $\mathcal{C}$-free $3$-graph, is equal to $2\sqrt{3} - 3$. We also establish the corresponding Erdős-Simonovits-type stability result, informally stating that all almost maximum $\mathcal{C}$-free graphs are close in the edit distance to a 2-part recursive construction. This extends the earlier analogous results of Kamčev-Letzter-Pokrovskiy ["The Turán density of tight cycles in three-uniform hypergraphs", Int. Math. Res. Not. 6 (2024), 4804-4841] that apply for sufficiently large $\ell$ only.
Additionally, we prove a finer structural result that allows us to determine the maximum number of edges in a $\{C_{4}^{3}, C_{5}^{3}\}$-free $3$-graph with a given number of vertices up to an additive $O(1)$ error term.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression
Authors:
Zhong-Zhi Li,
Xiao Liang,
Zihao Tang,
Lei Ji,
Peijie Wang,
Haotian Xu,
Xing W,
Haizhen Huang,
Weiwei Deng,
Yeyun Gong,
Zhijiang Guo,
Xiao Liu,
Fei Yin,
Cheng-Lin Liu
Abstract:
Large Language Models (LLMs) have recently achieved remarkable progress by leveraging Reinforcement Learning and extended Chain-of-Thought (CoT) techniques. However, the challenge of performing efficient language reasoning--especially during inference with extremely long outputs--has drawn increasing attention from the research community. In this work, we propose a dynamic ratio-based training pip…
▽ More
Large Language Models (LLMs) have recently achieved remarkable progress by leveraging Reinforcement Learning and extended Chain-of-Thought (CoT) techniques. However, the challenge of performing efficient language reasoning--especially during inference with extremely long outputs--has drawn increasing attention from the research community. In this work, we propose a dynamic ratio-based training pipeline that does not rely on sophisticated data annotations or interpolation between multiple models. We continuously balance the weights between the model's System-1 and System-2 data to eliminate redundant reasoning processes while preserving the model's reasoning capability. We validate our approach across models on DeepSeek-R1-Distill-7B and DeepSeek-R1-Distill-14B and on a diverse set of benchmarks with varying difficulty levels. Our method significantly reduces the number of output tokens by nearly 40% while maintaining the accuracy of the reasoning. Our code and data will be available soon.
△ Less
Submitted 14 June, 2025; v1 submitted 3 June, 2025;
originally announced June 2025.
-
Constrained Sliced Wasserstein Embedding
Authors:
Navid NaderiAlizadeh,
Darian Salehi,
Xinran Liu,
Soheil Kolouri
Abstract:
Sliced Wasserstein (SW) distances offer an efficient method for comparing high-dimensional probability measures by projecting them onto multiple 1-dimensional probability distributions. However, identifying informative slicing directions has proven challenging, often necessitating a large number of slices to achieve desirable performance and thereby increasing computational complexity. We introduc…
▽ More
Sliced Wasserstein (SW) distances offer an efficient method for comparing high-dimensional probability measures by projecting them onto multiple 1-dimensional probability distributions. However, identifying informative slicing directions has proven challenging, often necessitating a large number of slices to achieve desirable performance and thereby increasing computational complexity. We introduce a constrained learning approach to optimize the slicing directions for SW distances. Specifically, we constrain the 1D transport plans to approximate the optimal plan in the original space, ensuring meaningful slicing directions. By leveraging continuous relaxations of these transport plans, we enable a gradient-based primal-dual approach to train the slicer parameters, alongside the remaining model parameters. We demonstrate how this constrained slicing approach can be applied to pool high-dimensional embeddings into fixed-length permutation-invariant representations. Numerical results on foundation models trained on images, point clouds, and protein sequences showcase the efficacy of the proposed constrained learning approach in learning more informative slicing directions. Our implementation code can be found at https://github.com/Stranja572/constrainedswe.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Computing matrix $\varphi$-functions arising in exponential integrators
Authors:
Awad H. Al-Mohy,
Xiaobo Liu
Abstract:
A new scaling and recovering algorithm is proposed for simultaneously computing the matrix $\varphi$-functions that arise in exponential integrator methods for the numerical solution of certain first-order systems of ordinary differential equations (ODEs). The algorithm initially scales the input matrix down by a nonnegative integer power of two, then computes the $[m/m]$ diagonal Padé approximant…
▽ More
A new scaling and recovering algorithm is proposed for simultaneously computing the matrix $\varphi$-functions that arise in exponential integrator methods for the numerical solution of certain first-order systems of ordinary differential equations (ODEs). The algorithm initially scales the input matrix down by a nonnegative integer power of two, then computes the $[m/m]$ diagonal Padé approximant to $\varphi_p$, where $p$ is the largest index of interest. The remaining $[m+p{-}j/m]$ Padé approximants to $\varphi_j$, $0 \le j < p$, are obtained implicitly via a recurrence relation. The effect of scaling is subsequently recovered using the double-argument formula. A rigorous backward error analysis, based on the $[m+p/m]$ Padé approximant to the exponential, enables sharp bounds on the relative backward errors. These bounds are expressed in terms of the sequence $\|A^k\|^{1/k}$, which can be much smaller than $\|A\|$ for nonnormal matrices. The scaling parameter and the degrees of the Padé approximants are selected to minimize the overall computational cost, which benefits from the a priori sharpness of the bounds and the optimal evaluation schemes for diagonal Padé approximants. Furthermore, if the input matrix is (quasi-)triangular, the algorithm exploits its structure in the recovering phase. Numerical experiments demonstrate the superiority of the proposed algorithm over existing alternatives in both accuracy and efficiency.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
A Hybrid Subgradient Method for Nonsmooth Nonconvex Bilevel Optimization
Authors:
Nachuan Xiao,
Xiaoyin Hu,
Xin Liu,
Kim-Chuan Toh
Abstract:
In this paper, we focus on the nonconvex-nonconvex bilevel optimization problem (BLO), where both upper-level and lower-level objectives are nonconvex, with the upper-level problem potentially being nonsmooth. We develop a two-timescale momentum-accelerated subgradient method (TMG) that employs two-timescale stepsizes, and establish its local convergence when initialized within a sufficiently smal…
▽ More
In this paper, we focus on the nonconvex-nonconvex bilevel optimization problem (BLO), where both upper-level and lower-level objectives are nonconvex, with the upper-level problem potentially being nonsmooth. We develop a two-timescale momentum-accelerated subgradient method (TMG) that employs two-timescale stepsizes, and establish its local convergence when initialized within a sufficiently small neighborhood of the feasible region. To develop a globally convergent algorithm for (BLO), we introduce a feasibility restoration scheme (FRG) that drives iterates toward the feasible region. Both (TMG) and (FRG) only require the first-order derivatives of the upper-level and lower-level objective functions, ensuring efficient computations in practice. We then develop a novel hybrid method that alternates between (TMG) and (FRG) and adaptively estimates its hyperparameters. Under mild conditions, we establish the global convergence properties of our proposed algorithm. Preliminary numerical experiments demonstrate the high efficiency and promising potential of our proposed algorithm.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Asymptotic Efficiency Analysis of the Recursive Least-Squares Algorithm for ARX Systems Without Projection
Authors:
Xingrui Liu,
Jieming Ke,
Yanlong Zhao
Abstract:
This paper investigates the optimality analysis of the recursive least-squares (RLS) algorithm for autoregressive systems with exogenous inputs (ARX systems). A key challenge in analyzing is managing the potential unboundedness of the parameter estimates, which may diverge to infinity. Previous approaches addressed this issue by assuming that both the true parameter and the RLS estimates remain co…
▽ More
This paper investigates the optimality analysis of the recursive least-squares (RLS) algorithm for autoregressive systems with exogenous inputs (ARX systems). A key challenge in analyzing is managing the potential unboundedness of the parameter estimates, which may diverge to infinity. Previous approaches addressed this issue by assuming that both the true parameter and the RLS estimates remain confined within a known compact set, thereby ensuring uniform boundedness throughout the analysis. In contrast, we propose a new analytical framework that eliminates the need for such a boundness assumption. Specifically, we establish a quantitative relationship between the bounded moment conditions of quasi-stationary input/output signals and the convergence rate of the tail probability of the RLS estimation error. Based on this technique, we prove that when system inputs/outputs have bounded twentieth-order moments, the RLS algorithm achieves asymptotic normality and the covariance matrix of the RLS algorithm converges to the Cramér-Rao lower bound (CRLB), confirming its asymptotic efficiency. These results demonstrate that the RLS algorithm is an asymptotically optimal identification algorithm for ARX systems, even without the projection operators to ensure that parameter estimates reside within a prior known compact set.
△ Less
Submitted 25 May, 2025;
originally announced May 2025.
-
Identifying convex obstacles from backscattering far field data
Authors:
Jialei Li,
Xiaodong Liu,
Qingxiang Shi
Abstract:
The recovery of anomalies from backscattering far field data is a long-standing open problem in inverse scattering theory. We make a first step in this direction by establishing the unique identifiability of convex impenetrable obstacles from backscattering far field measurements. Specifically, we prove that both the boundary and the boundary conditions of the convex obstacle are uniquely determin…
▽ More
The recovery of anomalies from backscattering far field data is a long-standing open problem in inverse scattering theory. We make a first step in this direction by establishing the unique identifiability of convex impenetrable obstacles from backscattering far field measurements. Specifically, we prove that both the boundary and the boundary conditions of the convex obstacle are uniquely determined by the far field pattern measured in backscattering directions for all frequencies. The key tool is Majda's asymptotic estimate of the far field patterns in the high-frequency regime. Furthermore, we introduce a fast and stable numerical algorithm for reconstructing the boundary and computing the boundary condition. A key feature of the algorithm is that the boundary condition can be computed even if the boundary is not known, and vice versa. Numerical experiments demonstrate the validity and robustness of the proposed algorithm.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Sufficient conditions for $t$-tough graphs to be Hamiltonian and pancyclic or bipartite
Authors:
Xiangge Liu,
Caili Jia,
Yong Lu,
Jiaxu Zhong
Abstract:
The toughness of graph $G$, denoted by $τ(G)$, is $τ(G)=\min\{\frac{|S|}{c(G-S)}:S\subseteq V(G),c(G-S)\geq2\}$ for every vertex cut $S$ of $V(G)$ and the number of components of $G$ is denoted by $c(G)$. Bondy in 1973, suggested the ``metaconjecture" that almost any nontrivial condition on a graph which implies that the graph is Hamiltonian also implies that the graph is pancyclic. Recently, Bene…
▽ More
The toughness of graph $G$, denoted by $τ(G)$, is $τ(G)=\min\{\frac{|S|}{c(G-S)}:S\subseteq V(G),c(G-S)\geq2\}$ for every vertex cut $S$ of $V(G)$ and the number of components of $G$ is denoted by $c(G)$. Bondy in 1973, suggested the ``metaconjecture" that almost any nontrivial condition on a graph which implies that the graph is Hamiltonian also implies that the graph is pancyclic. Recently, Benediktovich [Discrete Applied Mathematics. 365 (2025) 130--137] confirmed the Bondy's metaconjecture for $t$-tough graphs in the case when $t\in\{1;2;3\}$ in terms of the size, the spectral radius and the signless Laplacian spectral radius of the graph. In this paper, we will confirm the Bondy's metaconjecture for $t$-tough graphs in the case when $t\geq4$ in terms of the size, the spectral radius, the signless Laplacian spectral radius, the distance spectral radius and the distance signless Laplacian spectral radius of graphs.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Nonlinear optical response in kagome lattice with inversion symmetry breaking
Authors:
Xiangyang Liu,
Junwen Lai,
Jie Zhan,
Tianye Yu,
Peitao Liu,
Seiji Yunoki,
Xing-Qiu Chen,
Yan Sun
Abstract:
The kagome lattice is a fundamental model structure in condensed matter physics and materials science featuring symmetry-protected flat bands, saddle points, and Dirac points. This structure has emerged as an ideal platform for exploring various quantum physics. By combining effective model analysis and first-principles calculations, we propose that the synergy among inversion symmetry breaking, f…
▽ More
The kagome lattice is a fundamental model structure in condensed matter physics and materials science featuring symmetry-protected flat bands, saddle points, and Dirac points. This structure has emerged as an ideal platform for exploring various quantum physics. By combining effective model analysis and first-principles calculations, we propose that the synergy among inversion symmetry breaking, flat bands, and saddle point-related van Hove singularities within the kagome lattice holds significant potential for generating strong second-order nonlinear optical response. This property provides an inspiring insight into the practical application of the kagome-like materials, which is helpful for a comprehensive understanding of kagome lattice-related physics. Moreover, this work offers an alternative approach for designing materials with strong a second-order nonlinear optical response.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
The defocusing energy-supercritical inhomogeneous NLS in four space dimension
Authors:
Xuan Liu,
Chengbin Xu
Abstract:
In this paper, we investigate the global well-posedness and scattering theory for the defocusing energy supcritical inhomogeneous nonlinear Schrödinger equation $iu_t + Δu =|x|^{-b} |u|^αu$ in four space dimension, where $s_c := 2- \frac{2-b}α \in (1, 2)$ and $0<b<\min \{ (s_c-1)^2+1,3-s_c\}$.
We prove that if the solution has a prior bound in the critical Sobolev space, that is,…
▽ More
In this paper, we investigate the global well-posedness and scattering theory for the defocusing energy supcritical inhomogeneous nonlinear Schrödinger equation $iu_t + Δu =|x|^{-b} |u|^αu$ in four space dimension, where $s_c := 2- \frac{2-b}α \in (1, 2)$ and $0<b<\min \{ (s_c-1)^2+1,3-s_c\}$.
We prove that if the solution has a prior bound in the critical Sobolev space, that is, $u \in L_t^\infty(I; \dot{H}_x^{s_c}(\mathbb{R}^4))$, then $u$ is global and scatters. The proof of the main results is based on the concentration-compactness/rigidity framework developed by Kenig and Merle [Invent. Math. 166 (2006)], together with a long-time Strichartz estimate, a spatially localized Morawetz estimate, and a frequency-localized Morawetz estimate.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Time-lagged marginal expected shortfall
Authors:
Jiajun Liu,
Xuannan Liu,
Yuwei Zhao
Abstract:
Marginal expected shortfall (MES) is an important measure when assessing and quantifying the contribution of the financial institution to a systemic crisis. In this paper, we propose time-lagged marginal expected shortfall (TMES) as a dynamic extension of the MES, accounting for time lags in assessing systemic risks. A natural estimator for the TMES is proposed, and its asymptotic properties are s…
▽ More
Marginal expected shortfall (MES) is an important measure when assessing and quantifying the contribution of the financial institution to a systemic crisis. In this paper, we propose time-lagged marginal expected shortfall (TMES) as a dynamic extension of the MES, accounting for time lags in assessing systemic risks. A natural estimator for the TMES is proposed, and its asymptotic properties are studied. To address challenges in constructing confidence intervals for the TMES in practice, we apply the stationary bootstrap method to generate confidence bands for the TMES estimator. Extensive simulation studies were conducted to investigate the asymptotic properties of empirical and bootstrapped TMES. Two practical applications of TMES, supported by real data analyses, effectively demonstrate its ability to account for time lags in risk assessment.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Global well-posedness of the elastic-viscous-plastic sea-ice model with the inviscid Voigt-regularisation
Authors:
Daniel W. Boutros,
Xin Liu,
Marita Thomas,
Edriss S. Titi
Abstract:
In this paper, we initiate the rigorous mathematical analysis of the elastic-viscous-plastic (EVP) sea-ice model, which was introduced in [E. C. Hunke and J. K. Dukowicz, J. Phys. Oceanogr., 27, 9 (1997), 1849-1867]. The EVP model is one of the standard and most commonly used dynamical sea-ice models. We study a regularized version of this model. In particular, we prove the global well-posedness o…
▽ More
In this paper, we initiate the rigorous mathematical analysis of the elastic-viscous-plastic (EVP) sea-ice model, which was introduced in [E. C. Hunke and J. K. Dukowicz, J. Phys. Oceanogr., 27, 9 (1997), 1849-1867]. The EVP model is one of the standard and most commonly used dynamical sea-ice models. We study a regularized version of this model. In particular, we prove the global well-posedness of the EVP model with the inviscid Voigt-regularisation of the evolution equation for the stress tensor. Due to the elastic relaxation and the Voigt regularisation, we are able to handle the case of viscosity coefficients without cutoff, which has been a major issue and a setback in the computational study and analysis of the related Hibler sea-ice model, which was originally introduced in [W. D. Hibler, J. Phys. Oceanogr., 9, 4 (1979), 815-846]. The EVP model shares some structural characteristics with the Oldroyd-B model and related models for viscoelastic non-Newtonian complex fluids.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Integral Representations of Sobolev Spaces via ReLU$^k$ Activation Function and Optimal Error Estimates for Linearized Networks
Authors:
Xinliang Liu,
Tong Mao,
Jinchao Xu
Abstract:
This paper presents two main theoretical results concerning shallow neural networks with ReLU$^k$ activation functions. We establish a novel integral representation for Sobolev spaces, showing that every function in $H^{\frac{d+2k+1}{2}}(Ω)$ can be expressed as an $L^2$-weighted integral of ReLU$^k$ ridge functions over the unit sphere. This result mirrors the known representation of Barron spaces…
▽ More
This paper presents two main theoretical results concerning shallow neural networks with ReLU$^k$ activation functions. We establish a novel integral representation for Sobolev spaces, showing that every function in $H^{\frac{d+2k+1}{2}}(Ω)$ can be expressed as an $L^2$-weighted integral of ReLU$^k$ ridge functions over the unit sphere. This result mirrors the known representation of Barron spaces and highlights a fundamental connection between Sobolev regularity and neural network representations. Moreover, we prove that linearized shallow networks -- constructed by fixed inner parameters and optimizing only the linear coefficients -- achieve optimal approximation rates $O(n^{-\frac{1}{2}-\frac{2k+1}{2d}})$ in Sobolev spaces.
△ Less
Submitted 12 May, 2025; v1 submitted 1 May, 2025;
originally announced May 2025.
-
On the small mass limit of stochastic wave equation driven by cylindrical stable process
Authors:
Qingming Zhao,
Xueru Liu,
Wei Wang
Abstract:
We explore the small mass limit of a stochastic wave equation (SWE) driven by cylindrical $α$-stable noise, where $α\in (1,2)$, and prove that it converges to a stochastic heat equation. We establish its well-posedness, and in particular, the càdlàg property, which is not trivial in the infinite dimensional case. Using a splitting technique, we decompose the velocity component into three parts, wh…
▽ More
We explore the small mass limit of a stochastic wave equation (SWE) driven by cylindrical $α$-stable noise, where $α\in (1,2)$, and prove that it converges to a stochastic heat equation. We establish its well-posedness, and in particular, the càdlàg property, which is not trivial in the infinite dimensional case. Using a splitting technique, we decompose the velocity component into three parts, which gives convenience to the moment estimate. We show the tightness of solution of SWE by verifying the infinite dimensional version of Aldous condition. After these preparation, we pass the limit and derive the approximation equation.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
A state reduction approach for learning-based model predictive control for train rescheduling
Authors:
Caio Fabio Oliveira da Silva,
Xiaoyu Liu,
Azita Dabiri,
Bart De Schutter
Abstract:
This paper proposes a state reduction method for learning-based model predictive control (MPC) for train rescheduling in urban rail transit systems. The state reduction integrates into a control framework where the discrete decision variables are determined by a learning-based classifier and the continuous decision variables are computed by MPC. Herein, the state representation is designed separat…
▽ More
This paper proposes a state reduction method for learning-based model predictive control (MPC) for train rescheduling in urban rail transit systems. The state reduction integrates into a control framework where the discrete decision variables are determined by a learning-based classifier and the continuous decision variables are computed by MPC. Herein, the state representation is designed separately for each component of the control framework. While a reduced state is employed for learning, a full state is used in MPC. Simulations on a large-scale train network highlight the effectiveness of the state reduction mechanism in improving the performance and reducing the memory usage.
△ Less
Submitted 28 April, 2025;
originally announced April 2025.
-
Fast convolution solver based on far-field smooth approximation
Authors:
Xin Liu,
Yong Zhang
Abstract:
The convolution potential arises in a wide variety of application areas, and its efficient and accurate evaluation encounters three challenges: singularity, nonlocality and anisotropy. We introduce a fast algorithm based on a far-field smooth approximation of the kernel, where the bounded domain Fourier transform, one of the most essential difficulties, is well approximated by the whole space Four…
▽ More
The convolution potential arises in a wide variety of application areas, and its efficient and accurate evaluation encounters three challenges: singularity, nonlocality and anisotropy. We introduce a fast algorithm based on a far-field smooth approximation of the kernel, where the bounded domain Fourier transform, one of the most essential difficulties, is well approximated by the whole space Fourier transform which usually admits explicit formula. The convolution is split into a regular and singular integral, and they are well resolved by trapezoidal rule and Fourier spectral method respectively. The scheme is simplified to a discrete convolution and is implemented efficiently with Fast Fourier Transform (FFT). Importantly, the tensor generation procedure is quite simple, highly efficient and independent of the anisotropy strength. It is easy to implement and achieves spectral accuracy with nearly optimal efficiency and minimum memory requirement. Rigorous error estimates and extensive numerical investigations, together with a comprehensive comparison, showcase its superiorities for different kernels.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
Mean Curvature Flow for Isoparametric Submanifolds in Hyperbolic Spaces
Authors:
X. Liu,
W. Yang
Abstract:
Mean curvature flows of isoparametric submanifolds in Euclidean spaces and spheres have been studied by Liu and Terng in \cite{X.CT} and \cite{X.C}. In particular, it was proved that such flows always have ancient solutions. This is also true for mean curvature flows of isoparametric hypersurfaces in hyperbolic spaces by a result of Reis and Tenenblat in \cite{S.H.T}. In this paper, we study mean…
▽ More
Mean curvature flows of isoparametric submanifolds in Euclidean spaces and spheres have been studied by Liu and Terng in \cite{X.CT} and \cite{X.C}. In particular, it was proved that such flows always have ancient solutions. This is also true for mean curvature flows of isoparametric hypersurfaces in hyperbolic spaces by a result of Reis and Tenenblat in \cite{S.H.T}. In this paper, we study mean curvature flows of isoparametric submanifolds in hyperbolic spaces with arbitrary codimension. In particular, we will show that they always have ancient solutions and study their limiting behaviors.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Convergence in natural parametrization of random walk frontier
Authors:
Yifan Gao,
Xinyi Li,
Runsheng Liu,
Xiangyi Liu,
Daisuke Shiraishi
Abstract:
In this paper, we show that the frontier of planar random walk converges weakly under natural parametrization to that of planar Brownian motion. As an intermediate result, we also show the convergence of the renormalized occupation measure.
In this paper, we show that the frontier of planar random walk converges weakly under natural parametrization to that of planar Brownian motion. As an intermediate result, we also show the convergence of the renormalized occupation measure.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Whittaker modules for $U_q(\mathfrak{sl}_3)$
Authors:
Xiangqian Guo,
Xuewen Liu,
Limeng Xia
Abstract:
In this paper, we study the Whittaker modules for the quantum enveloping algebra $U_q(\sl_3)$ with respect to a fixed Whittaker function. We construct the universal Whittaker module, find all its Whittaker vectors and investigate the submodules generated by subsets of Whittaker vectors and corresponding quotient modules. We also find Whittaker vectors and determine the irreducibility of these quot…
▽ More
In this paper, we study the Whittaker modules for the quantum enveloping algebra $U_q(\sl_3)$ with respect to a fixed Whittaker function. We construct the universal Whittaker module, find all its Whittaker vectors and investigate the submodules generated by subsets of Whittaker vectors and corresponding quotient modules. We also find Whittaker vectors and determine the irreducibility of these quotient modules and show that they exhaust all irreducible Whittaker modules. Finally, we can determine all maximal submodules of the universal Whittaker module. The Whittaker model of $U_q(\sl_3)$ are quite different from that of $U_q(\sl_2)$ and finite-dimensional simple Lie algebras, since the center of our algebra is not a polynomial algebra.
△ Less
Submitted 14 April, 2025;
originally announced April 2025.
-
Distance signless Laplacian spectral radius and tough graphs involving minimun degree
Authors:
Xiangge Liu,
Yong Lu,
Caili Jia,
Qiannan Zhou,
Yue Cui
Abstract:
Let $G=(V(G),E(G))$ be a simple graph, where $V(G)$ and $E(G)$ are the vertex set and the edge set of $G$, respectively. The number of components of $G$ is denoted by $c(G)$. Let $t$ be a positive real number, and a connected graph $G$ is $t$-tough if $t c(G-S)\leq|S|$ for every vertex cut $S$ of $V(G)$. The toughness of graph $G$, denoted by $τ(G)$, is the largest value of $t$ for which $G$ is…
▽ More
Let $G=(V(G),E(G))$ be a simple graph, where $V(G)$ and $E(G)$ are the vertex set and the edge set of $G$, respectively. The number of components of $G$ is denoted by $c(G)$. Let $t$ be a positive real number, and a connected graph $G$ is $t$-tough if $t c(G-S)\leq|S|$ for every vertex cut $S$ of $V(G)$. The toughness of graph $G$, denoted by $τ(G)$, is the largest value of $t$ for which $G$ is $t$-tough. Recently, Fan, Lin and Lu [European J. Combin. 110(2023), 103701] presented sufficient conditions based on the spectral radius for graphs to be 1-tough with minimum degree $δ(G)$ and graphs to be $t$-tough with $t\geq 1$ being an integer, respectively. In this paper, we establish sufficient conditions in terms of the distance signless Laplacian spectral radius for graphs to be 1-tough with minimum degree $δ(G)$ and graphs to be $t$-tough, where $\frac{1}{t}$ is a positive integer. Moreover, we consider the relationship between the distance signless Laplacian spectral radius and $t$-tough graphs in terms of the order $n$.
△ Less
Submitted 11 April, 2025; v1 submitted 10 April, 2025;
originally announced April 2025.
-
Sparsified-Learning for Heavy-Tailed Locally Stationary Processes
Authors:
Yingjie Wang,
Mokhtar Z. Alaya,
Salim Bouzebda,
Xinsheng Liu
Abstract:
Sparsified Learning is ubiquitous in many machine learning tasks. It aims to regularize the objective function by adding a penalization term that considers the constraints made on the learned parameters. This paper considers the problem of learning heavy-tailed LSP. We develop a flexible and robust sparse learning framework capable of handling heavy-tailed data with locally stationary behavior and…
▽ More
Sparsified Learning is ubiquitous in many machine learning tasks. It aims to regularize the objective function by adding a penalization term that considers the constraints made on the learned parameters. This paper considers the problem of learning heavy-tailed LSP. We develop a flexible and robust sparse learning framework capable of handling heavy-tailed data with locally stationary behavior and propose concentration inequalities. We further provide non-asymptotic oracle inequalities for different types of sparsity, including $\ell_1$-norm and total variation penalization for the least square loss.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
Viscous pressureless flows with free boundary in one space dimension: The constant viscosity case
Authors:
Xin Liu
Abstract:
We establish the global well-posedness of the free boundary problem of the viscous pressureless and almost pressureless heat conductive flows in one space dimension. In both cases, arbitrarily large but smooth initial data is considered, and the evolving fluid domains remain bounded for all time. In the viscous pressureless case, we are able to identify the terminal flow domain in terms of the ini…
▽ More
We establish the global well-posedness of the free boundary problem of the viscous pressureless and almost pressureless heat conductive flows in one space dimension. In both cases, arbitrarily large but smooth initial data is considered, and the evolving fluid domains remain bounded for all time. In the viscous pressureless case, we are able to identify the terminal flow domain in terms of the initial data. In the viscous almost pressureless case, we construct the flow as a perturbation of the viscous pressureless flow, and establish the first result for the Navier-Stokes-Fourier system in the current setting.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Strict Hölder equivalence of self-similar sets
Authors:
Yanfang Zhang,
Xinhui Liu
Abstract:
The study of Lipschitz equivalence of fractals is a very active topic in recent years. It is natural to ask when two fractal sets are strictly Hölder equivalent. In the present paper, we completely characterize the strict Hölder equivalence for two classes of self-similar sets: the first class is totally-disconnected fractal cubes and the second class is self-similar sets with two branches which s…
▽ More
The study of Lipschitz equivalence of fractals is a very active topic in recent years. It is natural to ask when two fractal sets are strictly Hölder equivalent. In the present paper, we completely characterize the strict Hölder equivalence for two classes of self-similar sets: the first class is totally-disconnected fractal cubes and the second class is self-similar sets with two branches which satisfy the strong separation condition.
△ Less
Submitted 5 April, 2025;
originally announced April 2025.
-
The growth of transcendental entire solutions of linear difference equations with polynomial coefficients
Authors:
Xiong-Feng Liu,
Zhi-Tao Wen,
Can-Xin Zhu
Abstract:
In this paper, we study the growth of transcendental entire solutions of linear difference equations
\begin{equation}
P_m(z)Δ^mf(z)+\cdots+P_1(z)Δf(z)+P_0(z)f(z)=0,\tag{+}
\end{equation} where $P_j(z)$ are polynomials for $j=0,\ldots,m$. At first, we reveal type of binomial series in terms of its coefficients. Second, we give a list of all possible orders, which are less than 1, and types of…
▽ More
In this paper, we study the growth of transcendental entire solutions of linear difference equations
\begin{equation}
P_m(z)Δ^mf(z)+\cdots+P_1(z)Δf(z)+P_0(z)f(z)=0,\tag{+}
\end{equation} where $P_j(z)$ are polynomials for $j=0,\ldots,m$. At first, we reveal type of binomial series in terms of its coefficients. Second, we give a list of all possible orders, which are less than 1, and types of transcendental entire solutions of linear difference equations $(+)$. In particular, we give so far the best precise growth estimate of transcendental entire solutions of order less than 1 of $(+)$, which improves results in [3, 4], [5], [7]. Third, for any given rational number $ρ\in(0,1)$ and real number $σ\in(0,\infty)$, we can construct a linear difference equation with polynomial coefficients which has a transcendental entire solution of order $ρ$ and type $σ$. At last, some examples are illustrated for our main theorem.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
A note on the cross matrices
Authors:
Xiaobo Liu
Abstract:
A cross matrix $X$ can have nonzero elements located only on the main diagonal and the anti-diagonal, so that the sparsity pattern has the shape of a cross. It is shown that $X$ can be factorized into products of matrices that are at most rank-two perturbations to the identity matrix and can be symmetrically permuted to block diagonal form with $2\times 2$ diagonal blocks and, if $n$ is odd, a…
▽ More
A cross matrix $X$ can have nonzero elements located only on the main diagonal and the anti-diagonal, so that the sparsity pattern has the shape of a cross. It is shown that $X$ can be factorized into products of matrices that are at most rank-two perturbations to the identity matrix and can be symmetrically permuted to block diagonal form with $2\times 2$ diagonal blocks and, if $n$ is odd, a $1\times 1$ diagonal block. The permutation similarity implies that any well-defined analytic function of $X$ remains a cross matrix. By exploiting these properties, explicit formulae for the determinant, inverse, and characteristic polynomial are derived. It is also shown that the structure of cross matrix can be preserved under matrix factorizations, including the LU, QR, and SVD decompositions.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Differentially Private Joint Independence Test
Authors:
Xingwei Liu,
Yuexin Chen,
Wangli Xu
Abstract:
Identification of joint dependence among more than two random vectors plays an important role in many statistical applications, where the data may contain sensitive or confidential information. In this paper, we consider the the $d$-variable Hilbert-Schmidt independence criterion (dHSIC) in the context of differential privacy. Given the limiting distribution of the empirical estimate of dHSIC is c…
▽ More
Identification of joint dependence among more than two random vectors plays an important role in many statistical applications, where the data may contain sensitive or confidential information. In this paper, we consider the the $d$-variable Hilbert-Schmidt independence criterion (dHSIC) in the context of differential privacy. Given the limiting distribution of the empirical estimate of dHSIC is complicated Gaussian chaos, constructing tests in the non-privacy regime is typically based on permutation and bootstrap. To detect joint dependence in privacy, we propose a dHSIC-based testing procedure by employing a differentially private permutation methodology. Our method enjoys privacy guarantee, valid level and pointwise consistency, while the bootstrap counterpart suffers inconsistent power. We further investigate the uniform power of the proposed test in dHSIC metric and $L_2$ metric, indicating that the proposed test attains the minimax optimal power across different privacy regimes. As a byproduct, our results also contain the pointwise and uniform power of the non-private permutation dHSIC, addressing an unsolved question remained in Pfister et al. (2018). Both numerical simulations and real data analysis on causal inference suggest our proposed test performs well empirically.
△ Less
Submitted 8 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Optimization over Trained Neural Networks: Difference-of-Convex Algorithm and Application to Data Center Scheduling
Authors:
Xinwei Liu,
Vladimir Dvorkin
Abstract:
When solving decision-making problems with mathematical optimization, some constraints or objectives may lack analytic expressions but can be approximated from data. When an approximation is made by neural networks, the underlying problem becomes optimization over trained neural networks. Despite recent improvements with cutting planes, relaxations, and heuristics, the problem remains difficult to…
▽ More
When solving decision-making problems with mathematical optimization, some constraints or objectives may lack analytic expressions but can be approximated from data. When an approximation is made by neural networks, the underlying problem becomes optimization over trained neural networks. Despite recent improvements with cutting planes, relaxations, and heuristics, the problem remains difficult to solve in practice. We propose a new solution based on a bilinear problem reformulation that penalizes ReLU constraints in the objective function. This reformulation makes the problem amenable to efficient difference-of-convex algorithms (DCA), for which we propose a principled approach to penalty selection that facilitates convergence to stationary points of the original problem. We apply the DCA to the problem of the least-cost allocation of data center electricity demand in a power grid, reporting significant savings in congested cases.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
A central limit theorem and its application to the limiting distribution of volatility target index
Authors:
Xuan Liu,
Michel Gauthier
Abstract:
We study the limiting distribution of a volatility target index as the discretisation time step converges to zero. Two limit theorems (a strong law of large numbers and a central limit theorem) are established, and as an application, the exact limiting distribution is derived. We demonstrate that the volatility of the limiting distribution is consistently larger than the target volatility, and con…
▽ More
We study the limiting distribution of a volatility target index as the discretisation time step converges to zero. Two limit theorems (a strong law of large numbers and a central limit theorem) are established, and as an application, the exact limiting distribution is derived. We demonstrate that the volatility of the limiting distribution is consistently larger than the target volatility, and converges to the target volatility as the observation-window parameter $λ$ in the definition of the realised variance converges to $1$. Besides the exact formula for the drift and the volatility of the limiting distribution, their upper and lower bounds are derived. As a corollary of the exact limiting distribution, we obtain a vega conversion formula which converts the rho sensitivity of a financial derivative on the limiting diffusion to the vega sensitivity of the same financial derivative on the underlying of the volatility target index.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Optimization-based method for conjugate heat transfer problems
Authors:
Liang Fang,
Xiandong Liu,
Lei Zhang
Abstract:
We propose a numerical approach for solving conjugate heat transfer problems using the finite volume method. This approach combines a semi-implicit scheme for fluid flow, governed by the incompressible Navier-Stokes equations, with an optimization-based approach for heat transfer across the fluid-solid interface. In the semi-implicit method, the convective term in the momentum equation is treated…
▽ More
We propose a numerical approach for solving conjugate heat transfer problems using the finite volume method. This approach combines a semi-implicit scheme for fluid flow, governed by the incompressible Navier-Stokes equations, with an optimization-based approach for heat transfer across the fluid-solid interface. In the semi-implicit method, the convective term in the momentum equation is treated explicitly, ensuring computational efficiency, while maintaining stability when a CFL condition involving fluid velocity is satisfied. Heat exchange between the fluid and solid domains is formulated as a constrained optimization problem, which is efficiently solved using a sequential quadratic programming method. Numerical results are presented to demonstrate the effectiveness and performance of the proposed approach.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
The second Dirichlet eigenvalue is simple on every non-equilateral triangle
Authors:
Ryoki Endo,
Xuefeng Liu
Abstract:
The Dirichlet eigenvalues of the Laplacian on a triangle that collapses into a line segment diverge to infinity. In this paper, to track the behavior of the eigenvalues during the collapsing process of a triangle, we establish a quantitative error estimate for the Dirichlet eigenvalues on collapsing triangles. As an application, we solve the open problem concerning the simplicity of the second Dir…
▽ More
The Dirichlet eigenvalues of the Laplacian on a triangle that collapses into a line segment diverge to infinity. In this paper, to track the behavior of the eigenvalues during the collapsing process of a triangle, we establish a quantitative error estimate for the Dirichlet eigenvalues on collapsing triangles. As an application, we solve the open problem concerning the simplicity of the second Dirichlet eigenvalue for nearly degenerate triangles, offering a complete solution to Conjecture 6.47 posed by R. Laugesen and B. Siudeja in A. Henrot's book ``Shape Optimization and Spectral Theory".
△ Less
Submitted 29 March, 2025; v1 submitted 9 March, 2025;
originally announced March 2025.
-
A quantitative sampling method for elastic and electromagnetic sources
Authors:
Xiaodong Liu,
Qingxiang Shi
Abstract:
This work is dedicated to a novel sampling method for accurately reconstructing elastic and electromagnetic sources from the far field patterns. We show that the proposed indicators in the form of integrals with full far field patterns are exactly the source functions. These facts not only give constructive uniqueness proofs of the inverse source problems, but also establish the theoretical basis…
▽ More
This work is dedicated to a novel sampling method for accurately reconstructing elastic and electromagnetic sources from the far field patterns. We show that the proposed indicators in the form of integrals with full far field patterns are exactly the source functions. These facts not only give constructive uniqueness proofs of the inverse source problems, but also establish the theoretical basis of the proposed sampling methods. Furthermore, we derive the stability estimates for the corresponding discrete indicators using the far field patterns with finitely many observations and frequencies. We have also proposed the indicators with partial far field patterns and proved their validity for providing the derivative information of the unknown sources. Numerical examples are presented to verify the accuracy and stability of the proposed quantitative sampling method.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
Identifying point sources for biharmonic wave equation from the scattered fields at sparse sensors
Authors:
Xiaodong Liu,
Qingxiang Shi,
Jing Wang
Abstract:
This work is dedicated to uniqueness and numerical algorithms for determining the point sources of the biharmonic wave equation using scattered fields at sparse sensors. We first show that the point sources in both $\mathbb{R}^2$ and $\mathbb{R}^3$ can be uniquely determined from the multifrequency sparse scattered fields. In particular, to deal with the challenges arising from the fundamental sol…
▽ More
This work is dedicated to uniqueness and numerical algorithms for determining the point sources of the biharmonic wave equation using scattered fields at sparse sensors. We first show that the point sources in both $\mathbb{R}^2$ and $\mathbb{R}^3$ can be uniquely determined from the multifrequency sparse scattered fields. In particular, to deal with the challenges arising from the fundamental solution of the biharmonic wave equation in $\mathbb{R}^2$, we present an innovative approach that leverages the Fourier transform and Funk-Hecke formula. Such a technique can also be applied for identifying the point sources of the Helmholtz equation. Moreover, we present the uniqueness results for identifying multiple point sources in $\mathbb{R}^3$ from the scattered fields at sparse sensors with finitely many frequencies. Based on the constructive uniqueness proofs, we propose three numerical algorithms for identifying the point sources by using multifrequency sparse scattered fields. The numerical experiments are presented to verify the effectiveness and robustness of the algorithms.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems
Authors:
Wei Liu,
Xin Liu,
Michael K. Ng,
Zaikun Zhang
Abstract:
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given…
▽ More
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given dataset, where the similarity matrix includes a scaling parameter to reflect the must-link constraints. Utilizing a relaxation technique, we formulate the graph partitioning problem into a continuous optimization model that does not require the exact cluster number, but only an overestimate of it. We then propose a block coordinate descent algorithm to efficiently solve this model, and establish its convergence result. Based on the obtained solution, we can construct the clusters that theoretically meet the must-link constraints under mild assumptions. Furthermore, we verify the effectiveness and efficiency of our proposed method through comprehensive numerical experiments.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Mixed-precision algorithms for solving the Sylvester matrix equation
Authors:
Andrii Dmytryshyn,
Massimiliano Fasi,
Nicholas J. Higham,
Xiaobo Liu
Abstract:
We consider the solution of the general Sylvester equation $AX+XB=C$ in mixed precision. First, we investigate the use of GMRES-based iterative refinement (GMRES-IR) to solve the equation using implicitly its Kronecker product form: we propose an efficient scheme to use the Schur factors of the coefficient matrices as preconditioners, but we demonstrate that this approach is not suitable in the ca…
▽ More
We consider the solution of the general Sylvester equation $AX+XB=C$ in mixed precision. First, we investigate the use of GMRES-based iterative refinement (GMRES-IR) to solve the equation using implicitly its Kronecker product form: we propose an efficient scheme to use the Schur factors of the coefficient matrices as preconditioners, but we demonstrate that this approach is not suitable in the case of the Sylvester equation. By revisiting a stationary iteration for linear systems, we therefore derive a new iterative refinement scheme for the quasi-triangular Sylvester equation, and our rounding error analysis provides sufficient conditions for convergence and a bound on the attainable relative residual. We leverage this iterative scheme to solve the general Sylvester equation in mixed precision. The new algorithms compute the Schur decomposition of the matrix coefficients in low precision, use the low-precision Schur factors to obtain an approximate solution to the quasi-triangular equation, and iteratively refine it to obtain a working-precision solution to the quasi-triangular equation. However, being only orthonormal to low precision, the unitary Schur factors of $A$ and $B$ cannot be used to recover the solution to the original equation. We propose two effective approaches to address this issue: one is based on re-orthonormalization in the working precision, and the other on explicit inversion of the almost-unitary factors. We test these mixed-precision algorithms on various Sylvester and Lyapunov equations from the literature. Our numerical experiments show that, for both classes of equations, the new algorithms are at least as accurate as existing ones. Our cost analysis, on the other hand, suggests that they would typically be faster than mono-precision alternatives if implemented on hardware that natively supports low precision.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
A Local Version of Hardy-type Spaces Associated with Ball Quasi-Banach Spaces and Non-negative Self-adjoint Operators on Spaces of Homogeneous Type and Their Applications
Authors:
Xiong Liu,
Wenhua Wang,
Tiantian Zhao
Abstract:
Let $(\mathbb{X},\,d,\,μ)$ be a space of homogeneous type in the sense of Coifman and Weiss, $X$ be a ball quasi-Banach function space on $\mathbb{X}$, $L$ be a non-negative self-adjoint operator on $L^2(\mathbb{X})$, and assume that, for all $t>0$, the semigroup $e^{-tL}$ has an integral representation whose kernel satisfies a Gaussian upper bound condition. In this paper, we first study a local…
▽ More
Let $(\mathbb{X},\,d,\,μ)$ be a space of homogeneous type in the sense of Coifman and Weiss, $X$ be a ball quasi-Banach function space on $\mathbb{X}$, $L$ be a non-negative self-adjoint operator on $L^2(\mathbb{X})$, and assume that, for all $t>0$, the semigroup $e^{-tL}$ has an integral representation whose kernel satisfies a Gaussian upper bound condition. In this paper, we first study a local version of Hardy space $h^{X}_L(\mathbb{X})$ associated with ball quasi-Banach space $X$ and non-negative self-adjoint operator $L$, which is an extension of Goldberg's result [Duke Math. J. {\bf46} (1979), no. 1, 27-42; MR0523600]. Even in the case of Euclidean space (that is, $\mathbb{X}=\mathbb{R}^d$), all of these results are still new.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
The Distributionally Robust Optimization Model of Sparse Principal Component Analysis
Authors:
Lei Wang,
Xin Liu,
Xiaojun Chen
Abstract:
We consider sparse principal component analysis (PCA) under a stochastic setting where the underlying probability distribution of the random parameter is uncertain. This problem is formulated as a distributionally robust optimization (DRO) model based on a constructive approach to capturing uncertainty in the covariance matrix, which constitutes a nonsmooth constrained min-max optimization problem…
▽ More
We consider sparse principal component analysis (PCA) under a stochastic setting where the underlying probability distribution of the random parameter is uncertain. This problem is formulated as a distributionally robust optimization (DRO) model based on a constructive approach to capturing uncertainty in the covariance matrix, which constitutes a nonsmooth constrained min-max optimization problem. We further prove that the inner maximization problem admits a closed-form solution, reformulating the original DRO model into an equivalent minimization problem on the Stiefel manifold. This transformation leads to a Riemannian optimization problem with intricate nonsmooth terms, a challenging formulation beyond the reach of existing algorithms. To address this issue, we devise an efficient smoothing manifold proximal gradient algorithm. We prove the Riemannian gradient consistency and global convergence of our algorithm to a stationary point of the nonsmooth minimization problem. Moreover, we establish the iteration complexity of our algorithm. Finally, numerical experiments are conducted to validate the effectiveness and scalability of our algorithm, as well as to highlight the necessity and rationality of adopting the DRO model for sparse PCA.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
A unified recursive identification algorithm with quantized observations based on weighted least-squares type criteria
Authors:
Xingrui Liu,
Ying Wang,
Yanlong Zhao
Abstract:
This paper investigates system identification problems with Gaussian inputs and quantized observations under fixed thresholds. A new formulation for the predictor of quantized observations is introduced, establishing a linear correlation with the parameter estimations through a probabilistic relationship among quantized observations, Gaussian inputs, and system parameters. Subsequently, a novel we…
▽ More
This paper investigates system identification problems with Gaussian inputs and quantized observations under fixed thresholds. A new formulation for the predictor of quantized observations is introduced, establishing a linear correlation with the parameter estimations through a probabilistic relationship among quantized observations, Gaussian inputs, and system parameters. Subsequently, a novel weighted least-squares criterion is proposed, and a two-step recursive identification algorithm is constructed, which is capable of addressing both noisy and noise-free linear systems. Convergence analysis of this identification algorithm is conducted, demonstrating convergence in both almost sure and $L^{p}$ senses under mild conditions, with respective rates of $O(\sqrt{ \log \log k/k})$ and $O(1/k^{p/2})$, where $k$ denotes the time step. In particular, this algorithm offers an asymptotically efficient estimation of the variance of Gaussian variables using quantized observations. Additionally, asymptotic normality is established, and an expression for the asymptotic variance is provided when the weight coefficients are properly selected. Furthermore, extensions to output-error systems are discussed, enhancing the applicability and relevance of the proposed methods. Two numerical examples are provided to validate these theoretical advancements.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Splitting finite element approximations for quasi-static electroporoelasticity equations
Authors:
Xuan Liu,
Yongkui Zou,
Ran Zhang,
Yanzhao Cao,
Amnon J. Meir
Abstract:
The electroporoelasticity model, which couples Maxwell's equations with Biot's equations, plays a critical role in applications such as water conservancy exploration, earthquake early warning, and various other fields. This work focuses on investigating its well-posedness and analyzing error estimates for a splitting backward Euler finite element method. We first define a weak solution consistent…
▽ More
The electroporoelasticity model, which couples Maxwell's equations with Biot's equations, plays a critical role in applications such as water conservancy exploration, earthquake early warning, and various other fields. This work focuses on investigating its well-posedness and analyzing error estimates for a splitting backward Euler finite element method. We first define a weak solution consistent with the finite element framework. Then, we prove the uniqueness and existence of such a solution using the Galerkin method and derive a priori estimates for high-order regularity. Using a splitting technique, we define an approximate splitting solution and analyze its convergence order. Next, we apply Nedelec's curl-conforming finite elements, Lagrange elements, and the backward Euler method to construct a fully discretized scheme. We demonstrate the stability of the splitting numerical solution and provide error estimates for its convergence order in both temporal and spatial variables. Finally, we present numerical experiments to validate the theoretical results, showing that our method significantly reduces computational complexity compared to the classical finite element method.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Long-time asymptotics of the KdV equation with delta function initial profile
Authors:
Xuliang Liu,
Deng-Shan Wang
Abstract:
This work investigates the long-time asymptotic behaviors of the solution to the KdV equation with delta function initial profiles in different regions, employing the Riemann-Hilbert formulation and Deift-Zhou nonlinear steepest descent method. When the initial value is a delta potential well, the asymptotic solution is predominantly dominated by a single soliton in certain region for $x>0$, while…
▽ More
This work investigates the long-time asymptotic behaviors of the solution to the KdV equation with delta function initial profiles in different regions, employing the Riemann-Hilbert formulation and Deift-Zhou nonlinear steepest descent method. When the initial value is a delta potential well, the asymptotic solution is predominantly dominated by a single soliton in certain region for $x>0$, while in other regions, the dispersive tails including self-similar region, collisionless shock region and dispersive wave region, play a more significant role. Conversely, when the initial value is a delta potential barrier, the soliton region is absent, although the dispersive tails still persist. Moreover, the general delta function initial profile with $L$-spikes is also studied and it is proved that one to $L$ solitons will be generated in soliton region, which depends on the sizes of the distance and height of the spikes. The leading-order terms of the solution in each region are derived, highlighting the efficacy of the Riemann-Hilbert formulation in elucidating the long-time behaviors of integrable systems.
△ Less
Submitted 28 March, 2025; v1 submitted 22 February, 2025;
originally announced February 2025.