-
Judicious Partitions in Edge-Weighted Graphs with Bounded Maximum Weighted Degree
Authors:
G. Gutin,
M. A. Nielsen,
A. Yeo,
Y. Zhou
Abstract:
In this paper, we investigate bounds for the following judicious $k$-partitioning problem: Given an edge-weighted graph $G$, find a $k$-partition $(V_1,V_2,\dots ,V_k)$ of $V(G)$ such that the total weight of edges in the heaviest induced subgraph, $\max_{i=1}^k w(G[V_i])$, is minimized. In our bounds, we also take into account the weight $w(V_1,V_2,\dots,V_k)$ of the cut induced by the partition…
▽ More
In this paper, we investigate bounds for the following judicious $k$-partitioning problem: Given an edge-weighted graph $G$, find a $k$-partition $(V_1,V_2,\dots ,V_k)$ of $V(G)$ such that the total weight of edges in the heaviest induced subgraph, $\max_{i=1}^k w(G[V_i])$, is minimized. In our bounds, we also take into account the weight $w(V_1,V_2,\dots,V_k)$ of the cut induced by the partition (i.e., the total weight of edges with endpoints in different parts) and show the existence of a partition satisfying tight bounds for both quantities simultaneously. We establish such tight bounds for the case $k=2$ and, to the best of our knowledge, present the first (even for unweighted graphs) completely tight bound for $k=3$. We also show that, in general, these results cannot be extended to $k \geq 4$ without introducing an additional lower-order term, and we propose a corresponding conjecture. Moreover, we prove that there always exists a $k$-partition satisfying $\max \left\{ w(G[V_i]) : i \in [k] \right\} \leq \frac{w(G)}{k^2} + \frac{k - 1}{2k^2} Δ_w(G),$ where $Δ_w(G)$ denotes the maximum weighted degree of $G$. This bound is tight for every integer $k\geq 2$.
△ Less
Submitted 8 July, 2025;
originally announced July 2025.
-
On the well-posedness of time-space fractional Schrödinger equation on $\mathbb{R}^{d}$
Authors:
Yong Zhen Yang,
Yong Zhou
Abstract:
This paper considers the well-posedness of a class of time-space fractional Schrödinger equations introduced by Naber. In contrast to the classical Schrödinger equation, the solution operator here exhibits derivative loss and lacks the structure of a semigroup, which makes the classical Strichartz estimates inapplicable. By using harmonic analysis tools -- including the smoothing effect theory of…
▽ More
This paper considers the well-posedness of a class of time-space fractional Schrödinger equations introduced by Naber. In contrast to the classical Schrödinger equation, the solution operator here exhibits derivative loss and lacks the structure of a semigroup, which makes the classical Strichartz estimates inapplicable. By using harmonic analysis tools -- including the smoothing effect theory of Kenig and Ponce for Korteweg-de Vries equations \cite[\emph{Commun.~Pure Appl.~Math.}]{Kenig}, real interpolation techniques, and the Van der Corput lemma -- we establish novel dispersive estimates for the solution operator. These estimates generalize Ponce's regularity results \cite[\emph{J.~Funct.~Anal.}]{Ponce} for oscillatory integrals and enable us to address the derivative loss in the Schrödinger kernel. For the cases $β<2$~(in one space dimension) and $β>2$~(in higher dimensions), we prove local and global well-posedness in Sobolev and Lorentz-type spaces, respectively. Additionally, we analyze the asymptotic behavior of solutions and demonstrate the existence of self-similar solutions under homogeneous initial data. The results highlight the interplay between fractional derivatives, dispersive properties, and nonlinear dynamics, extending the understanding of nonlocal evolution equations in quantum mechanics and related fields.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Local/global well-posedness analysis of time-space fractional Schrödinger equation on $\mathbb{R}^{d}$
Authors:
Yong Zhen Yang,
Yong Zhou
Abstract:
Based on the $φ(-Δ)$-type operator studied by Kim \cite[\emph{Adv. Math.}]{Kim2}, where $φ$ is the Bernstein function, this paper investigates a class of nonlinear time-space fractional Schrödinger equations that exhibit nonlocal effects in both time and space. The time part is derived from the model proposed by Narahari Achar, and the space part is a $φ(-Δ)$-type operator. Due to nonlocal effects…
▽ More
Based on the $φ(-Δ)$-type operator studied by Kim \cite[\emph{Adv. Math.}]{Kim2}, where $φ$ is the Bernstein function, this paper investigates a class of nonlinear time-space fractional Schrödinger equations that exhibit nonlocal effects in both time and space. The time part is derived from the model proposed by Narahari Achar, and the space part is a $φ(-Δ)$-type operator. Due to nonlocal effects, this invalidates the classical Strichartz estimate. Combining the asymptotic behavior of Mittag-Leffler functions, Hörmander multiplier theory and other methods of harmonic analysis, we establish the Gagliardo-Nirenberg inequality in the $φ$-Triebel-Lizorkin space studied by Mikulevičius \cite[\emph{Potential Anal.}]{Mikulevicius} and obtain some Sobolev estimates for the solution operator, thus establishing the global/local well-posedness of the equations in some Banach space. In particular, our results are complementary to those of Su \cite[\emph{J. Math. Anal. Appl.}]{Su}, and the methods are quite different.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Muckenhoupt-weighted $L_q(L_p)$ boundedness for time-space fractional nonlocal operators
Authors:
Yong Zhen Yang,
Yong Zhou
Abstract:
Based on the $φ(Δ)$-type operator studied by Kim \cite[\emph{Adv.~Math.}]{Kim2}, where $φ$ is a Bernstein function, we establish weighted $L_{q}(L_{p})$ estimates for solutions to the following fractional evolution equation: $$ \partial_{t}^αw(t,x) = φ(Δ)w(t,x) + h(t,x), \quad t > 0, \; x \in \mathbb{R}^{d}, $$ where $\partial_{t}^α$ denotes the Caputo derivative of $0 < α< 1$. To be specific, for…
▽ More
Based on the $φ(Δ)$-type operator studied by Kim \cite[\emph{Adv.~Math.}]{Kim2}, where $φ$ is a Bernstein function, we establish weighted $L_{q}(L_{p})$ estimates for solutions to the following fractional evolution equation: $$ \partial_{t}^αw(t,x) = φ(Δ)w(t,x) + h(t,x), \quad t > 0, \; x \in \mathbb{R}^{d}, $$ where $\partial_{t}^α$ denotes the Caputo derivative of $0 < α< 1$. To be specific, for all $1 < p, q < \infty$, we demonstrate that $$ \int_{0}^{\infty} \left( \int_{\mathbb{R}^{d}} \left| φ(Δ)w \right|^{p} μ_{1}(x) \, dx \right)^{\frac{q}{p}} μ_{2}(t) \, dt \leq C \int_{0}^{\infty} \left( \int_{\mathbb{R}^{d}} |h|^{p} μ_{1}(x) \, dx \right)^{\frac{q}{p}} μ_{2}(t) \, dt, $$ where $μ_{1}(x) \in A_{p}(\mathbb{R}^{d})$ and $μ_{2}(t) \in A_{q}(\mathbb{R})$ are \emph{Muckenhoupt} weights.~Our proof relies on harmonic analysis techniques, using fundamental tools including the \emph{Fefferman-Stein} inequality and \emph{Hardy-Littlewood} maximal estimates in weighted $L_q(L_p)$ spaces, and \emph{sharp function} estimates for solution operators. In particular, our results extend the work of Han and Kim (2020, J. Differ. Equ.,269:3515-3550) and complement the work of Dong (2023, Calc. Var. Partial Differ. Equ., 62:96).
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Physical Space Proof of Bilinear Estimates and Applications to Nonlinear Dispersive Equations (II)
Authors:
Xinfeng Hu,
Li Tu,
Yi Zhou
Abstract:
We study the Zakharov system in two and three spatial dimensions, reproducing the optimal local well-posedness results from Bejenaru-Herr-Holmer-Tataru [2] and Bejenaru-Herr [1]. The main tools are similar to [16], based on a bilinear estimate, which is proved in a physical space approach by a new type of div-curl lemma. The new ingredient of our proof is a Strichartz estimate with mixed spatial i…
▽ More
We study the Zakharov system in two and three spatial dimensions, reproducing the optimal local well-posedness results from Bejenaru-Herr-Holmer-Tataru [2] and Bejenaru-Herr [1]. The main tools are similar to [16], based on a bilinear estimate, which is proved in a physical space approach by a new type of div-curl lemma. The new ingredient of our proof is a Strichartz estimate with mixed spatial integrability.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Data-Driven Exploration for a Class of Continuous-Time Linear--Quadratic Reinforcement Learning Problems
Authors:
Yilie Huang,
Xun Yu Zhou
Abstract:
We study reinforcement learning (RL) for the same class of continuous-time stochastic linear--quadratic (LQ) control problems as in \cite{huang2024sublinear}, where volatilities depend on both states and controls while states are scalar-valued and running control rewards are absent. We propose a model-free, data-driven exploration mechanism that adaptively adjusts entropy regularization by the cri…
▽ More
We study reinforcement learning (RL) for the same class of continuous-time stochastic linear--quadratic (LQ) control problems as in \cite{huang2024sublinear}, where volatilities depend on both states and controls while states are scalar-valued and running control rewards are absent. We propose a model-free, data-driven exploration mechanism that adaptively adjusts entropy regularization by the critic and policy variance by the actor. Unlike the constant or deterministic exploration schedules employed in \cite{huang2024sublinear}, which require extensive tuning for implementations and ignore learning progresses during iterations, our adaptive exploratory approach boosts learning efficiency with minimal tuning. Despite its flexibility, our method achieves a sublinear regret bound that matches the best-known model-free results for this class of LQ problems, which were previously derived only with fixed exploration schedules. Numerical experiments demonstrate that adaptive explorations accelerate convergence and improve regret performance compared to the non-adaptive model-free and model-based counterparts.
△ Less
Submitted 30 June, 2025;
originally announced July 2025.
-
Faster Diffusion Models via Higher-Order Approximation
Authors:
Gen Li,
Yuchen Zhou,
Yuting Wei,
Yuxin Chen
Abstract:
In this paper, we explore provable acceleration of diffusion models without any additional retraining. Focusing on the task of approximating a target data distribution in $\mathbb{R}^d$ to within $\varepsilon$ total-variation distance, we propose a principled, training-free sampling algorithm that requires only the order of
$$ d^{1+2/K} \varepsilon^{-1/K} $$
score function evaluations (up to l…
▽ More
In this paper, we explore provable acceleration of diffusion models without any additional retraining. Focusing on the task of approximating a target data distribution in $\mathbb{R}^d$ to within $\varepsilon$ total-variation distance, we propose a principled, training-free sampling algorithm that requires only the order of
$$ d^{1+2/K} \varepsilon^{-1/K} $$
score function evaluations (up to log factor) in the presence of accurate scores, where $K$ is an arbitrarily large fixed integer. This result applies to a broad class of target data distributions, without the need for assumptions such as smoothness or log-concavity. Our theory is robust vis-a-vis inexact score estimation, degrading gracefully as the score estimation error increases -- without demanding higher-order smoothness on the score estimates as assumed in previous work. The proposed algorithm draws insight from high-order ODE solvers, leveraging high-order Lagrange interpolation and successive refinement to approximate the integral derived from the probability flow ODE.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
StructMG: A Fast and Scalable Structured Algebraic Multigrid
Authors:
Yi Zong,
Peinan Yu,
Haopeng Huang,
Zhengding Hu,
Xinliang Wang,
Qin Wang,
Chensong Zhang,
Xiaowen Xu,
Jian Sun,
Yongxiao Zhou,
Wei Xue
Abstract:
Parallel multigrid is widely used as preconditioners in solving large-scale sparse linear systems. However, the current multigrid library still needs more satisfactory performance for structured grid problems regarding speed and scalability. Based on the classical 'multigrid seesaw', we derive three necessary principles for an efficient structured multigrid, which instructs our design and implemen…
▽ More
Parallel multigrid is widely used as preconditioners in solving large-scale sparse linear systems. However, the current multigrid library still needs more satisfactory performance for structured grid problems regarding speed and scalability. Based on the classical 'multigrid seesaw', we derive three necessary principles for an efficient structured multigrid, which instructs our design and implementation of StructMG, a fast and scalable algebraic multigrid that constructs hierarchical grids automatically. As a preconditioner, StructMG can achieve both low cost per iteration and good convergence when solving large-scale linear systems with iterative methods in parallel. A stencil-based triple-matrix product via symbolic derivation and code generation is proposed for multi-dimensional Galerkin coarsening to reduce grid complexity, operator complexity, and implementation effort. A unified parallel framework of sparse triangular solver is presented to achieve fast convergence and high parallel efficiency for smoothers, including dependence-preserving Gauss-Seidel and incomplete LU methods. Idealized and real-world problems from radiation hydrodynamics, petroleum reservoir simulation, numerical weather prediction, and solid mechanics, are evaluated on ARM and X86 platforms to show StructMG's effectiveness. In comparison to \textit{hypre}'s structured and general multigrid preconditioners, StructMG achieves the fastest time-to-solutions in all cases with average speedups of 15.5x, 5.5x, 6.7x, 7.3x over SMG, PFMG, SysPFMG, and BoomerAMG, respectively. StructMG also significantly improves strong and weak scaling efficiencies.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
A Zeroth-Order Extra-Gradient Method For Black-Box Constrained Optimization
Authors:
Yuke Zhou,
Ruiyang Jin,
Siyang Gao,
Jianxiao Wang,
Jie Song
Abstract:
Non-analytical objectives and constraints often arise in control systems, particularly in problems with complex dynamics, which are challenging yet lack efficient solution methods. In this work, we consider general constrained optimization problems involving black-box objectives and constraints. To solve it, we reformulate it as a min-max problem and propose a zeroth-order extra gradient (ZOEG) al…
▽ More
Non-analytical objectives and constraints often arise in control systems, particularly in problems with complex dynamics, which are challenging yet lack efficient solution methods. In this work, we consider general constrained optimization problems involving black-box objectives and constraints. To solve it, we reformulate it as a min-max problem and propose a zeroth-order extra gradient (ZOEG) algorithm that combines the extra gradient method with a feedback-based stochastic zeroth-order gradient estimator. Then, we apply another coordinate gradient estimator to design the zeroth-order coordinate extra gradient algorithm (ZOCEG) to further improve efficiency. The theoretical analysis shows that ZOEG can achieve the best-known oracle complexity of $\mathcal{O}(dε^{-2})$ to get an $ε$-optimal solution ($d$ is the dimension of decision space), and ZOCEG can improve it to $\mathcal{O}(dε^{-1})$. Furthermore, we develop a variant of ZOCEG, which applies block coordinate updates to enhance the efficiency of single-step gradient estimation. Finally, numerical experiments on a load tracking problem validate our theoretical results and the effectiveness of the proposed algorithms.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
FICA: Faster Inner Convex Approximation of Chance Constrained Grid Dispatch with Decision-Coupled Uncertainty
Authors:
Yihong Zhou,
Hanbin Yang,
Thomas Morstyn
Abstract:
This paper proposes a Faster Inner Convex Approximation (FICA) method for solving power system dispatch problems with Wasserstein distributionally robust joint chance constraints (WJCC) and incorporating the modelling of the automatic generation control factors. The problem studied belongs to the computationally challenging class of WJCC with left-hand-side uncertainty (LHS-WJCC). By exploiting th…
▽ More
This paper proposes a Faster Inner Convex Approximation (FICA) method for solving power system dispatch problems with Wasserstein distributionally robust joint chance constraints (WJCC) and incorporating the modelling of the automatic generation control factors. The problem studied belongs to the computationally challenging class of WJCC with left-hand-side uncertainty (LHS-WJCC). By exploiting the special one-dimensional structure (even if only partially present) of the problem, the proposed FICA incorporates a set of strong valid inequalities to accelerate the solution process. We prove that FICA achieves the same optimality as the well-known conditional value-at-risk (CVaR) inner convex approximation method. Our numerical experiments demonstrate that the proposed FICA can yield 40x computational speedup compared to CVaR, and can even reach up to 500x speedup when the optimisation horizon exceeds 16 time steps. This speedup is achieved when only 50% of constraints in a WJCC have the one-dimensional structure. The approximation quality is numerically verified to be the same as CVaR, and the quality gap is below 1% when compared to the computationally demanding exact reformulation of the LHS-WJCC in most cases. We also discuss the applications of FICA in optimisation problems from other domains that (partially) exhibit the one-dimensional structure.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Circular Directional Flow Decomposition of Networks
Authors:
Marc Homs-Dones,
Robert S. MacKay,
Bazil Sansom,
Yijie Zhou
Abstract:
We introduce the Circular Directional Flow Decomposition (CDFD), a new framework for analyzing circularity in weighted directed networks. CDFD separates flow into two components: a circular (divergence-free) component and an acyclic component that carries all nett directional flow. This yields a normalized circularity index between 0 (fully acyclic) and 1 (for networks formed solely by the superpo…
▽ More
We introduce the Circular Directional Flow Decomposition (CDFD), a new framework for analyzing circularity in weighted directed networks. CDFD separates flow into two components: a circular (divergence-free) component and an acyclic component that carries all nett directional flow. This yields a normalized circularity index between 0 (fully acyclic) and 1 (for networks formed solely by the superposition of cycles), with the complement measuring directionality. This index captures the proportion of flow involved in cycles, and admits a range of interpretations - such as system closure, feedback, weighted strong connectivity, structural redundancy, or inefficiency. Although the decomposition is generally non-unique, we show that the set of all decompositions forms a well-structured geometric space with favourable topological properties. Within this space, we highlight two benchmark decompositions aligned with distinct analytical goals: the maximum circularity solution, which minimizes nett flow, and the Balanced Flow Forwarding (BFF) solution, a unique, locally computable decomposition that distributes circular flow across all feasible cycles in proportion to the original network structure. We demonstrate the interpretive value and computational tractability of both decompositions on synthetic and empirical networks. They outperform existing circularity metrics in detecting meaningful structural variation. The decomposition also enables structural analysis - such as mapping the distribution of cyclic flow - and supports practical applications that require explicit flow allocation or routing, including multilateral netting and efficient transport.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
An Efficient Augmented Lagrangian Method for Dynamic Optimal Transport on Surfaces Based on Second-Order Cone Programming
Authors:
Liang Chen,
Youyicun Lin,
Yuxuan Zhou
Abstract:
This paper proposes an efficient numerical optimization approach for solving dynamic optimal transport (DOT) problems on general smooth surfaces, computing both the quadratic Wasserstein distance and the associated transportation path. Building on the convex DOT model of Benamou and Brenier, we first properly reformulate its dual problem, discretized on a triangular mesh for space together with a…
▽ More
This paper proposes an efficient numerical optimization approach for solving dynamic optimal transport (DOT) problems on general smooth surfaces, computing both the quadratic Wasserstein distance and the associated transportation path. Building on the convex DOT model of Benamou and Brenier, we first properly reformulate its dual problem, discretized on a triangular mesh for space together with a staggered grid for time, to a linear second-order cone programming. Then the resulting finite-dimensional convex optimization problem is solved via an inexact semi-proximal augmented Lagrangian method with a highly efficient numerical implementation, and the algorithm is guaranteed to converge to a Karush-Kuhn-Tucker solution without imposing any additional assumptions. Finally, we implement the proposed methodology as an open-source software package. The effectiveness, robustness, and computational efficiency of the software are demonstrated through extensive numerical experiments across diverse datasets, where it consistently outperforms state-of-the-art solvers by several times in speed.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Some series connecting Fibonacci numbers to $π$
Authors:
Zhi-Wei Sun,
Yajun Zhou
Abstract:
Exploring the theory of Guillera--Rogers, we evaluate some infinite series whose summands are quadratic irrationals, in terms of $π$ and special values of Dirichlet $L$-functions. For example, we show that \[\sum_{k=1}^\infty\frac{3 \left(16 \sqrt{5}-35\right) k-4 \left(5 \sqrt{5}-11\right)}{k^{3}\binom{2k}{k}^3}\left(\frac{1+\sqrt{5}}{2} \right)^{8 k}=\frac{71π^{2}}{30}\]and\begin{align*}&\sum_{k…
▽ More
Exploring the theory of Guillera--Rogers, we evaluate some infinite series whose summands are quadratic irrationals, in terms of $π$ and special values of Dirichlet $L$-functions. For example, we show that \[\sum_{k=1}^\infty\frac{3 \left(16 \sqrt{5}-35\right) k-4 \left(5 \sqrt{5}-11\right)}{k^{3}\binom{2k}{k}^3}\left(\frac{1+\sqrt{5}}{2} \right)^{8 k}=\frac{71π^{2}}{30}\]and\begin{align*}&\sum_{k=1}^\infty\frac{6 \left(17 \sqrt{7}+35\right) k- 35 \sqrt{7}-89}{k^{3}\binom{2k}{k}^3}\left(-2^{11}\right)^k\big(45-17\sqrt{7}\big)^{2k}\\={}&128\left[ 20L_{-8}(2)-7\sqrt{7}L_{-56}(2) \right],\end{align*}where the central binomial coefficients are given by $ \binom{2k}k:=\frac{(2k)!}{(k!)^{2}} $, and the special Dirichlet $L$-values $ L_d(2):= \sum_{k=1}^\infty\left( \frac{d}{k} \right)\frac1{k^2}$ are defined through the Kronecker symbol $ \left(\frac{d}{\cdot}\right)$.
△ Less
Submitted 12 June, 2025; v1 submitted 2 June, 2025;
originally announced June 2025.
-
Characterizing the limiting critical Potts measures on locally regular-tree-like expander graphs
Authors:
Hang Du,
Yanxin Zhou
Abstract:
For any integers $d,q\ge 3$, we consider the $q$-state ferromagnetic Potts model with an external field on a sequence of expander graphs that converges to the $d$-regular tree $\mathtt{T}_d$ in the Benjamini-Schramm sense. We show that along the critical line, any subsequential local weak limit of the Potts measures is a mixture of the free and wired Potts Gibbs measures on $\mathtt{T}_d$. Further…
▽ More
For any integers $d,q\ge 3$, we consider the $q$-state ferromagnetic Potts model with an external field on a sequence of expander graphs that converges to the $d$-regular tree $\mathtt{T}_d$ in the Benjamini-Schramm sense. We show that along the critical line, any subsequential local weak limit of the Potts measures is a mixture of the free and wired Potts Gibbs measures on $\mathtt{T}_d$. Furthermore, we show the possibility of an arbitrary extent of strong phase coexistence: for any $α\in [0,1]$, there exists a sequence of locally $\mathtt{T}_d$-like expander graphs $\{G_n\}$, such that the Potts measures on $\{G_n\}$ locally weakly converges to the $(α,1-α)$-mixture of the free and wired Potts Gibbs measures. Our result extends results of \cite{HJP23} which restrict to the zero-field case and also require $q$ to be sufficiently large relative to $d$, and results of \cite{BDS23} which restrict to the even $d$ case. We also confirm the phase coexistence prediction of \cite{BDS23}, asserting that the Potts local weak limit is a genuine mixture of the free and wired states in a generic setting. We further characterize the subsequential local weak limits of random cluster measures on such graph sequences, for any cluster parameter $q>2$ (not necessarily integer).
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Asymptotic-preserving schemes for the initial-boundary value problem of hyperbolic relaxation systems
Authors:
Yizhou Zhou
Abstract:
In this work, we present a numerical method for the initial-boundary value problem (IBVP) of first-order hyperbolic systems with source terms. The scheme directly solves the relaxation system using a relatively coarse mesh and captures the equilibrium behavior quite well, even in the presence of boundary layers. This method extends the concept of asymptotic-preserving schemes from initial-value pr…
▽ More
In this work, we present a numerical method for the initial-boundary value problem (IBVP) of first-order hyperbolic systems with source terms. The scheme directly solves the relaxation system using a relatively coarse mesh and captures the equilibrium behavior quite well, even in the presence of boundary layers. This method extends the concept of asymptotic-preserving schemes from initial-value problems to IBVPs. Moreover, we apply this idea to design a unified numerical scheme for the interface problem of relaxation systems.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Wasserstein Transfer Learning
Authors:
Kaicheng Zhang,
Sinian Zhang,
Doudou Zhou,
Yidong Zhou
Abstract:
Transfer learning is a powerful paradigm for leveraging knowledge from source domains to enhance learning in a target domain. However, traditional transfer learning approaches often focus on scalar or multivariate data within Euclidean spaces, limiting their applicability to complex data structures such as probability distributions. To address this, we introduce a novel framework for transfer lear…
▽ More
Transfer learning is a powerful paradigm for leveraging knowledge from source domains to enhance learning in a target domain. However, traditional transfer learning approaches often focus on scalar or multivariate data within Euclidean spaces, limiting their applicability to complex data structures such as probability distributions. To address this, we introduce a novel framework for transfer learning in regression models, where outputs are probability distributions residing in the Wasserstein space. When the informative subset of transferable source domains is known, we propose an estimator with provable asymptotic convergence rates, quantifying the impact of domain similarity on transfer efficiency. For cases where the informative subset is unknown, we develop a data-driven transfer learning procedure designed to mitigate negative transfer. The proposed methods are supported by rigorous theoretical analysis and are validated through extensive simulations and real-world applications.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Liouville theorem for subcritical nonlinear heat equation
Authors:
Yang Zhou
Abstract:
We obtain a Li-Yau-type estimate for nonnegative ancient solutions to the subcritical semilinear heat equation $\frac{\p u}{\p t}=\De u+u^p$ in $\rz^n\times(-\infty,0)$. Then, we combine the Li-Yau type estimate and Melre-Zaag's result to prove the Liouville theorem of this equation.
We obtain a Li-Yau-type estimate for nonnegative ancient solutions to the subcritical semilinear heat equation $\frac{\p u}{\p t}=\De u+u^p$ in $\rz^n\times(-\infty,0)$. Then, we combine the Li-Yau type estimate and Melre-Zaag's result to prove the Liouville theorem of this equation.
△ Less
Submitted 18 May, 2025;
originally announced May 2025.
-
Bilevel Transmission Expansion Planning with Joint Chance-Constrained Dispatch
Authors:
Yuxin Xia,
Yihong Zhou,
Iacopo Savelli,
Thomas Morstyn
Abstract:
In transmission expansion planning (TEP), network planners make long-term investment decisions while anticipating market clearing outcomes that are increasingly affected by renewable generation uncertainty. Additionally, market participants' sensitivity to network charges and the requirement for cost recovery by the network planner introduce further complexity. Since the day-ahead market clears be…
▽ More
In transmission expansion planning (TEP), network planners make long-term investment decisions while anticipating market clearing outcomes that are increasingly affected by renewable generation uncertainty. Additionally, market participants' sensitivity to network charges and the requirement for cost recovery by the network planner introduce further complexity. Since the day-ahead market clears before uncertainty realizes, explicitly modelling these uncertainties at the lower-level market clearing becomes important in bilevel TEP problems. In this paper, we introduce a novel bilevel TEP framework with lower-level joint chance-constrained market clearing that manages line flow constraints under wind uncertainty and accounts for the effect of network tariffs on participants' actual marginal costs and utility. To solve this complex problem, we propose a Strengthened Linear Approximation (SLA) technique for handling Wasserstein distributionally robust joint chance constraints with right-hand-side uncertainties (RHS-WDRJCC). The proposed method offers more efficient approximations without additional conservativeness and avoids the numerical issues encountered in existing approaches by introducing valid inequalities. The case study demonstrates that the proposed model achieves the desired out-of-sample constraint satisfaction probability. Moreover, the numerical results highlight the significant computational advantage of SLA, achieving up to a 26x speedup compared to existing methods such as worst-case conditional value-at-risk, while maintaining high solution quality.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Asynchronous Decentralized SGD under Non-Convexity: A Block-Coordinate Descent Framework
Authors:
Yijie Zhou,
Shi Pu
Abstract:
Decentralized optimization has become vital for leveraging distributed data without central control, enhancing scalability and privacy. However, practical deployments face fundamental challenges due to heterogeneous computation speeds and unpredictable communication delays. This paper introduces a refined model of Asynchronous Decentralized Stochastic Gradient Descent (ADSGD) under practical assum…
▽ More
Decentralized optimization has become vital for leveraging distributed data without central control, enhancing scalability and privacy. However, practical deployments face fundamental challenges due to heterogeneous computation speeds and unpredictable communication delays. This paper introduces a refined model of Asynchronous Decentralized Stochastic Gradient Descent (ADSGD) under practical assumptions of bounded computation and communication times. To understand the convergence of ADSGD, we first analyze Asynchronous Stochastic Block Coordinate Descent (ASBCD) as a tool, and then show that ADSGD converges under computation-delay-independent step sizes. The convergence result is established without assuming bounded data heterogeneity. Empirical experiments reveal that ADSGD outperforms existing methods in wall-clock convergence time across various scenarios. With its simplicity, efficiency in memory and communication, and resilience to communication and computation delays, ADSGD is well-suited for real-world decentralized learning tasks.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization
Authors:
Liang Chen,
Youyicun Lin,
Yuxuan Zhou
Abstract:
This paper proposes an efficient numerical method based on second-order cone programming (SOCP) to solve dynamic optimal transport (DOT) problems with quadratic cost on staggered grid discretization. By properly reformulating discretized DOT problems into a linear SOCP, the proposed method eliminates the interpolation matrices and thus avoids solving a series of cubic equations and linear systems…
▽ More
This paper proposes an efficient numerical method based on second-order cone programming (SOCP) to solve dynamic optimal transport (DOT) problems with quadratic cost on staggered grid discretization. By properly reformulating discretized DOT problems into a linear SOCP, the proposed method eliminates the interpolation matrices and thus avoids solving a series of cubic equations and linear systems induced by interpolation. Then, by taking advantage of the SOCP reformulation, we can solve them efficiently by a computationally highly economical implementation of an inexact decomposition-based proximal augmented Lagrangian method. Moreover, we have made the proposed approach an open-source software package. Numerical experiments on various DOT problems suggest that the proposed approach performs significantly more efficiently than state-of-the-art software packages. In addition, it exhibits prominent robustness to problems with non-negative measures.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Modeling Cascading Driver Interventions in Partially Automated Traffic: A Semi-Markov Chain Approach
Authors:
Zihao Li,
Fan Pu,
Soyoung Ahn,
Yang Zhou
Abstract:
This paper presents an analytical modeling framework for partially automated traffic, incorporating cascading driver intervention behaviors. In this framework, drivers of partially automated vehicles have the flexibility to switch driving modes (either AV or HDV) under lockout constraints. The cascading impact is captured by making the switching probability leader-dependent, highlighting the influ…
▽ More
This paper presents an analytical modeling framework for partially automated traffic, incorporating cascading driver intervention behaviors. In this framework, drivers of partially automated vehicles have the flexibility to switch driving modes (either AV or HDV) under lockout constraints. The cascading impact is captured by making the switching probability leader-dependent, highlighting the influence of the leading vehicle on mode choice and the potential propagation of mode changes throughout traffic. Due to the complexity of this system, traditional Markov-based methods are insufficient. To address this, the paper introduces an innovative semi-Markov chain framework with lockout constraints, ideally suited for modeling the system dynamics. This framework reformulates the system as a nonlinear model whose solution can be efficiently approximated using numerical methods from control theory, such as the Runge-Kutta algorithm. Moreover, the system is proven to be a piecewise affine bilinear system, with the existence of solutions and both local and global stability established via Brouwer's Fixed Point Theorem and the 1D Uncertainty Polytopes Theorem. Numerical experiments corroborate these theoretical findings, confirming the presence of cascading impacts and elucidating the influence of modeling parameters on traffic throughput, thereby deepening our understanding of the system's properties.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
Global well-posedness in the critical Besov space of the skew mean curvature flow in $\mathbb{R}^d: d\geq 4$
Authors:
Ning-An Lai,
Jie Shao,
Yi Zhou
Abstract:
In this paper, we are devoted to studying the global regularity for the skew mean curvature flow with small initial data in $\mathbb{R}^d\, (d\geq 4)$. By using a new div-curl lemma which was first introduced by the third author to establish a bilinear estimate, and also the interaction Morawetz estimate, the global well-posedness for the skew mean curvature flow in the critical Besov space is est…
▽ More
In this paper, we are devoted to studying the global regularity for the skew mean curvature flow with small initial data in $\mathbb{R}^d\, (d\geq 4)$. By using a new div-curl lemma which was first introduced by the third author to establish a bilinear estimate, and also the interaction Morawetz estimate, the global well-posedness for the skew mean curvature flow in the critical Besov space is established, and hence the corresponding result obtained by Huang, Li and Tataru (Int. Math. Res. Not. 2024, no. 5, 3748-3798) is substantially improved.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
Authors:
Xin Chen,
Yuze Chen,
Yuan Zhou
Abstract:
We study a class of sequential decision-making problems with augmented predictions, potentially provided by a machine learning algorithm. In this setting, the decision-maker receives prediction intervals for unknown parameters that become progressively refined over time, and seeks decisions that are competitive with the hindsight optimal under all possible realizations of both parameters and predi…
▽ More
We study a class of sequential decision-making problems with augmented predictions, potentially provided by a machine learning algorithm. In this setting, the decision-maker receives prediction intervals for unknown parameters that become progressively refined over time, and seeks decisions that are competitive with the hindsight optimal under all possible realizations of both parameters and predictions. We propose a minimax Markov Decision Process (minimax-MDP) framework, where the system state consists of an adversarially evolving environment state and an internal state controlled by the decision-maker. We introduce a set of future-imposed conditions that characterize the feasibility of minimax-MDPs and enable the design of efficient, often closed-form, robustly competitive policies. We illustrate the framework through three applications: multi-period inventory ordering with refining demand predictions, resource allocation with uncertain utility functions, and a multi-phase extension of the minimax-MDP applied to the inventory problem with time-varying ordering costs. Our results provide a tractable and versatile approach to robust online decision-making under predictive uncertainty.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
One Dimensional Asymptotic Plateau Problem in $n$-Dimensional Asymptotically Conical Manifolds
Authors:
Jiayin Liu,
Shijin Zhang,
Yuan Zhou
Abstract:
Let $(M,g)$ be an asymptotically conical Riemannian manifold having dimension $n\ge 2$, opening angle $α\in (0,π/2) \setminus \{\arcsin \frac{1}{2k+1}\}_{k \in \mathbb{N}}$ and positive asymptotic rate. Under the assumption that the exponential map is proper at each point, we give a solution to the one dimensional asymptotic Plateau problem on $M$. Precisely, for any pair of antipodal points in th…
▽ More
Let $(M,g)$ be an asymptotically conical Riemannian manifold having dimension $n\ge 2$, opening angle $α\in (0,π/2) \setminus \{\arcsin \frac{1}{2k+1}\}_{k \in \mathbb{N}}$ and positive asymptotic rate. Under the assumption that the exponential map is proper at each point, we give a solution to the one dimensional asymptotic Plateau problem on $M$. Precisely, for any pair of antipodal points in the ideal boundary $\partial_\infty M = \mathbb S^{n-1}$, we prove the existence of a geodesic line with asymptotic prescribed boundaries and the Morse index $\le n-1$.
△ Less
Submitted 22 April, 2025; v1 submitted 21 April, 2025;
originally announced April 2025.
-
Perturbed Proximal Gradient ADMM for Nonconvex Composite Optimization
Authors:
Yuan Zhou,
Xinli Shi,
Luyao Guo,
Jinde Cao,
Mahmoud Abdel-Aty
Abstract:
This paper proposes a Perturbed Proximal Gradient ADMM (PPG-ADMM) framework for solving general nonconvex composite optimization problems, where the objective function consists of a smooth nonconvex term and a nonsmooth weakly convex term for both primal variables.
Unlike existing ADMM-based methods which necessitate the function associated with the last updated primal variable to be smooth, the…
▽ More
This paper proposes a Perturbed Proximal Gradient ADMM (PPG-ADMM) framework for solving general nonconvex composite optimization problems, where the objective function consists of a smooth nonconvex term and a nonsmooth weakly convex term for both primal variables.
Unlike existing ADMM-based methods which necessitate the function associated with the last updated primal variable to be smooth, the proposed PPG-ADMM removes this restriction by introducing a perturbation mechanism, which also helps reduce oscillations in the primal-dual updates, thereby improving convergence stability.
By employing a linearization technique for the smooth term and the proximal operator for the nonsmooth and weakly convex term, the subproblems have closed-form solutions, significantly reducing computational complexity. The convergence is established through a technically constructed Lyapunov function, which guarantees sufficient descent and has a well-defined lower bound.
With properly chosen parameters, PPG-ADMM converges to an $ε$-approximate stationary point at a sublinear convergence rate of $\mathcal{O}(1/\sqrt{K})$.
Furthermore, by appropriately tuning the perturbation parameter $β$, it achieves an $ε$-stationary point, providing stronger optimality guarantees. We further apply PPG-ADMM to two practical distributed nonconvex composite optimization problems, i.e., the distributed partial consensus problem and the resource allocation problem. The algorithm operates in a fully decentralized manner without a central coordinating node. Finally, numerical experiments validate the effectiveness of PPG-ADMM, demonstrating its improved convergence performance.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
Decentralized Nonconvex Composite Federated Learning with Gradient Tracking and Momentum
Authors:
Yuan Zhou,
Xinli Shi,
Xuelong Li,
Jiachen Zhong,
Guanghui Wen,
Jinde Cao
Abstract:
Decentralized Federated Learning (DFL) eliminates the reliance on the server-client architecture inherent in traditional federated learning, attracting significant research interest in recent years. Simultaneously, the objective functions in machine learning tasks are often nonconvex and frequently incorporate additional, potentially nonsmooth regularization terms to satisfy practical requirements…
▽ More
Decentralized Federated Learning (DFL) eliminates the reliance on the server-client architecture inherent in traditional federated learning, attracting significant research interest in recent years. Simultaneously, the objective functions in machine learning tasks are often nonconvex and frequently incorporate additional, potentially nonsmooth regularization terms to satisfy practical requirements, thereby forming nonconvex composite optimization problems. Employing DFL methods to solve such general optimization problems leads to the formulation of Decentralized Nonconvex Composite Federated Learning (DNCFL), a topic that remains largely underexplored. In this paper, we propose a novel DNCFL algorithm, termed \bf{DEPOSITUM}. Built upon proximal stochastic gradient tracking, DEPOSITUM mitigates the impact of data heterogeneity by enabling clients to approximate the global gradient. The introduction of momentums in the proximal gradient descent step, replacing tracking variables, reduces the variance introduced by stochastic gradients. Additionally, DEPOSITUM supports local updates of client variables, significantly reducing communication costs. Theoretical analysis demonstrates that DEPOSITUM achieves an expected $ε$-stationary point with an iteration complexity of $\mathcal{O}(1/ε^2)$. The proximal gradient, consensus errors, and gradient estimation errors decrease at a sublinear rate of $\mathcal{O}(1/T)$. With appropriate parameter selection, the algorithm achieves network-independent linear speedup without requiring mega-batch sampling. Finally, we apply DEPOSITUM to the training of neural networks on real-world datasets, systematically examining the influence of various hyperparameters on its performance. Comparisons with other federated composite optimization algorithms validate the effectiveness of the proposed method.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
FedCanon: Non-Convex Composite Federated Learning with Efficient Proximal Operation on Heterogeneous Data
Authors:
Yuan Zhou,
Jiachen Zhong,
Xinli Shi,
Guanghui Wen,
Xinghuo Yu
Abstract:
Composite federated learning offers a general framework for solving machine learning problems with additional regularization terms. However, many existing methods require clients to perform multiple proximal operations to handle non-smooth terms and their performance are often susceptible to data heterogeneity. To overcome these limitations, we propose a novel composite federated learning algorith…
▽ More
Composite federated learning offers a general framework for solving machine learning problems with additional regularization terms. However, many existing methods require clients to perform multiple proximal operations to handle non-smooth terms and their performance are often susceptible to data heterogeneity. To overcome these limitations, we propose a novel composite federated learning algorithm called \textbf{FedCanon}, designed to solve the optimization problems comprising a possibly non-convex loss function and a weakly convex, potentially non-smooth regularization term. By decoupling proximal mappings from local updates, FedCanon requires only a single proximal evaluation on the server per iteration, thereby reducing the overall proximal computation cost. It also introduces control variables that incorporate global gradient information into client updates, which helps mitigate the effects of data heterogeneity. Theoretical analysis demonstrates that FedCanon achieves sublinear convergence rates under general non-convex settings and linear convergence under the Polyak-Łojasiewicz condition, without relying on bounded heterogeneity assumptions. Experiments demonstrate that FedCanon outperforms the state-of-the-art methods in terms of both accuracy and computational efficiency, particularly under heterogeneous data distributions.
△ Less
Submitted 16 April, 2025;
originally announced April 2025.
-
Low-Rank Tensor Recovery via Theta Nuclear p-Norm
Authors:
Felix Röhrich,
Yuhuai Zhou
Abstract:
We investigate the low-rank tensor recovery problem using a relaxation of the nuclear p-norm by theta bodies.
We provide algebraic descriptions of the norms and compute their Gröbner bases.
Moreover, we develop geometric properties of these bodies.
Finally, our numerical results suggest that for
$n\times\cdots\times n$ tensors,
$m\geq O(n)$ measurements should be sufficient to recover lo…
▽ More
We investigate the low-rank tensor recovery problem using a relaxation of the nuclear p-norm by theta bodies.
We provide algebraic descriptions of the norms and compute their Gröbner bases.
Moreover, we develop geometric properties of these bodies.
Finally, our numerical results suggest that for
$n\times\cdots\times n$ tensors,
$m\geq O(n)$ measurements should be sufficient to recover low-rank tensors via theta body relaxation.
△ Less
Submitted 11 April, 2025;
originally announced April 2025.
-
Multi-bubble solutions for the Dirichlet problem of the $H$-system with higher degree
Authors:
Xiang Fang,
Juncheng Wei,
Youquan Zheng,
Yifu Zhou
Abstract:
We consider a Dirichlet problem of the $H$-system \begin{equation*} \begin{cases} Δv = 2v_x\wedge v_y ~& \text{ in }\mathcal{D},\\ v=\varepsilon \tilde g ~& \text{ on }\partial{\mathcal{D}}, \end{cases} \end{equation*} where $\mathcal D\subset \mathbb{R}^2$ is the unit disk, $v:\mathcal D\to \mathbb{R}^3$, and $\tilde g:\partial \mathcal D\to \mathbb{R}^3$ is a given smooth map. As…
▽ More
We consider a Dirichlet problem of the $H$-system \begin{equation*} \begin{cases} Δv = 2v_x\wedge v_y ~& \text{ in }\mathcal{D},\\ v=\varepsilon \tilde g ~& \text{ on }\partial{\mathcal{D}}, \end{cases} \end{equation*} where $\mathcal D\subset \mathbb{R}^2$ is the unit disk, $v:\mathcal D\to \mathbb{R}^3$, and $\tilde g:\partial \mathcal D\to \mathbb{R}^3$ is a given smooth map. As $\varepsilon\to 0^+$, we construct multi-bubble solutions concentrating at distinct points, taking around each point the profile of degree 2 $H$-bubble. This gives a partial answer to a conjecture due to Brezis-Coron and Chanillo-Malchiodi concerning the limiting configuration in the case of higher degrees. This seems to be the first construction in employing higher-degree harmonic maps as the primary configurations.
△ Less
Submitted 2 June, 2025; v1 submitted 8 April, 2025;
originally announced April 2025.
-
Trisimplicial vertices in (fork, odd parachute)-free graphs
Authors:
Kaiyang Lan,
Feng Liu,
Di Wu,
Yidong Zhou
Abstract:
An {\em odd hole} in a graph is an induced subgraph which is a cycle of odd length at least five. An {\em odd parachute} is a graph obtained from an odd hole $H$ by adding a new edge $uv$ such that $x$ is adjacent to $u$ but not to $v$ for each $x\in V(H)$. A graph $G$ is perfectly divisible if for each induced subgraph $H$ of $G$, $V(H)$ can be partitioned into $A$ and $B$ such that $H[A]$ is per…
▽ More
An {\em odd hole} in a graph is an induced subgraph which is a cycle of odd length at least five. An {\em odd parachute} is a graph obtained from an odd hole $H$ by adding a new edge $uv$ such that $x$ is adjacent to $u$ but not to $v$ for each $x\in V(H)$. A graph $G$ is perfectly divisible if for each induced subgraph $H$ of $G$, $V(H)$ can be partitioned into $A$ and $B$ such that $H[A]$ is perfect and $ω(H[B])<ω(H)$. A vertex of a graph is {\em trisimplicial} if its neighbourhood is the union of three cliques. In this paper, we prove that $χ(G)\leq \binom{ω(G)+1}{2}$ if $G$ is a (fork, odd parachute)-free graph by showing that $G$ contains a trisimplicial vertex when $G$ is nonperfectly divisible. This generalizes some results of Karthick, Kaufmann and Sivaraman [{\em Electron. J. Combin.} \textbf{29} (2022) \#P3.19], and Wu and Xu [{\em Discrete Math.} \textbf{347} (2024) 114121]. As a corollary, every nonperfectly divisible claw-free graph contains a trisimplicial vertex.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
Existence of Full Replica Symmetry Breaking for the Sherrington-Kirkpatrick Model at Low Temperature
Authors:
Yuxin Zhou
Abstract:
We prove the existence of full replica symmetry breaking (FRSB) for the Sherrington-Kirkpatrick (SK) model at low temperature. More specifically, we prove that slightly beyond the critical temperature, the Parisi measure for the SK model is supported on an interval starting at the origin and only has one jump discontinuity at the right endpoint.
We prove the existence of full replica symmetry breaking (FRSB) for the Sherrington-Kirkpatrick (SK) model at low temperature. More specifically, we prove that slightly beyond the critical temperature, the Parisi measure for the SK model is supported on an interval starting at the origin and only has one jump discontinuity at the right endpoint.
△ Less
Submitted 15 April, 2025; v1 submitted 31 March, 2025;
originally announced April 2025.
-
Nested Stochastic Algorithm for Generalized Sinkhorn distance-Regularized Distributionally Robust Optimization
Authors:
Yufeng Yang,
Yi Zhou,
Zhaosong Lu
Abstract:
Distributionally robust optimization (DRO) is a powerful technique to train robust models against data distribution shift. This paper aims to solve regularized nonconvex DRO problems, where the uncertainty set is modeled by a so-called generalized Sinkhorn distance and the loss function is nonconvex and possibly unbounded. Such a distance allows to model uncertainty of distributions with different…
▽ More
Distributionally robust optimization (DRO) is a powerful technique to train robust models against data distribution shift. This paper aims to solve regularized nonconvex DRO problems, where the uncertainty set is modeled by a so-called generalized Sinkhorn distance and the loss function is nonconvex and possibly unbounded. Such a distance allows to model uncertainty of distributions with different probability supports and divergence functions. For this class of regularized DRO problems, we derive a novel dual formulation taking the form of nested stochastic optimization, where the dual variable depends on the data sample. To solve the dual problem, we provide theoretical evidence to design a nested stochastic gradient descent (SGD) algorithm, which leverages stochastic approximation to estimate the nested stochastic gradients. We study the convergence rate of nested SGD and establish polynomial iteration and sample complexities that are independent of the data size and parameter dimension, indicating its potential for solving large-scale DRO problems. We conduct numerical experiments to demonstrate the efficiency and robustness of the proposed algorithm.
△ Less
Submitted 26 June, 2025; v1 submitted 28 March, 2025;
originally announced March 2025.
-
Uniform vector bundles over $\mathbb{P}^4$
Authors:
Rong Du,
Yuhang Zhou
Abstract:
There is a long-standing conjecture which states that every uniform algebraic vector bundle of rank $r<2n$ on the $n$-dimensional projective space $\mathbb{P}^n$ over an algebraically closed field of characteristic $0$ is homogeneous. This conjecture is valid for $n\leq3$. In this paper, we classify all uniform vector bundles of rank $r<8$ over $\mathbb{P}^4$ and show that the conjecture holds for…
▽ More
There is a long-standing conjecture which states that every uniform algebraic vector bundle of rank $r<2n$ on the $n$-dimensional projective space $\mathbb{P}^n$ over an algebraically closed field of characteristic $0$ is homogeneous. This conjecture is valid for $n\leq3$. In this paper, we classify all uniform vector bundles of rank $r<8$ over $\mathbb{P}^4$ and show that the conjecture holds for $n=4$.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
A proof of the multi-component $q$-Baker--Forrester conjecture
Authors:
Yue Zhou
Abstract:
The Selberg integral, an $n$-dimensional generalization of the Euler beta integral, plays a central role in random matrix theory, Calogero--Sutherland quantum many body systems, Knizhnik--Zamolodchikov equations, and multivariable orthogonal polynomial theory. The Selberg integral is known to be equivalent to the Morris constant term identity. In 1998, Baker and Forrester conjectured a $(p+1)$-com…
▽ More
The Selberg integral, an $n$-dimensional generalization of the Euler beta integral, plays a central role in random matrix theory, Calogero--Sutherland quantum many body systems, Knizhnik--Zamolodchikov equations, and multivariable orthogonal polynomial theory. The Selberg integral is known to be equivalent to the Morris constant term identity. In 1998, Baker and Forrester conjectured a $(p+1)$-component generalization of the $q$-Morris identity. It in turn yields a generalization of the Selberg integral. The $p=1$ case of Baker and Forrester's conjecture was proved by Károlyi, Nagy, Petrov and Volkov in 2015. In this paper, we give a proof of the $(p+1)$-component $q$-Baker--Forrester conjecture, thereby settling this 26-year-old conjecture.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
Optimal Investment Portfolio of Thyristor- and IGBT-based Electrolysis Rectifiers in Utility-scale Renewable P2H Systems
Authors:
Yangjun Zeng,
Yiwei Qiu,
Liuchao Xu,
Chenjia Gu,
Yi Zhou,
Jiarong Li,
Shi Chen,
Buxiang Zhou
Abstract:
Renewable power-to-hydrogen (ReP2H) systems require rectifiers to supply power to electrolyzers (ELZs). Two main types of rectifiers, insulated-gate bipolar transistor rectifiers (IGBT-Rs) and thyristor rectifiers (TRs), offer distinct tradeoffs. IGBT-Rs provide flexible reactive power control but are costly, whereas TRs are more affordable with lower power loss but consume a large amount of uncon…
▽ More
Renewable power-to-hydrogen (ReP2H) systems require rectifiers to supply power to electrolyzers (ELZs). Two main types of rectifiers, insulated-gate bipolar transistor rectifiers (IGBT-Rs) and thyristor rectifiers (TRs), offer distinct tradeoffs. IGBT-Rs provide flexible reactive power control but are costly, whereas TRs are more affordable with lower power loss but consume a large amount of uncontrollable reactive power. A mixed configuration of rectifiers in utility-scale ReP2H systems could achieve an decent tradeoff and increase overall profitability. To explore this potential, this paper proposes an optimal investment portfolio model. First, we model and compare the active and reactive power characteristics of ELZs powered by TRs and IGBT-Rs. Second, we consider the investment of ELZs, rectifiers, and var resources and coordinate the operation of renewables, energy storage, var resources, and the on-off switching and load allocation of multiple ELZs. Subsequently, a two-stage stochastic programming (SP) model based on weighted information gap decision theory (W-IGDT) is developed to address the uncertainties of the renewable power and hydrogen price, and we apply the progressive hedging (PH) algorithm to accelerate its solution. Case studies demonstrate that optimal rectifier configurations increase revenue by at most 2.56% compared with using only TRs or IGBT-Rs, as well as those in existing projects. Under the optimal portfolio, reactive power compensation investment is nearly eliminated, with a preferred TR-to-IGBT-R ratio of 3:1.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Maximal $L_p$-regularity for fractional problem driven by non-autonomous forms
Authors:
Jia Wei He,
Shi Long Li,
Yong Zhou
Abstract:
We investigate the maximal $L_p$-regularity in J.L. Lions' problem involving a time-fractional derivative and a non-autonomous form $a(t;\cdot,\cdot)$ on a Hilbert space $H$. This problem says whether the maximal $L_p$-regularity in $H$ hold when $t \mapsto a(t ; u, v)$ is merely continuous or even merely measurable. We prove the maximal $L_p$-regularity results when the coefficients satisfy gener…
▽ More
We investigate the maximal $L_p$-regularity in J.L. Lions' problem involving a time-fractional derivative and a non-autonomous form $a(t;\cdot,\cdot)$ on a Hilbert space $H$. This problem says whether the maximal $L_p$-regularity in $H$ hold when $t \mapsto a(t ; u, v)$ is merely continuous or even merely measurable. We prove the maximal $L_p$-regularity results when the coefficients satisfy general Dini-type continuity conditions. In particular, we construct a counterexample to negatively answer this problem, indicating the minimal Hölder-scale regularity required for positive results.
△ Less
Submitted 17 March, 2025; v1 submitted 12 March, 2025;
originally announced March 2025.
-
Notes on certain binomial harmonic sums of Sun's type
Authors:
Yajun Zhou
Abstract:
We prove and generalize some recent conjectures of Z.-W. Sun on infinite series whose summands involve products of harmonic numbers and several binomial coefficients. We evaluate various classes of infinite sums in closed form by interpreting them as automorphic objects on the moduli spaces for Legendre curves $Y^{ g+1}=(1-X)^{ g}X(1-t X)$ of positive genera $ g\in\{1,2,3,5\}$.
We prove and generalize some recent conjectures of Z.-W. Sun on infinite series whose summands involve products of harmonic numbers and several binomial coefficients. We evaluate various classes of infinite sums in closed form by interpreting them as automorphic objects on the moduli spaces for Legendre curves $Y^{ g+1}=(1-X)^{ g}X(1-t X)$ of positive genera $ g\in\{1,2,3,5\}$.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
A spectral Levenberg-Marquardt-Deflation method for multiple solutions of semilinear elliptic systems
Authors:
Lin Li,
Yuheng Zhou,
Pengcheng Xie,
Huiyuan Li
Abstract:
Many nonlinear differential equations arising from practical problems may permit nontrivial multiple solutions relevant to applications, and these multiple solutions are helpful to deeply understand these practical problems and to improve some applications. Developing an efficient numerical method for finding multiple solutions is very necessary due to the nonlinearity and multiple solutions of th…
▽ More
Many nonlinear differential equations arising from practical problems may permit nontrivial multiple solutions relevant to applications, and these multiple solutions are helpful to deeply understand these practical problems and to improve some applications. Developing an efficient numerical method for finding multiple solutions is very necessary due to the nonlinearity and multiple solutions of these equations. Moreover, providing an efficient iteration plays an important role in successfully obtaining multiple solutions with fast and stable convergence. In the current paper, an efficient algorithm for finding multiple solutions of semilinear elliptic systems is proposed, where the trust region Levenberg-Marquardt method is firstly used to iterate the resulted nonlinear algebraic system. When the nonlinear term in these equations has only the first derivative, our algorithm can efficiently find multiple solutions as well. Several numerical experiments are tested to show the efficiency of our algorithm, and some solutions which have not been shown in the literature are also found and shown.
△ Less
Submitted 16 April, 2025; v1 submitted 1 March, 2025;
originally announced March 2025.
-
Hyperbolic Monopoles, (Semi-)Holomorphic Chern-Simons Theories, and Generalized Chiral Potts Models
Authors:
Seyed Faroogh Moosavian,
Masahito Yamazaki,
Yehao Zhou
Abstract:
We study the relation between spectral data of magnetic monopoles in hyperbolic space and the curve of the spectral parameter of generalized chiral Potts models (gCPM) through the lens of (semi-)holomorphic field theories. We realize the identification of the data on the two sides, which we call the hyperbolic monopole/gCPM correspondence. For the group $\text{SU}(2)$, this correspondence had been…
▽ More
We study the relation between spectral data of magnetic monopoles in hyperbolic space and the curve of the spectral parameter of generalized chiral Potts models (gCPM) through the lens of (semi-)holomorphic field theories. We realize the identification of the data on the two sides, which we call the hyperbolic monopole/gCPM correspondence. For the group $\text{SU}(2)$, this correspondence had been observed by Atiyah and Murray in the 80s. Here, we revisit and generalize this correspondence and establish its origin. By invoking the work of Murray and Singer on hyperbolic monopoles, we first generalize the observation of Atiyah and Murray to the group $\text{SU}(n)$. We then propose a technology to engineer gCPM within the 4d Chern-Simons (CS) theory, which explains various features of the model, including the lack of rapidity-difference property of its R-matrix and its peculiarity of having a genus$\,\ge 2$ curve of the spectral parameter. Finally, we investigate the origin of the correspondence. We first clarify how the two sides of the correspondence can be realized from the 6d holomorphic CS theory on $\mathbb{P}S(M)$, the projective spinor bundle of the Minkowski space $M=\mathbb{R}^{1,3}$, for hyperbolic $\text{SU}(n)$-monopoles, and the Euclidean space $M=\mathbb{R}^4$, for the gCPM. We then establish that $\mathbb{P}S(M)$ can be holomorphically embedded into $\mathbb{P}S(\mathbb{C}^{1,3})$, the projective spinor bundle of $\mathbb{C}^{1,3}$, of complex dimension five with a fixed complex structure. We finally explain how the 6d CS theory on $\mathbb{P}S(M)$ can be realized as the dimensional reduction of the 10d holomorphic CS theory on $\mathbb{P}S(\mathbb{C}^{1,3})$. As the latter theory is only sensitive to the complex structure of $\mathbb{P}S(\mathbb{C}^{1,3})$, which has been fixed, we realize the correspondence as two incarnations of the same physics in ten dimensions.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Dynamics near a class of nonhyperbolic fixed points
Authors:
Meihua Jin,
Shihao Meng,
Yunhua Zhou
Abstract:
In this paper, we investigate some dynamical properties near a nonhyperbolic fixed point. Under some conditions on the higher nonlinear terms, we establish a stable manifold theorem and a degenerate Hartman theorem. Furthermore, the finite shadowing property also be discussed.
In this paper, we investigate some dynamical properties near a nonhyperbolic fixed point. Under some conditions on the higher nonlinear terms, we establish a stable manifold theorem and a degenerate Hartman theorem. Furthermore, the finite shadowing property also be discussed.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
The Stability of Pointwise Hyperbolic Systems
Authors:
Haiye Guo,
Yunhua Zhou
Abstract:
The stability of the system is an important part of the research on differential dynamical systems. This paper considers a pointwise hyperbolic system defined on a connected open subset N of a compact smooth Riemannian manifold M. The hyperbolicity may weaken when approaching the boundary of the open set. By analogy with the stability of hyperbolic systems, this paper constructs the expansive prop…
▽ More
The stability of the system is an important part of the research on differential dynamical systems. This paper considers a pointwise hyperbolic system defined on a connected open subset N of a compact smooth Riemannian manifold M. The hyperbolicity may weaken when approaching the boundary of the open set. By analogy with the stability of hyperbolic systems, this paper constructs the expansive property and the shadowing lemma on the pointwise pseudo orbits and thus obtains the stability of pointwise hyperbolic systems.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Analysis and Improvement of Eviction Enforcement
Authors:
Baris Ata,
Yuwei Zhou
Abstract:
Each year, nearly 13,000 eviction orders are issued in Cook County, Illinois. While most of these orders have an enforcement deadline, a portion does not. The Cook County Sheriff's Office (CCSO) is responsible for enforcing these orders, which involves selecting the orders to prioritize and planning daily enforcement routes. This task presents a challenge: balancing "equity" (i.e., prioritizing or…
▽ More
Each year, nearly 13,000 eviction orders are issued in Cook County, Illinois. While most of these orders have an enforcement deadline, a portion does not. The Cook County Sheriff's Office (CCSO) is responsible for enforcing these orders, which involves selecting the orders to prioritize and planning daily enforcement routes. This task presents a challenge: balancing "equity" (i.e., prioritizing orders that have been waiting longer) with "efficiency" (i.e., maximizing the number of orders served). Although the current CCSO policy is highly efficient, a significant fraction of eviction orders miss their deadline. Motivated by the CCSO's operations, we study a model of eviction enforcement planning and propose a policy that dynamically prioritizes orders based on their type (deadline or no deadline), location, and waiting time. Our approach employs a budgeted prize-collecting vehicle routing problem (VRP) for daily planning, where the "prizes" are determined by solving a stochastic control problem. This stochastic control problem, which relies on the VRP for determining feasible actions at each decision point, is high-dimensional due to its spatial nature, leading to the curse of dimensionality. We overcome this challenge by building on recent advances in high-dimensional stochastic control using deep neural networks. We compare the performance of our proposed policy with two practical benchmark policies, including one that mimics the current CCSO policy, using data from CCSO. Similar to the CCSO policy, our proposed policy leads to efficient resource utilization, but it also reduces the percentage of orders that miss their deadline by 72.38% without degrading the overall service effort for either type of orders. In a counterfactual study, we show that increasing the service capacity or extending the enforcement deadline further reduces the fraction of orders missing their deadline.
△ Less
Submitted 22 February, 2025;
originally announced February 2025.
-
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
Authors:
Yuhao Zhou,
Jintao Xu,
Chenglong Bao,
Chao Ding,
Jun Zhu
Abstract:
We consider the problem of finding an $ε$-stationary point of a nonconvex function with a Lipschitz continuous Hessian and propose a quadratic regularized Newton method incorporating a new class of regularizers constructed from the current and previous gradients. The method leverages a recently developed linear conjugate gradient approach with a negative curvature monitor to solve the regularized…
▽ More
We consider the problem of finding an $ε$-stationary point of a nonconvex function with a Lipschitz continuous Hessian and propose a quadratic regularized Newton method incorporating a new class of regularizers constructed from the current and previous gradients. The method leverages a recently developed linear conjugate gradient approach with a negative curvature monitor to solve the regularized Newton equation. Notably, our algorithm is adaptive, requiring no prior knowledge of the Lipschitz constant of the Hessian, and achieves a global complexity of $O(ε^{-\frac{3}{2}}) + \tilde O(1)$ in terms of the second-order oracle calls, and $\tilde O(ε^{-\frac{7}{4}})$ for Hessian-vector products, respectively. Moreover, when the iterates converge to a point where the Hessian is positive definite, the method exhibits quadratic local convergence. Preliminary numerical results illustrate the competitiveness of our algorithm.
△ Less
Submitted 14 February, 2025; v1 submitted 7 February, 2025;
originally announced February 2025.
-
Global $C^{1,α}$ regularity for Monge-Ampère equations on planar convex domains
Authors:
Qing Han,
Jiakun Liu,
Yang Zhou
Abstract:
In this paper, we establish the global Hölder gradient estimate for solutions to the Dirichlet problem of the Monge-Ampère equation $\det D^2u = f$ on strictly convex but not uniformly convex domain $Ω$.
In this paper, we establish the global Hölder gradient estimate for solutions to the Dirichlet problem of the Monge-Ampère equation $\det D^2u = f$ on strictly convex but not uniformly convex domain $Ω$.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
Dynamic Operation and Control of a Multi-Stack Alkaline Water Electrolysis System with Shared Gas Separators and Lye Circulation: A Model-Based Study
Authors:
Yiwei Qiu,
Jiatong Li,
Yangjun Zeng,
Yi Zhou,
Shi Chen,
Xiaoyan Qiu,
Buxiang Zhou,
Ge He,
Xu Ji,
Wenying Li
Abstract:
An emerging approach for large-scale hydrogen production using renewable energy is to integrate multiple alkaline water electrolysis (AWE) stacks into a single balance of plant (BoP) system, sharing components such as gas-lye separation and lye circulation. This configuration, termed the $N$-in-1 AWE system, packs $N$ stacks into a modular system, reducing land requirements, the complexity of plan…
▽ More
An emerging approach for large-scale hydrogen production using renewable energy is to integrate multiple alkaline water electrolysis (AWE) stacks into a single balance of plant (BoP) system, sharing components such as gas-lye separation and lye circulation. This configuration, termed the $N$-in-1 AWE system, packs $N$ stacks into a modular system, reducing land requirements, the complexity of plant topology, and overall capital costs. However, the coupling of these stacks through the shared BoP introduces challenges in dynamic operation under varying energy inputs, making their performance unclear compared to traditional 1-in-1 systems. To address this, we develop a state-space model of the $N$-in-1 AWE system, capturing the dynamic behaviors of lye circulation, temperature, and HTO impurity, and their impact on energy conversion efficiency. We then propose a nonlinear model predictive controller (NMPC) to coordinately optimize inter-stack electrolytic current distribution, lye flow, and cooling, enabling the system to dynamically track varying load commands while maximizing efficiency, stabilizing temperature, and limiting HTO impurity accumulation. Simulation studies on a 4,000 Nm$^3$/h-rated 4-in-1 system verify the proposed controller under dynamic operation. Comparison with 4 independent 1-in-1 systems reveals that, with proper control, the $N$-in-1 configuration offers comparable flexibility in accommodating real-world wind power inputs. The average differences in the root-mean-square errors (RMSEs) for load-tracking and stack temperature stabilization, and specific energy consumption are below 0.014 MW, 2.356 K, and 0.003 kWh/Nm$^3$.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
Solving Random Hyperbolic Conservation Laws Using Linear Programming
Authors:
Shaoshuai Chu,
Michael Herty,
Maria Lukacova-Medvid'ova,
Yizhou Zhou
Abstract:
A novel structure-preserving numerical method to solve random hyperbolic systems of conservation laws is presented. The method uses a concept of generalized, measure-valued solutions to random conservation laws. This yields a linear partial differential equation with respect to the Young measure and allows to compute the approximation based on linear programming problems. We analyze the structure-…
▽ More
A novel structure-preserving numerical method to solve random hyperbolic systems of conservation laws is presented. The method uses a concept of generalized, measure-valued solutions to random conservation laws. This yields a linear partial differential equation with respect to the Young measure and allows to compute the approximation based on linear programming problems. We analyze the structure-preserving properties of the derived numerical method and discuss its advantages and disadvantages. Numerical results for one-dimensional Burgers equation and the isentropic Euler equations and comparisons with stochastic collocation method illustrate the behavior of the proposed numerical method.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Feedback Arc Sets and Feedback Arc Set Decompositions in Weighted and Unweighted Oriented Graphs
Authors:
Gregory Gutin,
Mads Anker Nielsen,
Anders Yeo,
Yacong Zhou
Abstract:
For any arc-weighted oriented graph $D=(V(D), A(D),w)$, we write
${\rm fas}_w(D)$ to denote the minimum weight of a feedback arc set in $D$. In this paper, we consider upper bounds on ${\rm fas}_w(D)$ for arc-weight oriented graphs $D$ with bounded maximum degrees and directed girth. We obtain such bounds by introducing a new parameter ${\rm fasd}(D)$, which is the maximum integer such that…
▽ More
For any arc-weighted oriented graph $D=(V(D), A(D),w)$, we write
${\rm fas}_w(D)$ to denote the minimum weight of a feedback arc set in $D$. In this paper, we consider upper bounds on ${\rm fas}_w(D)$ for arc-weight oriented graphs $D$ with bounded maximum degrees and directed girth. We obtain such bounds by introducing a new parameter ${\rm fasd}(D)$, which is the maximum integer such that $A(D)$ can be partitioned into ${\rm fasd}(D)$ feedback arc sets. This new parameter seems to be interesting in its own right.
We obtain several bounds for both ${\rm fas}_w(D)$ and ${\rm fasd}(D)$ when $D$ has maximum degree $Δ(D)\le Δ$ and directed girth $g(D)\geq g$. In particular, we show that if $Δ(D)\leq~4$ and $g(D)\geq 3$, then ${\rm fasd}(D) \geq 3$ and therefore ${\rm fas}_w(D)\leq \frac{w(D)}{3}$ which generalizes a tight bound for an unweighted oriented graph with maximum degree at most 4. We also show that ${\rm fasd}(D)\geq g$ and ${\rm fas}_w(D) \leq \frac{w(D)}{g}$ if $Δ(D)\leq 3$ and $g(D)\geq g$ for $g\in \{3,4,5\}$ and these bounds are tight. However, for $g=10$ the bound ${\rm fasd}(D)\geq g$ does not always hold when $Δ(D)\leq 3$. Finally we give some bounds for the cases when $Δ$ or $g$ are large.
△ Less
Submitted 30 May, 2025; v1 submitted 12 January, 2025;
originally announced January 2025.
-
Oriented discrepancy of Hamilton cycles and paths in digraphs
Authors:
Qiwen Guo,
Gregory Gutin,
Yongxin Lan,
Qi Shao,
Anders Yeo,
Yacong Zhou
Abstract:
Erd{\H o}s (1963) initiated extensive graph discrepancy research on 2-edge-colored graphs. Gishboliner, Krivelevich, and Michaeli (2023) launched similar research on oriented graphs. They conjectured the following generalization of Dirac's theorem: If the minimum degree $δ$ of an $n$-vertex oriented graph $G$ is greater or equal to $n/2$,then $G$ has a Hamilton oriented cycle with at least $δ$ for…
▽ More
Erd{\H o}s (1963) initiated extensive graph discrepancy research on 2-edge-colored graphs. Gishboliner, Krivelevich, and Michaeli (2023) launched similar research on oriented graphs. They conjectured the following generalization of Dirac's theorem: If the minimum degree $δ$ of an $n$-vertex oriented graph $G$ is greater or equal to $n/2$,then $G$ has a Hamilton oriented cycle with at least $δ$ forward arcs. This conjecture was proved by Freschi and Lo (2024) who posed an open problem to extend their result to an Ore-type condition. We propose two conjectures for such extensions and prove some results which provide support to the conjectures. For forward arc maximization on Hamilton oriented cycles and paths in semicomplete multipartite digraphs and locally semicomplete digraphs, we obtain characterizations which lead to polynomial-time algorithms.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
Evaluation of Rail Decarbonization Alternatives: Framework and Application
Authors:
Adrian Hernandez,
Max TM Ng,
Nazib Siddique,
Pablo L. Durango-Cohen,
Amgad Elgowainy,
Hani S. Mahmassani,
Michael Wang,
Yan Zhou
Abstract:
The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fuel…
▽ More
The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fueling facility location and sizing problem are solved with a five-step framework. Life cycle analyses (LCA) and techno-economic analyses (TEA) are used to estimate carbon reduction, capital investments, cost of carbon reduction, and operational impacts, enabling sensitivity analysis with operational and technological parameters. The framework is illustrated on lower-carbon drop-in fuels as well as battery-electric technology deployments for US Eastern and Western Class I railroad networks. Drop-in fuel deployments are modeled as admixtures with diesel in existing locomotives, while battery-electric deployments are shown for varying technology penetration levels and locomotive ranges. When mixed in a 50 percent ratio with diesel, results show biodiesel's capacity to reduce emissions at 36 percent with a cost of 0.13 USD per kilogram of CO2 reduced, while e-fuels offer a 50 percent emissions reduction potential at a cost of 0.22 USD per kilogram of CO2 reduced. Battery-electric results for 50 percent deployment over all ton-miles highlight the value of future innovations in battery energy densities as scenarios assuming 800-mile range locomotives show an estimated emissions reduction of 46 percent with a cost of 0.06 USD per kilogram of CO2 reduced, compared to 16 percent emissions reduction at a cost of 0.11 USD per kilogram of CO2 reduced for 400-mile range locomotives.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Exploring low-rank structure for an inverse scattering problem with far-field data
Authors:
Yuyuan Zhou,
Lorenzo Audibert,
Shixu Meng,
Bo Zhang
Abstract:
In this work, we introduce a novel low-rank structure tailored for solving the inverse scattering problem. The particular low-rank structure is given by the generalized prolate spheroidal wave functions, computed stably and accurately via a Sturm-Liouville problem. We first process the far-field data to obtain a post-processed data set within a disk domain. Subsequently, the post-processed data ar…
▽ More
In this work, we introduce a novel low-rank structure tailored for solving the inverse scattering problem. The particular low-rank structure is given by the generalized prolate spheroidal wave functions, computed stably and accurately via a Sturm-Liouville problem. We first process the far-field data to obtain a post-processed data set within a disk domain. Subsequently, the post-processed data are projected onto a low-rank space given by the low-rank structure. The unknown is approximately solved in this low-rank space, by dropping higher-order terms. The low-rank structure leads to an explicit stability estimate for unknown functions belonging to standard Sobolev spaces, and a Lipschitz stability estimate for unknowns belonging to a finite dimensional low-rank space. Various numerical experiments are conducted to validate its performance, encompassing assessments of resolution capability, robustness against randomly added noise and modeling errors, and demonstration of increasing stability.
△ Less
Submitted 22 May, 2025; v1 submitted 27 December, 2024;
originally announced December 2024.