-
Split-Merge Revisited: A Scalable Approach to Generalized Eigenvalue Problems
Authors:
Xiaozhi Liu,
Yong Xia
Abstract:
The generalized eigenvalue problem (GEP) serves as a cornerstone in a wide range of applications in numerical linear algebra and scientific computing. However, traditional approaches that aim to maximize the classical Rayleigh quotient often suffer from numerical instability and limited computational efficiency, especially in large-scale settings. In this work, we explore an alternative difference…
▽ More
The generalized eigenvalue problem (GEP) serves as a cornerstone in a wide range of applications in numerical linear algebra and scientific computing. However, traditional approaches that aim to maximize the classical Rayleigh quotient often suffer from numerical instability and limited computational efficiency, especially in large-scale settings. In this work, we explore an alternative difference-based formulation of GEP by minimizing a structured quadratic polynomial objective, which enables the application of efficient first-order optimization methods. We establish global convergence guarantees for these methods without requiring line search, and further introduce a transform-domain perspective that reveals the intrinsic connection and performance gap between classical first-order algorithms and the power method. Based on this insight, we develop an accelerated preconditioned mirror descent algorithm, which allows for flexible preconditioner design and improved convergence behavior. Lastly, we extend the recently proposed Split-Merge algorithm to the general GEP setting, incorporating richer second-order information to further accelerate convergence. Empirical results on both synthetic and real-world datasets demonstrate that our proposed methods achieve significant improvements over existing baselines in terms of both computational efficiency and numerical stability.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
A new sparsity promoting residual transform operator for Lasso regression
Authors:
Yao Xiao,
Anne Gelb,
Aditya Viswanathan
Abstract:
Lasso regression is a widely employed approach within the $\ell_1$ regularization framework used to promote sparsity and recover piecewise smooth signals $f:[a,b) \rightarrow \mathbb{R}$ when the given observations are obtained from noisy, blurred, and/or incomplete data environments. In choosing the regularizing sparsity-promoting operator, it is assumed that the particular type of variability of…
▽ More
Lasso regression is a widely employed approach within the $\ell_1$ regularization framework used to promote sparsity and recover piecewise smooth signals $f:[a,b) \rightarrow \mathbb{R}$ when the given observations are obtained from noisy, blurred, and/or incomplete data environments. In choosing the regularizing sparsity-promoting operator, it is assumed that the particular type of variability of the underlying signal, for example, piecewise constant or piecewise linear behavior across the entire domain, is both known and fixed. Such an assumption is problematic in more general cases, e.g.~when a signal exhibits piecewise oscillatory behavior with varying wavelengths and magnitudes. To address the limitations of assuming a fixed (and typically low order) variability when choosing a sparsity-promoting operator, this investigation proposes a novel residual transform operator that can be used within the Lasso regression formulation. In a nutshell, the idea is that for a general piecewise smooth signal $f$, it is possible to design two operators $\mathcal L_1$ and $\mathcal L_2$ such that $\mathcal L_1{\boldsymbol f} \approx \mathcal L_2{\boldsymbol f}$, where ${\boldsymbol f} \in \mathbb{R}^n$ is a discretized approximation of $f$, but $\mathcal L_1 \not\approx \mathcal L_2$. The corresponding residual transform operator, $\mathcal L = \mathcal L_1- \mathcal L_2$, yields a result that (1) effectively reduces the variability dependent error that occurs when applying either $\mathcal L_1$ or $\mathcal L_2$ to ${\boldsymbol f}$, a property that holds even when $\mathcal L_1{\boldsymbol f} \approx \mathcal L_2{\boldsymbol f}$ is not a good approximation to the true sparse domain vector of ${\boldsymbol f}$, and (2) does not require $\mathcal L_1$ or $\mathcal L_2$ to have prior information regarding the variability of the underlying signal.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
First-order methods for stochastic and finite-sum convex optimization with deterministic constraints
Authors:
Zhaosong Lu,
Yifeng Xiao
Abstract:
In this paper, we study a class of stochastic and finite-sum convex optimization problems with deterministic constraints. Existing methods typically aim to find an $ε$-$expectedly\ feasible\ stochastic\ optimal$ solution, in which the expected constraint violation and expected optimality gap are both within a prescribed tolerance $ε$. However, in many practical applications, constraints must be ne…
▽ More
In this paper, we study a class of stochastic and finite-sum convex optimization problems with deterministic constraints. Existing methods typically aim to find an $ε$-$expectedly\ feasible\ stochastic\ optimal$ solution, in which the expected constraint violation and expected optimality gap are both within a prescribed tolerance $ε$. However, in many practical applications, constraints must be nearly satisfied with certainty, rendering such solutions potentially unsuitable due to the risk of substantial violations. To address this issue, we propose stochastic first-order methods for finding an $ε$-$surely\ feasible\ stochastic\ optimal$ ($ε$-SFSO) solution, where the constraint violation is deterministically bounded by $ε$ and the expected optimality gap is at most $ε$. Our methods apply an accelerated stochastic gradient (ASG) scheme or a modified variance-reduced ASG scheme $only\ once$ to a sequence of quadratic penalty subproblems with appropriately chosen penalty parameters. We establish first-order oracle complexity bounds for the proposed methods in computing an $ε$-SFSO solution. As a byproduct, we also derive first-order oracle complexity results for sample average approximation method in computing an $ε$-SFSO solution of the stochastic optimization problem using our proposed methods to solve the sample average problem.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Fast convergence of a primal-dual dynamical system with implicit Hessian damping and Tikhonov regularization
Authors:
Hong-lu Li,
Xin He,
Yi-bin Xiao
Abstract:
This paper proposes two primal-dual dynamical systems for solving linear equality constrained convex optimization problems: one with implicit Hessian damping only, and the other further incorporating Tikhonov regularization. We analyze the fast convergence properties of both dynamical systems and show that they achieve the same convergence rates. Moreover, we show that the trajectory generated by…
▽ More
This paper proposes two primal-dual dynamical systems for solving linear equality constrained convex optimization problems: one with implicit Hessian damping only, and the other further incorporating Tikhonov regularization. We analyze the fast convergence properties of both dynamical systems and show that they achieve the same convergence rates. Moreover, we show that the trajectory generated by the dynamical system with Tikhonov regularization converges strongly to the minimum-norm solution of the underlying problem. Finally, numerical experiments are conducted to validate the theoretical findings. Interestingly, the trajectories exhibit smooth behavior even when the objective function is only continuously differentiable.
△ Less
Submitted 26 June, 2025; v1 submitted 24 June, 2025;
originally announced June 2025.
-
Enhanced Trust Region Sequential Convex Optimization for Multi-Drone Thermal Screening Trajectory Planning in Urban Environments
Authors:
Kaiyuan Chen,
Zhengjie Hu,
Shaolin Zhang,
Yuanqing Xia,
Wannian Liang,
Shuo Wang
Abstract:
The rapid detection of abnormal body temperatures in urban populations is essential for managing public health risks, especially during outbreaks of infectious diseases. Multi-drone thermal screening systems offer promising solutions for fast, large-scale, and non-intrusive human temperature monitoring. However, trajectory planning for multiple drones in complex urban environments poses significan…
▽ More
The rapid detection of abnormal body temperatures in urban populations is essential for managing public health risks, especially during outbreaks of infectious diseases. Multi-drone thermal screening systems offer promising solutions for fast, large-scale, and non-intrusive human temperature monitoring. However, trajectory planning for multiple drones in complex urban environments poses significant challenges, including collision avoidance, coverage efficiency, and constrained flight environments. In this study, we propose an enhanced trust region sequential convex optimization (TR-SCO) algorithm for optimal trajectory planning of multiple drones performing thermal screening tasks. Our improved algorithm integrates a refined convex optimization formulation within a trust region framework, effectively balancing trajectory smoothness, obstacle avoidance, altitude constraints, and maximum screening coverage. Simulation results demonstrate that our approach significantly improves trajectory optimality and computational efficiency compared to conventional convex optimization methods. This research provides critical insights and practical contributions toward deploying efficient multi-drone systems for real-time thermal screening in urban areas. For reader who are interested in our research, we release our source code at https://github.com/Cherry0302/Enhanced-TR-SCO.
△ Less
Submitted 19 June, 2025; v1 submitted 6 June, 2025;
originally announced June 2025.
-
Optimal Pricing Strategies for Heterogeneous Customers in Dual-Channel Closed-Loop Supply Chains: A Modeling Approach
Authors:
Yang Xiao,
Hisashi Kurata,
Ting Wang
Abstract:
Dual-channel closed-loop supply chains (DCCLSCs) play a vital role in attaining both sustainability and profitability. This paper introduces a game-theoretic model to analyze optimal pricing strategies for primary and replacement customers within three distinct recycling frameworks: manufacturer-led, retailer-led, and collaborative recycling. The model identifies equilibrium pricing and subsidy de…
▽ More
Dual-channel closed-loop supply chains (DCCLSCs) play a vital role in attaining both sustainability and profitability. This paper introduces a game-theoretic model to analyze optimal pricing strategies for primary and replacement customers within three distinct recycling frameworks: manufacturer-led, retailer-led, and collaborative recycling. The model identifies equilibrium pricing and subsidy decisions for each scenario, considering the primary customer's preference for the direct channel and the specific roles in recycling. The findings indicate that manufacturers tend to set lower prices in direct channels compared to retailers, aiming to stimulate demand and promote trade-ins. Manufacturer-led recycling initiatives result in stable pricing, whereas retailer-led recycling necessitates higher subsidies. Collaborative recycling strategies yield lower prices and an increase in trade-ins. Primary customers' preference for the direct channel significantly impacts pricing strategies, with a stronger preference leading to lower direct-channel prices and higher manufacturer subsidies. This paper contributes to the field by incorporating primary customer channel preferences and diverse recycling frameworks into DCCLSC pricing models. These insights assist manufacturers and retailers in adjusting pricing strategies and trade-in incentives according to primary customer preferences and associated costs, thereby enhancing profitability and recycling efficiency within DCCLSCs.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
A novel and application-oriented inverse nodal problem for Sturm-Liouville operators
Authors:
Yuchao He,
Mengda Wu,
Yonghui Xia,
Meirong Zhang
Abstract:
This paper develops a methodological framework for addressing a novel and application-oriented inverse nodal problem in Sturm-Liouville operators, having significant applications in seismic wave analysis and submarine underwater radar (sonar) detection. By utilizing a given finite set of nodal data, we propose an optimization framework to find the potential $\hat q$ that is most closely approximat…
▽ More
This paper develops a methodological framework for addressing a novel and application-oriented inverse nodal problem in Sturm-Liouville operators, having significant applications in seismic wave analysis and submarine underwater radar (sonar) detection. By utilizing a given finite set of nodal data, we propose an optimization framework to find the potential $\hat q$ that is most closely approximating a predefined target potential $q_0$. The inverse nodal optimization problem is reformulated as a solvability problem for a class of nonlinear Schrödinger equations, enabling systematic investigation of the inverse nodal problem. {As an example, when the constant target potential $q_0$ is considered, we find that the Schrödinger equations are completely integrable and conclude that the potential $\hat q$ is `periodic' in a certain sense. Furthermore, the reconstruction of $\hat q$ is reduced to solving a system of three featured parameters, thereby establishing an explicit quantitative relationship between $\|\hat q\|_{Lp}$ and $T_*$. Of importance, we prove the uniqueness of the potential $\hat q$ when $p>3/2$. These new findings represent a substantial advancement in this field of study. Our methodology also bridges theoretical rigor with practical applicability, addressing scenarios where only partial nodal information is available.
△ Less
Submitted 26 June, 2025; v1 submitted 25 May, 2025;
originally announced May 2025.
-
On the PPW Conjecture For Hopf-symmetric Sets In Non-compact Rank One Symmetric Space
Authors:
Yusen Xia
Abstract:
In this paper, we proved that for a bounded Hopf-symmetric domain $Ω$ in a noncompact rank one symmetric space $M$, the second Dirichlet eigenvalue $λ_2 (Ω) \leq λ_2 (B_1)$ where $B_1$ is a geodesic ball in $M$ such that $λ_1 (Ω) =λ_1 (B_1)$. This generalizes the work of Ashbaugh & Benguria, Benguria & Linde for bounded domains in constant curvature spaces.
In this paper, we proved that for a bounded Hopf-symmetric domain $Ω$ in a noncompact rank one symmetric space $M$, the second Dirichlet eigenvalue $λ_2 (Ω) \leq λ_2 (B_1)$ where $B_1$ is a geodesic ball in $M$ such that $λ_1 (Ω) =λ_1 (B_1)$. This generalizes the work of Ashbaugh & Benguria, Benguria & Linde for bounded domains in constant curvature spaces.
△ Less
Submitted 9 June, 2025; v1 submitted 23 May, 2025;
originally announced May 2025.
-
Bilevel Transmission Expansion Planning with Joint Chance-Constrained Dispatch
Authors:
Yuxin Xia,
Yihong Zhou,
Iacopo Savelli,
Thomas Morstyn
Abstract:
In transmission expansion planning (TEP), network planners make long-term investment decisions while anticipating market clearing outcomes that are increasingly affected by renewable generation uncertainty. Additionally, market participants' sensitivity to network charges and the requirement for cost recovery by the network planner introduce further complexity. Since the day-ahead market clears be…
▽ More
In transmission expansion planning (TEP), network planners make long-term investment decisions while anticipating market clearing outcomes that are increasingly affected by renewable generation uncertainty. Additionally, market participants' sensitivity to network charges and the requirement for cost recovery by the network planner introduce further complexity. Since the day-ahead market clears before uncertainty realizes, explicitly modelling these uncertainties at the lower-level market clearing becomes important in bilevel TEP problems. In this paper, we introduce a novel bilevel TEP framework with lower-level joint chance-constrained market clearing that manages line flow constraints under wind uncertainty and accounts for the effect of network tariffs on participants' actual marginal costs and utility. To solve this complex problem, we propose a Strengthened Linear Approximation (SLA) technique for handling Wasserstein distributionally robust joint chance constraints with right-hand-side uncertainties (RHS-WDRJCC). The proposed method offers more efficient approximations without additional conservativeness and avoids the numerical issues encountered in existing approaches by introducing valid inequalities. The case study demonstrates that the proposed model achieves the desired out-of-sample constraint satisfaction probability. Moreover, the numerical results highlight the significant computational advantage of SLA, achieving up to a 26x speedup compared to existing methods such as worst-case conditional value-at-risk, while maintaining high solution quality.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
SPP-SBL: Space-Power Prior Sparse Bayesian Learning for Block Sparse Recovery
Authors:
Yanhao Zhang,
Zhihan Zhu,
Yong Xia
Abstract:
The recovery of block-sparse signals with unknown structural patterns remains a fundamental challenge in structured sparse signal reconstruction. By proposing a variance transformation framework, this paper unifies existing pattern-based block sparse Bayesian learning methods, and introduces a novel space power prior based on undirected graph models to adaptively capture the unknown patterns of bl…
▽ More
The recovery of block-sparse signals with unknown structural patterns remains a fundamental challenge in structured sparse signal reconstruction. By proposing a variance transformation framework, this paper unifies existing pattern-based block sparse Bayesian learning methods, and introduces a novel space power prior based on undirected graph models to adaptively capture the unknown patterns of block-sparse signals. By combining the EM algorithm with high-order equation root-solving, we develop a new structured sparse Bayesian learning method, SPP-SBL, which effectively addresses the open problem of space coupling parameter estimation in pattern-based methods. We further demonstrate that learning the relative values of space coupling parameters is key to capturing unknown block-sparse patterns and improving recovery accuracy. Experiments validate that SPP-SBL successfully recovers various challenging structured sparse signals (e.g., chain-structured signals and multi-pattern sparse signals) and real-world multi-modal structured sparse signals (images, audio), showing significant advantages in recovery accuracy across multiple metrics.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Extreme Points of Base Polytope of Submodular Set Functions and Limit for Quotient Convergent Graph Sequence
Authors:
Yaobin Chen,
Zhicheng Liu,
Yihang Xiao,
Junchi Zhang
Abstract:
Submodular set functions are of great importance in mathematics and theoretical computer science, serving as fundamental tools in optimization, combinatorics, and economics due to their natural properties and wide-ranging applications. In 2023, Lovász systematically extended the theory of submodular set functions from finite sets to general set algebras and proposed several open problems about the…
▽ More
Submodular set functions are of great importance in mathematics and theoretical computer science, serving as fundamental tools in optimization, combinatorics, and economics due to their natural properties and wide-ranging applications. In 2023, Lovász systematically extended the theory of submodular set functions from finite sets to general set algebras and proposed several open problems about the behavior of submodular functions in infinite settings, including the characterization of extreme points of the base polytope of submodular set functions.
We characterize conditions under which the extreme points of the base polytope of a submodular function are restricting measures with respect to its majorizing measure. Applying this result, we characterize the core of increasing subadditive non-atomic games and provide a positive answer to a question of Kristóf Bérzi, Márton Borbényi, László Lovász and László Márton Tóth regarding the rank function for graphing's cycle matroid.
Furthermore, building on the limit theory for set functions, we prove that the limit of convergent sequence of bounded-degree graphs' cycle matroids can be represented as the cycle matroid of a graphing, analogous to the completeness result for local-global convergence.
△ Less
Submitted 23 June, 2025; v1 submitted 20 April, 2025;
originally announced April 2025.
-
Finite pattern problems related to Engel expansion
Authors:
Chun-Yun Cao,
Yang Xiao
Abstract:
Let $\mathcal{F}$ be a countable collection of functions $f$ defined on the integers with integer values, such that for every $f\in \mathcal{F}$, $f(n)\to +\infty$ as $n\to +\infty$. This paper primarily investigates the Hausdorff dimension of the set of points whose digit sequences of the Engel expansion are strictly increasing and contain any finite pattern of $\mathcal{F}$, demonstrating applic…
▽ More
Let $\mathcal{F}$ be a countable collection of functions $f$ defined on the integers with integer values, such that for every $f\in \mathcal{F}$, $f(n)\to +\infty$ as $n\to +\infty$. This paper primarily investigates the Hausdorff dimension of the set of points whose digit sequences of the Engel expansion are strictly increasing and contain any finite pattern of $\mathcal{F}$, demonstrating applications with representative examples.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
Kernel compensation method for Maxwell eigenproblem with mimetic finite difference discretization
Authors:
Chenhao Jin,
Yinhua Xia,
Yan Xu
Abstract:
We present a kernel compensation method for Maxwell eigenproblem for photonic crystals to avoid the infinite-dimensional kernels that cause many difficulties in the calculation of energy gaps. The quasi-periodic problem is first transformed into a periodic one on the cube by the Floquet-Bloch theory. Then the compensation operator is introduced in Maxwell's equation with the shifted curl operator.…
▽ More
We present a kernel compensation method for Maxwell eigenproblem for photonic crystals to avoid the infinite-dimensional kernels that cause many difficulties in the calculation of energy gaps. The quasi-periodic problem is first transformed into a periodic one on the cube by the Floquet-Bloch theory. Then the compensation operator is introduced in Maxwell's equation with the shifted curl operator. The discrete problem depends on the compatible discretization of the de Rham complex, which is implemented by the mimetic finite difference method in this paper. We prove that the compensation term exactly fills up the kernel of the original problem and avoids spurious eigenvalues. Also, we propose an efficient preconditioner and its FFT and multigrid solvers, which allow parallel computing. Numerical experiments for different three-dimensional lattices are performed to validate the accuracy and effectiveness of the method.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Minimax Rate-Optimal Inference for Individualized Quantile Treatment Effects in High-dimensional Models
Authors:
Jiachen Sun,
Yin Xia
Abstract:
The quantification of treatment effects plays an important role in a wide range of applications, including policy making and bio-pharmaceutical research. In this article, we study the quantile treatment effect (QTE) while addressing two specific types of heterogeneities: (a) personalized heterogeneity, which captures the varying treatment effects for different individuals, and (b) quantile heterog…
▽ More
The quantification of treatment effects plays an important role in a wide range of applications, including policy making and bio-pharmaceutical research. In this article, we study the quantile treatment effect (QTE) while addressing two specific types of heterogeneities: (a) personalized heterogeneity, which captures the varying treatment effects for different individuals, and (b) quantile heterogeneity, which accounts for how the impact of covariates varies across different quantile levels. A well-designed debiased estimator for the individualized quantile treatment effect (IQTE) is proposed to capture such heterogeneities effectively. We show that this estimator converges weakly to a Gaussian process as a function of the quantile levels and propose valid statistical inference methods, including the construction of confidence intervals and the development of hypothesis testing decision rules. In addition, the minimax optimality frameworks for these inference procedures are established. Specifically, we derive the minimax optimal rates for the expected length of confidence intervals and the magnitude of the detection boundary for hypothesis testing procedures, illustrating the superiority of the proposed estimator. The effectiveness of our methods is demonstrated through extensive simulations and an analysis of the National Health and Nutrition Examination Survey (NHANES) datasets.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Rational maps whose Julia sets are generalized Sierpiński gaskets
Authors:
Xiaole He,
Yingqing Xiao,
Fei Yang
Abstract:
It has been shown that the Sierpiński gasket-like sets can appear as the Julia sets of some geometrically finite rational maps. In this paper we prove that such type of Julia sets can also appear in the rational maps containing Siegel disks, Cremer points or which are infinitely renormalizable. Based on this, we prove the existence of gasket Julia sets with positive area. Moreover, we present a cr…
▽ More
It has been shown that the Sierpiński gasket-like sets can appear as the Julia sets of some geometrically finite rational maps. In this paper we prove that such type of Julia sets can also appear in the rational maps containing Siegel disks, Cremer points or which are infinitely renormalizable. Based on this, we prove the existence of gasket Julia sets with positive area. Moreover, we present a criterion which guarantees the existence of gasket Julia sets in some rational maps having exactly one fixed attracting or parabolic basin.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
Finite Samples for Shallow Neural Networks
Authors:
Yu Xia,
Zhiqiang Xu
Abstract:
This paper investigates the ability of finite samples to identify two-layer irreducible shallow networks with various nonlinear activation functions, including rectified linear units (ReLU) and analytic functions such as the logistic sigmoid and hyperbolic tangent. An ``irreducible" network is one whose function cannot be represented by another network with fewer neurons. For ReLU activation funct…
▽ More
This paper investigates the ability of finite samples to identify two-layer irreducible shallow networks with various nonlinear activation functions, including rectified linear units (ReLU) and analytic functions such as the logistic sigmoid and hyperbolic tangent. An ``irreducible" network is one whose function cannot be represented by another network with fewer neurons. For ReLU activation functions, we first establish necessary and sufficient conditions for determining the irreducibility of a network. Subsequently, we prove a negative result: finite samples are insufficient for definitive identification of any irreducible ReLU shallow network. Nevertheless, we demonstrate that for a given irreducible network, one can construct a finite set of sampling points that can distinguish it from other network with the same neuron count. Conversely, for logistic sigmoid and hyperbolic tangent activation functions, we provide a positive result. We construct finite samples that enable the recovery of two-layer irreducible shallow analytic networks. To the best of our knowledge, this is the first study to investigate the exact identification of two-layer irreducible networks using finite sample function values. Our findings provide insights into the comparative performance of networks with different activation functions under limited sampling conditions.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
Stable recovery guarantees for blind deconvolution under random mask assumption
Authors:
Song Li,
Yu Xia
Abstract:
This study addresses the blind deconvolution problem with modulated inputs, focusing on a measurement model where an unknown blurring kernel $\boldsymbol{h}$ is convolved with multiple random modulations $\{\boldsymbol{d}_l\}_{l=1}^{L}$(coded masks) of a signal $\boldsymbol{x}$, subject to $\ell_2$-bounded noise. We introduce a more generalized framework for coded masks, enhancing the versatility…
▽ More
This study addresses the blind deconvolution problem with modulated inputs, focusing on a measurement model where an unknown blurring kernel $\boldsymbol{h}$ is convolved with multiple random modulations $\{\boldsymbol{d}_l\}_{l=1}^{L}$(coded masks) of a signal $\boldsymbol{x}$, subject to $\ell_2$-bounded noise. We introduce a more generalized framework for coded masks, enhancing the versatility of our approach. Our work begins within a constrained least squares framework, where we establish a robust recovery bound for both $\boldsymbol{h}$ and $\boldsymbol{x}$, demonstrating its near-optimality up to a logarithmic factor. Additionally, we present a new recovery scheme that leverages sparsity constraints on $\boldsymbol{x}$. This approach significantly reduces the sampling complexity to the order of $L=O(\log n)$ when the non-zero elements of $\boldsymbol{x}$ are sufficiently separated. Furthermore, we demonstrate that incorporating sparsity constraints yields a refined error bound compared to the traditional constrained least squares model. The proposed method results in more robust and precise signal recovery, as evidenced by both theoretical analysis and numerical simulations. These findings contribute to advancing the field of blind deconvolution and offer potential improvements in various applications requiring signal reconstruction from modulated inputs.
△ Less
Submitted 27 February, 2025;
originally announced March 2025.
-
A new fuzzy fractional differential variational inequality with Mittag-Leffler kernel of order $q \in (1,2]$
Authors:
Zeng-bao Wu,
Tao Chen,
Quan-guo Zhang,
Yi-bin Xiao
Abstract:
This paper considers a new fuzzy fractional differential variational inequality with Mittag-Leffler kernel of order $q \in (1,2]$ comprising a fuzzy fractional differential inclusion with Mittag-Leffler kernel of order $q \in (1,2]$ and a variational inequality in Euclidean spaces. The existence of solutions for such a novel system is obtained under some mild conditions.
This paper considers a new fuzzy fractional differential variational inequality with Mittag-Leffler kernel of order $q \in (1,2]$ comprising a fuzzy fractional differential inclusion with Mittag-Leffler kernel of order $q \in (1,2]$ and a variational inequality in Euclidean spaces. The existence of solutions for such a novel system is obtained under some mild conditions.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
A new fractional differential quasi-variational inequality with Mittag-Leffler kernel in Hilbert spaces and its applications
Authors:
Zeng-bao Wu,
Quan-guo Zhang,
Tao Chen,
Yue Zeng,
Nan-jing Huang,
Yi-bin Xiao
Abstract:
This paper considers a new fractional differential quasi-variational inequality with Mittag-Leffler kernel comprising a fractional differential equation with Mittag-Leffler kernel and a quasi-variational inequality in Hilbert spaces. Qualitative properties of the solution for the parameterized quasi-variational inequality are investigated, which improve some known results in the literature. Moreov…
▽ More
This paper considers a new fractional differential quasi-variational inequality with Mittag-Leffler kernel comprising a fractional differential equation with Mittag-Leffler kernel and a quasi-variational inequality in Hilbert spaces. Qualitative properties of the solution for the parameterized quasi-variational inequality are investigated, which improve some known results in the literature. Moreover, the unique existence of the solution and Hyers-Ulam stability are obtained for such a novel system under mild conditions. Finally, the obtained abstract results are applied to analyze the unique solvability and stability for a multi-agent optimization problem and a price control problem.
△ Less
Submitted 13 May, 2025; v1 submitted 4 March, 2025;
originally announced March 2025.
-
Scalable Multi-Level Optimization for Sequentially Cleared Energy Markets with a Case Study on Gas and Carbon Aware Unit Commitment
Authors:
Yuxin Xia,
Iacopo Savelli,
Thomas Morstyn
Abstract:
This paper examines Mixed-Integer Multi-Level problems with Sequential Followers (MIMLSF), a specialized optimization model aimed at enhancing upper-level decision-making by incorporating anticipated outcomes from lower-level sequential market-clearing processes. We introduce a novel approach that combines lexicographic optimization with a weighted-sum method to asymptotically approximate the MIML…
▽ More
This paper examines Mixed-Integer Multi-Level problems with Sequential Followers (MIMLSF), a specialized optimization model aimed at enhancing upper-level decision-making by incorporating anticipated outcomes from lower-level sequential market-clearing processes. We introduce a novel approach that combines lexicographic optimization with a weighted-sum method to asymptotically approximate the MIMLSF as a single-level problem, capable of managing multi-level problems exceeding three levels. To enhance computational efficiency and scalability, we propose a dedicated Benders decomposition method with multi-level subproblem separability. To demonstrate the practical application of our MIMLSF solution technique, we tackle a unit commitment problem (UC) within an integrated electricity, gas, and carbon market clearing framework in the Northeastern United States, enabling the incorporation of anticipated costs and revenues from gas and carbon markets into UC decisions. This ensures that only profitable gas-fired power plants (GFPPs) are committed, allowing system operators to make informed decisions that prevent GFPP economic losses and reduce total operational costs under stressed electricity and gas systems. The case study not only demonstrates the applicability of the MIMLSF model but also highlights the computational benefits of the dedicated Benders decomposition technique, achieving average reductions of 32.23% in computing time and 94.23% in optimality gaps compared to state-of-the-art methods.
△ Less
Submitted 27 February, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
Best Subset Selection: Optimal Pursuit for Feature Selection and Elimination
Authors:
Zhihan Zhu,
Yanhao Zhang,
Yong Xia
Abstract:
This paper introduces two novel criteria: one for feature selection and another for feature elimination in the context of best subset selection, which is a benchmark problem in statistics and machine learning. From the perspective of optimization, we revisit the classical selection and elimination criteria in traditional best subset selection algorithms, revealing that these classical criteria cap…
▽ More
This paper introduces two novel criteria: one for feature selection and another for feature elimination in the context of best subset selection, which is a benchmark problem in statistics and machine learning. From the perspective of optimization, we revisit the classical selection and elimination criteria in traditional best subset selection algorithms, revealing that these classical criteria capture only partial variations of the objective function after the entry or exit of features. By formulating and solving optimization subproblems for feature entry and exit exactly, new selection and elimination criteria are proposed, proved as the optimal decisions for the current entry-and-exit process compared to classical criteria. Replacing the classical selection and elimination criteria with the proposed ones generates a series of enhanced best subset selection algorithms. These generated algorithms not only preserve the theoretical properties of the original algorithms but also achieve significant meta-gains without increasing computational cost across various scenarios and evaluation metrics on multiple tasks such as compressed sensing and sparse regression.
△ Less
Submitted 30 May, 2025; v1 submitted 28 January, 2025;
originally announced January 2025.
-
Split-Merge: A Difference-based Approach for Dominant Eigenvalue Problem
Authors:
Xiaozhi Liu,
Yong Xia
Abstract:
The computation of the dominant eigenvector of symmetric positive semidefinite matrices is a cornerstone operation in numerous optimization-driven applications. Traditional methods, typically based on the \textit{Quotient} formulation, often suffer from challenges related to computational efficiency and reliance on prior spectral knowledge. In this work, we leverage the alternative \textit{Differe…
▽ More
The computation of the dominant eigenvector of symmetric positive semidefinite matrices is a cornerstone operation in numerous optimization-driven applications. Traditional methods, typically based on the \textit{Quotient} formulation, often suffer from challenges related to computational efficiency and reliance on prior spectral knowledge. In this work, we leverage the alternative \textit{Difference} formulation to reinterpret the classical power method as a first-order optimization algorithm. This perspective allows for a novel convergence analysis and facilitates the development of accelerated variants with larger step-sizes, achieving faster convergence without additional computational cost. Building on this insight, we introduce a generalized family of Difference-based methods, with the power method as a special case. Within this family, we propose Split-Merge, an algorithm that attains accelerated convergence without requiring spectral knowledge and operates solely via matrix-vector products. Extensive experiments on both synthetic and real-world datasets demonstrate that Split-Merge consistently outperforms state-of-the-art methods in both efficiency and scalability. In particular, it achieves more than a $\boldsymbol{10\times}$ speedup over the classical power method, underscoring its practical effectiveness for large-scale problems.
△ Less
Submitted 25 June, 2025; v1 submitted 25 January, 2025;
originally announced January 2025.
-
Hitting probabilities, thermal capacity, and Hausdorff dimension results for the Brownian sheet
Authors:
Cheuk Yin Lee,
Yimin Xiao
Abstract:
Let $W= \{W(t): t \in \mathbb{R}_+^N \}$ be an $(N, d)$-Brownian sheet and let $E \subset (0, \infty)^N$ and $F \subset \mathbb{R}^d$ be compact sets. We prove a necessary and sufficient condition for $W(E)$ to intersect $F$ with positive probability and determine the essential supremum of the Hausdorff dimension of the intersection set $W(E)\cap F$ in terms of the thermal capacity of…
▽ More
Let $W= \{W(t): t \in \mathbb{R}_+^N \}$ be an $(N, d)$-Brownian sheet and let $E \subset (0, \infty)^N$ and $F \subset \mathbb{R}^d$ be compact sets. We prove a necessary and sufficient condition for $W(E)$ to intersect $F$ with positive probability and determine the essential supremum of the Hausdorff dimension of the intersection set $W(E)\cap F$ in terms of the thermal capacity of $E \times F$. This extends the previous results of Khoshnevisan and Xiao (2015) for the Brownian motion and Khoshnevisan and Shi (1999) for the Brownian sheet in the special case when $E \subset (0, \infty)^N$ is an interval.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
Matched pairs of actions on the Kac-Paljutkin algebra $H_8$
Authors:
Yongyue Xiao,
Yunnan Li
Abstract:
The notion of matched pair of actions on a Hopf algebra generalizes the braided group construction of Lu, Yan and Zhu, and efficiently provides Yang-Baxter operators. In this paper, we classify matched pairs of actions on the Kac-Paljutkin Hopf algebra $H_8$. Through calculations, we obtain 6 matched pairs of actions on $H_8$. Based on such a classification result, we find that four of them can be…
▽ More
The notion of matched pair of actions on a Hopf algebra generalizes the braided group construction of Lu, Yan and Zhu, and efficiently provides Yang-Baxter operators. In this paper, we classify matched pairs of actions on the Kac-Paljutkin Hopf algebra $H_8$. Through calculations, we obtain 6 matched pairs of actions on $H_8$. Based on such a classification result, we find that four of them can be derived from the coquasitriangular structures of $H_8$, while the other two can not. Furthermore, we discover that the Yang-Baxter operators associated to exactly these two distinguished matched pairs of actions are involutive.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
An Efficient Dual ADMM for Huber Regression with Fused Lasso Penalty
Authors:
Mengjiao Shi,
Yunhai Xiao
Abstract:
The ordinary least squares estimate in linear regression is sensitive to the influence of errors with large variance, which reduces its robustness, especially when dealing with heavy-tailed errors or outliers frequently encountered in real-world scenarios. To address this issue and accommodate the sparsity of coefficients along with their sequential disparities, we combine the adaptive robust Hube…
▽ More
The ordinary least squares estimate in linear regression is sensitive to the influence of errors with large variance, which reduces its robustness, especially when dealing with heavy-tailed errors or outliers frequently encountered in real-world scenarios. To address this issue and accommodate the sparsity of coefficients along with their sequential disparities, we combine the adaptive robust Huber loss function with a fused lasso penalty. This combination yields a robust estimator capable of simultaneously achieving estimation and variable selection. Furthermore, we utilize an efficient alternating direction method of multipliers to solve this regression model from a dual perspective. The effectiveness and efficiency of our proposed approach is demonstrated through numerical experiments carried out on both simulated and real datasets.
△ Less
Submitted 30 April, 2025; v1 submitted 9 January, 2025;
originally announced January 2025.
-
Temporal regularity for the stochastic heat equation with rough dependence in space
Authors:
Bin Qian,
Min Wang,
Ran Wang,
Yimin Xiao
Abstract:
Consider the nonlinear stochastic heat equation $$
\frac{\partial u (t,x)}{\partial t}=\frac{\partial^2 u (t,x)}{\partial x^2}+ σ(u (t,x))\dot{W}(t,x),\quad t> 0,\,
x\in \mathbb{R}, $$ where $\dot W$ is a Gaussian noise which is white in time and has the covariance of a fractional Brownian motion with Hurst parameter $H\in(\frac 14,\frac 12)$ in the space variable. When $σ(0)=0$, the well-pose…
▽ More
Consider the nonlinear stochastic heat equation $$
\frac{\partial u (t,x)}{\partial t}=\frac{\partial^2 u (t,x)}{\partial x^2}+ σ(u (t,x))\dot{W}(t,x),\quad t> 0,\,
x\in \mathbb{R}, $$ where $\dot W$ is a Gaussian noise which is white in time and has the covariance of a fractional Brownian motion with Hurst parameter $H\in(\frac 14,\frac 12)$ in the space variable. When $σ(0)=0$, the well-posedness of the solution and its Hölder continuity have been proved by Hu et al. \cite{HHLNT2017}. In this paper, we study the asymptotic properties of the temporal gradient $u(t+\varepsilon, x)-u(t, x)$ at any fixed $t \ge 0$ and $x\in \mathbb R$, as $\varepsilon\downarrow 0$. As applications, we deduce Khintchine's law of iterated logarithm, Chung's law of iterated logarithm, and a result on the $q$-variations of the temporal process $\{u(t, x)\}_{t \ge 0}$, where $x\in \mathbb R$ is fixed.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
Grid homology for singular links in lens space and a resolution cube
Authors:
Yonghan Xiao
Abstract:
In this paper, we define grid homologies for singular links in lens spaces and use them to construct a resolution cube for knot Floer homology of regular links in lens spaces. The results will first be proved over $\mathbb{Z}/2\mathbb{Z}$ and then over $\mathbb{Z}$ with the help of sign assignments. We will also identify the signed grid homology and classical knot Floer homology over $\mathbb{Z}$…
▽ More
In this paper, we define grid homologies for singular links in lens spaces and use them to construct a resolution cube for knot Floer homology of regular links in lens spaces. The results will first be proved over $\mathbb{Z}/2\mathbb{Z}$ and then over $\mathbb{Z}$ with the help of sign assignments. We will also identify the signed grid homology and classical knot Floer homology over $\mathbb{Z}$ for regular links in lens spaces, illustrating the fact that our resolution cube is genuinely one for knot Floer homology. The main advancement in the paper is that we give a complete description of singular knot theory in lens spaces which was only defined in $S^3$ previously and we construct a signed combinatorial resolution cube for knot Floer homology in lens spaces which may be powerful in relating $HFK^\circ$ to other link homology theories.
△ Less
Submitted 27 March, 2025; v1 submitted 7 January, 2025;
originally announced January 2025.
-
Hilbert's tenth problem for systems of diagonal quadratic forms, and Büchi's problem
Authors:
Stanley Yao Xiao
Abstract:
In this paper we complete Büchi's proof that there is no decision algorithm for the solubility in integers of arbitrary systems of diagonal quadratic form equations, by proving the assertion that whenever $x_1^2, \cdots, x_5^2$ are five squares such that the second differences satisfy
\[x_{k+2}^2 - 2 x_{k+1}^2 + x_k^2 = 2\]
for $k = 1,2,3$, then they must be consecutive. This answers a questio…
▽ More
In this paper we complete Büchi's proof that there is no decision algorithm for the solubility in integers of arbitrary systems of diagonal quadratic form equations, by proving the assertion that whenever $x_1^2, \cdots, x_5^2$ are five squares such that the second differences satisfy
\[x_{k+2}^2 - 2 x_{k+1}^2 + x_k^2 = 2\]
for $k = 1,2,3$, then they must be consecutive. This answers a question of J.~Richard~Büchi.
△ Less
Submitted 7 June, 2025; v1 submitted 21 December, 2024;
originally announced December 2024.
-
Strengthened and Faster Linear Approximation to Joint Chance Constraints with Wasserstein Ambiguity
Authors:
Yihong Zhou,
Yuxin Xia,
Hanbin Yang,
Thomas Morstyn
Abstract:
Many real-world decision-making problems in energy systems, transportation, and finance have uncertain parameters in their constraints. Wasserstein distributionally robust joint chance constraints (WDRJCC) offer a promising solution by explicitly guaranteeing the probability of the simultaneous satisfaction of multiple constraints. WDRJCC are computationally demanding, and although manageable for…
▽ More
Many real-world decision-making problems in energy systems, transportation, and finance have uncertain parameters in their constraints. Wasserstein distributionally robust joint chance constraints (WDRJCC) offer a promising solution by explicitly guaranteeing the probability of the simultaneous satisfaction of multiple constraints. WDRJCC are computationally demanding, and although manageable for small problems, practical applications often demand more tractable approaches -- especially for large-scale and complex problems, such as power system unit commitment problems and multilevel problems with chance-constrained lower levels. To address this, this paper proposes a novel inner-approximation for a specific type of WDRJCC, namely WDRJCC with right-hand-side uncertainties (RHS-WDRJCC). We propose a Strengthened and Faster Linear Approximation (SFLA) by strengthening an existing convex inner-approximation that is equivalent to the worst-case conditional value-at-risk (W-CVaR) method under specific hyperparameters. This strengthening process reduces the number of constraints and tightens the feasible region for ancillary variables, leading to significant computational speedup. Despite the tightening, we prove that the proposed SFLA does not introduce additional conservativeness and can even lead to less conservativeness. The significance and superiority of the proposed SFLA are validated in two important real-world problems. In a power system unit commitment problem, the proposed SFLA achieves up to 10x and on average 3.8x computational speedup compared to the strengthened and exact mixed-integer reformulation in finding comparable high-quality feasible solutions. In a bilevel strategic bidding problem where the exact reformulation is not applicable due to non-convexity, we show that the proposed SFLA can lead to 90x speedup compared to existing convex approximation methods such as W-CVaR.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Arborescences of Random Covering Graphs
Authors:
Muchen Ju,
Junjie Ni,
Kaixin Wang,
Yihan Xiao
Abstract:
A rooted arborescence of a directed graph is a spanning tree directed towards a particular vertex. A recent work of Chepuri et al. showed that the arborescences of a covering graph of a directed graph G are closely related to the arborescences of G. In this paper, we study the weighted sum of arborescences of a random covering graph and give a formula for the expected value, resolving a conjecture…
▽ More
A rooted arborescence of a directed graph is a spanning tree directed towards a particular vertex. A recent work of Chepuri et al. showed that the arborescences of a covering graph of a directed graph G are closely related to the arborescences of G. In this paper, we study the weighted sum of arborescences of a random covering graph and give a formula for the expected value, resolving a conjecture of Chepuri et al.
△ Less
Submitted 5 June, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Many-insurer robust games of reinsurance and investment under model uncertainty in incomplete markets
Authors:
Guohui Guan,
Zongxia Liang,
Yi Xia
Abstract:
This paper studies the robust reinsurance and investment games for competitive insurers. Model uncertainty is characterized by a class of equivalent probability measures. Each insurer is concerned with relative performance under the worst-case scenario. Insurers' surplus processes are approximated by drifted Brownian motion with common and idiosyncratic insurance risks. The insurers can purchase p…
▽ More
This paper studies the robust reinsurance and investment games for competitive insurers. Model uncertainty is characterized by a class of equivalent probability measures. Each insurer is concerned with relative performance under the worst-case scenario. Insurers' surplus processes are approximated by drifted Brownian motion with common and idiosyncratic insurance risks. The insurers can purchase proportional reinsurance to divide the insurance risk with the reinsurance premium calculated by the variance principle. We consider an incomplete market driven by the 4/2 stochastic volatility mode. This paper formulates the robust mean-field game for a non-linear system originating from the variance principle and the 4/2 model. For the case of an exponential utility function, we derive closed-form solutions for the $n$-insurer game and the corresponding mean-field game. We show that relative concerns lead to new hedging terms in the investment and reinsurance strategies. Model uncertainty can significantly change the insurers' hedging demands. The hedging demands in the investment-reinsurance strategies exhibit highly non-linear dependence with the insurers' competitive coefficients, risk aversion and ambiguity aversion coefficients. Finally, numerical results demonstrate the herd effect of competition.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Opinion Dynamic Under Malicious Agent Influence in Multi-Agent Systems: From the Perspective of Opinion Evolution Cost
Authors:
Yuhan Suo,
Runqi Chai,
Senchun Chai,
Ishrak MD Farhan,
Xudong Zhao,
Yuanqing Xia
Abstract:
In human social systems, debates are often seen as a means to resolve differences of opinion. However, in reality, debates frequently incur significant communication costs, especially when dealing with stubborn opponents. Inspired by this phenomenon, this paper examines the impact of malicious agents on the evolution of normal agents' opinions from the perspective of opinion evolution cost, and pr…
▽ More
In human social systems, debates are often seen as a means to resolve differences of opinion. However, in reality, debates frequently incur significant communication costs, especially when dealing with stubborn opponents. Inspired by this phenomenon, this paper examines the impact of malicious agents on the evolution of normal agents' opinions from the perspective of opinion evolution cost, and proposes corresponding solutions for the scenario in which malicious agents hold different opinions in multi-agent systems(MASs). First, this paper analyzes the negative impact of malicious agents on the opinion evolution process, reveals the additional evolution cost it brings, and provides a theoretical basis for the subsequent solutions. Secondly, based on the characteristics of opinion evolution, the malicious agent isolation algorithm based on opinion evolution direction vector is proposed, which does not strongly restrict the proportion of malicious agents. Additionally, an evolution rate adjustment mechanism is introduced, allowing the system to flexibly regulate the evolution process in complex situations, effectively achieving the trade-off between opinion evolution rate and cost. Extensive numerical simulations demonstrate that the algorithm can effectively eliminate the negative influence of malicious agents and achieve a balance between opinion evolution costs and convergence speed.
△ Less
Submitted 13 December, 2024; v1 submitted 2 December, 2024;
originally announced December 2024.
-
The Tammes Problem in $\mathbb{R}^{n}$ and Linear Programming Method
Authors:
Yanlu Lian,
Qun Mo,
Yu Xia
Abstract:
The Tammes problem delves into the optimal arrangement of $N$ points on the surface of the $n$-dimensional unit sphere (denoted as $\mathbb{S}^{n-1}$), aiming to maximize the minimum distance between any two points. In this paper, we articulate the sufficient conditions requisite for attaining the optimal value of the Tammes problem for arbitrary $n, N \in \mathbb{N}^{+}$, employing the linear pro…
▽ More
The Tammes problem delves into the optimal arrangement of $N$ points on the surface of the $n$-dimensional unit sphere (denoted as $\mathbb{S}^{n-1}$), aiming to maximize the minimum distance between any two points. In this paper, we articulate the sufficient conditions requisite for attaining the optimal value of the Tammes problem for arbitrary $n, N \in \mathbb{N}^{+}$, employing the linear programming framework pioneered by Delsarte et al. Furthermore, we showcase several illustrative examples across various dimensions $n$ and select values of $N$ that yield optimal configurations. The findings illuminate the intricate structure of optimal point distributions on spheres, thereby enriching the existing body of research in this domain.
△ Less
Submitted 24 November, 2024;
originally announced November 2024.
-
Strassen's local law of the iterated logarithm for the generalized fractional Brownian motion
Authors:
Ran Wang,
Yimin Xiao
Abstract:
Let $X:=\{X(t)\}_{t\ge0}$ be a generalized fractional Brownian motion: $$
\{X(t)\}_{t\ge0}\overset{d}{=}\left\{ \int_{\mathbb R} \left((t-u)_+^α-(-u)_+^α \right) |u|^{-γ/2} B(du) \right\}_{t\ge0}, $$ with parameters $γ\in (0, 1)$ and $α\in \left(-1/2+ γ/2, \, 1/2+ γ/2 \right)$. This is a self-similar Gaussian process introduced by Pang and Taqqu (2019) as the scaling limit of power-law shot nois…
▽ More
Let $X:=\{X(t)\}_{t\ge0}$ be a generalized fractional Brownian motion: $$
\{X(t)\}_{t\ge0}\overset{d}{=}\left\{ \int_{\mathbb R} \left((t-u)_+^α-(-u)_+^α \right) |u|^{-γ/2} B(du) \right\}_{t\ge0}, $$ with parameters $γ\in (0, 1)$ and $α\in \left(-1/2+ γ/2, \, 1/2+ γ/2 \right)$. This is a self-similar Gaussian process introduced by Pang and Taqqu (2019) as the scaling limit of power-law shot noise processes. The parameters $α$ and $γ$ determine the probabilistic and statistical properties of $X$. In particular, the parameter $γ$ introduces non-stationarity of the increments. In this paper, we prove Strassen's local law of the iterated logarithm of $X$ at any fixed point $t_0 \in (0, \infty)$, which describes explicitly the roles played by the parameters $α, γ$ and the location $t_0$. Our result is different from the previous Strassen's LIL for $X$ at infinity proved by Ichiba, Pang and Taqqu (2022).
△ Less
Submitted 23 November, 2024;
originally announced November 2024.
-
Iterative Reweighted Framework Based Algorithms for Sparse Linear Regression with Generalized Elastic Net Penalty
Authors:
Yanyun Ding,
Zhenghua Yao,
Peili Li,
Yunhai Xiao
Abstract:
The elastic net penalty is frequently employed in high-dimensional statistics for parameter regression and variable selection. It is particularly beneficial compared to lasso when the number of predictors greatly surpasses the number of observations. However, empirical evidence has shown that the $\ell_q$-norm penalty (where $0 < q < 1$) often provides better regression compared to the $\ell_1$-no…
▽ More
The elastic net penalty is frequently employed in high-dimensional statistics for parameter regression and variable selection. It is particularly beneficial compared to lasso when the number of predictors greatly surpasses the number of observations. However, empirical evidence has shown that the $\ell_q$-norm penalty (where $0 < q < 1$) often provides better regression compared to the $\ell_1$-norm penalty, demonstrating enhanced robustness in various scenarios. In this paper, we explore a generalized elastic net model that employs a $\ell_r$-norm (where $r \geq 1$) in loss function to accommodate various types of noise, and employs a $\ell_q$-norm (where $0 < q < 1$) to replace the $\ell_1$-norm in elastic net penalty. Theoretically, we establish the computable lower bounds for the nonzero entries of the generalized first-order stationary points of the proposed generalized elastic net model. For implementation, we develop two efficient algorithms based on the locally Lipschitz continuous $ε$-approximation to $\ell_q$-norm. The first algorithm employs an alternating direction method of multipliers (ADMM), while the second utilizes a proximal majorization-minimization method (PMM), where the subproblems are addressed using the semismooth Newton method (SNN). We also perform extensive numerical experiments with both simulated and real data, showing that both algorithms demonstrate superior performance. Notably, the PMM-SSN is efficient than ADMM, even though the latter provides a simpler implementation.
△ Less
Submitted 5 December, 2024; v1 submitted 22 November, 2024;
originally announced November 2024.
-
Sample path properties and small ball probabilities for stochastic fractional diffusion equations
Authors:
Yuhui Guo,
Jian Song,
Ran Wang,
Yimin Xiao
Abstract:
We consider the following stochastic space-time fractional diffusion equation with vanishing initial condition:$$ \partial^β u(t, x)=- \left(-Δ\right)^{α/ 2} u(t, x)+ I_{0+}^γ\left[\dot{W}(t, x)\right],\quad t\in[0,T],\: x \in \mathbb{R}^d,$$ where $α>0$, $β\in(0,2)$, $γ\in[0,1)$, $\left(-Δ\right)^{α/2}$ is the fractional/power of Laplacian and $\dot{W}$ is a fractional space-time Gaussian noise.…
▽ More
We consider the following stochastic space-time fractional diffusion equation with vanishing initial condition:$$ \partial^β u(t, x)=- \left(-Δ\right)^{α/ 2} u(t, x)+ I_{0+}^γ\left[\dot{W}(t, x)\right],\quad t\in[0,T],\: x \in \mathbb{R}^d,$$ where $α>0$, $β\in(0,2)$, $γ\in[0,1)$, $\left(-Δ\right)^{α/2}$ is the fractional/power of Laplacian and $\dot{W}$ is a fractional space-time Gaussian noise. We prove the existence and uniqueness of the solution and then focus on various sample path regularity properties of the solution. More specifically, we establish the exact uniform and local moduli of continuity and Chung-type laws of the iterated logarithm. The small ball probability is also studied.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
A polynomially solvable case of unconstrained (-1,1)-quadratic fractional optimization
Authors:
Meijia Yang,
Yong Xia
Abstract:
In this paper, we consider an unconstrained (-1,1)-quadratic fractional optimization in the following form: $\min_{x\in\{-1,1\}^n}~(x^TAx+α)/(x^TBx+β)$, where $A$ and $B$, given by their nonzero eigenvalues and associated eigenvectors, have ranks not exceeding fixed integers $r_a$ and $r_b$, respectively. We show that this problem can be solved in $O(n^{r_a+r_b+1}\log^2 n)$ by the accelerated Newt…
▽ More
In this paper, we consider an unconstrained (-1,1)-quadratic fractional optimization in the following form: $\min_{x\in\{-1,1\}^n}~(x^TAx+α)/(x^TBx+β)$, where $A$ and $B$, given by their nonzero eigenvalues and associated eigenvectors, have ranks not exceeding fixed integers $r_a$ and $r_b$, respectively. We show that this problem can be solved in $O(n^{r_a+r_b+1}\log^2 n)$ by the accelerated Newton-Dinkelbach method when the matrices $A$ has nonpositive diagonal entries only, $B$ has nonnegative diagonal entries only. Furthermore, this problem can be solved in $O(n^{r_a+r_b+2}\log^2 n)$ when $A$ has $O(\log(n))$ positive diagonal entries, $B$ has $O(\log(n))$ negative diagonal entries.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Closing the duality gap of the generalized trace ratio problem
Authors:
Meijia Yang,
Yong Xia
Abstract:
The generalized trace ratio problem {\rm (GTRP)} is to maximize a quadratic fractional objective function in trace formulation over the Stiefel manifold. In this paper, based on a newly developed matrix S-lemma, we show that {\rm (GTRP)}, if a redundant constraint is added and well scaled, has zero Lagrangian duality gap. However, this is not always true without the technique of scaling or adding…
▽ More
The generalized trace ratio problem {\rm (GTRP)} is to maximize a quadratic fractional objective function in trace formulation over the Stiefel manifold. In this paper, based on a newly developed matrix S-lemma, we show that {\rm (GTRP)}, if a redundant constraint is added and well scaled, has zero Lagrangian duality gap. However, this is not always true without the technique of scaling or adding the redundant constraint.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
Revisiting Atomic Norm Minimization: A Sequential Approach for Atom Identification and Refinement
Authors:
Xiaozhi Liu,
Jinjiang Wei,
Yong Xia
Abstract:
Atomic norm minimization (ANM) is a key approach for line spectral estimation (LSE). Most related algorithms formulate ANM as a semidefinite programming (SDP), which incurs high computational cost. In this letter, we revisit the ANM problem and present a novel limit-based formulation, which dissects the essential components of the semidefinite characterization of ANM. Our new formulation does not…
▽ More
Atomic norm minimization (ANM) is a key approach for line spectral estimation (LSE). Most related algorithms formulate ANM as a semidefinite programming (SDP), which incurs high computational cost. In this letter, we revisit the ANM problem and present a novel limit-based formulation, which dissects the essential components of the semidefinite characterization of ANM. Our new formulation does not depend on SDP and can be extended to handle more general atomic sets beyond mixture of complex sinusoids. Furthermore, we reveal the connection between ANM and Bayesian LSE approaches, bridging the gap between these two methodologies. Based on this new formulation, we propose a low-complexity algorithm called Sequential Atom Identification and Refinement (SAIR) for ANM. Simulation results demonstrate that SAIR achieves superior estimation accuracy and computational efficiency compared to other state-of-the-art methods.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
Pattern formation and global analysis of a systematically reduced plant model in dryland environment
Authors:
Yonghui Xia,
Jianglong Xiao,
Jianshe Yu
Abstract:
This paper delves into a systematically reduced plant system proposed by Jaïbi et al. [Phys. D, 2020] in arid area. They used the method of geometric singular perturbation to study the existence of abundant orbits. Instead, we deliberate the stability and distributed patterns of this system. For a non-diffusive scenario for the model, we scrutinize the local and global stability of equilibria and…
▽ More
This paper delves into a systematically reduced plant system proposed by Jaïbi et al. [Phys. D, 2020] in arid area. They used the method of geometric singular perturbation to study the existence of abundant orbits. Instead, we deliberate the stability and distributed patterns of this system. For a non-diffusive scenario for the model, we scrutinize the local and global stability of equilibria and derive conditions for the existence or non-existence of the limit cycle. The bifurcation behaviors are also explored. For the spatial model, we investigate Hopf, Turing, Hopf-Turing, Turing-Turing bifurcations. Specially, the evolution process from periodic solutions to spatially nonconstant steady states is observed near the Hopf-Turing bifurcation point. And mixed nonconstant steady states near the Turing-Turing bifurcation point are observed. Furthermore, it's found that there exist gap, spot, stripe and mixed patterns. The seed-dispersal rate enables the transformation of pattern structures. Reasonable control of system parameters may prevent desertification from occurring.
△ Less
Submitted 30 October, 2024;
originally announced November 2024.
-
Intelligent acceleration adaptive control of linear $2\times2$ hyperbolic PDE systems
Authors:
Xianhe Zhang,
Yu Xiao,
Xiaodong Xu,
Biao Luo
Abstract:
Traditional approaches to stabilizing hyperbolic PDEs, such as PDE backstepping, often encounter challenges when dealing with high-dimensional or complex nonlinear problems. Their solutions require high computational and analytical costs. Recently, neural operators (NOs) for the backstepping design of first-order hyperbolic partial differential equations (PDEs) have been introduced, which rapidly…
▽ More
Traditional approaches to stabilizing hyperbolic PDEs, such as PDE backstepping, often encounter challenges when dealing with high-dimensional or complex nonlinear problems. Their solutions require high computational and analytical costs. Recently, neural operators (NOs) for the backstepping design of first-order hyperbolic partial differential equations (PDEs) have been introduced, which rapidly generate gain kernel without requiring online numerical solution. In this paper we apply neural operators to a more complex class of $2\times2$ hyperbolic PDE systems for adaptive stability control. Once the NO has been well-trained offline on a sufficient training set obtained using a numerical solver, the kernel equation no longer needs to be solved again, thereby avoiding the high computational cost during online operations.Specifically, we introduce the deep operator network (DeepONet), a neural network framework, to learn the nonlinear operator of the system parameters to the kernel gain. The approximate backstepping kernel is obtained by utilizing the network after learning, instead of numerically solving the kernel equations in the form of PDEs, to further derive the approximate controller and the target system. We analyze the existence and approximation of DeepONet operators and provide stability and convergence proofs for the closed-loop systems with NOs. Finally, the effectiveness of the proposed NN-adaptive control scheme is verified by comparative simulation, which shows that the NN operator achieved up to three orders of magnitude faster compared to conventional PDE solvers, significantly improving real-time control performance.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
Hybrid Pipe Dreams for Key Polynomials
Authors:
Yihan Xiao,
Rui Xiong,
Haofeng Zhang
Abstract:
We develop a family of new combinatorial models for key polynomials. It is similar to the hybrid pipe dream model for Schubert polynomials defined recently by Knutson and Udell.
We develop a family of new combinatorial models for key polynomials. It is similar to the hybrid pipe dream model for Schubert polynomials defined recently by Knutson and Udell.
△ Less
Submitted 3 November, 2024;
originally announced November 2024.
-
On the existence of $L^p$-Optimal Transport maps for norms on $\mathbb{R}^N$
Authors:
Guoxi Liu,
Mattia Magnabosco,
Yicheng Xia
Abstract:
In this paper, we prove existence of $L^p$-optimal transport maps with $p \in (1,\infty)$ in a class of branching metric spaces defined on $\mathbb{R}^N$. In particular, we introduce the notion of cylinder-like convex function and we prove an existence result for the Monge problem with cost functions of the type $c(x, y) = f(g(y - x))$, where $f: [0, \infty) \rightarrow [0, \infty)$ is an increasi…
▽ More
In this paper, we prove existence of $L^p$-optimal transport maps with $p \in (1,\infty)$ in a class of branching metric spaces defined on $\mathbb{R}^N$. In particular, we introduce the notion of cylinder-like convex function and we prove an existence result for the Monge problem with cost functions of the type $c(x, y) = f(g(y - x))$, where $f: [0, \infty) \rightarrow [0, \infty)$ is an increasing strictly convex function and $g: \mathbb{R}^N \rightarrow [0, \infty)$ is a cylinder-like convex function. When specialised to cylinder-like norm, our results shows existence of $L^p$-optimal transport maps for several "branching'" norms, including all norms in $\mathbb{R}^2$ and all crystalline norms.
△ Less
Submitted 29 October, 2024;
originally announced October 2024.
-
Lipschitz-free Projected Subgradient Method with Time-varying Step-size
Authors:
Yong Xia,
Yanhao Zhang,
Zhihan Zhu
Abstract:
We introduce a novel family of time-varying step-sizes for the classical projected subgradient method, offering optimal ergodic convergence. Importantly, this approach does not depend on the Lipschitz assumption of the objective function, thereby broadening the convergence result of projected subgradient method to non-Lipschitz case.
We introduce a novel family of time-varying step-sizes for the classical projected subgradient method, offering optimal ergodic convergence. Importantly, this approach does not depend on the Lipschitz assumption of the objective function, thereby broadening the convergence result of projected subgradient method to non-Lipschitz case.
△ Less
Submitted 17 December, 2024; v1 submitted 6 October, 2024;
originally announced October 2024.
-
Integer tile and Spectrality of Cantor-Moran measures with equidifferent digit sets
Authors:
Sha Wu,
Yingqing Xiao
Abstract:
Let $\left\{b_{k}\right\}_{k=1}^{\infty}$ be a sequence of integers with $|b_{k}|\geq2$ and $\left\{D_{k}\right\}_{k=1}^{\infty} $ be a sequence of equidifferent digit sets with $D_{k}=\left\{0,1, \cdots, N-1\right\}t_{k},$ where $N\geq2$ is a prime number and $\{t_{k}\}_{k=1}^{\infty}$ is bounded. In this paper, we study the existence of the Cantor-Moran measure $μ_{\{b_k\},\{D_k\}}$ and show tha…
▽ More
Let $\left\{b_{k}\right\}_{k=1}^{\infty}$ be a sequence of integers with $|b_{k}|\geq2$ and $\left\{D_{k}\right\}_{k=1}^{\infty} $ be a sequence of equidifferent digit sets with $D_{k}=\left\{0,1, \cdots, N-1\right\}t_{k},$ where $N\geq2$ is a prime number and $\{t_{k}\}_{k=1}^{\infty}$ is bounded. In this paper, we study the existence of the Cantor-Moran measure $μ_{\{b_k\},\{D_k\}}$ and show that $$\mathbf{D}_k:=D_k\oplus b_{k} D_{k-1}\oplus b_{k}b_{k-1} D_{k-2}\oplus\cdots\oplus b_{k}b_{k-1}\cdots b_2D_{1}$$ is an integer tile for all $k\in\mathbb{N}^+$ if and only if $\mathbf{s}_i\neq\mathbf{s}_j$ for all $i\neq j\in\mathbb{N}^{+}$, where $\mathbf{s}_i$ is defined as the numbers of factor $N$ in $\frac{b_1b_2\cdots b_i}{Nt_i}$. Moreover, we prove that $\mathbf{D}_k$ being an integer tile for all $k\in\mathbb{N}^+$ is a necessary condition for the Cantor-Moran measure to be a spectral measure, and we provide an example to demonstrate that it cannot become a sufficient condition. Furthermore, under some additional assumptions, we establish that the Cantor-Moran measure to be a spectral measure is equivalent to $\mathbf{D}_k$ being an integer tile for all $k\in\mathbb{N}^+$.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
Twisted Rota-Baxter family operators on Hom-associative algebras
Authors:
Wen Teng,
Yunpeng Xiao
Abstract:
In this paper, we first define twisted Rota-Baxter family operators on Hom-associative algebras indexed by a semigroup $Ω$. Then we introduce and study Hom-NS-family algebras as the underlying structures of twisted Rota-Baxter family operators. Meanwhile, We show that a Hom-NS-family algebra induces an ordinary Hom-NS-algebra on the tensor product with the semigroup algebra. Moreover, we define th…
▽ More
In this paper, we first define twisted Rota-Baxter family operators on Hom-associative algebras indexed by a semigroup $Ω$. Then we introduce and study Hom-NS-family algebras as the underlying structures of twisted Rota-Baxter family operators. Meanwhile, We show that a Hom-NS-family algebra induces an ordinary Hom-NS-algebra on the tensor product with the semigroup algebra. Moreover, we define the cohomology of a twisted Rota-Baxter family operator. This cohomology can also be viewed as the cohomology of a certain Hom-$Ω$-associative algebra with coefficients in a suitable bimodule. Finally, we examine deformations of twisted Rota-Baxter family operators and demonstrate that they are governed by the aforementioned cohomology. The concept of Nijenhuis elements linked to a twisted Rota-Baxter family operator is introduced to provide a sufficient condition for its rigidity.
△ Less
Submitted 26 October, 2024;
originally announced October 2024.
-
Two-Dimensional Quaternion Linear Canonical Transform A Novel Framework for Probability Modeling
Authors:
Muhammad Adnan Samad,
Yuanqing Xia,
Saima Siddiqui,
Muhammad Younus Bhat
Abstract:
The linear canonical transform (LCT) serves as a powerful generalization of the Fourier transform (FT), encapsulating various integral transforms within a unified framework. This versatility has made it a cornerstone in fields such as signal processing, optics, and quantum mechanics. Extending this concept to quaternion algebra, the Quaternion Fourier Transform (QFT) emerged, enriching the analysi…
▽ More
The linear canonical transform (LCT) serves as a powerful generalization of the Fourier transform (FT), encapsulating various integral transforms within a unified framework. This versatility has made it a cornerstone in fields such as signal processing, optics, and quantum mechanics. Extending this concept to quaternion algebra, the Quaternion Fourier Transform (QFT) emerged, enriching the analysis of multidimensional and complex-valued signals. The Quaternion Linear Canonical Transform (QLCT), a further generalization, has now positioned itself as a central tool across various disciplines, including applied mathematics, engineering, computer science, and statistics. In this paper, we introduce the Two Dimensional Quaternion Linear Canonical Transform (2DQLCT) as a novel framework for probability modeling. By leveraging the 2DQLCT, we aim to provide a more comprehensive understanding of probability distributions, particularly in the context of multi-dimensional and complex-valued signals. This framework not only broadens the theoretical underpinnings of probability theory but also opens new avenues for researchers
△ Less
Submitted 18 October, 2024;
originally announced October 2024.
-
Spectral theoretic characterisation of Markov chain convergence
Authors:
Bryn Davies,
Yu Xiao
Abstract:
In this work, we characterise the statistics of Markov chains by constructing an associated sequence of periodic differential operators. Studying the density of states of these operators reveals the absolutely continuous invariant measure of the Markov chain. This approach also leads to a direct proof of convergence to the invariant measure, along with explicit convergence rates. We show how our m…
▽ More
In this work, we characterise the statistics of Markov chains by constructing an associated sequence of periodic differential operators. Studying the density of states of these operators reveals the absolutely continuous invariant measure of the Markov chain. This approach also leads to a direct proof of convergence to the invariant measure, along with explicit convergence rates. We show how our method can be applied to a class of related Markov chains including the logistic map, the tent map and Chebyshev maps of arbitrary order.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
A New Linear Programming Method in Sphere Packing
Authors:
Qun Mo,
Jinming Wen,
Yu Xia
Abstract:
Inspired by the linear programming method developed by Cohn and Elkies (Ann. Math. 157(2): 689-714, 2003), we introduce a new linear programming method to solve the sphere packing problem. More concretely, we consider sequences of auxiliary functions $\{g_m\}_{m\in \mathbb{N}^{+}}$, where $g_m$ is a $mΛ$-periodic auxiliary function defined on $\mathbb{R}^n$, with $Λ$ being a given full-rank lattic…
▽ More
Inspired by the linear programming method developed by Cohn and Elkies (Ann. Math. 157(2): 689-714, 2003), we introduce a new linear programming method to solve the sphere packing problem. More concretely, we consider sequences of auxiliary functions $\{g_m\}_{m\in \mathbb{N}^{+}}$, where $g_m$ is a $mΛ$-periodic auxiliary function defined on $\mathbb{R}^n$, with $Λ$ being a given full-rank lattice in $\mathbb{R}^n$. This new method extends the original approach and offers a greater flexibility. Furthermore, using this new linear programming framework, we construct several effective auxiliary functions for dimensions $n=1,2,3$. We hope this approach provides valuable insights into solving sphere packing problems for $n=2,3$ and even higher dimensions.
△ Less
Submitted 2 December, 2024; v1 submitted 7 October, 2024;
originally announced October 2024.
-
Tikhonov regularized mixed-order primal-dual dynamical system for convex optimization problems with linear equality constraints
Authors:
Honglu Li,
Xin He,
Yibin Xiao
Abstract:
In Hilbert spaces, we consider a Tikhonov regularized mixed-order primal-dual dynamical system for a convex optimization problem with linear equality constraints. The dynamical system with general time-dependent parameters: viscous damping and temporal scaling can derive certain existing systems when special parameters are selected. When these parameters satisfy appropriate conditions and the Tikh…
▽ More
In Hilbert spaces, we consider a Tikhonov regularized mixed-order primal-dual dynamical system for a convex optimization problem with linear equality constraints. The dynamical system with general time-dependent parameters: viscous damping and temporal scaling can derive certain existing systems when special parameters are selected. When these parameters satisfy appropriate conditions and the Tikhonov regularization parameter ε(t) approaches zero at an appropriate rate, we analyze the asymptotic convergence properties of the proposed system by constructing suitable Lyapunov functions. And we obtain that the objective function error enjoys O(1/(t^2β(t))) convergence rate. Under suitable conditions, it can be better than O(1/(t^2)). In addition, we utilize the Lyapunov analysis method to obtain the strong convergence of the trajectory generated by the Tikhonov regularized dynamical system. In particular, when Tikhonov regularization parameter ε(t) vanishes to 0 at some suitable rate, the convergence rate of the primal-dual gap can be o(1/(β(t))). The effectiveness of our theoretical results has been demonstrated through numerical experiments.
△ Less
Submitted 26 September, 2024; v1 submitted 25 September, 2024;
originally announced September 2024.