-
Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance
Authors:
Lisha Chen,
Quan Xiao,
Ellen Hidemi Fukuda,
Xinyi Chen,
Kun Yuan,
Tianyi Chen
Abstract:
Multi-objective learning under user-specified preference is common in real-world problems such as multi-lingual speech recognition under fairness. In this work, we frame such a problem as a semivectorial bilevel optimization problem, whose goal is to optimize a pre-defined preference function, subject to the constraint that the model parameters are weakly Pareto optimal. To solve this problem, we…
▽ More
Multi-objective learning under user-specified preference is common in real-world problems such as multi-lingual speech recognition under fairness. In this work, we frame such a problem as a semivectorial bilevel optimization problem, whose goal is to optimize a pre-defined preference function, subject to the constraint that the model parameters are weakly Pareto optimal. To solve this problem, we convert the multi-objective constraints to a single-objective constraint through a merit function with an easy-to-evaluate gradient, and then, we use a penalty-based reformulation of the bilevel optimization problem. We theoretically establish the properties of the merit function, and the relations of solutions for the penalty reformulation and the constrained formulation. Then we propose algorithms to solve the reformulated single-level problem, and establish its convergence guarantees. We test the method on various synthetic and real-world problems. The results demonstrate the effectiveness of the proposed method in finding preference-guided optimal solutions to the multi-objective problem.
△ Less
Submitted 26 March, 2025;
originally announced April 2025.
-
On Dirichlet non-improvable numbers and shrinking target problems
Authors:
Qian Xiao
Abstract:
In one-dimensional Diophantine approximation, the Diophantine properties of a real number are characterized by its partial quotients, especially the growth of its large partial quotients. Notably, Kleinbock and Wadleigh [Proc. Amer. Math. Soc. 2018] made a seminal contribution by linking the improvability of Dirichlet's theorem to the growth of the product of consecutive partial quotients. In this…
▽ More
In one-dimensional Diophantine approximation, the Diophantine properties of a real number are characterized by its partial quotients, especially the growth of its large partial quotients. Notably, Kleinbock and Wadleigh [Proc. Amer. Math. Soc. 2018] made a seminal contribution by linking the improvability of Dirichlet's theorem to the growth of the product of consecutive partial quotients. In this paper, we extend the concept of Dirichlet non-improvable sets within the framework of shrinking target problems. Specifically, consider the dynamical system $([0,1), T)$ of continued fractions. Let $\{z_n\}_{n \ge 1}$ be a sequence of real numbers in $[0,1]$ and let $B > 1$. We determine the Hausdorff dimension of the following set:
\[
\begin{split}
\{x\in[0,1):|T^nx-z_n||T^{n+1}x-Tz_n|<B^{-n}\text{ infinitely often}\}.
\end{split}
\]
△ Less
Submitted 10 May, 2025; v1 submitted 13 March, 2025;
originally announced March 2025.
-
A First-order Generative Bilevel Optimization Framework for Diffusion Models
Authors:
Quan Xiao,
Hui Yuan,
A F M Saif,
Gaowen Liu,
Ramana Kompella,
Mengdi Wang,
Tianyi Chen
Abstract:
Diffusion models, which iteratively denoise data samples to synthesize high-quality outputs, have achieved empirical success across domains. However, optimizing these models for downstream tasks often involves nested bilevel structures, such as tuning hyperparameters for fine-tuning tasks or noise schedules in training dynamics, where traditional bilevel methods fail due to the infinite-dimensiona…
▽ More
Diffusion models, which iteratively denoise data samples to synthesize high-quality outputs, have achieved empirical success across domains. However, optimizing these models for downstream tasks often involves nested bilevel structures, such as tuning hyperparameters for fine-tuning tasks or noise schedules in training dynamics, where traditional bilevel methods fail due to the infinite-dimensional probability space and prohibitive sampling costs. We formalize this challenge as a generative bilevel optimization problem and address two key scenarios: (1) fine-tuning pre-trained models via an inference-only lower-level solver paired with a sample-efficient gradient estimator for the upper level, and (2) training diffusion models from scratch with noise schedule optimization by reparameterizing the lower-level problem and designing a computationally tractable gradient estimator. Our first-order bilevel framework overcomes the incompatibility of conventional bilevel methods with diffusion processes, offering theoretical grounding and computational practicality. Experiments demonstrate that our method outperforms existing fine-tuning and hyperparameter search baselines.
△ Less
Submitted 12 February, 2025;
originally announced February 2025.
-
Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions
Authors:
Zhaoxian Wu,
Quan Xiao,
Tayfun Gokmen,
Omobayode Fagbohungbe,
Tianyi Chen
Abstract:
As the economic and environmental costs of training and deploying large vision or language models increase dramatically, analog in-memory computing (AIMC) emerges as a promising energy-efficient solution. However, the training perspective, especially its training dynamic, is underexplored. In AIMC hardware, the trainable weights are represented by the conductance of resistive elements and updated…
▽ More
As the economic and environmental costs of training and deploying large vision or language models increase dramatically, analog in-memory computing (AIMC) emerges as a promising energy-efficient solution. However, the training perspective, especially its training dynamic, is underexplored. In AIMC hardware, the trainable weights are represented by the conductance of resistive elements and updated using consecutive electrical pulses. Among all the physical properties of resistive elements, the response to the pulses directly affects the training dynamics. This paper first provides a theoretical foundation for gradient-based training on AIMC hardware and studies the impact of response functions. We demonstrate that noisy update and asymmetric response functions negatively impact Analog SGD by imposing an implicit penalty term on the objective. To overcome the issue, Tiki-Taka, a residual learning algorithm, converges exactly to a critical point by optimizing a main array and a residual array bilevelly. The conclusion is supported by simulations validating our theoretical insights.
△ Less
Submitted 14 February, 2025; v1 submitted 10 February, 2025;
originally announced February 2025.
-
Unfitted boundary algebraic equation method based on difference potentials and lattice Green's function in 3D
Authors:
Qing Xia
Abstract:
This work presents an unfitted boundary algebraic equation (BAE) method for solving three-dimensional elliptic partial differential equations on complex geometries using finite difference on structured meshes. We demonstrate that replacing finite auxiliary domains with free-space LGFs streamlines the computation of difference potentials, enabling matrix-free implementations and significant cost re…
▽ More
This work presents an unfitted boundary algebraic equation (BAE) method for solving three-dimensional elliptic partial differential equations on complex geometries using finite difference on structured meshes. We demonstrate that replacing finite auxiliary domains with free-space LGFs streamlines the computation of difference potentials, enabling matrix-free implementations and significant cost reductions. We establish theoretical foundations by showing the equivalence between direct formulations in difference potentials framework and indirect single/double layer formulations and analyzing their spectral properties. The spectral analysis demonstrates that discrete double layer formulations provide better-conditioned systems for iterative solvers, similarly as in boundary integral method. The method is validated through matrix-free numerical experiments on both Poisson and modified Helmholtz equations in 3D implicitly defined geometries, showing optimal convergence rates and computational efficiency. This framework naturally extends to unbounded domains and provides a foundation for applications to more complex systems like Helmholtz and Stokes equations.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Optimal convergence speed in the classical limits of relativistic Cucker-Smale models
Authors:
Seung-Yeal Ha,
Tommaso Ruggeri,
Qinghua Xiao
Abstract:
We study quantitative estimates for the flocking and uniform-time classical limit to the relativistic Cucker-Smale (in short RCS) model introduced in \cite{Ha-Kim-Ruggeri-ARMA-2020}. Different from previous works, we do not neglect the relativistic effect on the presence of the pressure in momentum equation. For the RCS model, we provide a quantitative estimate on the uniform-time classical limit…
▽ More
We study quantitative estimates for the flocking and uniform-time classical limit to the relativistic Cucker-Smale (in short RCS) model introduced in \cite{Ha-Kim-Ruggeri-ARMA-2020}. Different from previous works, we do not neglect the relativistic effect on the presence of the pressure in momentum equation. For the RCS model, we provide a quantitative estimate on the uniform-time classical limit with an optimal convergence rate which is the same as in finite-time classical limit under a relaxed initial condition. We also allow corresponding initial data for the RCS and Cucker-Smale (CS) model to be different in the classical limit. This removes earlier constraints employed in the previous classical limit. As a direct application of this optimal convergence rate in the classical limit of the RCS model, we derive an optimal convergence rate for the corresponding uniform-time classical limit for the kinetic RCS model.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Pipeline Gradient-based Model Training on Analog In-memory Accelerators
Authors:
Zhaoxian Wu,
Quan Xiao,
Tayfun Gokmen,
Hsinyu Tsai,
Kaoutar El Maghraoui,
Tianyi Chen
Abstract:
Aiming to accelerate the training of large deep neural models (DNN) in an energy-efficient way, an analog in-memory computing (AIMC) accelerator emerges as a solution with immense potential. In AIMC accelerators, trainable weights are kept in memory without the need to move from memory to processors during the training, reducing a bunch of overhead. However, although the in-memory feature enables…
▽ More
Aiming to accelerate the training of large deep neural models (DNN) in an energy-efficient way, an analog in-memory computing (AIMC) accelerator emerges as a solution with immense potential. In AIMC accelerators, trainable weights are kept in memory without the need to move from memory to processors during the training, reducing a bunch of overhead. However, although the in-memory feature enables efficient computation, it also constrains the use of data parallelism since copying weights from one AIMC to another is expensive. To enable parallel training using AIMC, we propose synchronous and asynchronous pipeline parallelism for AIMC accelerators inspired by the pipeline in digital domains. This paper provides a theoretical convergence guarantee for both synchronous and asynchronous pipelines in terms of both sampling and clock cycle complexity, which is non-trivial since the physical characteristic of AIMC accelerators leads to analog updates that suffer from asymmetric bias. The simulations of training DNN on real datasets verify the efficiency of pipeline training.
△ Less
Submitted 19 October, 2024;
originally announced October 2024.
-
Unlocking Global Optimality in Bilevel Optimization: A Pilot Study
Authors:
Quan Xiao,
Tianyi Chen
Abstract:
Bilevel optimization has witnessed a resurgence of interest, driven by its critical role in trustworthy and efficient AI applications. While many recent works have established convergence to stationary points or local minima, obtaining the global optimum of bilevel optimization remains an important yet open problem. The difficulty lies in the fact that, unlike many prior non-convex single-level pr…
▽ More
Bilevel optimization has witnessed a resurgence of interest, driven by its critical role in trustworthy and efficient AI applications. While many recent works have established convergence to stationary points or local minima, obtaining the global optimum of bilevel optimization remains an important yet open problem. The difficulty lies in the fact that, unlike many prior non-convex single-level problems, bilevel problems often do not admit a benign landscape, and may indeed have multiple spurious local solutions. Nevertheless, attaining global optimality is indispensable for ensuring reliability, safety, and cost-effectiveness, particularly in high-stakes engineering applications that rely on bilevel optimization. In this paper, we first explore the challenges of establishing a global convergence theory for bilevel optimization, and present two sufficient conditions for global convergence. We provide algorithm-dependent proofs to rigorously substantiate these sufficient conditions on two specific bilevel learning scenarios: representation learning and data hypercleaning (a.k.a. reweighting). Experiments corroborate the theoretical findings, demonstrating convergence to the global minimum in both cases.
△ Less
Submitted 24 December, 2024; v1 submitted 28 August, 2024;
originally announced August 2024.
-
A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints
Authors:
Liuyuan Jiang,
Quan Xiao,
Victor M. Tenorio,
Fernando Real-Rojas,
Antonio G. Marques,
Tianyi Chen
Abstract:
Interest in bilevel optimization has grown in recent years, partially due to its applications to tackle challenging machine-learning problems. Several exciting recent works have been centered around developing efficient gradient-based algorithms that can solve bilevel optimization problems with provable guarantees. However, the existing literature mainly focuses on bilevel problems either without…
▽ More
Interest in bilevel optimization has grown in recent years, partially due to its applications to tackle challenging machine-learning problems. Several exciting recent works have been centered around developing efficient gradient-based algorithms that can solve bilevel optimization problems with provable guarantees. However, the existing literature mainly focuses on bilevel problems either without constraints, or featuring only simple constraints that do not couple variables across the upper and lower levels, excluding a range of complex applications. Our paper studies this challenging but less explored scenario and develops a (fully) first-order algorithm, which we term BLOCC, to tackle BiLevel Optimization problems with Coupled Constraints. We establish rigorous convergence theory for the proposed algorithm and demonstrate its effectiveness on two well-known real-world applications - hyperparameter selection in support vector machine (SVM) and infrastructure planning in transportation networks using the real data from the city of Seville.
△ Less
Submitted 25 August, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
Almost Ricci solitons on Finsler spaces
Authors:
Qiaoling Xia
Abstract:
In this paper, (gradient) almost Ricci solitons on Finsler measure spaces $(M, F, m)$ are introduced and investigated. We prove that $(M, F, m)$ is a gradient almost Ricci soliton if and only if the infinity-Ricci curvature Ric$_\infty$ is a scalar function on $M$ when $M$ is compact. Moreover, we give an equivalent characterization of (gradient) almost Ricci solitons for Randers metrics $F=α+β$,…
▽ More
In this paper, (gradient) almost Ricci solitons on Finsler measure spaces $(M, F, m)$ are introduced and investigated. We prove that $(M, F, m)$ is a gradient almost Ricci soliton if and only if the infinity-Ricci curvature Ric$_\infty$ is a scalar function on $M$ when $M$ is compact. Moreover, we give an equivalent characterization of (gradient) almost Ricci solitons for Randers metrics $F=α+β$, which implies that every Randers (gradient) almost Ricci soliton is of isotropic S$_{BH}$-curvature. Based on this and the navigation technique, we further classify Randers almost Ricci solitons (resp. gradient almost Ricci solitons) up to classifications of Randers Einstein metrics $F$ (resp. Riemannian gradient almost Ricci solitons) and the homothetic vector fields of $F$ (resp. solutions of the equation which the weight function $f$ of $m$ satisfies) when $F$ has isotropic S$_{BH}$-curvature. As applications, we obtain some rigidity results for compact Randers (gradient) Ricci solitons and construct several Randers gradient Ricci solitons, which are the first nontrivial examples of gradient Ricci solitons in Finsler geometry.
△ Less
Submitted 8 November, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Multifractal Formalism from Large Deviations
Authors:
Mirmukhsin Makhmudov,
Evgeny Verbitskiy,
Qian Xiao
Abstract:
It has often been observed that the Multifractal Formalism and the Large Deviation Principles are intimately related. In fact, Multifractal Formalism was heuristically derived using the Large Deviations ideas. In numerous examples in which the multifractal results have been rigorously established, the corresponding Large Deviation results are valid as well. Moreover, the proofs of multifractal and…
▽ More
It has often been observed that the Multifractal Formalism and the Large Deviation Principles are intimately related. In fact, Multifractal Formalism was heuristically derived using the Large Deviations ideas. In numerous examples in which the multifractal results have been rigorously established, the corresponding Large Deviation results are valid as well. Moreover, the proofs of multifractal and large deviations are remarkably similar. The natural question then is whether under which conditions multifractal formalism can be deduced from the corresponding large deviations results. More specifically, given a sequence of random variables $\{ {X_n} \}_{n\in\N}$, satisfying a Large Deviation Principle, what can be said about the multifractal nature of the level sets $K_α=\{ω: \lim_{n} \frac{X_n(ω)}{n}=α\}$. Under some technical assumptions, we establish the upper and lower bounds for multifractal spectra in terms of the large deviation rate functions, and show that many known results of multifractal formalism are covered by our setup.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Transport multi-paths with capacity constraints
Authors:
Qinglan Xia,
Haotian Sun
Abstract:
This article generalizes the study of branched/ramified optimal transportation to those with capacity constraints. Each admissible transport network studied here is represented by a transport multi-path between measures, with a capacity constraint on each of its components. The associated transport cost is given by the sum of the $\textbf{M}_α$-cost of each component. Using this new formulation, w…
▽ More
This article generalizes the study of branched/ramified optimal transportation to those with capacity constraints. Each admissible transport network studied here is represented by a transport multi-path between measures, with a capacity constraint on each of its components. The associated transport cost is given by the sum of the $\textbf{M}_α$-cost of each component. Using this new formulation, we prove the existence of an optimal solution and provide an upper bound on the number of components for the solution. Additionally, we conduct analytical examinations of the properties (e.g. ``map-compatibility", and ``simple common-source property") of each solution component and explore the interplay among components, particularly in the discrete case.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
A Bilevel Optimization Method for Inverse Mean-Field Games
Authors:
Jiajia Yu,
Quan Xiao,
Tianyi Chen,
Rongjie Lai
Abstract:
In this paper, we introduce a bilevel optimization framework for addressing inverse mean-field games, alongside an exploration of numerical methods tailored for this bilevel problem. The primary benefit of our bilevel formulation lies in maintaining the convexity of the objective function and the linearity of constraints in the forward problem. Our paper focuses on inverse mean-field games charact…
▽ More
In this paper, we introduce a bilevel optimization framework for addressing inverse mean-field games, alongside an exploration of numerical methods tailored for this bilevel problem. The primary benefit of our bilevel formulation lies in maintaining the convexity of the objective function and the linearity of constraints in the forward problem. Our paper focuses on inverse mean-field games characterized by unknown obstacles and metrics. We show numerical stability for these two types of inverse problems. More importantly, we, for the first time, establish the identifiability of the inverse mean-field game with unknown obstacles via the solution of the resultant bilevel problem. The bilevel approach enables us to employ an alternating gradient-based optimization algorithm with a provable convergence guarantee. To validate the effectiveness of our methods in solving the inverse problems, we have designed comprehensive numerical experiments, providing empirical evidence of its efficacy.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Global Hilbert expansion for some non-relativistic kinetic equations
Authors:
Yuanjie Lei,
Shuangqian Liu,
Qinghua Xiao,
Huijiang Zhao
Abstract:
The Vlasov-Maxwell-Landau (VML) system and the Vlasov-Maxwell-Boltzmann (VMB) system are fundamental models in dilute collisional plasmas. In this paper, we are concerned with the hydrodynamic limits of both the VML and the non-cutoff VMB systems in the entire space. Our primary objective is to rigorously prove that, within the framework of Hilbert expansion, the unique classical solution of the V…
▽ More
The Vlasov-Maxwell-Landau (VML) system and the Vlasov-Maxwell-Boltzmann (VMB) system are fundamental models in dilute collisional plasmas. In this paper, we are concerned with the hydrodynamic limits of both the VML and the non-cutoff VMB systems in the entire space. Our primary objective is to rigorously prove that, within the framework of Hilbert expansion, the unique classical solution of the VML or non-cutoff VMB system converges globally over time to the smooth global solution of the Euler-Maxwell system as the Knudsen number approaches zero.
The core of our analysis hinges on deriving novel interplay energy estimates for the solutions of these two systems, concerning both a local Maxwellian and a global Maxwellian, respectively. Our findings address a problem in the hydrodynamic limit for Landau-type equations and non-cutoff Boltzmann-type equations with a magnetic field. Furthermore, the approach developed in this paper can be seamlessly extended to assess the validity of the Hilbert expansion for other types of kinetic equations.
△ Less
Submitted 26 November, 2023; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Map-compatible decomposition of transport paths
Authors:
Qinglan Xia,
Haotian Sun
Abstract:
In the Monge-Kantorovich transport problem, the transport cost is expressed in terms of transport maps or transport plans, which play crucial roles there. A variant of the Monge-Kantorovich problem is the ramified (branching) transport problem that models branching transport systems via transport paths. In this article, we showed that any cycle-free transport path between two atomic measures can b…
▽ More
In the Monge-Kantorovich transport problem, the transport cost is expressed in terms of transport maps or transport plans, which play crucial roles there. A variant of the Monge-Kantorovich problem is the ramified (branching) transport problem that models branching transport systems via transport paths. In this article, we showed that any cycle-free transport path between two atomic measures can be decomposed into the sum of a map-compatible path and a plan-compatible path. Moreover, we showed that each stair-shaped transport path can be decomposed into the difference of two map-compatible transport paths.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Generalized Li-Yau's inequalities on Finsler measure spaces
Authors:
Qiaoling Xia
Abstract:
It is known that the Finsler heat flow is a nonlinear flow. This leads to the study of the linearized heat semigroup for the Finsler heat flow. In this paper, we first study its properties. By means of the linearized heat semigroup, we give two different kinds of generalized Li-Yau's inequalities for the positive solutions to the heat equation on $n$-dimensional complete Finsler measure spaces wit…
▽ More
It is known that the Finsler heat flow is a nonlinear flow. This leads to the study of the linearized heat semigroup for the Finsler heat flow. In this paper, we first study its properties. By means of the linearized heat semigroup, we give two different kinds of generalized Li-Yau's inequalities for the positive solutions to the heat equation on $n$-dimensional complete Finsler measure spaces with Ric$_N\geq K$ for some $N\in [n, \infty)$ and $K\in \mathbb R$. These inequalities almost recover all known Li-Yau's type inequalities on complete Finsler and Riemannian manifolds with lower Ricci curvature bounds. In particular, we obtain some new Li-Yau's type inequalities on complete Finsler and Riemannian measure spaces both in negative and positive Ricci curvature. As applications, we obtain two generalized Harnack inequalities. Finally we give several equivalent characterizations of Ric$_\infty\geq K (K\in \mathbb R)$ by the linearized heat semigroup approach and their applications.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
A Generalized Alternating Method for Bilevel Learning under the Polyak-Łojasiewicz Condition
Authors:
Quan Xiao,
Songtao Lu,
Tianyi Chen
Abstract:
Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly c…
▽ More
Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly convex lower-level objective. However, it remains unclear whether this result can be generalized to bilevel problems beyond this basic setting. In this paper, we first introduce a stationary metric for the considered bilevel problems, which generalizes the existing metric, for a nonconvex lower-level objective that satisfies the Polyak-Łojasiewicz (PL) condition. We then propose a Generalized ALternating mEthod for bilevel opTimization (GALET) tailored to BLO with convex PL LL problem and establish that GALET achieves an $ε$-stationary point for the considered problem within $\tilde{\cal O}(ε^{-1})$ iterations, which matches the iteration complexity of GD for single-level smooth nonconvex problems.
△ Less
Submitted 5 October, 2023; v1 submitted 4 June, 2023;
originally announced June 2023.
-
Partial Plateau's Problem with $H$-mass
Authors:
Enrique Alvarado,
Qinglan Xia
Abstract:
Classically, Plateau's problem asks to find a surface of the least area with a given boundary $B$. In this article, we investigate a version of Plateau's problem, where the boundary of an admissible surface is only required to partially span $B$. Our boundary data is given by a flat $(m-1)$-chain $B$ and a smooth compactly supported differential $(m-1)$-form $Φ$. We are interested in minimizing…
▽ More
Classically, Plateau's problem asks to find a surface of the least area with a given boundary $B$. In this article, we investigate a version of Plateau's problem, where the boundary of an admissible surface is only required to partially span $B$. Our boundary data is given by a flat $(m-1)$-chain $B$ and a smooth compactly supported differential $(m-1)$-form $Φ$. We are interested in minimizing $
\mathbf{M}(T) - \int_{\partial T} Φ$ over all $m$-dimensional rectifiable currents $T$ in $\mathbb{R}^n$ such that $\partial T$ is a subcurrent of the given boundary $B$. The existence of a rectifiable minimizer is proven with Federer and Fleming's compactness theorem. We generalize this problem by replacing the mass $\mathbf{M}$ with the $H$-mass of rectifiable currents. By minimizing over a larger class of objects, called scans with boundary, and by defining their $H$-mass as a type of lower-semicontinuous envelope over the $H$-mass of rectifiable currents, we prove an existence result for this problem by using Hardt and De Pauw's BV compactness theorem.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Non-Iterative Solution for Coordinated Optimal Dispatch via Equivalent Projection-Part II: Method and Applications
Authors:
Zhenfei Tan,
Zheng Yan,
Haiwang Zhong,
Qing Xia
Abstract:
This two-part paper develops a non-iterative coordinated optimal dispatch framework, i.e., free of iterative information exchange, via the innovation of the equivalent projection (EP) theory. The EP eliminates internal variables from technical and economic operation constraints of the subsystem and obtains an equivalent model with reduced scale, which is the key to the non-iterative coordinated op…
▽ More
This two-part paper develops a non-iterative coordinated optimal dispatch framework, i.e., free of iterative information exchange, via the innovation of the equivalent projection (EP) theory. The EP eliminates internal variables from technical and economic operation constraints of the subsystem and obtains an equivalent model with reduced scale, which is the key to the non-iterative coordinated optimization. In Part II of this paper, a novel projection algorithm with the explicit error guarantee measured by the Hausdorff distance is proposed, which characterizes the EP model by the convex hull of its vertices. This algorithm is proven to yield a conservative approximation within the prespecified error tolerance and can obtain the exact EP model if the error tolerance is set to zero, which provides flexibility to balance the computation accuracy and effort. Applications of the EP-based coordinated dispatch are demonstrated based on the multi-area coordination and transmission-distribution coordination. Case studies with a wide range of system scales verify the superiority of the proposed projection algorithm in terms of computational efficiency and scalability, and validate the effectiveness of the EP-based coordinated dispatch in comparison with the joint optimization.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Non-Iterative Solution for Coordinated Optimal Dispatch via Equivalent Projection-Part I: Theory
Authors:
Zhenfei Tan,
Zheng Yan,
Haiwang Zhong,
Qing Xia
Abstract:
Coordinated optimal dispatch is of utmost importance for the efficient and secure operation of hierarchically structured power systems. Conventional coordinated optimization methods, such as the Lagrangian relaxation and Benders decomposition, require iterative information exchange among subsystems. Iterative coordination methods have drawbacks including slow convergence, risk of oscillation and d…
▽ More
Coordinated optimal dispatch is of utmost importance for the efficient and secure operation of hierarchically structured power systems. Conventional coordinated optimization methods, such as the Lagrangian relaxation and Benders decomposition, require iterative information exchange among subsystems. Iterative coordination methods have drawbacks including slow convergence, risk of oscillation and divergence, and incapability of multi-level optimization problems. To this end, this paper aims at the non-iterative coordinated optimization method for hierarchical power systems. The theory of the equivalent projection (EP) is proposed, which makes external equivalence of the optimal dispatch model of the subsystem. Based on the EP theory, a coordinated optimization framework is developed, where each subsystem submits the EP model as a substitute for its original model to participate in the cross-system coordination. The proposed coordination framework is proven to guarantee the same optimality as the joint optimization, with additional benefits of avoiding iterative information exchange, protecting privacy, compatibility with practical dispatch scheme, and capability of multi-level problems.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
On Penalty-based Bilevel Gradient Descent Method
Authors:
Han Shen,
Quan Xiao,
Tianyi Chen
Abstract:
Bilevel optimization enjoys a wide range of applications in emerging machine learning and signal processing problems such as hyper-parameter optimization, image reconstruction, meta-learning, adversarial training, and reinforcement learning. However, bilevel optimization problems are traditionally known to be difficult to solve. Recent progress on bilevel algorithms mainly focuses on bilevel optim…
▽ More
Bilevel optimization enjoys a wide range of applications in emerging machine learning and signal processing problems such as hyper-parameter optimization, image reconstruction, meta-learning, adversarial training, and reinforcement learning. However, bilevel optimization problems are traditionally known to be difficult to solve. Recent progress on bilevel algorithms mainly focuses on bilevel optimization problems through the lens of the implicit-gradient method, where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle a challenging class of bilevel problems through the lens of the penalty method. We show that under certain conditions, the penalty reformulation recovers the (local) solutions of the original bilevel problem. Further, we propose the penalty-based bilevel gradient descent (PBGD) algorithm and establish its finite-time convergence for the constrained bilevel problem with lower-level constraints yet without lower-level strong convexity. Experiments on synthetic and real datasets showcase the efficiency of the proposed PBGD algorithm.
△ Less
Submitted 6 January, 2025; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Authors:
Quan Xiao,
Han Shen,
Wotao Yin,
Tianyi Chen
Abstract:
Stochastic bilevel optimization, which captures the inherent nested structure of machine learning problems, is gaining popularity in many recent applications. Existing works on bilevel optimization mostly consider either unconstrained problems or constrained upper-level problems. This paper considers the stochastic bilevel optimization problems with equality constraints both in the upper and lower…
▽ More
Stochastic bilevel optimization, which captures the inherent nested structure of machine learning problems, is gaining popularity in many recent applications. Existing works on bilevel optimization mostly consider either unconstrained problems or constrained upper-level problems. This paper considers the stochastic bilevel optimization problems with equality constraints both in the upper and lower levels. By leveraging the special structure of the equality constraints problem, the paper first presents an alternating implicit projected SGD approach and establishes the $\tilde{\cal O}(ε^{-2})$ sample complexity that matches the state-of-the-art complexity of ALSET \citep{chen2021closing} for unconstrained bilevel problems. To further save the cost of projection, the paper presents two alternating implicit projection-efficient SGD approaches, where one algorithm enjoys the $\tilde{\cal O}(ε^{-2}/T)$ upper-level and $\tilde{\cal O}(ε^{-1.5}/T^{\frac{3}{4}})$ lower-level projection complexity with ${\cal O}(T)$ lower-level batch size, and the other one enjoys $\tilde{\cal O}(ε^{-1.5})$ upper-level and lower-level projection complexity with ${\cal O}(1)$ batch size. Application to federated bilevel optimization has been presented to showcase the empirical performance of our algorithms. Our results demonstrate that equality-constrained bilevel optimization with strongly-convex lower-level problems can be solved as efficiently as stochastic single-level optimization problems.
△ Less
Submitted 12 February, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Local-basis Difference Potentials Method for elliptic PDEs in complex geometry
Authors:
Qing Xia
Abstract:
We develop efficient and high-order accurate finite difference methods for elliptic partial differential equations in complex geometry in the Difference Potentials framework. The main novelty of the developed schemes is the use of local basis functions defined at near-boundary grid points. The use of local basis functions allow unified numerical treatment of (i) explicitly and implicitly defined g…
▽ More
We develop efficient and high-order accurate finite difference methods for elliptic partial differential equations in complex geometry in the Difference Potentials framework. The main novelty of the developed schemes is the use of local basis functions defined at near-boundary grid points. The use of local basis functions allow unified numerical treatment of (i) explicitly and implicitly defined geometry; (ii) geometry of more complicated shapes, such as those with corners, multi-connected domain, etc; and (iii) different types of boundary conditions. This geometrically flexible approach is complementary to the classical difference potentials method using global basis functions, especially in the case where a large number of global basis functions are needed to resolve the boundary, or where the optimal global basis functions are difficult to obtain. Fast Poisson solvers based on FFT are employed for standard centered finite difference stencils regardless of the designed order of accuracy. Proofs of convergence of difference potentials in maximum norm are outlined both theoretically and numerically.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Bounding the diameter and eigenvalues of amply regular graphs via Lin-Lu-Yau curvature
Authors:
Xueping Huang,
Shiping Liu,
Qing Xia
Abstract:
An amply regular graph is a regular graph such that any two adjacent vertices have $α$ common neighbors and any two vertices with distance $2$ have $β$ common neighbors. We prove a sharp lower bound estimate for the Lin--Lu--Yau curvature of any amply regular graph with girth $3$ and $β>α$. The proof involves new ideas relating discrete Ricci curvature with local matching properties: This includes…
▽ More
An amply regular graph is a regular graph such that any two adjacent vertices have $α$ common neighbors and any two vertices with distance $2$ have $β$ common neighbors. We prove a sharp lower bound estimate for the Lin--Lu--Yau curvature of any amply regular graph with girth $3$ and $β>α$. The proof involves new ideas relating discrete Ricci curvature with local matching properties: This includes a novel construction of a regular bipartite graph from the local structure and related distance estimates. As a consequence, we obtain sharp diameter and eigenvalue bounds for amply regular graphs.
△ Less
Submitted 11 June, 2024; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Hilbert expansion for kinetic equations with non-relativistic Coulomb collision
Authors:
Yuanjie Lei,
Shuangqian Liu,
Qinghua Xiao,
Huijiang Zhao
Abstract:
In this paper, we study the hydrodynamic limits of both the Landau equation and the Vlasov-Maxwell-Landau system in the whole space. Our main purpose is two-fold: the first one is to give a rigorous derivation of the compressible Euler equations from the Landau equation via the Hilbert expansion; while the second one is to prove, still in the setting of Hilbert expansion, that the unique classical…
▽ More
In this paper, we study the hydrodynamic limits of both the Landau equation and the Vlasov-Maxwell-Landau system in the whole space. Our main purpose is two-fold: the first one is to give a rigorous derivation of the compressible Euler equations from the Landau equation via the Hilbert expansion; while the second one is to prove, still in the setting of Hilbert expansion, that the unique classical solution of the Vlasov-Maxwell-Landau system converges, which is shown to be globally in time, to the resulting global smooth solution of the Euler-Maxwell system, as the Knudsen number goes to zero. The main ingredient of our analysis is to derive some novel interplay energy estimates on the solutions of the Landau equation and the Vlasov-Maxwell-Landau system which are small perturbations of both a local Maxwellian and a global Maxwellian, respectively. Our result solves an open problem in the hydrodynamic limit for the Landau-type equations with Coulomb potential and the approach developed in this paper can seamlessly be used to deal with the problem on the validity of the Hilbert expansion for other types of kinetic equations.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Hilbert Expansion for Coulomb Collisional Kinetic Models
Authors:
Zhimeng Ouyang,
Lei Wu,
Qinghua Xiao
Abstract:
The relativistic Vlasov-Maxwell-Landau (r-VML) system and the relativistic Landau equation (r-LAN) are fundamental models that describe the dynamics of an electron gas. In this paper, we introduce a novel weighted energy method and establish the validity of the Hilbert expansion for the r-VML system and r-LAN equation. As the Knudsen number shrinks to zero, we rigorously demonstrate the relativist…
▽ More
The relativistic Vlasov-Maxwell-Landau (r-VML) system and the relativistic Landau equation (r-LAN) are fundamental models that describe the dynamics of an electron gas. In this paper, we introduce a novel weighted energy method and establish the validity of the Hilbert expansion for the r-VML system and r-LAN equation. As the Knudsen number shrinks to zero, we rigorously demonstrate the relativistic Euler-Maxwell limit and relativistic Euler limit, respectively. This successfully resolves the long-standing open problem regarding the hydrodynamic limits of Landau-type equations.
△ Less
Submitted 7 June, 2023; v1 submitted 30 June, 2022;
originally announced July 2022.
-
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
Authors:
Quan Xiao,
Qing Ling,
Tianyi Chen
Abstract:
A major challenge of applying zeroth-order (ZO) methods is the high query complexity, especially when queries are costly. We propose a novel gradient estimation technique for ZO methods based on adaptive lazy queries that we term as LAZO. Different from the classic one-point or two-point gradient estimation methods, LAZO develops two alternative ways to check the usefulness of old queries from pre…
▽ More
A major challenge of applying zeroth-order (ZO) methods is the high query complexity, especially when queries are costly. We propose a novel gradient estimation technique for ZO methods based on adaptive lazy queries that we term as LAZO. Different from the classic one-point or two-point gradient estimation methods, LAZO develops two alternative ways to check the usefulness of old queries from previous iterations, and then adaptively reuses them to construct the low-variance gradient estimates. We rigorously establish that through judiciously reusing the old queries, LAZO can reduce the variance of stochastic gradient estimates so that it not only saves queries per iteration but also achieves the regret bound for the symmetric two-point method. We evaluate the numerical performance of LAZO, and demonstrate the low-variance property and the performance gain of LAZO in both regret and query complexity relative to several existing ZO methods. The idea of LAZO is general, and can be applied to other variants of ZO methods.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning
Authors:
Momin Abbas,
Quan Xiao,
Lisha Chen,
Pin-Yu Chen,
Tianyi Chen
Abstract:
Model-agnostic meta learning (MAML) is currently one of the dominating approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML can be challenging due to the innate bilevel problem structure. Specifically, the loss landscape of MAML is much more complex with possibly more saddle points and local minimizers than its empirical risk minimization counterpart. To addres…
▽ More
Model-agnostic meta learning (MAML) is currently one of the dominating approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML can be challenging due to the innate bilevel problem structure. Specifically, the loss landscape of MAML is much more complex with possibly more saddle points and local minimizers than its empirical risk minimization counterpart. To address this challenge, we leverage the recently invented sharpness-aware minimization and develop a sharpness-aware MAML approach that we term Sharp-MAML. We empirically demonstrate that Sharp-MAML and its computation-efficient variant can outperform the plain-vanilla MAML baseline (e.g., $+3\%$ accuracy on Mini-Imagenet). We complement the empirical study with the convergence rate analysis and the generalization bound of Sharp-MAML. To the best of our knowledge, this is the first empirical and theoretical study on sharpness-aware minimization in the context of bilevel learning. The code is available at https://github.com/mominabbass/Sharp-MAML.
△ Less
Submitted 14 August, 2022; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Hilbert Expansion for the Relativistic Landau Equation
Authors:
Zhimeng Ouyang,
Lei Wu,
Qinghua Xiao
Abstract:
In this paper, we study the local-in-time validity of the Hilbert expansion for the relativistic Landau equation. We justify that solutions of the relativistic Landau equation converge to small classical solutions of the limiting relativistic Euler equations as the Knudsen number shrinks to zero in a weighted Sobolev space. The key difficulty comes from the temporal and spatial derivatives of the…
▽ More
In this paper, we study the local-in-time validity of the Hilbert expansion for the relativistic Landau equation. We justify that solutions of the relativistic Landau equation converge to small classical solutions of the limiting relativistic Euler equations as the Knudsen number shrinks to zero in a weighted Sobolev space. The key difficulty comes from the temporal and spatial derivatives of the local Maxwellian, which produce momentum growth terms and are uncontrollable by the standard $L^2$-based energy and dissipation. We introduce novel time-dependent weight functions to generate additional dissipation terms to suppress the large momentum. The argument relies on a hierarchy of energy-dissipation structures with or without weights. As far as the authors are aware of, this is the first result of the Hilbert expansion for the Landau-type equation.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
General 2-path Problem
Authors:
Qianghui Xiao
Abstract:
In this paper, some preliminaries about signal flow graph, linear time-invariant system on F(z) and computational complexity are first introduced in detail. In order to synthesize the necessary and sufficient condition on F(z) for a general 2-path problem, the sufficient condition on F(z) or R and necessary conditions on F(z) for a general 2-path problem are secondly analyzed respectively. Moreove…
▽ More
In this paper, some preliminaries about signal flow graph, linear time-invariant system on F(z) and computational complexity are first introduced in detail. In order to synthesize the necessary and sufficient condition on F(z) for a general 2-path problem, the sufficient condition on F(z) or R and necessary conditions on F(z) for a general 2-path problem are secondly analyzed respectively. Moreover, an equivalent sufficient and necessary condition on R whether there exists a general 2-path is deduced in detail. Finally, the computational complexity of the algorithm for this equivalent sufficient and necessary condition is introduced so that it means that the general 2-path problem is a P problem.
△ Less
Submitted 22 April, 2023; v1 submitted 30 January, 2022;
originally announced January 2022.
-
Solution to Morgan Problem
Authors:
Qianghui Xiao
Abstract:
In this paper, some preliminaries about Morgan problem, signal flow graph and controllable linear time-invariant standard system are first introduced in detail. In order to synthesize the necessary and sufficient condition for decoupling system, the first and second necessary conditions, and a sufficient condition for decoupling a controllable linear time-invariant system are secondly analyzed res…
▽ More
In this paper, some preliminaries about Morgan problem, signal flow graph and controllable linear time-invariant standard system are first introduced in detail. In order to synthesize the necessary and sufficient condition for decoupling system, the first and second necessary conditions, and a sufficient condition for decoupling a controllable linear time-invariant system are secondly analyzed respectively. Therefore, the nonregular static state feedback expression for decoupling system that ensures the internal stability of the uncontrollable subsystem of the decoupled system at the same time is deduced. Then the pole assignment for the controllable subsystem of a decoupled system while ensuring its decoupled state is introduced. Finally, two examples combined with their corresponding signal flow graphs show the simplicity and feasibility of the necessary and sufficient condition for decoupling system described in the paper.
△ Less
Submitted 11 April, 2022; v1 submitted 8 January, 2022;
originally announced January 2022.
-
Approximating continuous function on orbit spaces
Authors:
Qianqian Xia
Abstract:
In this paper we study a subclass of subcartesian space-the orbit space of a proper action of Lie group on smooth manifold. We show that continuous functions on orbit space can be approximated by smooth functions.
In this paper we study a subclass of subcartesian space-the orbit space of a proper action of Lie group on smooth manifold. We show that continuous functions on orbit space can be approximated by smooth functions.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
On controlled invariance for regular distributions
Authors:
Qianqian Xia
Abstract:
This paper considers the problem of controlled invariance of involutive regular distribution, both for smooth and real analytic cases. After a review of some existing work, a precise formulation of the problem of local and global controlled invariance of involutive regular distributions for both affine control systems and affine distributions is introduced. A complete characterization for local co…
▽ More
This paper considers the problem of controlled invariance of involutive regular distribution, both for smooth and real analytic cases. After a review of some existing work, a precise formulation of the problem of local and global controlled invariance of involutive regular distributions for both affine control systems and affine distributions is introduced. A complete characterization for local controlled invariance of involutive regular distributions for affine control systems is presented. A geometric interpretation for this characterization is provided. A result on local controlled invariance for real analytic affine distribution is given. Then we investigate conditions that allow passages from local controlled invariance to global controlled invariance, for both smooth and real analytic affine distributions. We clarify existing results in the literature. Finally, for manifolds with a symmetry Lie group action, the problem of global controlled invariance is considered.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Smooth distributions on subcartesian spaces are globally finitely generated
Authors:
Qianqian Xia
Abstract:
We prove that a connected subcartesian space admits embedding in a Euclidean space. The Whitney Embedding Theorem is then stated as a corollary of our result. Based on the above result together with the theory of distribution on smooth manifolds, we show that smooth generalized distributions on connected subcartesian spaces are globally finitely generated. We also show that smooth generalized subb…
▽ More
We prove that a connected subcartesian space admits embedding in a Euclidean space. The Whitney Embedding Theorem is then stated as a corollary of our result. Based on the above result together with the theory of distribution on smooth manifolds, we show that smooth generalized distributions on connected subcartesian spaces are globally finitely generated. We also show that smooth generalized subbundles of vector bundles on connected subcartesian spaces are globally finitely generated.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
On the Curvature of Metric Triples
Authors:
Qinglan Xia
Abstract:
In this article, we introduce a notion of curvature, denoted by $ k_X(T)$, for a metric triple $T$ inside a (possibly discrete) metric space $X$. Such a notion enables us to consider curvature information of any metric space, including discrete metric spaces such as those generated by scientific data. To define the notion, we employ the information consisting of side lengths of the triple as well…
▽ More
In this article, we introduce a notion of curvature, denoted by $ k_X(T)$, for a metric triple $T$ inside a (possibly discrete) metric space $X$. Such a notion enables us to consider curvature information of any metric space, including discrete metric spaces such as those generated by scientific data. To define the notion, we employ the information consisting of side lengths of the triple as well as the minimum total distance from vertices of the triple to points of the metric space. Those information provides us a unique number $k_X(T)$ such that the triple $T$ can be isometrically embedded into the model space $M_k^2$ up to $k\le k_X(T)$. The value $k_X(T)$ agrees with the usual curvature when $X$ is a convex subset of a model space. We also show that the curvature $k_X(T)$ of any metric triple $T$ inside a $CAT(k)$ space is bounded above by $k$.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
High-order accurate schemes for Maxwell's equations with nonlinear active media and material interfaces
Authors:
Qing Xia,
Jeffrey W. Banks,
William D. Henshaw,
Alexander V. Kildishev,
Gregor Kovačič,
Ludmila J. Prokopeva,
Donald W. Schwendeman
Abstract:
We describe a fourth-order accurate finite-difference time-domain scheme for solving dispersive Maxwell's equations with nonlinear multi-level carrier kinetics models. The scheme is based on an efficient single-step three time-level modified equation approach for Maxwell's equations in second-order form for the electric field coupled to ODEs for the polarization vectors and population densities of…
▽ More
We describe a fourth-order accurate finite-difference time-domain scheme for solving dispersive Maxwell's equations with nonlinear multi-level carrier kinetics models. The scheme is based on an efficient single-step three time-level modified equation approach for Maxwell's equations in second-order form for the electric field coupled to ODEs for the polarization vectors and population densities of the atomic levels. The resulting scheme has a large CFL-one time-step. Curved interfaces between different materials are accurately treated with curvilinear grids and compatibility conditions. A novel hierarchical modified equation approach leads to an explicit scheme that does not require any nonlinear iterations. The hierarchical approach at interfaces leads to local updates at the interface with no coupling in the tangential directions. Complex geometry is treated with overset grids. Numerical stability is maintained using high-order upwind dissipation designed for Maxwell's equations in second-order form. The scheme is carefully verified for a number of two and three-dimensional problems. The resulting numerical model with generalized dispersion and arbitrary nonlinear multi-level system can be used for many plasmonic applications such as for ab initio time domain modeling of nonlinear engineered materials for nanolasing applications, where nano-patterned plasmonic dispersive arrays are used to enhance otherwise weak nonlinearity in the active media.
△ Less
Submitted 21 August, 2021;
originally announced August 2021.
-
A Single-Timescale Method for Stochastic Bilevel Optimization
Authors:
Tianyi Chen,
Yuejiao Sun,
Quan Xiao,
Wotao Yin
Abstract:
Stochastic bilevel optimization generalizes the classic stochastic optimization from the minimization of a single objective to the minimization of an objective function that depends the solution of another optimization problem. Recently, stochastic bilevel optimization is regaining popularity in emerging machine learning applications such as hyper-parameter optimization and model-agnostic meta lea…
▽ More
Stochastic bilevel optimization generalizes the classic stochastic optimization from the minimization of a single objective to the minimization of an objective function that depends the solution of another optimization problem. Recently, stochastic bilevel optimization is regaining popularity in emerging machine learning applications such as hyper-parameter optimization and model-agnostic meta learning. To solve this class of stochastic optimization problems, existing methods require either double-loop or two-timescale updates, which are sometimes less efficient. This paper develops a new optimization method for a class of stochastic bilevel problems that we term Single-Timescale stochAstic BiLevEl optimization (STABLE) method. STABLE runs in a single loop fashion, and uses a single-timescale update with a fixed batch size. To achieve an $ε$-stationary point of the bilevel problem, STABLE requires ${\cal O}(ε^{-2})$ samples in total; and to achieve an $ε$-optimal solution in the strongly convex case, STABLE requires ${\cal O}(ε^{-1})$ samples. To the best of our knowledge, this is the first bilevel optimization algorithm achieving the same order of sample complexity as the stochastic gradient descent method for the single-level stochastic optimization.
△ Less
Submitted 30 March, 2022; v1 submitted 9 February, 2021;
originally announced February 2021.
-
Topological $R$-pressure and topological pressure of free semigroup actions
Authors:
Yinan Zheng,
Qian Xiao
Abstract:
In this paper we introduce the definition of topological $r$-pressure of free semigroup actions on compact metric space and provide some properties of it. Through skew-product transformation into a medium, we can obtain the following two main results. 1. We extend the result that the topological pressure is the limit of topological $r$-pressure in\cite{C} to free semigroup actions ($r\to 0$). 2. L…
▽ More
In this paper we introduce the definition of topological $r$-pressure of free semigroup actions on compact metric space and provide some properties of it. Through skew-product transformation into a medium, we can obtain the following two main results. 1. We extend the result that the topological pressure is the limit of topological $r$-pressure in\cite{C} to free semigroup actions ($r\to 0$). 2. Let $f_i,$ $i=0, 1, \cdots, m-1$, be homeomorphisms on a compact metric space. For any continuous function, we verify that the topological pressure of $f_0, \cdots, f_{m-1}$ equals the topological pressure of $f_0^{-1}, \cdots, f_{m-1}^{-1}.$
△ Less
Submitted 31 December, 2020;
originally announced December 2020.
-
Topological pressure of free semigroup actions for non-compact sets and Bowen's equation, II
Authors:
Qian Xiao,
Dongkui Ma
Abstract:
Inspired to the work of Ma and Wu\cite{Ma} and Climenhaga\cite{Climenhaga}, we introduce the new nation of topological pressure of a semigroup of maps by using the Carathéodory-Pesin structure (C-P structure) with respect to arbitrary subset in this paper. Moreover, by Bowen's equation, we characterize the Hausdorff dimension of an arbitrary subset, where the points of the subset have the positive…
▽ More
Inspired to the work of Ma and Wu\cite{Ma} and Climenhaga\cite{Climenhaga}, we introduce the new nation of topological pressure of a semigroup of maps by using the Carathéodory-Pesin structure (C-P structure) with respect to arbitrary subset in this paper. Moreover, by Bowen's equation, we characterize the Hausdorff dimension of an arbitrary subset, where the points of the subset have the positive lower Lyapunov exponents and satisfy a so called tempered contraction condition.
△ Less
Submitted 20 December, 2020;
originally announced December 2020.
-
Ramified optimal transportation with payoff on the boundary
Authors:
Qinglan Xia,
Shaofeng Xu
Abstract:
This paper studies a variant of ramified/branched optimal transportation problems. Given the distributions of production capacities and market sizes, a firm looks for an allocation of productions over factories, a distribution of sales across markets, and a transport path that delivers the product to maximize its profit. Mathematically, given any two measures $μ$ and $ν$ on $X$, and a payoff funct…
▽ More
This paper studies a variant of ramified/branched optimal transportation problems. Given the distributions of production capacities and market sizes, a firm looks for an allocation of productions over factories, a distribution of sales across markets, and a transport path that delivers the product to maximize its profit. Mathematically, given any two measures $μ$ and $ν$ on $X$, and a payoff function $h$, the planner wants to minimize $\mathbf{M}_{α}(T)-\int_{X}hd(\partial T)$ among all transport paths $T$ from $\tildeμ$ to $\tildeν$ with $\tildeμ\leq μ$ and $\tildeν\leq ν$, where $\mathbf{M}_{α}$ is the standard cost functional used in ramified transportation. After proving the existence result, we provide a characterization of the boundary measures of the optimal solution. They turn out to be the original measures restricted on some Borel subsets up to a Delta mass on each connected component. Our analysis further finds that as the boundary payoff increases, the corresponding solution of the current problem converges to an optimal transport path, which is the solution of the standard ramified transportation.
△ Less
Submitted 30 August, 2021; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Topological pressure of free semigroup actions for non-compact sets and Bowen's equation
Authors:
Qian Xiao,
Dongkui Ma
Abstract:
Climenhaga showed the applicability of Bowen equation to arbitrary subset of a compact metric space. The main purpose of this paper is to generalize the main result of Climenhaga to free semigroup actions for non-compact sets. We introduce the notions of the topological pressure and lower and upper capacity topological pressure of a free semigroup action for non-compact sets by using the Caratheod…
▽ More
Climenhaga showed the applicability of Bowen equation to arbitrary subset of a compact metric space. The main purpose of this paper is to generalize the main result of Climenhaga to free semigroup actions for non-compact sets. We introduce the notions of the topological pressure and lower and upper capacity topological pressure of a free semigroup action for non-compact sets by using the Caratheodory- Pesin structure. Some properties of these notions are given, followed by three main results. One is to characterize the Hausdorff dimension of arbitrary subset in term of the topological pressure by Bowen equation, whose points have the positive lower Lyapunov exponents and satisfy a tempered contraction condition, the other is the estimation of topological pressure of a free semigroup action on arbitrary subset of X and the third is the relationship between the upper capacity topological pressure of a skew-product transformation and the upper capacity topological pressure of a free semigroup action with respect to arbitrary subset.
△ Less
Submitted 13 July, 2020;
originally announced July 2020.
-
The existence of minimizers for an isoperimetric problem with Wasserstein penalty term in unbounded domains
Authors:
Qinglan Xia,
Bohan Zhou
Abstract:
In this article, we consider the (double) minimization problem $$\min\left\{P(E;Ω)+λW_p(E,F):~E\subseteqΩ,~F\subseteq \mathbb{R}^d,~\lvert E\cap F\rvert=0,~ \lvert E\rvert=\lvert F\rvert=1\right\},$$ where $p\geqslant 1$, $Ω$ is a (possibly unbounded) domain in $\mathbb{R}^d$, $P(E;Ω)$ denotes the relative perimeter of $E$ in $Ω$ and $W_p$ denotes the $p$-Wasserstein distance. When $Ω$ is unbounde…
▽ More
In this article, we consider the (double) minimization problem $$\min\left\{P(E;Ω)+λW_p(E,F):~E\subseteqΩ,~F\subseteq \mathbb{R}^d,~\lvert E\cap F\rvert=0,~ \lvert E\rvert=\lvert F\rvert=1\right\},$$ where $p\geqslant 1$, $Ω$ is a (possibly unbounded) domain in $\mathbb{R}^d$, $P(E;Ω)$ denotes the relative perimeter of $E$ in $Ω$ and $W_p$ denotes the $p$-Wasserstein distance. When $Ω$ is unbounded and $d\geqslant 3$, it is an open problem proposed by Buttazzo, Carlier and Laborde in the paper ON THE WASSERSTEIN DISTANCE BETWEEN MUTUALLY SINGULAR MEASURES. We prove the existence of minimizers to this problem when $\frac{1}{p}+\frac{2}{d}>1$, $Ω=\mathbb{R}^d$ and $λ$ is sufficiently small.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
Global Hilbert expansion for the relativistic Vlasov-Maxwell-Boltzmann system
Authors:
Yan Guo,
Qinghua Xiao
Abstract:
Consider the relativistic Vlasov-Maxwell-Boltzmann system describing the dynamics of an electron gas in the presence of a fixed ion background. Thanks to recent works \cite{Germain-Masmoudi-ASENS-2014, Guo-Ionescu-Pausader-JMP-2014} and \cite{Deng-Ionescu-Pausader-ARMA-2017}, we establish the global-in-time validity of its Hilbert expansion and derive the limiting relativistic Euler-Maxwell system…
▽ More
Consider the relativistic Vlasov-Maxwell-Boltzmann system describing the dynamics of an electron gas in the presence of a fixed ion background. Thanks to recent works \cite{Germain-Masmoudi-ASENS-2014, Guo-Ionescu-Pausader-JMP-2014} and \cite{Deng-Ionescu-Pausader-ARMA-2017}, we establish the global-in-time validity of its Hilbert expansion and derive the limiting relativistic Euler-Maxwell system as the mean free path goes to zero. Our method is based on the $L^2-L^{\infty}$ framework and the Glassey-Strauss Representation of the electromagnetic field, with auxiliary $H^1$ estimate and $W^{1,\infty}$ estimates to control the characteristic curves and corresponding $L^{\infty}$ norm.
△ Less
Submitted 10 January, 2023; v1 submitted 31 January, 2020;
originally announced January 2020.
-
Transmission Expansion Planning with Seasonal Network Optimization
Authors:
Xingpeng Li,
Qianxue Xia
Abstract:
Transmission expansion planning (TEP) is critical for the power grid to meet fast growing demand in the future. Traditional TEP model does not utilize the flexibility in the transmission network that is considered as static assets. However, as the load profile may have different seasonal patterns, the optimal network configuration could be very different for different seasons in the planning horiz…
▽ More
Transmission expansion planning (TEP) is critical for the power grid to meet fast growing demand in the future. Traditional TEP model does not utilize the flexibility in the transmission network that is considered as static assets. However, as the load profile may have different seasonal patterns, the optimal network configuration could be very different for different seasons in the planning horizon. Therefore, this paper proposes to incorporate seasonal network optimization (SNO) into the traditional TEP model. SNO dynamically optimizes the network for each season of each planning epoch. Two TEP-SNO models are proposed to investigate the benefits of optimizing the status of (i) existing branches, and (ii) existing and new branches, respectively. Numerical simulations demonstrate the effectiveness of the proposed TEP-SNO models. It is shown that SNO can improve system operational efficiency, defer investment of new transmission elements, and reduce the total cost.
△ Less
Submitted 29 November, 2019;
originally announced November 2019.
-
Stochastic Optimal Power Flow with Network Reconfiguration: Congestion Management and Facilitating Grid Integration of Renewables
Authors:
Xingpeng Li,
Qianxue Xia
Abstract:
There has been a significant growth of variable renewable generation in the power grid today. However, the industry still uses deterministic optimization to model and solve the optimal power flow (OPF) problem for real-time generation dispatch that ignores the uncertainty associated with intermittent renewable power. Thus, it is necessary to study stochastic OPF (SOPF) that can better handle uncer…
▽ More
There has been a significant growth of variable renewable generation in the power grid today. However, the industry still uses deterministic optimization to model and solve the optimal power flow (OPF) problem for real-time generation dispatch that ignores the uncertainty associated with intermittent renewable power. Thus, it is necessary to study stochastic OPF (SOPF) that can better handle uncertainty since SOPF is able to consider the probabilistic forecasting information of intermittent renewables. Transmission network congestion is one of the main reasons for renewable energy curtailment. Prior efforts in the literature show that utilizing transmission network reconfiguration can relieve congestion and resolve congestion-induced issues. This paper enhances SOPF by incorporating network reconfiguration into the dispatch model. Numerical simulations show that renewable curtailment can be avoided with the proposed network reconfiguration scheme that relieves transmission congestion in post-contingency situations. It is also shown that network reconfiguration can substantially reduce congestion cost, especially the contingency-case congestion cost.
△ Less
Submitted 29 November, 2019;
originally announced November 2019.
-
Morphisms of tautological control systems
Authors:
Qianqian Xia
Abstract:
In this paper, we investigate morphisms of tautological control systems. Given a tautological control system $\mathfrak{H}$ on the manifold N and a mapping $Φ: M \to N$, we study existence of tautological control system $\mathfrak{G}$ on the manifold $M$ such that there exists a trajectory-preserving morphism $(Φ, Φ^ #)$ from $\mathfrak{G}$ to $\mathfrak{H}$. Sufficient conditions are given such t…
▽ More
In this paper, we investigate morphisms of tautological control systems. Given a tautological control system $\mathfrak{H}$ on the manifold N and a mapping $Φ: M \to N$, we study existence of tautological control system $\mathfrak{G}$ on the manifold $M$ such that there exists a trajectory-preserving morphism $(Φ, Φ^ #)$ from $\mathfrak{G}$ to $\mathfrak{H}$. Sufficient conditions are given such that reachability of $\mathfrak{H}$ implies the reachability of $\mathfrak{G}$. Correspondence between the notion of lifting ordinary control systems and morphisms of tautological control systems are examined. We give an application of the above results to the class of second-order type control systems, where the special structure of second-order type leads to additional results.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.
-
Quotients of affine connection control systems
Authors:
Qianqian Xia,
Zhiyong Geng
Abstract:
In this paper, we investigate the existence of a subclass of quotients of affine connection control systems, which preserve the mechanical structures. Both local and global sufficient and necessary conditions are given for the geodesically accessible affine connection control systems such that they can admit this subclass of quotients. The structural properties of the quotient map and the quotient…
▽ More
In this paper, we investigate the existence of a subclass of quotients of affine connection control systems, which preserve the mechanical structures. Both local and global sufficient and necessary conditions are given for the geodesically accessible affine connection control systems such that they can admit this subclass of quotients. The structural properties of the quotient map and the quotient mechanical control system are discussed.
△ Less
Submitted 5 August, 2019;
originally announced August 2019.
-
m-Order Time Optimal Control Synthesis Function of Discrete System
Authors:
Qianghui Xiao,
Yang Zhang
Abstract:
In this paper, first of all, we introduce the basic concepts of generating function in combinatorics and some combinatorial identities. In order to facilitate the understanding of m-order time optimal control synthesis function of discrete system (referred as m-order synthesis function), secondly, we introduce the derivation process and control ideas of 2nd-order synthesis function, and then deduc…
▽ More
In this paper, first of all, we introduce the basic concepts of generating function in combinatorics and some combinatorial identities. In order to facilitate the understanding of m-order time optimal control synthesis function of discrete system (referred as m-order synthesis function), secondly, we introduce the derivation process and control ideas of 2nd-order synthesis function, and then deduce in detail the m-order synthesis function by means of generating function. By use of the m-order tracking-form synthesis function with filter factor, the methods of signal extraction and its predictive compensation are presented in this paper, and their immunity and effectiveness are verified by numerical simulation.
△ Less
Submitted 14 February, 2022; v1 submitted 16 May, 2019;
originally announced May 2019.
-
Difference Potentials Method for Models with Dynamic Boundary Conditions and Bulk-Surface Problems
Authors:
Yekaterina Epshteyn,
Qing Xia
Abstract:
In this work, we consider parabolic models with dynamic boundary conditions and parabolic bulk-surface problems in 3D. Such partial differential equations based models describe phenomena that happen both on the surface and in the bulk/domain. These problems may appear in many applications, ranging from cell dynamics in biology, to grain growth models in polycrystalline materials. Using Difference…
▽ More
In this work, we consider parabolic models with dynamic boundary conditions and parabolic bulk-surface problems in 3D. Such partial differential equations based models describe phenomena that happen both on the surface and in the bulk/domain. These problems may appear in many applications, ranging from cell dynamics in biology, to grain growth models in polycrystalline materials. Using Difference Potentials framework, we develop novel numerical algorithms for the approximation of the problems. The constructed algorithms efficiently and accurately handle the coupling of the models in the bulk and on the surface, approximate 3D irregular geometry in the bulk by the use of only Cartesian meshes, employ Fast Poisson Solvers, and utilize spectral approximation on the surface. Several numerical tests are given to illustrate the robustness of the developed numerical algorithms.
△ Less
Submitted 5 February, 2020; v1 submitted 17 April, 2019;
originally announced April 2019.
-
Efficient Numerical Algorithms based on Difference Potentials for Chemotaxis Systems in 3D
Authors:
Yekaterina Epshteyn,
Qing Xia
Abstract:
In this work, we propose efficient and accurate numerical algorithms based on Difference Potentials Method for numerical solution of chemotaxis systems and related models in 3D. The developed algorithms handle 3D irregular geometry with the use of only Cartesian meshes and employ Fast Poisson Solvers. In addition, to further enhance computational efficiency of the methods, we design a Difference-P…
▽ More
In this work, we propose efficient and accurate numerical algorithms based on Difference Potentials Method for numerical solution of chemotaxis systems and related models in 3D. The developed algorithms handle 3D irregular geometry with the use of only Cartesian meshes and employ Fast Poisson Solvers. In addition, to further enhance computational efficiency of the methods, we design a Difference-Potentials-based domain decomposition approach which allows mesh adaptivity and easy parallelization of the algorithm in space. Extensive numerical experiments are presented to illustrate the accuracy, efficiency and robustness of the developed numerical algorithms.
△ Less
Submitted 9 February, 2019; v1 submitted 8 November, 2018;
originally announced November 2018.