-
Mathematical Modeling of Carbon Dioxide Emissions with GDP Linkage: Sensitivity Analysis and Optimal Control Strategy
Authors:
Hua Liu,
Zhuoma Gangji,
Yumei Wei,
Jianhua Ye,
Gang Ma
Abstract:
Climate change and global warming are among the most significant issues that humanity is currently facing, and also among the issues that pose the greatest threats to all mankind. These issues are primarily driven by abnormal increases in greenhouse gas concentrations. Mathematical modeling serves as a powerful approach to analyze the dynamic patterns of atmospheric carbon dioxide. In this paper,…
▽ More
Climate change and global warming are among the most significant issues that humanity is currently facing, and also among the issues that pose the greatest threats to all mankind. These issues are primarily driven by abnormal increases in greenhouse gas concentrations. Mathematical modeling serves as a powerful approach to analyze the dynamic patterns of atmospheric carbon dioxide. In this paper, we established a mathmetical model with four state variables to investigate the dynamic behavior of the interaction between atmospheric carbon dioxide, GDP, forest area and human population. Relevant theories were employed to analyze the system's boundedness and the stability of equilibrium points. The parameter values were estimated with the help of the actual data in China and numerical fitting was carried out to verify the results of the theoretical analysis. The sensitivity analysis of the compartments with respect to the model parameters was analyzed by using the Partial Rank Correlation Coefficient (PRCC) and the Latin Hypercube Sampling test. Apply the optimal control theory to regulate the atmospheric carbon dioxide level and provide the corresponding numerical fitting. Finally, corresponding discussions and suggestions were put forward with the help of the results of the theoretical analysis and numerical fitting.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Weil polynomials of abelian varieties over finite fields
Authors:
Michael Cerchia,
Zeyu Liu,
Diana Mocanu,
Haodong Yao,
Jing Ye
Abstract:
In this paper, we investigate Weil polynomials and their relationship with isogeny classes of abelian varieties over finite fields. We give a necessary condition for a degree 12 polynomial with integer coefficients to be a Weil polynomial. Moreover, we provide explicit criteria that determine when a Weil polynomial of degree 14 occurs as the characteristic polynomial of a Frobenius endomorphism ac…
▽ More
In this paper, we investigate Weil polynomials and their relationship with isogeny classes of abelian varieties over finite fields. We give a necessary condition for a degree 12 polynomial with integer coefficients to be a Weil polynomial. Moreover, we provide explicit criteria that determine when a Weil polynomial of degree 14 occurs as the characteristic polynomial of a Frobenius endomorphism acting on an abelian variety.
△ Less
Submitted 29 June, 2025; v1 submitted 16 June, 2025;
originally announced June 2025.
-
Upper cluster structure on Kac--Moody Richardson varieties
Authors:
Huanchen Bao,
Jeff York Ye
Abstract:
We show coordinate rings of open Richardson varieties are upper cluster algebras for any symmetrizable Kac--Moody type. We further show the coordinate rings of (generalized) open Richardson varieties on the twisted product of flag varieties are upper cluster algebras for any symmetrizable Kac--Moody type. This includes, as special cases, reduced double Bruhat cells, Bott-Samelson varieties, braid…
▽ More
We show coordinate rings of open Richardson varieties are upper cluster algebras for any symmetrizable Kac--Moody type. We further show the coordinate rings of (generalized) open Richardson varieties on the twisted product of flag varieties are upper cluster algebras for any symmetrizable Kac--Moody type. This includes, as special cases, reduced double Bruhat cells, Bott-Samelson varieties, braid varieties. Our results generalize various results by Casals--Gorsky--Gorsky--Le--Shen--Simental and Galashin--Lam--Sherman-Bennett--Speyer in finite types.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Guided Diffusion Sampling on Function Spaces with Applications to PDEs
Authors:
Jiachen Yao,
Abbas Mammadov,
Julius Berner,
Gavin Kerrigan,
Jong Chul Ye,
Kamyar Azizzadenesheli,
Anima Anandkumar
Abstract:
We propose a general framework for conditional sampling in PDE-based inverse problems, targeting the recovery of whole solutions from extremely sparse or noisy measurements. This is accomplished by a function-space diffusion model and plug-and-play guidance for conditioning. Our method first trains an unconditional discretization-agnostic denoising model using neural operator architectures. At inf…
▽ More
We propose a general framework for conditional sampling in PDE-based inverse problems, targeting the recovery of whole solutions from extremely sparse or noisy measurements. This is accomplished by a function-space diffusion model and plug-and-play guidance for conditioning. Our method first trains an unconditional discretization-agnostic denoising model using neural operator architectures. At inference, we refine the samples to satisfy sparse observation data via a gradient-based guidance mechanism. Through rigorous mathematical analysis, we extend Tweedie's formula to infinite-dimensional Hilbert spaces, providing the theoretical foundation for our posterior sampling approach. Our method (FunDPS) accurately captures posterior distributions in function spaces under minimal supervision and severe data scarcity. Across five PDE tasks with only 3% observation, our method achieves an average 32% accuracy improvement over state-of-the-art fixed-resolution diffusion baselines while reducing sampling steps by 4x. Furthermore, multi-resolution fine-tuning ensures strong cross-resolution generalizability. To the best of our knowledge, this is the first diffusion-based framework to operate independently of discretization, offering a practical and flexible solution for forward and inverse problems in the context of PDEs. Code is available at https://github.com/neuraloperator/FunDPS
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
When is $A + x A =\mathbb{R}$
Authors:
Jinhe Ye,
Liang Yu,
Xuanheng zhao
Abstract:
We show that there is an additive $F_σ$ subgroup $A$ of $\mathbb{R}$ and $x \in \mathbb{R}$ such that $\mathrm{dim_H} (A) = \frac{1}{2}$ and $A + x A =\mathbb{R}$. However, if $A \subseteq \mathbb{R}$ is a subring of $\mathbb{R}$ and there is $x \in \mathbb{R}$ such that $A + x A =\mathbb{R}$, then $A =\mathbb{R}$. Moreover, assuming the continuum hypothesis (CH), there is a subgroup $A$ of…
▽ More
We show that there is an additive $F_σ$ subgroup $A$ of $\mathbb{R}$ and $x \in \mathbb{R}$ such that $\mathrm{dim_H} (A) = \frac{1}{2}$ and $A + x A =\mathbb{R}$. However, if $A \subseteq \mathbb{R}$ is a subring of $\mathbb{R}$ and there is $x \in \mathbb{R}$ such that $A + x A =\mathbb{R}$, then $A =\mathbb{R}$. Moreover, assuming the continuum hypothesis (CH), there is a subgroup $A$ of $\mathbb{R}$ with $\mathrm{dim_H} (A) = 0$ such that $x \not\in \mathbb{Q}$ if and only if $A + x A =\mathbb{R}$ for all $x \in \mathbb{R}$. A key ingredient in the proof of this theorem consists of some techniques in recursion theory and algorithmic randomness. We believe it may lead to applications to other constructions of exotic sets of reals. Several other theorems on measurable, and especially Borel and analytic subgroups and subfields of the reals are presented. We also discuss some of these results in the $p$-adics.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
$α$-scaled strong convergence and stability of stochastic theta method for time-changed stochastic differential equations with local Lipschitz coefficients
Authors:
Jingwei Chen,
Jun Ye,
Jinwen Chen,
Zhidong Wang
Abstract:
We propose the first $α$-parameterized framework for solving time-changed stochastic differential equations (SDEs), explicitly linking convergence rates and key driving parameter of the underlying stochastic processes. Theoretically, we derive exact moment estimates and exponential moment estimate of inverse $α$-stable subordinator $E$ using Mittag-Leffler functions. The stochastic theta (ST) meth…
▽ More
We propose the first $α$-parameterized framework for solving time-changed stochastic differential equations (SDEs), explicitly linking convergence rates and key driving parameter of the underlying stochastic processes. Theoretically, we derive exact moment estimates and exponential moment estimate of inverse $α$-stable subordinator $E$ using Mittag-Leffler functions. The stochastic theta (ST) method is investigated for a class of SDEs driven by a time-changed Brownian motion, whose coefficients are time-space-dependent and satisfy the local Lipschitz condition. We prove that the convergence order dynamically responds to the stability index $α$ of stable subordinator $D$, filling a critical gap in traditional methods that treat these factors independently. We also investigate the criteria of asympotical mean square stability of the ST method. Finally, some numerical simulations are presented to illustrate the theoretical results.
△ Less
Submitted 1 June, 2025; v1 submitted 27 March, 2025;
originally announced March 2025.
-
Determining some graph joins by the signless Laplacian spectrum
Authors:
Jiachang Ye,
Jianguo Qian,
Zoran Stanić
Abstract:
A graph is determined by its signless Laplacian spectrum if there is no other non-isomorphic graph sharing the same signless Laplacian spectrum. Let $C_l$, $P_l$, $K_l$ and $K_{s,l-s}$ be the cycle, the path, the complete graph and the complete bipartite graph with $l$ vertices, respectively. We prove that $$G\cong K_1\vee (C_{l_1}\cup C_{l_2}\cup\cdots \cup C_{l_t}\cup sK_1),$$ with…
▽ More
A graph is determined by its signless Laplacian spectrum if there is no other non-isomorphic graph sharing the same signless Laplacian spectrum. Let $C_l$, $P_l$, $K_l$ and $K_{s,l-s}$ be the cycle, the path, the complete graph and the complete bipartite graph with $l$ vertices, respectively. We prove that $$G\cong K_1\vee (C_{l_1}\cup C_{l_2}\cup\cdots \cup C_{l_t}\cup sK_1),$$ with $s\ge 0, t\ge 1, n\geq 22$, is determined by the signless Laplacian spectrum if and only if either $s=0$ or $s\ge 1$ and $l_i\ne 3$ holds for all $1\leq i\leq t$, where $n$ is the order of $G$, and $\cup$ and $\vee$ stand for the disjoint union and the join of two graphs, respectively. Moreover, for $s\ge 1$ and $l_t=3$, $K_1\vee (K_{1,3}\cup C_{l_1}\cup C_{l_2}\cup\cdots \cup C_{l_{t-1}}\cup (s-1)K_1)$ is fixed as a graph sharing the signless Laplacian spectrum with $G$. This contribution extends some recently published results.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
Which $L$-cospectral graphs have same degree sequences
Authors:
Jiachang Ye
Abstract:
Let $λ_{i}(G)$ be the $i$-th largest Laplacian eigenvalues of graph $G$, where $1\le i\le |V(G)|$. Liu, Yuan, You and Chen [Discrete Math., 341 (2018) 2969--2976] raised the problem for ``Which cospectral graphs have same degree sequences". In this paper, let $W_3$ and $W_5$ be the two graphs as shown in Fig. 2 and let $G$ be a connected graph with $n\ge 18$ vertices. We shall show that:
$(1)$ I…
▽ More
Let $λ_{i}(G)$ be the $i$-th largest Laplacian eigenvalues of graph $G$, where $1\le i\le |V(G)|$. Liu, Yuan, You and Chen [Discrete Math., 341 (2018) 2969--2976] raised the problem for ``Which cospectral graphs have same degree sequences". In this paper, let $W_3$ and $W_5$ be the two graphs as shown in Fig. 2 and let $G$ be a connected graph with $n\ge 18$ vertices. We shall show that:
$(1)$ If $λ_{2}(G)<5<n-1<λ_{1}(G)$, $λ_{1}(G) \notin \{λ_{1}(W_3),λ_{1}(W_5)\}$ and $H$ is Laplacian cospectral with $G$, then $H$ must have the same degree sequence with $G$;
$(2)$ If $λ_2(G)\le 4.7<n-2< λ_1(G)$, and $H$ is Laplacian cospectral with $G$, then $H$ must have the same degree sequence with $G$.
The former result easily leads to the unique theorem result of [Discrete Math., 308 (2008) 4267--4271], that is: Every multi-fan graph $K_1\vee (P_{l_1}\cup P_{l_1}\cup\cdots \cup P_{l_t})$ is determined by the Laplacian spectrum. Moreover, it can also deduce a new conclusion: $K_1\vee (P_{l_1}\cup P_{l_1}\cup\cdots \cup P_{l_t}\cup C_{s_1}\cup C_{s_2}\cup\cdots \cup C_{s_k})$ $(t\ge 1, k\ge 1)$ is determined by the Laplacian spectrum if the graph order $n\ge 18$ and each $s_i$ $(i=1,2,\ldots, k)$ is odd.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
ScoreFusion: Fusing Score-based Generative Models via Kullback-Leibler Barycenters
Authors:
Hao Liu,
Junze Tony Ye,
Jose Blanchet,
Nian Si
Abstract:
We introduce ScoreFusion, a theoretically grounded method for fusing multiple pre-trained diffusion models that are assumed to generate from auxiliary populations. ScoreFusion is particularly useful for enhancing the generative modeling of a target population with limited observed data. Our starting point considers the family of KL barycenters of the auxiliary populations, which is proven to be an…
▽ More
We introduce ScoreFusion, a theoretically grounded method for fusing multiple pre-trained diffusion models that are assumed to generate from auxiliary populations. ScoreFusion is particularly useful for enhancing the generative modeling of a target population with limited observed data. Our starting point considers the family of KL barycenters of the auxiliary populations, which is proven to be an optimal parametric class in the KL sense, but difficult to learn. Nevertheless, by recasting the learning problem as score matching in denoising diffusion, we obtain a tractable way of computing the optimal KL barycenter weights. We prove a dimension-free sample complexity bound in total variation distance, provided that the auxiliary models are well-fitted for their own task and the auxiliary tasks combined capture the target well. The sample efficiency of ScoreFusion is demonstrated by learning handwritten digits. We also provide a simple adaptation of a Stable Diffusion denoising pipeline that enables sampling from the KL barycenter of two auxiliary checkpoints; on a portrait generation task, our method produces faces that enhance population heterogeneity relative to the auxiliary distributions.
△ Less
Submitted 16 April, 2025; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Lang-Weil Type Estimates in Finite Difference Fields
Authors:
Martin Hils,
Ehud Hrushovski,
Jinhe Ye,
Tingxiang Zou
Abstract:
We prove a uniform estimate of the number of points for difference algebraic varieties in finite difference fields in the spirit of Lang-Weil. More precisely, we give uniform lower and upper bounds for the number of rational points of a difference variety in terms of its transformal dimension. As a main technical ingredient, we prove an equidimensionality result for Frobenius reductions of differe…
▽ More
We prove a uniform estimate of the number of points for difference algebraic varieties in finite difference fields in the spirit of Lang-Weil. More precisely, we give uniform lower and upper bounds for the number of rational points of a difference variety in terms of its transformal dimension. As a main technical ingredient, we prove an equidimensionality result for Frobenius reductions of difference varieties.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Zilber's Trichotomy in Hausdorff Geometric Structures
Authors:
Benjamin Castle,
Assaf Hasson,
Jinhe Ye
Abstract:
We give a new axiomatic treatment of the Zilber trichotomy, and use it to complete the proof of the trichotomy for relics of algebraically closed fields, i.e., reducts of the ACF-induced structure on ACF-definable sets. More precisely, we introduce a class of geometric structures equipped with a Hausdorff topology, called \textit{Hausdorff geometric structures}. Natural examples include the comple…
▽ More
We give a new axiomatic treatment of the Zilber trichotomy, and use it to complete the proof of the trichotomy for relics of algebraically closed fields, i.e., reducts of the ACF-induced structure on ACF-definable sets. More precisely, we introduce a class of geometric structures equipped with a Hausdorff topology, called \textit{Hausdorff geometric structures}. Natural examples include the complex field; algebraically closed valued fields; o-minimal expansions of real closed fields; and characteristic zero Henselian fields (in particular $p$-adically closed fields). We then study the Zilber trichotomy for relics of Hausdorff geometric structures, showing that under additional assumptions, every non-locally modular strongly minimal relic on a real sort interprets a one-dimensional group. Combined with recent results, this allows us to prove the trichotomy for strongly minimal relics on the real sorts of algebraically closed valued fields. Finally, we make progress on the imaginary sorts, reducing the trichotomy for \textit{all} ACVF relics (in all sorts) to a conjectural technical condition that we prove in characteristic $(0,0)$.
△ Less
Submitted 28 April, 2025; v1 submitted 3 May, 2024;
originally announced May 2024.
-
New second-order optimality conditions for directional optimality of a general set-constrained optimization problem
Authors:
Wei Ouyang,
Jane Ye,
Binbin Zhang
Abstract:
In this paper we derive new second-order optimality conditions for a very general set-constrained optimization problem where the underlying set may be nononvex. We consider local optimality in specific directions (i.e., optimal in a directional neighborhood) in pursuit of developing these new optimality conditions. First-order necessary conditions for local optimality in given directions are provi…
▽ More
In this paper we derive new second-order optimality conditions for a very general set-constrained optimization problem where the underlying set may be nononvex. We consider local optimality in specific directions (i.e., optimal in a directional neighborhood) in pursuit of developing these new optimality conditions. First-order necessary conditions for local optimality in given directions are provided by virtue of the corresponding directional normal cones. Utilizing the classical and/or the lower generalized support function, we obtain new second-order necessary and sufficient conditions for local optimality of general nonconvex constrained optimization problem in given directions via both the corresponding asymptotic second-order tangent cone and outer second-order tangent set. Our results do not require convexity and/or nonemptyness of the outer second-order tangent set. This is an important improvement to other results in the literature since the outer second-order tangent set can be nonconvex and empty even when the set is convex.
△ Less
Submitted 3 March, 2025; v1 submitted 26 April, 2024;
originally announced April 2024.
-
Nonlinear kernel-free quadratic hyper-surface support vector machine with 0-1 loss function
Authors:
Mingyang Wu,
Zhixia Yang,
Junyou Ye
Abstract:
For the binary classification problem, a novel nonlinear kernel-free quadratic hyper-surface support vector machine with 0-1 loss function (QSSVM$_{0/1}$) is proposed. Specifically, the task of QSSVM$_{0/1}$ is to seek a quadratic separating hyper-surface to divide the samples into two categories. And it has better interpretability than the methods using kernel functions, since each feature of the…
▽ More
For the binary classification problem, a novel nonlinear kernel-free quadratic hyper-surface support vector machine with 0-1 loss function (QSSVM$_{0/1}$) is proposed. Specifically, the task of QSSVM$_{0/1}$ is to seek a quadratic separating hyper-surface to divide the samples into two categories. And it has better interpretability than the methods using kernel functions, since each feature of the sample acts both independently and synergistically. By introducing the 0-1 loss function to construct the optimization model makes the model obtain strong sample sparsity. The proximal stationary point of the optimization problem is defined by the proximal operator of the 0-1 loss function, which figures out the problem of non-convex discontinuity of the optimization problem due to the 0-1 loss function. A new iterative algorithm based on the alternating direction method of multipliers (ADMM) framework is designed to solve the optimization problem, which relates to the working set defined by support vectors. The computational complexity and convergence of the algorithm are discussed. Numerical experiments on 4 artificial datasets and 14 benchmark datasets demonstrate that our QSSVM$_{0/1}$ achieves higher classification accuracy, fewer support vectors and less CPU time cost than other state-of-the-art methods.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Hyperbolicity and model-complete fields
Authors:
Michał Szachniewicz,
Jinhe Ye
Abstract:
We study model-complete fields that avoid a given quasi-project variety $V$. There is a close connection between hyperbolicity of $V$ and the existence of the model companion for the theory of characteristic-zero fields avoiding rational points on $V$. This gives a model theoretic notion of hyperbolicity that we call excludability.
In particular, we show that if $V$ is a Brody hyperbolic project…
▽ More
We study model-complete fields that avoid a given quasi-project variety $V$. There is a close connection between hyperbolicity of $V$ and the existence of the model companion for the theory of characteristic-zero fields avoiding rational points on $V$. This gives a model theoretic notion of hyperbolicity that we call excludability.
In particular, we show that if $V$ is a Brody hyperbolic projective variety over $\mathbb{Q}$ with $V(\mathbb{Q}) = \varnothing$, then the model companion, called $V\XF$, exists. We also study some model-theoretic properties of $V\mathrm{XF}$. This extends the results for curves by Will Johnson and the second author.
△ Less
Submitted 9 January, 2025; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Uncertainty Propagation and Bayesian Fusion on Unimodular Lie Groups from a Parametric Perspective
Authors:
Jikai Ye,
Gregory S. Chirikjian
Abstract:
We address the problem of uncertainty propagation and Bayesian fusion on unimodular Lie groups. Starting from a stochastic differential equation (SDE) defined on Lie groups via Mckean-Gangolli injection, we first convert it to a parametric SDE in exponential coordinates. The coefficient transform method for the conversion is stated for both Ito's and Stratonovich's interpretation of the SDE. Then…
▽ More
We address the problem of uncertainty propagation and Bayesian fusion on unimodular Lie groups. Starting from a stochastic differential equation (SDE) defined on Lie groups via Mckean-Gangolli injection, we first convert it to a parametric SDE in exponential coordinates. The coefficient transform method for the conversion is stated for both Ito's and Stratonovich's interpretation of the SDE. Then we derive a mean and covariance fitting formula for probability distributions on Lie groups defined by a concentrated distribution on the exponential coordinate. It is used to derive the mean and covariance propagation equations for the SDE defined by injection, which coincides with the result derived from a Fokker-Planck equation in previous work. We also propose a simple modification to the update step of Kalman filters using the fitting formula, which improves the fusion accuracy with moderate computation time.
△ Less
Submitted 7 March, 2025; v1 submitted 7 January, 2024;
originally announced January 2024.
-
Of model completeness and algebraic groups
Authors:
Daniel Max Hoffmann,
Piotr Kowalski,
Chieu-Minh Tran,
Jinhe Ye
Abstract:
We show that if G is a split semisimple algebraic group over a model complete field K, then the groups G(K) and G(K)' (the commutator group which is a ``Chevalley group'' as for example the group PSL_2(K)) are model complete as well.
We show that if G is a split semisimple algebraic group over a model complete field K, then the groups G(K) and G(K)' (the commutator group which is a ``Chevalley group'' as for example the group PSL_2(K)) are model complete as well.
△ Less
Submitted 2 March, 2025; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Optimality conditions for bilevel programs via Moreau envelope reformulation
Authors:
Kuang Bai,
Jane Ye,
Shangzhi Zeng
Abstract:
For bilevel programs with a convex lower level program, the classical approach replaces the lower level program with its Karush-Kuhn-Tucker condition and solve the resulting mathematical program with complementarity constraint (MPCC). It is known that when the set of lower level multipliers is not unique, MPCC may not be equivalent to the original bilevel problem, and many MPCC-tailored constraint…
▽ More
For bilevel programs with a convex lower level program, the classical approach replaces the lower level program with its Karush-Kuhn-Tucker condition and solve the resulting mathematical program with complementarity constraint (MPCC). It is known that when the set of lower level multipliers is not unique, MPCC may not be equivalent to the original bilevel problem, and many MPCC-tailored constraint qualifications do not hold. In this paper, we study bilevel programs where the lower level is generalized convex. Applying the equivalent reformulation via Moreau envelope, we derive new directional optimality conditions. Even in the nondirectional case, the new optimality condition is stronger than the strong stationarity for the corresponding MPCC.
△ Less
Submitted 11 March, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Directional derivative of the value function for parametric set-constrained optimization problems
Authors:
Kuang Bai,
Jane Ye
Abstract:
This paper is concerned with the directional derivative of the value function for a very general set-constrained optimization problem under perturbation. Under reasonable assumptions, we obtain upper and lower estimates for the upper and lower Dini directional derivative of the value function respectively, from which we obtain Hadamard directional differentiability of the value function when the s…
▽ More
This paper is concerned with the directional derivative of the value function for a very general set-constrained optimization problem under perturbation. Under reasonable assumptions, we obtain upper and lower estimates for the upper and lower Dini directional derivative of the value function respectively, from which we obtain Hadamard directional differentiability of the value function when the set of multipliers is a singleton. Our results do not require convexity of the set involved. Even in the case of a parametric nonlinear program, our results improve the classical ones in that our regularity conditions are weaker and the directional solution set is used which is in general smaller than its nondirectional counterparts.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Unified Enhancement of Privacy Bounds for Mixture Mechanisms via $f$-Differential Privacy
Authors:
Chendi Wang,
Buxin Su,
Jiayuan Ye,
Reza Shokri,
Weijie J. Su
Abstract:
Differentially private (DP) machine learning algorithms incur many sources of randomness, such as random initialization, random batch subsampling, and shuffling. However, such randomness is difficult to take into account when proving differential privacy bounds because it induces mixture distributions for the algorithm's output that are difficult to analyze. This paper focuses on improving privacy…
▽ More
Differentially private (DP) machine learning algorithms incur many sources of randomness, such as random initialization, random batch subsampling, and shuffling. However, such randomness is difficult to take into account when proving differential privacy bounds because it induces mixture distributions for the algorithm's output that are difficult to analyze. This paper focuses on improving privacy bounds for shuffling models and one-iteration differentially private gradient descent (DP-GD) with random initializations using $f$-DP. We derive a closed-form expression of the trade-off function for shuffling models that outperforms the most up-to-date results based on $(ε,δ)$-DP. Moreover, we investigate the effects of random initialization on the privacy of one-iteration DP-GD. Our numerical computations of the trade-off function indicate that random initialization can enhance the privacy of DP-GD. Our analysis of $f$-DP guarantees for these mixture mechanisms relies on an inequality for trade-off functions introduced in this paper. This inequality implies the joint convexity of $F$-divergences. Finally, we study an $f$-DP analog of the advanced joint convexity of the hockey-stick divergence related to $(ε,δ)$-DP and apply it to analyze the privacy of mixture mechanisms.
△ Less
Submitted 1 November, 2023; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Distribution System Flexibility Characterization: A Network-Informed Data-Driven Approach
Authors:
Qi Li,
Jianzhe Liu,
Bai Cui,
Wenzhan Song,
Jin Ye
Abstract:
A distribution system can flexibly adjust its substation-level power output by aggregating its local distributed energy resources (DERs). Due to DER and network constraints, characterizing the exact feasible power output region is computationally intensive. Hence, existing results usually rely on unpractical assumptions or suffer from conservativeness issues. Sampling-based data-driven methods can…
▽ More
A distribution system can flexibly adjust its substation-level power output by aggregating its local distributed energy resources (DERs). Due to DER and network constraints, characterizing the exact feasible power output region is computationally intensive. Hence, existing results usually rely on unpractical assumptions or suffer from conservativeness issues. Sampling-based data-driven methods can potentially address these limitations. Still, existing works usually exhibit computational inefficiency issues as they use a random sampling approach, which carries little information from network physics and provides few insights into the iterative search process. This letter proposes a novel network-informed data-driven method to close this gap. A computationally efficient data sampling approach is developed to obtain high-quality training data, leveraging network information and legacy learning experience. Then, a classifier is trained to estimate the feasible power output region with high accuracy. Numerical studies based on a real-world Southern California Edison network validate the performance of the proposed work.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Convergence Rate of LQG Mean Field Games with Common Noise
Authors:
Jiamin Jian,
Qingshuo Song,
Jiaxuan Ye
Abstract:
This paper focuses on exploring the convergence properties of a generic player's trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific…
▽ More
This paper focuses on exploring the convergence properties of a generic player's trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the N-player game and utilizes the associated Mean Field Game framework.
△ Less
Submitted 2 July, 2023;
originally announced July 2023.
-
Calm local optimality for nonconvex-nonconcave minimax problems
Authors:
Xiaoxiao Ma,
Wei Yao,
Jane J. Ye,
Jin Zhang
Abstract:
Nonconvex-nonconcave minimax problems have found numerous applications in various fields including machine learning. However, questions remain about what is a good surrogate for local minimax optimum and how to characterize the minimax optimality. Recently Jin, Netrapalli, and Jordan (ICML 2020) introduced a concept of local minimax point and derived optimality conditions for the smooth and uncons…
▽ More
Nonconvex-nonconcave minimax problems have found numerous applications in various fields including machine learning. However, questions remain about what is a good surrogate for local minimax optimum and how to characterize the minimax optimality. Recently Jin, Netrapalli, and Jordan (ICML 2020) introduced a concept of local minimax point and derived optimality conditions for the smooth and unconstrained case. In this paper, we introduce the concept of calm local minimax point, which is a local minimax point with a calm radius function. With the extra calmness property we obtain first and second-order sufficient and necessary optimality conditions for a very general class of nonsmooth nonconvex-nonconcave minimax problem. Moreover we show that the calm local minimax optimality and the local minimax optimality coincide under a weak sufficient optimality condition for the maximization problem. This equivalence allows us to derive stronger optimality conditions under weaker assumptions for local minimax optimality.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Moreau Envelope Based Difference-of-weakly-Convex Reformulation and Algorithm for Bilevel Programs
Authors:
Lucy L. Gao,
Jane J. Ye,
Haian Yin,
Shangzhi Zeng,
Jin Zhang
Abstract:
Bilevel programming has emerged as a valuable tool for hyperparameter selection, a central concern in machine learning. In a recent study by Ye et al. (2023), a value function-based difference of convex algorithm was introduced to address bilevel programs. This approach proves particularly powerful when dealing with scenarios where the lower-level problem exhibits convexity in both the upper-level…
▽ More
Bilevel programming has emerged as a valuable tool for hyperparameter selection, a central concern in machine learning. In a recent study by Ye et al. (2023), a value function-based difference of convex algorithm was introduced to address bilevel programs. This approach proves particularly powerful when dealing with scenarios where the lower-level problem exhibits convexity in both the upper-level and lower-level variables. Examples of such scenarios include support vector machines and $\ell_1$ and $\ell_2$ regularized regression. In this paper, we significantly expand the range of applications, now requiring convexity only in the lower-level variables of the lower-level program. We present an innovative single-level difference of weakly convex reformulation based on the Moreau envelope of the lower-level problem. We further develop a sequentially convergent Inexact Proximal Difference of Weakly Convex Algorithm (iP-DwCA). To evaluate the effectiveness of the proposed iP-DwCA, we conduct numerical experiments focused on tuning hyperparameters for kernel support vector machines on simulated data.
△ Less
Submitted 20 January, 2024; v1 submitted 29 June, 2023;
originally announced June 2023.
-
Non-trivial higher homotopy of first-order theories
Authors:
Tim Campion,
Jinhe Ye
Abstract:
Let $T$ be the theory of dense cyclically ordered sets with at least two elements. We determine the classifying space of $\mathsf{Mod}(T)$ to be homotopically equivalent to $\mathbb{CP}^\infty$. In particular, $π_2(\lvert\mathsf{Mod}(T)\rvert)=\mathbb{Z}$, which answers a question in our previous work. The computation is based on Connes' cycle category $Λ$.
Let $T$ be the theory of dense cyclically ordered sets with at least two elements. We determine the classifying space of $\mathsf{Mod}(T)$ to be homotopically equivalent to $\mathbb{CP}^\infty$. In particular, $π_2(\lvert\mathsf{Mod}(T)\rvert)=\mathbb{Z}$, which answers a question in our previous work. The computation is based on Connes' cycle category $Λ$.
△ Less
Submitted 12 January, 2024; v1 submitted 21 June, 2023;
originally announced June 2023.
-
PLMEs and Disjunctive Decompositions for Bilevel Optimization
Authors:
Jiawang Nie,
Jane J. Ye,
Suhan Zhong
Abstract:
This paper studies bilevel polynomial optimization in which lower level constraining functions depend linearly on lower level variables. We show that such a bilevel program can be reformulated as a disjunctive program using partial Lagrange multiplier expressions (PLMEs). An advantage of this approach is that branch problems of the disjunctive program are easier to solve. In particular, since the…
▽ More
This paper studies bilevel polynomial optimization in which lower level constraining functions depend linearly on lower level variables. We show that such a bilevel program can be reformulated as a disjunctive program using partial Lagrange multiplier expressions (PLMEs). An advantage of this approach is that branch problems of the disjunctive program are easier to solve. In particular, since the PLME can be easily obtained, these branch problems can be efficiently solved by polynomial optimization techniques. Solving each branch problem either returns infeasibility or gives a candidate local or global optimizer for the original bilevel optimization. We give necessary and sufficient conditions for these candidates to be global optimizers, and sufficient conditions for the local optimality. Numerical experiments are also presented to show the efficiency of the method.
△ Less
Submitted 2 April, 2023;
originally announced April 2023.
-
Extremal spectral radius of weighted adjacency matrices of bicyclic graphs
Authors:
Jiachang Ye,
Junli Hu,
Xiaodan Chen
Abstract:
The weighted adjacency matrix $A_{f}(G)$ of a simple graph $G=(V,E)$ is the $|V|\times|V|$ matrix whose $ij$-entry equals $f(d_{i},d_j)$, where $f(x,y)$ is a symmetric function such that $f(d_i,d_j)>0$ if $ij\in E$ and $f(d_i,d_j)=0$ if $ij\notin E$ and $d_i$ is the degree of the vertex $i$. In this paper, we determine the unique graph having the largest spectral radius of $A_{f}(G)$ among all the…
▽ More
The weighted adjacency matrix $A_{f}(G)$ of a simple graph $G=(V,E)$ is the $|V|\times|V|$ matrix whose $ij$-entry equals $f(d_{i},d_j)$, where $f(x,y)$ is a symmetric function such that $f(d_i,d_j)>0$ if $ij\in E$ and $f(d_i,d_j)=0$ if $ij\notin E$ and $d_i$ is the degree of the vertex $i$. In this paper, we determine the unique graph having the largest spectral radius of $A_{f}(G)$ among all the bicyclic graphs under the assumption that $f(x,y)$ is increasing and convex in $x$ and $f(x_1,y_1)\geq f(x_2,y_2)$ when $|x_1-y_1|>|x_2-y_2|$ and $x_1+y_1=x_2+y_2$. Moreover, we determine the unique graph having the second largest spectral radius of $A_{f}(G)$ among all the bicyclic graphs when $f(x,y)=x+y$, $(x+y)^2$ or $x^2+y^2$, which corresponds to the well-known first Zagreb index, first hyper-Zagreb index, and forgotten index, respectively. In addition, we also characterize the bicyclic graphs with the first two largest spectral radii of $A_{f}(G)$ when $f(x,y)=\frac{1}{2}(x/y+y/x)$, corresponding to the extended index.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Curve-excluding fields
Authors:
Will Johnson,
Jinhe Ye
Abstract:
If $C$ is a curve over $\mathbb{Q}$ with genus at least $2$ and $C(\mathbb{Q})$ is empty, then the class of fields $K$ of characteristic 0 such that $C(K) = \varnothing$ has a model companion, which we call $C\mathrm{XF}$. The theory $C\mathrm{XF}$ is not complete, but we characterize the completions. Using $C\mathrm{XF}$, we produce examples of fields with interesting combinations of properties.…
▽ More
If $C$ is a curve over $\mathbb{Q}$ with genus at least $2$ and $C(\mathbb{Q})$ is empty, then the class of fields $K$ of characteristic 0 such that $C(K) = \varnothing$ has a model companion, which we call $C\mathrm{XF}$. The theory $C\mathrm{XF}$ is not complete, but we characterize the completions. Using $C\mathrm{XF}$, we produce examples of fields with interesting combinations of properties. For example, we produce (1) a model-complete field with unbounded Galois group, (2) an infinite field with a decidable first-order theory that is not ``large'' in the sense of Pop, (3) a field that is algebraically bounded but not ``very slim'' in the sense of Junker and Koenigsmann, and (4) a pure field that is strictly NSOP$_4$, i.e., NSOP$_4$ but not NSOP$_3$. Lastly, we give a new construction of fields that are virtually large but not large.
△ Less
Submitted 12 December, 2024; v1 submitted 10 March, 2023;
originally announced March 2023.
-
Sensitivity analysis of the maximal value function with applications in nonconvex minimax programs
Authors:
L. Guo,
J. J. Ye,
J. Zhang
Abstract:
In this paper, we perform sensitivity analysis for the maximal value function which is the optimal value function for a parametric maximization problem. Our aim is to study various subdifferentials for the maximal value function. We obtain upper estimates of Fréchet, limiting, and horizon subdifferentials of the maximal value function by using some sensitivity analysis techniques sophisticatedly.…
▽ More
In this paper, we perform sensitivity analysis for the maximal value function which is the optimal value function for a parametric maximization problem. Our aim is to study various subdifferentials for the maximal value function. We obtain upper estimates of Fréchet, limiting, and horizon subdifferentials of the maximal value function by using some sensitivity analysis techniques sophisticatedly. The derived upper estimates depend only on the union of all solutions and not on its convex hull or only one solution from the solution set. Finally, we apply the derived results to develop some new necessary optimality conditions for nonconvex minimax problems. In the nonconvex-concave setting, our Wolfe duality approach compare favourably with the first order approach in that the necessary condition is sharper and the constraint qualification is weaker.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
On the polynomiality conjecture of cluster realization of quantum groups
Authors:
Ivan Chi-Ho Ip,
Jeff York Ye
Abstract:
In this paper, we give a sufficient and necessary condition for a regular element of a quantum cluster algebra $\mathcal{O}_q(\mathcal{X})$ to be universally polynomial. This resolves several conjectures by the first author on the polynomiality of the cluster realization of quantum group generators in different families of positive representations.
In this paper, we give a sufficient and necessary condition for a regular element of a quantum cluster algebra $\mathcal{O}_q(\mathcal{X})$ to be universally polynomial. This resolves several conjectures by the first author on the polynomiality of the cluster realization of quantum group generators in different families of positive representations.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Winning Strategies for Generalized Zeckendorf Game
Authors:
Steven J. Miller,
Eliel Sosis,
Jingkai Ye
Abstract:
Zeckendorf proved that every positive integer $n$ can be written uniquely as the sum of non-adjacent Fibonacci numbers; a similar result holds for other positive linear recurrence sequences. These legal decompositions can be used to construct a game that starts with a fixed integer $n$, and players take turns using moves relating to a given recurrence relation. The game eventually terminates in a…
▽ More
Zeckendorf proved that every positive integer $n$ can be written uniquely as the sum of non-adjacent Fibonacci numbers; a similar result holds for other positive linear recurrence sequences. These legal decompositions can be used to construct a game that starts with a fixed integer $n$, and players take turns using moves relating to a given recurrence relation. The game eventually terminates in a unique legal decomposition, and the player who makes the final move wins.
For the Fibonacci game, Player $2$ has the winning strategy for all $n>2$. We give a non-constructive proof that for the two-player $(c, k)$-nacci game, for all $k$ and sufficiently large $n$, Player $1$ has a winning strategy when $c$ is even and Player $2$ has a winning strategy when $c$ is odd. Interestingly, the player with the winning strategy can make a mistake as early as the $c + 1$ turn, in which case the other player gains the winning strategy. Furthermore, we proved that for the $(c, k)$-nacci game with players $p \ge c + 2$, no player has a winning strategy for any $n \ge 3c^2 + 6c + 3$. We find a stricter lower boundary, $n \ge 7$, in the case of the three-player $(1, 2)$-nacci game. Then we extend the result from the multiplayer game to multialliance games, showing which alliance has a winning strategy or when no winning strategy exists for some special cases of multialliance games.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Directional subdifferential of the value function
Authors:
Kuang Bai,
Jane J. Ye
Abstract:
The directional subdifferential of the value function gives an estimate on how much the optimal value changes under a perturbation in a certain direction. In this paper we derive upper estimates for the directional limiting and singular subdifferential of the value function for a very general parametric optimization problem. We obtain a characterization for the directional Lipschitzness of a local…
▽ More
The directional subdifferential of the value function gives an estimate on how much the optimal value changes under a perturbation in a certain direction. In this paper we derive upper estimates for the directional limiting and singular subdifferential of the value function for a very general parametric optimization problem. We obtain a characterization for the directional Lipschitzness of a locally lower semicontinuous function in terms of the directional subdifferentials. Based on this characterization and the derived upper estimate for the directional singular subdifferential, we are able to obtain a sufficient condition for the directional Lipschitzness of the value function. Finally, we specify these results for various cases when all functions involved are smooth, when the perturbation is additive, when the constraint is independent of the parameter, or when the constraints are equalities and inequalities. Our results extend the corresponding results on the sensitivity of the value function to allow directional perturbations. Even in the case of full perturbations, our results recover or even extend some existing results, including the Danskin's theorem.
△ Less
Submitted 17 April, 2023; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Tropical functions on a skeleton
Authors:
Antoine Ducros,
Ehud Hrushovski,
François Loeser,
Jinhe Ye
Abstract:
We prove a general finiteness statement for the ordered abelian group of tropical functions on skeleta in Berkovich analytifications of algebraic varieties. Our approach consists in working in the framework of stable completions of algebraic varieties, a model-theoretic version of Berkovich analytifications, for which we prove a similar result, of which the former one is a consequence.
We prove a general finiteness statement for the ordered abelian group of tropical functions on skeleta in Berkovich analytifications of algebraic varieties. Our approach consists in working in the framework of stable completions of algebraic varieties, a model-theoretic version of Berkovich analytifications, for which we prove a similar result, of which the former one is a consequence.
△ Less
Submitted 19 June, 2024; v1 submitted 8 October, 2022;
originally announced October 2022.
-
Optimality conditions and constraint qualifications for cardinality constrained optimization problems
Authors:
Zhuoyu Xiao,
Jane J. Ye
Abstract:
The cardinality constrained optimization problem (CCOP) is an optimization problem where the maximum number of nonzero components of any feasible point is bounded. In this paper, we consider CCOP as a mathematical program with disjunctive subspaces constraints (MPDSC). Since a subspace is a special case of a convex polyhedral set, MPDSC is a special case of the mathematical program with disjunctiv…
▽ More
The cardinality constrained optimization problem (CCOP) is an optimization problem where the maximum number of nonzero components of any feasible point is bounded. In this paper, we consider CCOP as a mathematical program with disjunctive subspaces constraints (MPDSC). Since a subspace is a special case of a convex polyhedral set, MPDSC is a special case of the mathematical program with disjunctive constraints (MPDC). Using the special structure of subspaces, we are able to obtain more precise formulas for the tangent and (directional) normal cones for the disjunctive set of subspaces. We then obtain first and second order optimality conditions by using the corresponding results from MPDC. Thanks to the special structure of the subspace, we are able to obtain some results for MPDSC that do not hold in general for MPDC. In particular we show that the relaxed constant positive linear dependence (RCPLD) is a sufficient condition for the metric subregularity/error bound property for MPDSC which is not true for MPDC in general. Finally we show that under all constraint qualifications presented in this paper, certain exact penalization holds for CCOP.
△ Less
Submitted 17 September, 2022;
originally announced September 2022.
-
Topology Optimization with Frictional Self-Contact
Authors:
Zeshun Zong,
Xuan Li,
Jianping Ye,
Sian Wen,
Yin Yang,
Danny M. Kaufman,
Minchen Li,
Chenfanfu Jiang
Abstract:
Contact-aware topology optimization faces challenges in robustness, accuracy, and applicability to internal structural surfaces under self-contact. This work builds on the recently proposed barrier-based Incremental Potential Contact (IPC) model and presents a new self-contact-aware topology optimization framework. A combination of SIMP, adjoint sensitivity analysis, and the IPC frictional-contact…
▽ More
Contact-aware topology optimization faces challenges in robustness, accuracy, and applicability to internal structural surfaces under self-contact. This work builds on the recently proposed barrier-based Incremental Potential Contact (IPC) model and presents a new self-contact-aware topology optimization framework. A combination of SIMP, adjoint sensitivity analysis, and the IPC frictional-contact model is investigated. Numerical examples for optimizing varying objective functions under contact are presented. The resulting algorithm proposed solves topology optimization for large deformation and complex frictionally contacting scenarios with accuracy and robustness.
△ Less
Submitted 24 August, 2022; v1 submitted 6 August, 2022;
originally announced August 2022.
-
When is the étale open topology a field topology?
Authors:
Philip Dittmann,
Erik Walsberg,
Jinhe Ye
Abstract:
We investigate the following question: Given a field $K$, when is the étale open topology $\mathcal{E}_K$ induced by a field topology? On the positive side, when $K$ is the fraction field of a local domain $R\neq K$, using a weak form of resolution of singularities due to Gabber, we show that $\mathcal{E}_K$ agrees with the $R$-adic topology when $R$ is quasi-excellent and henselian. Various patho…
▽ More
We investigate the following question: Given a field $K$, when is the étale open topology $\mathcal{E}_K$ induced by a field topology? On the positive side, when $K$ is the fraction field of a local domain $R\neq K$, using a weak form of resolution of singularities due to Gabber, we show that $\mathcal{E}_K$ agrees with the $R$-adic topology when $R$ is quasi-excellent and henselian. Various pathologies appear when dropping the quasi-excellence assumption. For locally bounded field topologies, we introduce the notion of generalized t-henselianity (gt-henselianity) following Prestel and Ziegler. We establish the following: For a locally bounded field topology $τ$, the étale open topology is induced by $τ$ if and only if $τ$ is gt-henselian and some non-empty étale image is $τ$-bounded open. On the negative side, we obtain that for a pseudo-algebraically closed field $K$, $\mathcal{E}_K$ is never induced by a field topology.
△ Less
Submitted 23 June, 2025; v1 submitted 3 August, 2022;
originally announced August 2022.
-
A note on geometric theories of fields
Authors:
Will Johnson,
Jinhe Ye
Abstract:
Let $T$ be a complete theory of fields, possibly with extra structure. Suppose that model-theoretic algebraic closure agrees with field-theoretic algebraic closure, or more generally that model-theoretic algebraic closure has the exchange property. Then $T$ has uniform finiteness, or equivalently, it eliminates the quantifier $\exists^\infty$. It follows that very slim fields in the sense of Junke…
▽ More
Let $T$ be a complete theory of fields, possibly with extra structure. Suppose that model-theoretic algebraic closure agrees with field-theoretic algebraic closure, or more generally that model-theoretic algebraic closure has the exchange property. Then $T$ has uniform finiteness, or equivalently, it eliminates the quantifier $\exists^\infty$. It follows that very slim fields in the sense of Junker and Koenigsmann are the same thing as geometric fields in the sense of Hrushovski and Pillay. Modulo some fine print, these two concepts are also equivalent to algebraically bounded fields in the sense of van den Dries.
From the proof, one gets a one-cardinal theorem for geometric theories of fields: any infinite definable set has the same cardinality as the field. We investigate whether this extends to interpretable sets. We show that positive dimensional interpretable sets must have the same cardinality as the field, but zero-dimensional interpretable sets can have smaller cardinality. As an application, we show that any geometric theory of fields has an uncountable model with only countably many finite algebraic extensions.
△ Less
Submitted 7 March, 2023; v1 submitted 31 July, 2022;
originally announced August 2022.
-
An invitation to extension domination
Authors:
Kyle Gannon,
Jinhe Ye
Abstract:
Motivated by the theory of domination for types, we introduce a notion of domination for Keisler measures called extension domination. We argue that this variant of domination behaves similarly to its type setting counterpart. We prove that extension domination extends domination for types and that it forms a preorder on the space of global Keisler measures. We then explore some basic properties r…
▽ More
Motivated by the theory of domination for types, we introduce a notion of domination for Keisler measures called extension domination. We argue that this variant of domination behaves similarly to its type setting counterpart. We prove that extension domination extends domination for types and that it forms a preorder on the space of global Keisler measures. We then explore some basic properties related to this notion (e.g. approximations by formulas, closure under localizations, convex combinations). We also prove a few preservation theorems and provide some explicit examples.
△ Less
Submitted 8 January, 2024; v1 submitted 17 July, 2022;
originally announced July 2022.
-
Value Function Based Difference-of-Convex Algorithm for Bilevel Hyperparameter Selection Problems
Authors:
Lucy Gao,
Jane J. Ye,
Haian Yin,
Shangzhi Zeng,
Jin Zhang
Abstract:
Gradient-based optimization methods for hyperparameter tuning guarantee theoretical convergence to stationary solutions when for fixed upper-level variable values, the lower level of the bilevel program is strongly convex (LLSC) and smooth (LLS). This condition is not satisfied for bilevel programs arising from tuning hyperparameters in many machine learning algorithms. In this work, we develop a…
▽ More
Gradient-based optimization methods for hyperparameter tuning guarantee theoretical convergence to stationary solutions when for fixed upper-level variable values, the lower level of the bilevel program is strongly convex (LLSC) and smooth (LLS). This condition is not satisfied for bilevel programs arising from tuning hyperparameters in many machine learning algorithms. In this work, we develop a sequentially convergent Value Function based Difference-of-Convex Algorithm with inexactness (VF-iDCA). We show that this algorithm achieves stationary solutions without LLSC and LLS assumptions for bilevel programs from a broad class of hyperparameter tuning applications. Our extensive experiments confirm our theoretical findings and show that the proposed VF-iDCA yields superior performance when applied to tune hyperparameters.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Relaxed constant positive linear dependence constraint qualification for disjunctive programs
Authors:
Mengwei Xu,
Jane J. Ye
Abstract:
The disjunctive system is a system involving a disjunctive set which is the union of finitely many polyhedral convex sets. In this paper, we introduce a notion of the relaxed constant positive linear dependence constraint qualification (RCPLD) for the disjunctive system. For a disjunctive system, our notion is weaker than the one we introduced for a more general system recently (J. Glob. Optim. 20…
▽ More
The disjunctive system is a system involving a disjunctive set which is the union of finitely many polyhedral convex sets. In this paper, we introduce a notion of the relaxed constant positive linear dependence constraint qualification (RCPLD) for the disjunctive system. For a disjunctive system, our notion is weaker than the one we introduced for a more general system recently (J. Glob. Optim. 2020) and is still a constraint qualification. To obtain the local error bound for the disjunctive system, we introduce the piecewise RCPLD under which the error bound property holds if all inequality constraint functions are subdifferentially regular and the rest of the constraint functions are smooth. We then specialize our results to the ortho-disjunctive program, which includes the mathematical program with equilibrium constraints (MPEC), the mathematical program with vanishing constraints (MPVC) and the mathematical program with switching constraints (MPSC) as special cases. For MPEC, we recover MPEC-RCPLD, an MPEC variant of RCPLD and propose the MPEC piecewise RCPLD to obtain the {error bound property}. For MPVC, we introduce new constraint qualifications MPVC-RCPLD and the piecewise RCPLD, which also implies the local error bound. For MPSC, we show that both RCPLD and the piecewise RCPLD coincide and hence it leads to the local error bound.
△ Less
Submitted 5 March, 2023; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Second-order optimality conditions for general nonconvex optimization problems and variational analysis of disjunctive systems
Authors:
Matus Benko,
Helmut Gfrerer,
Jane Ye,
Jin Zhang,
Jinchuan Zhou
Abstract:
In this paper, we propose second-order sufficient optimality conditions for a very general nonconvex constrained optimization problem, which covers many prominent mathematical programs.Unlike the existing results in the literature, our conditions prove to be sufficient, for an essential local minimizer of second order, under merely basic smoothness and closedness assumptions on the data defining t…
▽ More
In this paper, we propose second-order sufficient optimality conditions for a very general nonconvex constrained optimization problem, which covers many prominent mathematical programs.Unlike the existing results in the literature, our conditions prove to be sufficient, for an essential local minimizer of second order, under merely basic smoothness and closedness assumptions on the data defining the problem.In the second part, we propose a comprehensive first- and second-order variational analysis of disjunctive systems and demonstrate how the second-order objects appearing in the optimality conditions can be effectively computed in this case.
△ Less
Submitted 22 November, 2022; v1 submitted 18 March, 2022;
originally announced March 2022.
-
Beautiful pairs
Authors:
Pablo Cubides Kovacsics,
Martin Hils,
Jinhe Ye
Abstract:
We introduce an abstract framework to study certain classes of stably embedded pairs of models of a complete $\mathcal{L}$-theory $T$, called \textit{beautiful pairs}, which comprises Poizat's belles paires of stable structures and van den Dries-Lewenberg's tame pairs of o-minimal structures. Using an amalgamation construction, we relate several properties of beautiful pairs with properties analog…
▽ More
We introduce an abstract framework to study certain classes of stably embedded pairs of models of a complete $\mathcal{L}$-theory $T$, called \textit{beautiful pairs}, which comprises Poizat's belles paires of stable structures and van den Dries-Lewenberg's tame pairs of o-minimal structures. Using an amalgamation construction, we relate several properties of beautiful pairs with properties analogous to properties in Fraïssé classes.
After characterizing beautiful pairs of various theories of ordered abelian groups and valued fields, including the theories of algebraically, $p$-adically and real closed valued fields, we show an Ax-Kochen-Ershov type result for beautiful pairs of henselian valued fields. As an application, we derive strict pro-definability of particular classes of definable types. When $T$ is one of the theories of valued fields mentioned above, the corresponding classes of types are related to classical geometric spaces and our main result specializes to their strict pro-definability. Most notably, we exhibit the strict pro-definability of a natural space of types associated to Huber's analytification. In this way, we also recover a result of Hrushovski-Loeser on the strict pro-definability of stably dominated types in algebraically closed valued fields, which corresponds to Berkovich's analytification.
△ Less
Submitted 31 March, 2025; v1 submitted 1 December, 2021;
originally announced December 2021.
-
Performance Analysis of Fractional Learning Algorithms
Authors:
Abdul Wahab,
Shujaat Khan,
Imran Naseem,
Jong Chul Ye
Abstract:
Fractional learning algorithms are trending in signal processing and adaptive filtering recently. However, it is unclear whether the proclaimed superiority over conventional algorithms is well-grounded or is a myth as their performance has never been extensively analyzed. In this article, a rigorous analysis of fractional variants of the least mean squares and steepest descent algorithms is perfor…
▽ More
Fractional learning algorithms are trending in signal processing and adaptive filtering recently. However, it is unclear whether the proclaimed superiority over conventional algorithms is well-grounded or is a myth as their performance has never been extensively analyzed. In this article, a rigorous analysis of fractional variants of the least mean squares and steepest descent algorithms is performed. Some critical schematic kinks in fractional learning algorithms are identified. Their origins and consequences on the performance of the learning algorithms are discussed and swift ready-witted remedies are proposed. Apposite numerical experiments are conducted to discuss the convergence and efficiency of the fractional learning algorithms in stochastic environments.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Adaptive Uncertainty-Weighted ADMM for Distributed Optimization
Authors:
Jianping Ye,
Caleb Wan,
Samy Wu Fung
Abstract:
We present AUQ-ADMM, an adaptive uncertainty-weighted consensus ADMM method for solving large-scale convex optimization problems in a distributed manner. Our key contribution is a novel adaptive weighting scheme that empirically increases the progress made by consensus ADMM scheme and is attractive when using a large number of subproblems. The weights are related to the uncertainty associated with…
▽ More
We present AUQ-ADMM, an adaptive uncertainty-weighted consensus ADMM method for solving large-scale convex optimization problems in a distributed manner. Our key contribution is a novel adaptive weighting scheme that empirically increases the progress made by consensus ADMM scheme and is attractive when using a large number of subproblems. The weights are related to the uncertainty associated with the solutions of each subproblem, and are efficiently computed using low-rank approximations. We show AUQ-ADMM provably converges and demonstrate its effectiveness on a series of machine learning applications, including elastic net regression, multinomial logistic regression, and support vector machines. We provide an implementation based on the PyTorch package.
△ Less
Submitted 19 April, 2022; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Extremal Polygonal Cacti for General Sombor Index
Authors:
Jiachang Ye,
Jianguo Qian
Abstract:
The Sombor index of a graph $G$ was recently introduced by Gutman from the geometric point of view, defined as $SO(G)=\sum_{uv\in E(G)}\sqrt{d(u)^2+d(v)^2}$, where $d(u)$ is the degree of a vertex $u$. For two real numbers $α$ and $β$, the $α$-Sombor index and general Sombor index of $G$ are two generalized forms of the Sombor index defined as $SO_α(G)=\sum_{uv\in E(G)}(d(u)^α+d(v)^α)^{1/α}$ and…
▽ More
The Sombor index of a graph $G$ was recently introduced by Gutman from the geometric point of view, defined as $SO(G)=\sum_{uv\in E(G)}\sqrt{d(u)^2+d(v)^2}$, where $d(u)$ is the degree of a vertex $u$. For two real numbers $α$ and $β$, the $α$-Sombor index and general Sombor index of $G$ are two generalized forms of the Sombor index defined as $SO_α(G)=\sum_{uv\in E(G)}(d(u)^α+d(v)^α)^{1/α}$ and $SO_α(G;β)=\sum_{uv\in E(G)}(d(u)^α+d(v)^α)^β$, respectively. A $k$-polygonal cactus is a connected graph in which every block is a cycle of length $k$. In this paper, we establish a lower bound on $α$-Sombor index for $k$-polygonal cacti and show that the bound is attained only by chemical $k$-polygonal cacti. The extremal $k$-polygonal cacti for $SO_α(G;β)$ with some particular $α$ and $β$ are also considered.
△ Less
Submitted 3 October, 2021; v1 submitted 29 August, 2021;
originally announced August 2021.
-
The étale open topology over the fraction field of a henselian local domain
Authors:
Will Johnson,
Erik Walsberg,
Jinhe Ye
Abstract:
Suppose that $R$ is a local domain with fraction field $K$. If $R$ is Henselian then the $R$-adic topology over $K$ refines the étale open topology. If $R$ is regular then the étale open topology over $K$ refines the $R$-adic topology. In particular the étale open topology over $L((t_1,\ldots,t_n))$ agrees with the $L[[t_1,\ldots,t_n]]$-adic topology for any field $L$ and $n \ge 1$.
Suppose that $R$ is a local domain with fraction field $K$. If $R$ is Henselian then the $R$-adic topology over $K$ refines the étale open topology. If $R$ is regular then the étale open topology over $K$ refines the $R$-adic topology. In particular the étale open topology over $L((t_1,\ldots,t_n))$ agrees with the $L[[t_1,\ldots,t_n]]$-adic topology for any field $L$ and $n \ge 1$.
△ Less
Submitted 23 August, 2022; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Combined approach with second-order optimality conditions for bilevel programming problems
Authors:
Xiaoxiao Ma,
Wei Yao,
Jane J. Ye,
Jin Zhang
Abstract:
In this paper, we propose a combined approach with second-order optimality conditions of the lower level problem to study constraint qualifications and optimality conditions for bilevel programming problems. The new method is inspired by the combined approach developed by Ye and Zhu in 2010, where the authors combined the classical first-order and the value function approaches to derive new necess…
▽ More
In this paper, we propose a combined approach with second-order optimality conditions of the lower level problem to study constraint qualifications and optimality conditions for bilevel programming problems. The new method is inspired by the combined approach developed by Ye and Zhu in 2010, where the authors combined the classical first-order and the value function approaches to derive new necessary optimality conditions. In our approach, we add a second-order optimality condition to the combined program as a new constraint. We show that when all known approaches fail, adding the second-order optimality condition as a constraint makes the corresponding partial calmness condition and the resulting necessary optimality condition easier to hold. We also give some discussions on advantages and disadvantages of the combined approaches with the first-order and the second-order information.
△ Less
Submitted 7 February, 2023; v1 submitted 31 July, 2021;
originally announced August 2021.
-
Generic property of the partial calmness condition for bilevel programming problems
Authors:
Rongzhu Ke,
Wei Yao,
Jane J. Ye,
Jin Zhang
Abstract:
The partial calmness for the bilevel programming problem (BLPP) is an important condition which ensures that a local optimal solution of BLPP is a local optimal solution of a partially penalized problem where the lower level optimality constraint is moved to the objective function and hence a weaker constraint qualification can be applied. In this paper we propose a sufficient condition in the for…
▽ More
The partial calmness for the bilevel programming problem (BLPP) is an important condition which ensures that a local optimal solution of BLPP is a local optimal solution of a partially penalized problem where the lower level optimality constraint is moved to the objective function and hence a weaker constraint qualification can be applied. In this paper we propose a sufficient condition in the form of a partial error bound condition which guarantees the partial calmness condition. We analyse the partial calmness for the combined program based on the Bouligand (B-) and the Fritz John (FJ) stationary conditions from a generic point of view. Our main result states that the partial error bound condition for the combined programs based on B and FJ conditions are generic for an important setting with applications in economics and hence the partial calmness for the combined program is not a particularly stringent assumption. Moreover we derive optimality conditions for the combined program for the generic case without any extra constraint qualifications and show the exact equivalence between our optimality condition and the one by Jongen and Shikhman given in implicit form. Our arguments are based on Jongen, Jonker and Twilt's generic (five type) classification of the so-called generalized critical points for one-dimensional parametric optimization problems and Jongen and Shikhman's generic local reductions of BLPPs.
△ Less
Submitted 2 November, 2021; v1 submitted 30 July, 2021;
originally announced July 2021.
-
The convergence rate of the equilibrium measure for the hybrid LQG Mean Field Game
Authors:
Jiamin Jian,
Peiyao Lai,
Qingshuo Song,
Jiaxuan Ye
Abstract:
In this work, we study the convergence rate of the $N$-player LQG game with a Markov chain common noise towards its asymptotic Mean Field Game. By postulating a Markovian structure via two auxiliary processes for the first and second moments of the Mean Field Game equilibrium and applying the fixed point condition in Mean Field Game, we first provide the characterization of the equilibrium measure…
▽ More
In this work, we study the convergence rate of the $N$-player LQG game with a Markov chain common noise towards its asymptotic Mean Field Game. By postulating a Markovian structure via two auxiliary processes for the first and second moments of the Mean Field Game equilibrium and applying the fixed point condition in Mean Field Game, we first provide the characterization of the equilibrium measure in Mean Field Game with a finite-dimensional Riccati system of ODEs. Additionally, with an explicit coupling of the optimal trajectory of the $N$-player game driven by $N$ dimensional Brownian motion and Mean Field Game counterpart driven by one-dimensional Brownian motion, we obtain the convergence rate $O(N^{-1/2})$ with respect to 2-Wasserstein distance.
△ Less
Submitted 28 August, 2023; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Éz fields
Authors:
Erik Walsberg,
Jinhe Ye
Abstract:
Let $K$ be a field. The étale open topology on the $K$-points $V(K)$ of a $K$-variety $V$ was introduced in our previous work. The étale open topology is non-discrete if and only if $K$ is large. If $K$ is separably, real, $p$-adically closed then the étale open topology agrees with the Zariski, order, valuation topology, respectively. We show that existentially definable sets in perfect large fie…
▽ More
Let $K$ be a field. The étale open topology on the $K$-points $V(K)$ of a $K$-variety $V$ was introduced in our previous work. The étale open topology is non-discrete if and only if $K$ is large. If $K$ is separably, real, $p$-adically closed then the étale open topology agrees with the Zariski, order, valuation topology, respectively. We show that existentially definable sets in perfect large fields behave well with respect to this topology: such sets are finite unions of étale open subsets of Zariski closed sets. This implies that existentially definable sets in arbitrary perfect large fields enjoy some of the well-known topological properties of definable sets in algebraically, real, and $p$-adically closed fields. We introduce and study the class of éz fields: $K$ is éz if $K$ is large and every definable set is a finite union of étale open subsets of Zariski closed sets. This should be seen as a generalized notion of model completeness for large fields. Algebraically closed, real closed, $p$-adically closed, and bounded $\mathrm{PAC}$ fields are éz. (In particular pseudofinite fields and infinite algebraic extensions of finite fields are éz.) We develop the basics of a theory of definable sets in éz fields. This gives a uniform approach to the theory of definable sets across all characteristic zero local fields and a new topological theory of definable sets in bounded $\mathrm{PAC}$ fields. We also show that some prominent examples of possibly non-model complete model-theoretically tame fields (characteristic zero Henselian fields and Frobenius fields) are éz.
△ Less
Submitted 14 October, 2022; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Difference of convex algorithms for bilevel programs with applications in hyperparameter selection
Authors:
Jane J. Ye,
Xiaoming Yuan,
Shangzhi Zeng,
Jin Zhang
Abstract:
In this paper, we present difference of convex algorithms for solving bilevel programs in which the upper level objective functions are difference of convex functions, and the lower level programs are fully convex. This nontrivial class of bilevel programs provides a powerful modelling framework for dealing with applications arising from hyperparameter selection in machine learning. Thanks to the…
▽ More
In this paper, we present difference of convex algorithms for solving bilevel programs in which the upper level objective functions are difference of convex functions, and the lower level programs are fully convex. This nontrivial class of bilevel programs provides a powerful modelling framework for dealing with applications arising from hyperparameter selection in machine learning. Thanks to the full convexity of the lower level program, the value function of the lower level program turns out to be convex and hence the bilevel program can be reformulated as a difference of convex bilevel program. We propose two algorithms for solving the reformulated difference of convex program and show their convergence under very mild assumptions. Finally we conduct numerical experiments to a bilevel model of support vector machine classification.
△ Less
Submitted 28 August, 2022; v1 submitted 17 February, 2021;
originally announced February 2021.