-
h-calibration: Rethinking Classifier Recalibration with Probabilistic Error-Bounded Objective
Authors:
Wenjian Huang,
Guiping Cao,
Jiahao Xia,
Jingkun Chen,
Hao Wang,
Jianguo Zhang
Abstract:
Deep neural networks have demonstrated remarkable performance across numerous learning tasks but often suffer from miscalibration, resulting in unreliable probability outputs. This has inspired many recent works on mitigating miscalibration, particularly through post-hoc recalibration methods that aim to obtain calibrated probabilities without sacrificing the classification performance of pre-trai…
▽ More
Deep neural networks have demonstrated remarkable performance across numerous learning tasks but often suffer from miscalibration, resulting in unreliable probability outputs. This has inspired many recent works on mitigating miscalibration, particularly through post-hoc recalibration methods that aim to obtain calibrated probabilities without sacrificing the classification performance of pre-trained models. In this study, we summarize and categorize previous works into three general strategies: intuitively designed methods, binning-based methods, and methods based on formulations of ideal calibration. Through theoretical and practical analysis, we highlight ten common limitations in previous approaches. To address these limitations, we propose a probabilistic learning framework for calibration called h-calibration, which theoretically constructs an equivalent learning formulation for canonical calibration with boundedness. On this basis, we design a simple yet effective post-hoc calibration algorithm. Our method not only overcomes the ten identified limitations but also achieves markedly better performance than traditional methods, as validated by extensive experiments. We further analyze, both theoretically and experimentally, the relationship and advantages of our learning objective compared to traditional proper scoring rule. In summary, our probabilistic framework derives an approximately equivalent differentiable objective for learning error-bounded calibrated probabilities, elucidating the correspondence and convergence properties of computational statistics with respect to theoretical bounds in canonical calibration. The theoretical effectiveness is verified on standard post-hoc calibration benchmarks by achieving state-of-the-art performance. This research offers valuable reference for learning reliable likelihood in related fields.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
On the equivalent p-th von Neumann-Jordan constant associated with isosceles orthogonality in Banach spaces
Authors:
Yuxin Wang,
Qi Liu,
Yongmo Hu,
Jinyu Xia,
Mengmeng Bao
Abstract:
In this paper, we define a new geometric constant based on isosceles orthogonality, denoted by . Through research, we find that this constant is the equivalent p-th von Neumann Jordan constant in the sense of isosceles orthogonality. First, we obtain some basic properties of the constant. Then, we calculate the upper and lower bounds of the constant. Through three examples, it is found that the up…
▽ More
In this paper, we define a new geometric constant based on isosceles orthogonality, denoted by . Through research, we find that this constant is the equivalent p-th von Neumann Jordan constant in the sense of isosceles orthogonality. First, we obtain some basic properties of the constant. Then, we calculate the upper and lower bounds of the constant. Through three examples, it is found that the upper bound of the constant is attainable. We also compare the relationship between this constant and other constants. Finally, we establish the connection between the constant and some geometric properties in Banach spaces, such as uniform non-squareness, uniform smoothness.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Numerical characterization of the hard Lefschetz classes of dimension two, II: supercritical collections of free divisor classes
Authors:
Jiajun Hu,
Jian Xiao
Abstract:
For $(n-2)$ free divisor classes on a smooth projective variety of dimension $n$, the product of these free divisor classes induces a Lefschetz type operator acting on the Néron-Severi space or the cohomology group of $(1,1)$ classes. We give a characterization of this kernel space, when the collection of these free divisor classes is supercritical. This resolves Shenfeld-van Handel's open problem…
▽ More
For $(n-2)$ free divisor classes on a smooth projective variety of dimension $n$, the product of these free divisor classes induces a Lefschetz type operator acting on the Néron-Severi space or the cohomology group of $(1,1)$ classes. We give a characterization of this kernel space, when the collection of these free divisor classes is supercritical. This resolves Shenfeld-van Handel's open problem in this setting. As consequences, we provide an algebro-geometric proof of the characterization of the extremals of the Alexandrov-Fenchel inequality for a supercritical collection of rational convex polytopes; we also give a characterization of the extremals of the Khovanskii-Teissier inequality given by the intersection numbers of two arbitrary free divisor classes.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
Notes on Chevalley Groups and Root Category I
Authors:
Buyan Li,
Jie Xiao
Abstract:
Based on the construction of simple Lie algebras via root category and following Chevalley's results, we construct Chevalley groups from the root category. Then we prove the Bruhat decomposition and the simplicity of the Chevalley groups, and calculate the orders of finite Chevalley groups.
Based on the construction of simple Lie algebras via root category and following Chevalley's results, we construct Chevalley groups from the root category. Then we prove the Bruhat decomposition and the simplicity of the Chevalley groups, and calculate the orders of finite Chevalley groups.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
The skew generalized von Neumann-Jordan type constant in Banach spaces
Authors:
Yuxin Wang,
Qi Liu,
Yueyue Feng,
Jinyu Xia,
Muhammad Sarfraz
Abstract:
Recently, the von Neumann-Jordan type constants C(X) has defined by Takahashi. A new skew generalized constant Cp(λ,μ,X) based on C(X) constant is given in this paper. First, we will obtain some basic properties of this new constant. Moreover, some relations between this new constant and other constants are investigated. Specially, with the Banach-Mazur distance, we use this new constant to study…
▽ More
Recently, the von Neumann-Jordan type constants C(X) has defined by Takahashi. A new skew generalized constant Cp(λ,μ,X) based on C(X) constant is given in this paper. First, we will obtain some basic properties of this new constant. Moreover, some relations between this new constant and other constants are investigated. Specially, with the Banach-Mazur distance, we use this new constant to study isomorphic Banach spaces. Ultimately, by leveraging the connection between the newly introduced constant and the weak orthogonality coefficient ω(X), a sufficient condition for normal structure is established.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
On some generalized geometric constants with two parameters in Banach spaces
Authors:
Yuxin Wang,
Qi Liu,
Haoyu Zhou,
Jinyu Xia,
Muhammad Toseef
Abstract:
In this paper, we build upon the TX constant that was introduced by Alonso and Llorens-Fuster in 2008. Through the incorporation of suitable parameters, we have successfully generalized the aforementioned constant into two novel forms of geometric constants, which are denoted as T1(λ,μ,X ) and T2(\k{appa},τ,X ). First, we obtained some basic properties of these two constants, such as the upper and…
▽ More
In this paper, we build upon the TX constant that was introduced by Alonso and Llorens-Fuster in 2008. Through the incorporation of suitable parameters, we have successfully generalized the aforementioned constant into two novel forms of geometric constants, which are denoted as T1(λ,μ,X ) and T2(\k{appa},τ,X ). First, we obtained some basic properties of these two constants, such as the upper and lower bounds. Next, these two constants served as the basis for our characterization of Hilbert spaces. More significantly, our findings reveal that these two constants exhibit a profound and intricate interrelation with other well-known constants in Banach spaces. Finally, we characterized uniformly non-square spaces by means of these two constants.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Positivity in the shadow of Hodge index theorem
Authors:
Jiajun Hu,
Jian Xiao
Abstract:
Taking a compact Kähler manifold as playground, we explore the powerfulness of Hodge index theorem. A main object is the Lorentzian classes on a compact Kähler manifold, behind which the characterization via Lorentzian polynomials over the Kähler cone and hence the validity of Hodge index theorem. Along the exploration, we discover several applications in complex geometry that may be unexpected be…
▽ More
Taking a compact Kähler manifold as playground, we explore the powerfulness of Hodge index theorem. A main object is the Lorentzian classes on a compact Kähler manifold, behind which the characterization via Lorentzian polynomials over the Kähler cone and hence the validity of Hodge index theorem. Along the exploration, we discover several applications in complex geometry that may be unexpected before. (1) For a Lefschetz type operator given by the complete intersection of nef classes, we give a complete characterization of its kernel face against the pseudo-effective cone. (2) We provide a new approach to Teissier's proportionality problem from the validity of hard Lefschetz property. This perspective enables us to establish the extremals for the Brunn-Minkowski inequality on a strictly Lorentzian class, and thus also characterize the most extremal case for a log-concavity sequence given by the intersection numbers of two nef classes. These Lorentzian classes include the fundamental classes of smooth projective varieties or compact Kähler manifolds as typical examples, hence our result extends Boucksom-Favre-Jonsson's and Fu-Xiao's results in respective settings to broader contexts, e.g. certain algebraic cycle classes given by reducible subvarieties. (3) Furthermore, we also strengthen the proportionality characterization by comparing various quantitative deficits and establishing stability estimates. Two quantitative sharper stability estimates with close relation with complex Monge--Ampère equations and Newton-Okounkov bodies are also discussed.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
A new geometric constant to compare p-angular and skew p-angular distances
Authors:
Yuxin Wang,
Qi Liu,
Jinyu Xia,
Muhammad Sarfraz
Abstract:
The $p$-angular distance was first introduced by Maligranda in 2006, while the skew $p$-angular distance was first introduced by Rooin in 2018. In this paper, we shall introduce a new geometric constant named Maligranda-Rooin constant in Banach spaces to compare $p$-angular distance and skew $p$-angular distance. We denote the Maligranda-Rooin constant as $\mathcal{M} \mathcal{R}_p(\mathcal{X})$.…
▽ More
The $p$-angular distance was first introduced by Maligranda in 2006, while the skew $p$-angular distance was first introduced by Rooin in 2018. In this paper, we shall introduce a new geometric constant named Maligranda-Rooin constant in Banach spaces to compare $p$-angular distance and skew $p$-angular distance. We denote the Maligranda-Rooin constant as $\mathcal{M} \mathcal{R}_p(\mathcal{X})$. First, the upper and lower bounds for the $\mathcal{M} \mathcal{R}_p(\mathcal{X})$ constant is given. Next, it's shown that, a normed linear space is an inner space if and only if $\mathcal{M} \mathcal{R}_p(\mathcal{X})=1$. Moreover, an equivalent form of this new constant is established. By means of the $\mathcal{M} \mathcal{R}_p(\mathcal{X})$ constant, we carry out the quantification of the characterization of uniform nonsquareness. Finally, we study the relationship between the $\mathcal{M} \mathcal{R}_p(\mathcal{X})$ constant, uniform convexity, uniform smooth and normal structure.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Symmetric form geometric constant related to isosceles orthogonality in Banach spaces
Authors:
Qichuan Ni,
Qi Liu,
Yuxin Wang,
Jinyu Xia,
Ranran Wang
Abstract:
In this article, we introduce a novel geometric constant $L_X(t)$, which provides an equivalent definition of the von Neumann-Jordan constant from an orthogonal perspective. First, we present some fundamental properties of the constant $L_X(t)$ in Banach spaces, including its upper and lower bounds, as well as its convexity, non-increasing continuity. Next, we establish the identities of $L_X(t)$…
▽ More
In this article, we introduce a novel geometric constant $L_X(t)$, which provides an equivalent definition of the von Neumann-Jordan constant from an orthogonal perspective. First, we present some fundamental properties of the constant $L_X(t)$ in Banach spaces, including its upper and lower bounds, as well as its convexity, non-increasing continuity. Next, we establish the identities of $L_X(t)$ and the function $γ_X(t)$, the von Neumann-Jordan constant, respectively. We also delve into the relationship between this novel constant and several renowned geometric constants (such as the James constant and the modulus of convexity). Furthermore, by utilizing the lower bound of this new constant, we characterize Hilbert spaces. Finally, based on these findings, we further investigate the connection between this novel constant and the geometric properties of Banach spaces, including uniformly non-square, uniformly normal structure, uniformly smooth, etc.
△ Less
Submitted 30 March, 2025;
originally announced April 2025.
-
Enhanced gradient recovery-based a posteriori error estimator and adaptive finite element method for elliptic equations
Authors:
Ying Liu,
Jingjing Xiao,
Nianyu Yi,
Huihui Cao
Abstract:
Recovery type a posteriori error estimators are popular, particularly in the engineering community, for their computationally inexpensive, easy to implement, and generally asymptotically exactness. Unlike the residual type error estimators, one can not establish upper and lower a posteriori error bounds for the classical recovery type error estimators without the saturation assumption. In this pap…
▽ More
Recovery type a posteriori error estimators are popular, particularly in the engineering community, for their computationally inexpensive, easy to implement, and generally asymptotically exactness. Unlike the residual type error estimators, one can not establish upper and lower a posteriori error bounds for the classical recovery type error estimators without the saturation assumption. In this paper, we first present three examples to show the unsatisfactory performance in the practice of standard residual or recovery-type error estimators, then, an improved gradient recovery-based a posteriori error estimator is constructed. The proposed error estimator contains two parts, one is the difference between the direct and post-processed gradient approximations, and the other is the residual of the recovered gradient. The reliability and efficiency of the enhanced estimator are derived. Based on the improved recovery-based error estimator and the newest-vertex bisection refinement method with a tailored mark strategy, an adaptive finite element algorithm is designed. We then prove the convergence of the adaptive method by establishing the contraction of gradient error plus oscillation. Numerical experiments are provided to illustrate the asymptotic exactness of the new recovery-based a posteriori error estimator and the high efficiency of the corresponding adaptive algorithm.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
High Accuracy Techniques Based Adaptive Finite Element Methods for Elliptic PDEs
Authors:
Jingjing Xiao,
Ying Liu,
Nianyu Yi
Abstract:
This paper aims to develop an efficient adaptive finite element method for the second-order elliptic problem. Although the theory for adaptive finite element methods based on residual-type a posteriori error estimator and bisection refinement has been well established, in practical computations, the use of non-asymptotic exact of error estimator and the excessive number of adaptive iteration steps…
▽ More
This paper aims to develop an efficient adaptive finite element method for the second-order elliptic problem. Although the theory for adaptive finite element methods based on residual-type a posteriori error estimator and bisection refinement has been well established, in practical computations, the use of non-asymptotic exact of error estimator and the excessive number of adaptive iteration steps often lead to inefficiency of the adaptive algorithm. We propose an efficient adaptive finite element method based on high-accuracy techniques including the superconvergence recovery technique and high-quality mesh optimization. The centroidal Voronoi Delaunay triangulation mesh optimization is embedded in the mesh adaption to provide high-quality mesh, and then assure that the superconvergence property of the recovered gradient and the asymptotical exactness of the error estimator. A tailored adaptive strategy, which could generate high-quality meshes with a target number of vertices, is developed to ensure the adaptive computation process terminated within $7$ steps. The effectiveness and robustness of the adaptive algorithm is numerically demonstrated.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium
Authors:
Kaizhao Liu,
Qi Long,
Zhekun Shi,
Weijie J. Su,
Jiancong Xiao
Abstract:
Aligning large language models (LLMs) with diverse human preferences is critical for ensuring fairness and informed outcomes when deploying these models for decision-making. In this paper, we seek to uncover fundamental statistical limits concerning aligning LLMs with human preferences, with a focus on the probabilistic representation of human preferences and the preservation of diverse preference…
▽ More
Aligning large language models (LLMs) with diverse human preferences is critical for ensuring fairness and informed outcomes when deploying these models for decision-making. In this paper, we seek to uncover fundamental statistical limits concerning aligning LLMs with human preferences, with a focus on the probabilistic representation of human preferences and the preservation of diverse preferences in aligned LLMs. We first show that human preferences can be represented by a reward model if and only if the preference among LLM-generated responses is free of any Condorcet cycle. Moreover, we prove that Condorcet cycles exist with probability converging to one exponentially fast under a probabilistic preference model, thereby demonstrating the impossibility of fully aligning human preferences using reward-based approaches such as reinforcement learning from human feedback. Next, we explore the conditions under which LLMs would employ mixed strategies -- meaning they do not collapse to a single response -- when aligned in the limit using a non-reward-based approach, such as Nash learning from human feedback (NLHF). We identify a necessary and sufficient condition for mixed strategies: the absence of a response that is preferred over all others by a majority. As a blessing, we prove that this condition holds with high probability under the probabilistic preference model, thereby highlighting the statistical possibility of preserving minority preferences without explicit regularization in aligning LLMs. Finally, we leverage insights from our statistical results to design a novel, computationally efficient algorithm for finding Nash equilibria in aligning LLMs with NLHF. Our experiments show that Llama-3.2-1B, aligned with our algorithm, achieves a win rate of 60.55\% against the base model.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Unitary Friedberg-Jacquet periods and their twists: Relative trace formulas
Authors:
Spencer Leslie,
Jingwei Xiao,
Wei Zhang
Abstract:
In a companion paper, we formulated a global conjecture for the automorphic period integral associated to the symmetric pairs defined by unitary groups over number fields, generalizing a theorem of Waldspurger's toric period for $\mathrm{GL}(2)$. In this paper, we introduce a new relative trace formula to prove our global conjecture under some local hypotheses. A new feature is the presence of rel…
▽ More
In a companion paper, we formulated a global conjecture for the automorphic period integral associated to the symmetric pairs defined by unitary groups over number fields, generalizing a theorem of Waldspurger's toric period for $\mathrm{GL}(2)$. In this paper, we introduce a new relative trace formula to prove our global conjecture under some local hypotheses. A new feature is the presence of relative endoscopy in the comparison. We also establish several local results on relative characters.
△ Less
Submitted 27 March, 2025; v1 submitted 12 March, 2025;
originally announced March 2025.
-
Unitary Friedberg-Jacquet periods and their twists: Fundamental lemmas
Authors:
Spencer Leslie,
Jingwei Xiao,
Wei Zhang
Abstract:
We formulate a global conjecture for the automorphic period integral associated to the symmetric pairs defined by unitary groups over number fields, generalizing a theorem of Waldspurger's toric period for $\mathrm{GL}(2)$. We introduce a new relative trace formula to prove our global conjecture under some local hypotheses. A new feature is the presence of the relative endoscopy. In this paper we…
▽ More
We formulate a global conjecture for the automorphic period integral associated to the symmetric pairs defined by unitary groups over number fields, generalizing a theorem of Waldspurger's toric period for $\mathrm{GL}(2)$. We introduce a new relative trace formula to prove our global conjecture under some local hypotheses. A new feature is the presence of the relative endoscopy. In this paper we prove the main local theorem: a new relative fundamental lemma comparing certain orbital integrals of functions matched in terms of Hironaka and Satake transforms.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Isogenies of CM Elliptic Curves
Authors:
Edgar Assing,
Yingkun Li,
Tian Wang,
Jiacheng Xia
Abstract:
Given two CM elliptic curves over a number field and a natural number $m$, we establish a polynomial lower bound (in terms of $m$) for the number of rational primes $p$ such that the reductions of these elliptic curves modulo a prime above $p$ are $m$-isogenous. The proof relies on higher Green functions and theorems of Gross-Zagier and Gross-Kohnen-Zagier. A crucial observation is that the Fourie…
▽ More
Given two CM elliptic curves over a number field and a natural number $m$, we establish a polynomial lower bound (in terms of $m$) for the number of rational primes $p$ such that the reductions of these elliptic curves modulo a prime above $p$ are $m$-isogenous. The proof relies on higher Green functions and theorems of Gross-Zagier and Gross-Kohnen-Zagier. A crucial observation is that the Fourier coefficients of incoherent Eisenstein series can be approximated by those of coherent Eisenstein series of increasing level. Another key ingredient is an explicit upper bound for the Petersson norm of an arbitrary elliptic modular form in terms of finitely many of its Fourier coefficients at the cusp infinity, which is a result of independent interest.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
An Efficient Quantum Approximate Optimization Algorithm with Fixed Linear Ramp Schedule for Truss Structure Optimization
Authors:
Junsen Xiao,
Naruethep Sukulthanasorn,
Reika Nomura,
Shuji Moriguchi,
Kenjiro Terada
Abstract:
This study proposes a novel structural optimization framework based on quantum variational circuits, in which the multiplier acting on the cross-sectional area of each rod in a truss structure as an updater is used as a design variable. Specifically, we employ a classical processor for structural analysis with the finite element method, and the Quantum Approximate Optimization Algorithm (QAOA) is…
▽ More
This study proposes a novel structural optimization framework based on quantum variational circuits, in which the multiplier acting on the cross-sectional area of each rod in a truss structure as an updater is used as a design variable. Specifically, we employ a classical processor for structural analysis with the finite element method, and the Quantum Approximate Optimization Algorithm (QAOA) is subsequently performed to update the cross-sectional area so that the compliance is minimized. The advantages of this framework can be seen in three key aspects. First, by defining design variables as multipliers, rather than simply reducing the design variable to a binary candidate of inclusion or exclusion (corresponding to qubit states, ``0" and ``1"), it provides greater flexibility in adjusting the cross-sectional area of the rod at each iteration of the optimization process. Second, the multipliers acting on rods are encoded with on-off encoding, eliminating additional constraints in the convergence judgement. As a result, the objective function is in a simple format, enabling efficient optimization using QAOA.Third, a fixed linear ramp schedule (FLRS) for variational parameter setting bypasses the classical optimization process, thereby improving the operational efficiency of the framework. In the two structural cases investigated in this study, the proposed approach highlights the feasibility and applicability potential of quantum computing in advancing engineering design and optimization. Numerical experiments have demonstrated the effectiveness of this framework, providing a firm foundation for future research on quantum-assisted optimization methods in engineering fields.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Essential $p$-capacity-volume estimates for rotationally symmetric manifolds
Authors:
Xiaoshang Jin,
Jie Xiao
Abstract:
Given $p\in [1,\infty]$, this article presents the novel basic volumetric estimates for the relative $p$-capacities with their applications to finding not only the sharp weak $(p,q)$-imbeddings but also the precise lower bounds of the principal $p$-frequencies, which principally live in the rotationally symmetric manifolds.
Given $p\in [1,\infty]$, this article presents the novel basic volumetric estimates for the relative $p$-capacities with their applications to finding not only the sharp weak $(p,q)$-imbeddings but also the precise lower bounds of the principal $p$-frequencies, which principally live in the rotationally symmetric manifolds.
△ Less
Submitted 19 February, 2025;
originally announced February 2025.
-
Sharply estimating hyperbolic capacities
Authors:
Xiaoshang Jin,
Jie Xiao
Abstract:
This paper is devoted to establishing four types of sharp capacitary inequalities within the hyperbolic space as detailed in Theorems 2.1-3.1-4.1-5.1.
This paper is devoted to establishing four types of sharp capacitary inequalities within the hyperbolic space as detailed in Theorems 2.1-3.1-4.1-5.1.
△ Less
Submitted 19 February, 2025;
originally announced February 2025.
-
A note on the maximum diversity of intersecting families of symmetric groups
Authors:
Jian Wang,
Jimeng Xiao
Abstract:
Let $\mathcal{S}_n$ be the symmetric group on the set $[n]:=\{1,2,\ldots,n\}$. A family $\mathcal{F}\subset \mathcal{S}_n$ is called intersecting if for every $σ,π\in \mathcal{F}$ there exists some $i\in [n]$ such that $σ(i)=π(i)$. Deza and Frankl proved that the largest intersecting family of permutations is the full star, that is, the collection of all permutations with a fixed position. The div…
▽ More
Let $\mathcal{S}_n$ be the symmetric group on the set $[n]:=\{1,2,\ldots,n\}$. A family $\mathcal{F}\subset \mathcal{S}_n$ is called intersecting if for every $σ,π\in \mathcal{F}$ there exists some $i\in [n]$ such that $σ(i)=π(i)$. Deza and Frankl proved that the largest intersecting family of permutations is the full star, that is, the collection of all permutations with a fixed position. The diversity of an intersecting family $\mathcal{F}$ is defined as the minimum number of permutations in $\mathcal{F}$, which deletion results in a star. In the present paper, by applying the spread approximation method developed recently by Kupavskii and Zakharov, we prove that for $n\geq 500$ the diversity of an intersecting subfamily of $\mathcal{S}_n$ is at most $(n-3)(n-3)!$, which is best possible.
△ Less
Submitted 12 January, 2025;
originally announced January 2025.
-
Self-embedding similitudes of Bedford-McMullen carpets with dependent ratios
Authors:
Jian-Ci Xiao
Abstract:
We prove that any non-degenerate Bedford-McMullen carpet does not allow oblique self-embedding similitudes; that is, if $f$ is a similitude sending the carpet into itself, then the image of the $x$-axis under $f$ must be parallel to one of the principal axes. We also establish a logarithmic commensurability result on the contraction ratios of such embeddings. This completes a previous study of Alg…
▽ More
We prove that any non-degenerate Bedford-McMullen carpet does not allow oblique self-embedding similitudes; that is, if $f$ is a similitude sending the carpet into itself, then the image of the $x$-axis under $f$ must be parallel to one of the principal axes. We also establish a logarithmic commensurability result on the contraction ratios of such embeddings. This completes a previous study of Algom and Hochman [Ergod. Th. & Dynam. Sys. 39 (2019), 577--603] on Bedford-McMullen carpets generated by multiplicatively independent exponents, together with a new proof on their non-obliqueness statement.
For the self-similar case, however, we construct a generalized Sierpinski carpet that is symmetric with respect to an appropriate oblique line and hence allows a reflectional oblique self-embedding. As a complement, we prove that if a generalized Sierpinski carpet satisfies the strong separation condition and permits an oblique rotational self-embedding similitude, then the tangent of the rotation angle takes values $\pm 1$.
△ Less
Submitted 31 January, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
Non-uniform Cross-intersecting Families
Authors:
Zhen Jia,
Qing Xiang,
Jimeng Xiao,
Huajun Zhang
Abstract:
Let $m\geq 2$, $n$ be positive integers, and $R_i=\{k_{i,1} >k_{i,2} >\cdots> k_{i,t_i}\}$ be subsets of $[n]$ for $i=1,2,\ldots,m$. The families $\mathcal{F}_1\subseteq \binom{[n]}{R_1},\mathcal{F}_2\subseteq \binom{[n]}{R_2},\ldots,\mathcal{F}_m\subseteq \binom{[n]}{R_m}$ are said to be non-empty cross-intersecting if for each $i\in [m]$, $\mathcal{F}_i\neq\emptyset$ and for any…
▽ More
Let $m\geq 2$, $n$ be positive integers, and $R_i=\{k_{i,1} >k_{i,2} >\cdots> k_{i,t_i}\}$ be subsets of $[n]$ for $i=1,2,\ldots,m$. The families $\mathcal{F}_1\subseteq \binom{[n]}{R_1},\mathcal{F}_2\subseteq \binom{[n]}{R_2},\ldots,\mathcal{F}_m\subseteq \binom{[n]}{R_m}$ are said to be non-empty cross-intersecting if for each $i\in [m]$, $\mathcal{F}_i\neq\emptyset$ and for any $A\in \mathcal{F}_i,B\in\mathcal{F}_j$, $1\leq i<j\leq m$, $|A\bigcap B|\geq1$. In this paper, we determine the maximum value of $\sum_{j=1}^{m}|\mathcal{F}_j|$ for non-empty cross-intersecting family $\mathcal{F}_1, \mathcal{F}_2,\ldots,\mathcal{F}_m$ when $n\geq k_1+k_2$, where $k_1$ (respectively, $k_2$) is the largest (respectively, second largest) value in $\{k_{1,1},k_{2,1},\ldots,k_{m,1}\}$. This result is a generalization of the results by Shi, Frankl and Qian \cite{shi2022non} on non-empty cross-intersecting families. Moreover, the extremal families are completely characterized.
△ Less
Submitted 27 November, 2024;
originally announced November 2024.
-
The skew generalized Von Neumann Jordan constant in the unit sphere
Authors:
Yuxin Wang,
Qi Liu,
Jinyu Xia,
Shuaizhe Huang
Abstract:
In this paper, we introduce a new constant for Banach spaces, denoted as $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$. We provide calculations for both the lower and upper bounds of this constant, as well as its exact values in certain Banach spaces. Furthermore, we give the inequality relationship between the $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$ constant and the other two constants. Besides, we e…
▽ More
In this paper, we introduce a new constant for Banach spaces, denoted as $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$. We provide calculations for both the lower and upper bounds of this constant, as well as its exact values in certain Banach spaces. Furthermore, we give the inequality relationship between the $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$ constant and the other two constants. Besides, we establish an equivalent relationship between the $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$ constant and the $\widetilde{C}_{\mathrm{NJ}}^{(p)}(X)$ constant. Specifically, we shall exhibit the connections between the constant $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$ and certain geometric characteristics of Ba nach spaces, including uniform convexity and uniform nonsquareness. Additionally, a sufficient condition for uniform normal structure about the $\widetilde{C}_{\mathrm{NJ}}^p(ξ, v, X)$ constant is also established.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
Lusztig sheaves and integrable highest weight modules in symmetrizable cases
Authors:
Yixin Lan,
Yumeng Wu,
Jie Xiao
Abstract:
The present paper continues the work of [10] and [6]. For any symmetrizable generalized Cartan Matrix $C$ and the corresponding quantum group $\mathbf{U}$, we consider the associated quiver $Q$ with an admissible automorphism $a$. We construct the category $\widetilde{\mathcal{Q}/\mathcal{N}}$ of the localization of Lusztig sheaves for the quiver with the automorphism of corresponding framed quive…
▽ More
The present paper continues the work of [10] and [6]. For any symmetrizable generalized Cartan Matrix $C$ and the corresponding quantum group $\mathbf{U}$, we consider the associated quiver $Q$ with an admissible automorphism $a$. We construct the category $\widetilde{\mathcal{Q}/\mathcal{N}}$ of the localization of Lusztig sheaves for the quiver with the automorphism of corresponding framed quiver and 2-framed quiver. Their Grothendieck groups give realizations of integrable highest weight module $L(λ)$ and the tensor product of integrable highest weights $\mathbf{U}-$module $L(λ_1)\otimes L(λ_2)$, and modulo the traceless ones Lusztig sheaves provide the (signed) canonical basis of $L(λ)$ and $L(λ_1)\otimes L(λ_2)$. As an application, the symmetrizable crystal structures on Nakajima's quiver/tensor product varieties and Lusztig's nilpotent varieties of preprojective algebras are deduced.
△ Less
Submitted 6 July, 2025; v1 submitted 14 November, 2024;
originally announced November 2024.
-
Finiteness Results for Non-Scattering Herglotz Waves: The case of inhomogeneities obtained by very general perturbations of disks
Authors:
Michael S. Vogelius,
Jingni Xiao
Abstract:
We study non-scattering phenomena associated with the time-harmonic Helmholtz equation in two dimensions. For very general classes of star-shaped domains, we show that there are at most finitely many wave numbers such that Herglotz incident waves with a fixed density function are non-scattering.
We study non-scattering phenomena associated with the time-harmonic Helmholtz equation in two dimensions. For very general classes of star-shaped domains, we show that there are at most finitely many wave numbers such that Herglotz incident waves with a fixed density function are non-scattering.
△ Less
Submitted 16 June, 2025; v1 submitted 12 November, 2024;
originally announced November 2024.
-
Pattern formation and global analysis of a systematically reduced plant model in dryland environment
Authors:
Yonghui Xia,
Jianglong Xiao,
Jianshe Yu
Abstract:
This paper delves into a systematically reduced plant system proposed by Jaïbi et al. [Phys. D, 2020] in arid area. They used the method of geometric singular perturbation to study the existence of abundant orbits. Instead, we deliberate the stability and distributed patterns of this system. For a non-diffusive scenario for the model, we scrutinize the local and global stability of equilibria and…
▽ More
This paper delves into a systematically reduced plant system proposed by Jaïbi et al. [Phys. D, 2020] in arid area. They used the method of geometric singular perturbation to study the existence of abundant orbits. Instead, we deliberate the stability and distributed patterns of this system. For a non-diffusive scenario for the model, we scrutinize the local and global stability of equilibria and derive conditions for the existence or non-existence of the limit cycle. The bifurcation behaviors are also explored. For the spatial model, we investigate Hopf, Turing, Hopf-Turing, Turing-Turing bifurcations. Specially, the evolution process from periodic solutions to spatially nonconstant steady states is observed near the Hopf-Turing bifurcation point. And mixed nonconstant steady states near the Turing-Turing bifurcation point are observed. Furthermore, it's found that there exist gap, spot, stripe and mixed patterns. The seed-dispersal rate enables the transformation of pattern structures. Reasonable control of system parameters may prevent desertification from occurring.
△ Less
Submitted 30 October, 2024;
originally announced November 2024.
-
Robust globally divergence-free weak Galerkin methods for unsteady incompressible convective Brinkman-Forchheimer equations
Authors:
Xiaojuan Wang,
Jihong Xiao,
Xiaoping Xie,
Shiquan Zhang
Abstract:
This paper develops and analyzes a class of semi-discrete and fully discrete weak Galerkin finite element methods for unsteady incompressible convective Brinkman-Forchheimer equations. For the spatial discretization, the methods adopt the piecewise polynomials of degrees $m\ (m\geq1)$ and $m-1$ respectively to approximate the velocity and pressure inside the elements, and piecewise polynomials of…
▽ More
This paper develops and analyzes a class of semi-discrete and fully discrete weak Galerkin finite element methods for unsteady incompressible convective Brinkman-Forchheimer equations. For the spatial discretization, the methods adopt the piecewise polynomials of degrees $m\ (m\geq1)$ and $m-1$ respectively to approximate the velocity and pressure inside the elements, and piecewise polynomials of degree $m$ to approximate their numerical traces on the interfaces of elements. In the fully discrete method, the backward Euler difference scheme is used to approximate the time derivative. The methods are shown to yield globally divergence-free velocity approximation. Optimal a priori error estimates in the energy norm and $L^2$ norm are established. A convergent linearized iterative algorithm is designed for solving the fully discrete system. Numerical experiments are provided to verify the theoretical results.
△ Less
Submitted 12 October, 2024;
originally announced October 2024.
-
Time-Consistent Portfolio Selection for Rank-Dependent Utilities in an Incomplete Market
Authors:
Jiaqin Wei,
Jianming Xia,
Qian Zhao
Abstract:
We investigate the portfolio selection problem for an agent with rank-dependent utility in an incomplete financial market. For a constant-coefficient market and CRRA utilities, we characterize the deterministic strict equilibrium strategies. In the case of time-invariant probability weighting function, we provide a comprehensive characterization of the deterministic strict equilibrium strategy. Th…
▽ More
We investigate the portfolio selection problem for an agent with rank-dependent utility in an incomplete financial market. For a constant-coefficient market and CRRA utilities, we characterize the deterministic strict equilibrium strategies. In the case of time-invariant probability weighting function, we provide a comprehensive characterization of the deterministic strict equilibrium strategy. The unique non-zero equilibrium, if exists, can be determined by solving an autonomous ODE. In the case of time-variant probability weighting functions, we observe that there may be infinitely many non-zero deterministic strict equilibrium strategies, which are derived from the positive solutions to a nonlinear singular ODE. By specifying the maximal solution to the singular ODE, we are able to identify all the positive solutions. In addition, we address the issue of selecting an optimal strategy from the numerous equilibrium strategies available.
△ Less
Submitted 28 September, 2024;
originally announced September 2024.
-
On the asymptotics of real solutions for the Painlevé I equation
Authors:
Wen-Gao Long,
Jun Xia
Abstract:
In this paper, we revisit the asymptotic formulas of real Painlevé I transcendents as the independent variable tends to negative infinity, which were initially derived by Kapaev with the complex WKB method. Using the Riemann-Hilbert method, we improve the error estimates of the oscillatory type asymptotics and provide precise error estimates of the singular type asymptotics. We also establish the…
▽ More
In this paper, we revisit the asymptotic formulas of real Painlevé I transcendents as the independent variable tends to negative infinity, which were initially derived by Kapaev with the complex WKB method. Using the Riemann-Hilbert method, we improve the error estimates of the oscillatory type asymptotics and provide precise error estimates of the singular type asymptotics. We also establish the corresponding asymptotics for the associated Hamiltonians of real Painlevé I transcendents. In addition, two typos in the mentioned asymptotic behaviors in literature are corrected.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
A conservative, implicit solver for 0D-2V multi-species nonlinear Fokker-Planck collision equations
Authors:
Yanpeng Wang,
Jianyuan Xiao,
Yifeng Zheng,
Zhihui Zou,
Pengfei Zhang,
Ge Zhuang
Abstract:
In this study, we present an optimal implicit algorithm specifically designed to accurately solve the multi-species nonlinear 0D-2V axisymmetric Fokker-Planck-Rosenbluth (FPR) collision equation while preserving mass, momentum, and energy. Our approach relies on the utilization of nonlinear Shkarofsky's formula of FPR (FPRS) collision operator in the spherical-polar coordinate. The key innovation…
▽ More
In this study, we present an optimal implicit algorithm specifically designed to accurately solve the multi-species nonlinear 0D-2V axisymmetric Fokker-Planck-Rosenbluth (FPR) collision equation while preserving mass, momentum, and energy. Our approach relies on the utilization of nonlinear Shkarofsky's formula of FPR (FPRS) collision operator in the spherical-polar coordinate. The key innovation lies in the introduction of a new function named King, with the adoption of the Legendre polynomial expansion for the angular coordinate and King function expansion for the speed coordinate. The Legendre polynomial expansion will converge exponentially and the King method, a moment convergence algorithm, could ensure the conservation with high precision in discrete form. Additionally, post-step projection onto manifolds is employed to exactly enforce symmetries of the collision operators. Through solving several typical problems across various nonequilibrium configurations, we demonstrate the high accuracy and superior performance of the presented algorithm for weakly anisotropic plasmas.
△ Less
Submitted 4 December, 2024; v1 submitted 2 August, 2024;
originally announced August 2024.
-
Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs
Authors:
Du Ouyang,
Jichang Xiao,
Xiaoqun Wang
Abstract:
We explore the application of the quasi-Monte Carlo (QMC) method in deep backward dynamic programming (DBDP) (Hure et al. 2020) for numerically solving high-dimensional nonlinear partial differential equations (PDEs). Our study focuses on examining the generalization error as a component of the total error in the DBDP framework, discovering that the rate of convergence for the generalization error…
▽ More
We explore the application of the quasi-Monte Carlo (QMC) method in deep backward dynamic programming (DBDP) (Hure et al. 2020) for numerically solving high-dimensional nonlinear partial differential equations (PDEs). Our study focuses on examining the generalization error as a component of the total error in the DBDP framework, discovering that the rate of convergence for the generalization error is influenced by the choice of sampling methods. Specifically, for a given batch size $m$, the generalization error under QMC methods exhibits a convergence rate of $O(m^{-1+\varepsilon})$, where $\varepsilon$ can be made arbitrarily small. This rate is notably more favorable than that of the traditional Monte Carlo (MC) methods, which is $O(m^{-1/2+\varepsilon})$. Our theoretical analysis shows that the generalization error under QMC methods achieves a higher order of convergence than their MC counterparts. Numerical experiments demonstrate that QMC indeed surpasses MC in delivering solutions that are both more precise and stable.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
A novel design update framework for topology optimization with quantum annealing: Application to truss and continuum structures
Authors:
Naruethep Sukulthanasorn,
Junsen Xiao,
Koya Wagatsuma,
Reika Nomura,
Shuji Moriguchi,
Kenjiro Terada
Abstract:
This paper presents a novel design update strategy for topology optimization, as an iterative optimization. The key contribution lies in incorporating a design updater concept with quantum annealing, applicable to both truss and continuum structures. To align with density-based approaches in topology optimization, these updaters are formulated through a multiplicative relationship to represent the…
▽ More
This paper presents a novel design update strategy for topology optimization, as an iterative optimization. The key contribution lies in incorporating a design updater concept with quantum annealing, applicable to both truss and continuum structures. To align with density-based approaches in topology optimization, these updaters are formulated through a multiplicative relationship to represent the design material and serve as design variables. Specifically, structural analysis is conducted on a classical computer using the finite element method, while quantum annealing is utilized for topology updates. The primary objective of the framework is to minimize compliance under a volume constraint. An encoding formulation for the design variables is derived, and the penalty method along with a slack variable is employed to transform the inequality volume constraint. Subsequently, the optimization problem for determining the updater is formulated as a Quadratic Unconstrained Binary Optimization (QUBO) model. To demonstrate its performance, the developed design framework is tested on different computing platforms to perform design optimization for truss structures, as well as 2D and 3D continuum structures. Numerical results indicate that the proposed framework successfully finds optimal topologies similar to benchmark results. Furthermore, the results show the advantage of reduced time in finding an optimal design using quantum annealing compared to simulated annealing.
△ Less
Submitted 22 January, 2025; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Examples of non-scattering inhomogeneities
Authors:
Lucas Chesnel,
Houssem Haddar,
Hongjie Li,
Jingni Xiao
Abstract:
We consider the scattering of waves by a penetrable inclusion embedded in some reference medium. We exhibit examples of materials and geometries for which non-scattering frequencies exist, i.e., for which at some frequencies there are incident fields which produce null scattered fields outside of the inhomogeneity. We show in particular that certain domains with corners or even cusps can support n…
▽ More
We consider the scattering of waves by a penetrable inclusion embedded in some reference medium. We exhibit examples of materials and geometries for which non-scattering frequencies exist, i.e., for which at some frequencies there are incident fields which produce null scattered fields outside of the inhomogeneity. We show in particular that certain domains with corners or even cusps can support non-scattering frequencies. We relate the latter, for some inclusions, to resonance frequencies for Dirichlet or Neumann cavities. We also find situations where incident non-scattering fields solve the Helmholtz equation in a neighborhood of the inhomogeneity and not in the whole space. Finally, in relation with invisibility, we give examples of inclusions of anisotropic materials which are non-scattering for all real frequencies. We prove that corresponding material indices must have a special structure on the boundary.
△ Less
Submitted 18 December, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Roots and Logarithms of Multipliers
Authors:
Jingbo Xia,
Congquan Yan,
Danjun Zhao,
Jingming Zhu
Abstract:
By now it is a well-known fact that if $f$ is a multiplier for the Drury-Arveson space $H^2_n$, and if there is a $c>0$ such that $|f(z)|\geq c$ for every $z\in B$, then the reciprocal function 1/f is also a multiplier for $H^2_n$. We show that for such an $f$ and for every $t\in \mathbb{R}$, $f^t$ is also a multiplier for $H^2_n$. We do so by deriving a differentiation formula for $R^m(f^th)$.Mor…
▽ More
By now it is a well-known fact that if $f$ is a multiplier for the Drury-Arveson space $H^2_n$, and if there is a $c>0$ such that $|f(z)|\geq c$ for every $z\in B$, then the reciprocal function 1/f is also a multiplier for $H^2_n$. We show that for such an $f$ and for every $t\in \mathbb{R}$, $f^t$ is also a multiplier for $H^2_n$. We do so by deriving a differentiation formula for $R^m(f^th)$.Moreover, by this formula the same result holds for spaces $H_{m,s}$ of the Besov-Dirichlet type. The same technique also gives us the result that for a non-vanishing multiplier $f$ of $H^2_n$, $log f$ is a multiplier of $H^2_n$ if and only if log $f$ is bounded on $B$.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network
Authors:
Zhiqiang Cai,
Tong Ding,
Min Liu,
Xinyu Liu,
Jianlin Xia
Abstract:
In this paper, we propose a structure-guided Gauss-Newton (SgGN) method for solving least squares problems using a shallow ReLU neural network. The method effectively takes advantage of both the least squares structure and the neural network structure of the objective function. By categorizing the weights and biases of the hidden and output layers of the network as nonlinear and linear parameters,…
▽ More
In this paper, we propose a structure-guided Gauss-Newton (SgGN) method for solving least squares problems using a shallow ReLU neural network. The method effectively takes advantage of both the least squares structure and the neural network structure of the objective function. By categorizing the weights and biases of the hidden and output layers of the network as nonlinear and linear parameters, respectively, the method iterates back and forth between the nonlinear and linear parameters. The nonlinear parameters are updated by a damped Gauss-Newton method and the linear ones are updated by a linear solver. Moreover, at the Gauss-Newton step, a special form of the Gauss-Newton matrix is derived for the shallow ReLU neural network and is used for efficient iterations. It is shown that the corresponding mass and Gauss-Newton matrices in the respective linear and nonlinear steps are symmetric and positive definite under reasonable assumptions. Thus, the SgGN method naturally produces an effective search direction without the need of additional techniques like shifting in the Levenberg-Marquardt method to achieve invertibility of the Gauss-Newton matrix. The convergence and accuracy of the method are demonstrated numerically for several challenging function approximation problems, especially those with discontinuities or sharp transition layers that pose significant challenges for commonly used training algorithms in machine learning.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
A self-similar set with non-locally connected components
Authors:
Jian-Ci Xiao
Abstract:
Luo, Rao and Xiong [Topol. Appl. 322 (2022), 108271] conjectured that if a planar self-similar iterated function system with the open set condition does not involve rotations or reflections, then every connected component of the attractor is locally connected. We create a homogeneous counterexample of Lalley-Gatzouras type, which disproves this conjecture.
Luo, Rao and Xiong [Topol. Appl. 322 (2022), 108271] conjectured that if a planar self-similar iterated function system with the open set condition does not involve rotations or reflections, then every connected component of the attractor is locally connected. We create a homogeneous counterexample of Lalley-Gatzouras type, which disproves this conjecture.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Some study of the approximate orthogonality connected with integral orthogonalities
Authors:
Ranran Wang,
Qi Liu,
Jinyu Xia,
Yongmo Hu
Abstract:
In this paper, we investigate a novel form of approximate orthogonality that is based on integral orthogonality. Additionally, we establish the fundamental properties of this new approximate orthogonality and examine its capability to preserve mappings of orthogonality. Moreover, we explore the relationship between this new approximate orthogonality and other forms of approximate orthogonality.
In this paper, we investigate a novel form of approximate orthogonality that is based on integral orthogonality. Additionally, we establish the fundamental properties of this new approximate orthogonality and examine its capability to preserve mappings of orthogonality. Moreover, we explore the relationship between this new approximate orthogonality and other forms of approximate orthogonality.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
Tame quivers and affine bases II: nonsimply-laced cases
Authors:
Jie Xiao,
Han Xu
Abstract:
In [Tame_quivers_and_affine_bases_I], we give a Ringel-Hall algebra approach to the canonical bases in the symmetric affine cases. In this paper, we extend the results to general symmetrizable affine cases by using Ringel-Hall algebras of representations of a valued quiver. We obtain a bar-invariant basis $\mathbf{B}'=\{C(\mathbf{c},t_λ)|(\mathbf{c},t_λ)\in\mathcal{G}^a\}$ in the generic compositi…
▽ More
In [Tame_quivers_and_affine_bases_I], we give a Ringel-Hall algebra approach to the canonical bases in the symmetric affine cases. In this paper, we extend the results to general symmetrizable affine cases by using Ringel-Hall algebras of representations of a valued quiver. We obtain a bar-invariant basis $\mathbf{B}'=\{C(\mathbf{c},t_λ)|(\mathbf{c},t_λ)\in\mathcal{G}^a\}$ in the generic composition algebra $\mathcal{C}^*$ and prove that $\mathcal{B}'=\mathbf{B}'\sqcup(-\mathbf{B}')$ coincides with Lusztig's signed canonical basis $\mathcal{B}$. Moreover, in type $\tilde{B}_n,\tilde{C}_n$, $\mathbf{B}'$ is the canonical basis $\mathbf{B}$.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Fock space: A bridge between Fredholm index and the quantum Hall effect
Authors:
Guo Chuan Thiang,
Jingbo Xia
Abstract:
We compute the quantized Hall conductance at various Landau levels by using the classic trace. The computations reduce to the single elementary one for the lowest Landau level. By using the theories of Helton-Howe-Carey-Pincus, and Toeplitz operators on the classic Fock space and higher Fock spaces, the Hall conductance is naturally identified with a Fredholm index. This brings new mathematical in…
▽ More
We compute the quantized Hall conductance at various Landau levels by using the classic trace. The computations reduce to the single elementary one for the lowest Landau level. By using the theories of Helton-Howe-Carey-Pincus, and Toeplitz operators on the classic Fock space and higher Fock spaces, the Hall conductance is naturally identified with a Fredholm index. This brings new mathematical insights to the extraordinary precision of quantization observed in quantum Hall measurements.
△ Less
Submitted 20 June, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Equilibrium stochastic control with implicitly defined objective functions
Authors:
Zongxia Liang,
Jianming Xia,
Keyu Zhang
Abstract:
This paper considers a class of stochastic control problems with implicitly defined objective functions, which are the sources of time-inconsistency. We study the closed-loop equilibrium solutions in a general controlled diffusion framework. First, we provide a sufficient and necessary condition for a strategy to be an equilibrium. Then, we apply the result to discuss two problems of dynamic portf…
▽ More
This paper considers a class of stochastic control problems with implicitly defined objective functions, which are the sources of time-inconsistency. We study the closed-loop equilibrium solutions in a general controlled diffusion framework. First, we provide a sufficient and necessary condition for a strategy to be an equilibrium. Then, we apply the result to discuss two problems of dynamic portfolio selection for a class of betweenness preferences, allowing for closed convex constraints on portfolio weights and borrowing cost, respectively. The equilibrium portfolio strategies are explicitly characterized in terms of the solutions of some first-order ordinary differential equations for the case of deterministic market coefficients.
△ Less
Submitted 26 December, 2023; v1 submitted 23 December, 2023;
originally announced December 2023.
-
Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation
Authors:
Jichang Xiao,
Fengjiang Fu,
Xiaoqun Wang
Abstract:
Deep learning algorithms have been widely used to solve linear Kolmogorov partial differential equations~(PDEs) in high dimensions, where the loss function is defined as a mathematical expectation. We propose to use the randomized quasi-Monte Carlo (RQMC) method instead of the Monte Carlo (MC) method for computing the loss function. In theory, we decompose the error from empirical risk minimizatio…
▽ More
Deep learning algorithms have been widely used to solve linear Kolmogorov partial differential equations~(PDEs) in high dimensions, where the loss function is defined as a mathematical expectation. We propose to use the randomized quasi-Monte Carlo (RQMC) method instead of the Monte Carlo (MC) method for computing the loss function. In theory, we decompose the error from empirical risk minimization~(ERM) into the generalization error and the approximation error. Notably, the approximation error is independent of the sampling methods. We prove that the convergence order of the mean generalization error for the RQMC method is $O(n^{-1+ε})$ for arbitrarily small $ε>0$, while for the MC method it is $O(n^{-1/2+ε})$ for arbitrarily small $ε>0$. Consequently, we find that the overall error for the RQMC method is asymptotically smaller than that for the MC method as $n$ increases. Our numerical experiments show that the algorithm based on the RQMC method consistently achieves smaller relative $L^{2}$ error than that based on the MC method.
△ Less
Submitted 23 June, 2024; v1 submitted 27 October, 2023;
originally announced October 2023.
-
Error analysis for empirical risk minimization over clipped ReLU networks in solving linear Kolmogorov partial differential equations
Authors:
Jichang Xiao,
Xiaoqun Wang
Abstract:
Deep learning algorithms have been successfully applied to numerically solve linear Kolmogorov partial differential equations~(PDEs). A recent research shows that if the initial functions are bounded, the empirical risk minimization (ERM) over clipped ReLU networks generalizes well for solving the linear Kolmogorov PDE. In this paper, we propose to use a truncation technique to extend the generali…
▽ More
Deep learning algorithms have been successfully applied to numerically solve linear Kolmogorov partial differential equations~(PDEs). A recent research shows that if the initial functions are bounded, the empirical risk minimization (ERM) over clipped ReLU networks generalizes well for solving the linear Kolmogorov PDE. In this paper, we propose to use a truncation technique to extend the generalization results for polynomially growing initial functions. Specifically, we prove that under an assumption, the sample size required to achieve an generalization error within $\varepsilon$ with a confidence level $\varrho$ grows polynomially in the size of the clipped neural networks and $(\varepsilon^{-1},\varrho^{-1})$, which means that the curse of dimensionality is broken. Moreover, we verify that the required assumptions hold for Black-Scholes PDEs and heat equations which are two important cases of linear Kolmogorov PDEs. For the approximation error, under certain assumptions, we establish approximation results for clipped ReLU neural networks when approximating the solution of Kolmogorov PDEs. Consequently, we establish that the ERM over artificial neural networks indeed overcomes the curse of dimensionality for a larger class of linear Kolmogorov PDEs.
△ Less
Submitted 23 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
On a self-embedding problem of self-similar sets
Authors:
Jian-Ci Xiao
Abstract:
Let $K\subset\mathbb{R}^d$ be a self-similar set generated by an iterated function system $\{\varphi_i\}_{i=1}^m$ satisfying the strong separation condition and let $f$ be a contracting similitude with $f(K)\subset K$. We show that $f(K)$ is relative open in $K$ if all $\varphi_i$'s share a common contraction ratio and orthogonal part. We also provide a counterexample when the orthogonal parts are…
▽ More
Let $K\subset\mathbb{R}^d$ be a self-similar set generated by an iterated function system $\{\varphi_i\}_{i=1}^m$ satisfying the strong separation condition and let $f$ be a contracting similitude with $f(K)\subset K$. We show that $f(K)$ is relative open in $K$ if all $\varphi_i$'s share a common contraction ratio and orthogonal part. We also provide a counterexample when the orthogonal parts are allowed to vary. This partially answers a question in Elekes, Keleti and M{á}th{é} [Ergodic Theory Dynam. Systems 30 (2010)].
As a byproduct of our argument, when $d=1$ and $K$ admits two homogeneous generating iterated function systems satisfying the strong separation condition but with contraction parts of opposite signs, we show that $K$ is symmetric. This partially answers a question in Feng and Wang [Adv. Math. 222 (2009)].
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex Optimization
Authors:
Zhenwei Lin,
Jingfan Xia,
Qi Deng,
Luo Luo
Abstract:
We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free Method (DGFM) and its variant, the Decentralized Gradient-Free Method$^+$ (DGFM$^{+}$). Based on the techniques of randomized smoothing and gradient tracking, DGFM r…
▽ More
We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free Method (DGFM) and its variant, the Decentralized Gradient-Free Method$^+$ (DGFM$^{+}$). Based on the techniques of randomized smoothing and gradient tracking, DGFM requires the computation of the zeroth-order oracle of a single sample in each iteration, making it less demanding in terms of computational resources for individual computing nodes. Theoretically, DGFM achieves a complexity of $\mathcal O(d^{3/2}δ^{-1}\varepsilon ^{-4})$ for obtaining an $(δ,\varepsilon)$-Goldstein stationary point. DGFM$^{+}$, an advanced version of DGFM, incorporates variance reduction to further improve the convergence behavior. It samples a mini-batch at each iteration and periodically draws a larger batch of data, which improves the complexity to $\mathcal O(d^{3/2}δ^{-1} \varepsilon^{-3})$. Moreover, experimental results underscore the empirical advantages of our proposed algorithms when applied to real-world datasets.
△ Less
Submitted 28 January, 2025; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Motivic cluster multiplication formulas in 2-Calabi-Yau categories
Authors:
Jie Xiao,
Fan Xu,
Fang Yang
Abstract:
We introduce a notion of motivic cluster characters via virtual Poincaré polynomials, and prove a motivic version of multiplication formulas obtained by Chen-Xiao-Xu for weighted quantum cluster characters associated to a 2-Calabi-Yau triangulated category $\mathcal{C}$ with a cluster tilting object. Furthermore, a refined form of this formula is also given. When $\mathcal{C}$ is the cluster categ…
▽ More
We introduce a notion of motivic cluster characters via virtual Poincaré polynomials, and prove a motivic version of multiplication formulas obtained by Chen-Xiao-Xu for weighted quantum cluster characters associated to a 2-Calabi-Yau triangulated category $\mathcal{C}$ with a cluster tilting object. Furthermore, a refined form of this formula is also given. When $\mathcal{C}$ is the cluster category of an acyclic quiver, our certain refined multiplication formula is a motivic version of the multiplication formula in [International Mathematics Research Notices, rnad172(2023)].
△ Less
Submitted 22 January, 2024; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Deterministic stack-sorting for set partitions
Authors:
Janabel Xia
Abstract:
A sock sequence is a sequence of elements, which we will refer to as socks, from a finite alphabet. A sock sequence is sorted if all occurrences of a sock appear consecutively. We define equivalence classes of sock sequences called sock patterns, which are in bijection with set partitions. The notion of stack-sorting for set partitions was originally introduced by Defant and Kravitz. In this paper…
▽ More
A sock sequence is a sequence of elements, which we will refer to as socks, from a finite alphabet. A sock sequence is sorted if all occurrences of a sock appear consecutively. We define equivalence classes of sock sequences called sock patterns, which are in bijection with set partitions. The notion of stack-sorting for set partitions was originally introduced by Defant and Kravitz. In this paper, we define a new deterministic stack-sorting map $φ_σ$ for sock sequences that uses a $σ$-avoiding stack, where pattern containment need not be consecutive. When $σ= aba$, we show that our stack-sorting map sorts any sock sequence with $n$ distinct socks in at most $n$ iterations, and that this bound is tight for $n \geq 3$. We obtain a fine-grained enumeration of the number of sock patterns of length $n$ on $r$ distinct socks that are $1$-stack-sortable under $φ_{aba}$, and we also obtain asymptotics for the number of sock patterns of length $n$ that are $1$-stack-sortable under $φ_{aba}$. Finally, we show that for all unsorted sock patterns $σ\neq a\cdots a b a \cdots a$, the map $φ_σ$ cannot eventually sort all sock sequences on any multiset $M$ unless every sock sequence on $M$ is already sorted.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Numerical characterization of the hard Lefschetz classes of dimension two
Authors:
Jiajun Hu,
Jian Xiao
Abstract:
We study the numerical characterization of two dimensional hard Lefschetz classes given by the complete intersections of nef classes. In Shenfeld and van Handel's breakthrough work on the characterization of the extremals of the Alexandrov-Fenchel inequality for convex polytopes, they proposed an open question on the algebraic analogue of the characterization. By taking further inspiration from ou…
▽ More
We study the numerical characterization of two dimensional hard Lefschetz classes given by the complete intersections of nef classes. In Shenfeld and van Handel's breakthrough work on the characterization of the extremals of the Alexandrov-Fenchel inequality for convex polytopes, they proposed an open question on the algebraic analogue of the characterization. By taking further inspiration from our previous work with Shang on hard Lefschetz theorems for free line bundles, we formulate and refine the conjectural picture more precisely and settle the open question when the collection of nef classes is given by a rearrangement of supercriticality, which in particular includes the big nef collection as a special case. The main results enable us to refine some previous results and study the extremals of Hodge index inequality, and more importantly provide the first series of examples of hard Lefschetz classes of dimension two both in algebraic geometry and analytic geometry, in which one can allow nontrivial augmented base locus and thus drop the semi-ampleness or semi-positivity assumption. As a key ingredient of the numerical characterization, we establish a local Hodge index inequality for Lorentzian polynomials, which is the algebraic analogue of the local Alexandrov-Fenchel inequality obtained by Shenfeld-van Handel for convex polytopes. This result holds in broad contexts, e.g., it holds on a smooth projective variety, on a compact Kähler manifold and on a Lorentzian fan, which contains the Bergman fan of a matroid or polymatroid as a typical example.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
The inequalities of Chern classes and Riemann-Roch type inequalities
Authors:
Xing Lu,
Jian Xiao
Abstract:
Motivated by Kollár-Matsusaka's Riemann-Roch type inequalities, applying effective very ampleness of adjoint bundles on Fujita conjecture and log-concavity given by Khovanskii-Teissier inequalities, we show that for any partition $λ$ of the positive integer $d$ there exists a universal bivariate polynomial $Q_λ(x, y)$ which has deg $Q \leq d$ and whose coefficients depend only on $n$, such that fo…
▽ More
Motivated by Kollár-Matsusaka's Riemann-Roch type inequalities, applying effective very ampleness of adjoint bundles on Fujita conjecture and log-concavity given by Khovanskii-Teissier inequalities, we show that for any partition $λ$ of the positive integer $d$ there exists a universal bivariate polynomial $Q_λ(x, y)$ which has deg $Q \leq d$ and whose coefficients depend only on $n$, such that for any projective manifold $X$ of dimension $n$ and any ample line bundle $L$ on $X$, \begin{equation*}
\left|c_λ(X)\cdot L^{n -d}\right|\leq
\frac{Q_λ(L^{n}, K_X \cdot L^{n -1} )}{(L^{n})^{d-1}}, \end{equation*} where $K_X$ is the canonical bundle of $X$ and $c_λ(X)$ is the monomial Chern class given by the partition $λ$. As a special case, when $K_X$ or $-K_X$ is ample, this implies that there exists a constant $c_n$ depending only on $n$ such that for any monomial Chern classes of top degree, the Chern number ratios \begin{equation*} \left|\frac{c_λ(X)}{c_1 (X) ^{n}}\right|\leq c_n, \end{equation*} which recovers a recent result of Du-Sun. The main result also yields an asymptotic version of the sharper Riemann-Roch type inequality. Furthermore, using similar method we also obtain inequalities for Chern classes of the logarithmic tangent bundle.
△ Less
Submitted 28 October, 2024; v1 submitted 23 August, 2023;
originally announced August 2023.
-
Lusztig sheaves and integrable highest weight modules
Authors:
Jiepeng Fang,
Yixin Lan,
Jie Xiao
Abstract:
We consider the localization $\mathcal{Q}_{\mathbf{V},\mathbf{W}}/\mathcal{N}_{\mathbf{V}}$ of Lusztig's sheaves for framed quivers, and define functors $E^{(n)}_{i},F^{(n)}_{i},K^{\pm}_{i},n\in \mathbb{N},i \in I$ between the localizations. With these functors, the Grothendieck group of localizations realizes the irreducible integrable highest weight modules $L(Λ)$ of quantum groups. Moreover, th…
▽ More
We consider the localization $\mathcal{Q}_{\mathbf{V},\mathbf{W}}/\mathcal{N}_{\mathbf{V}}$ of Lusztig's sheaves for framed quivers, and define functors $E^{(n)}_{i},F^{(n)}_{i},K^{\pm}_{i},n\in \mathbb{N},i \in I$ between the localizations. With these functors, the Grothendieck group of localizations realizes the irreducible integrable highest weight modules $L(Λ)$ of quantum groups. Moreover, the nonzero simple perverse sheaves in localizations form the canonical bases of $L(Λ)$. We also compare our realization (at $v \rightarrow 1$) with Nakajima's realization via quiver varieties and prove that the transition matrix between canonical bases and fundamental classes is upper triangular with diagonal entries all equal to $\pm 1$.
△ Less
Submitted 16 March, 2025; v1 submitted 30 July, 2023;
originally announced July 2023.
-
Making the Nyström method highly accurate for low-rank approximations
Authors:
Jianlin Xia
Abstract:
The Nyström method is a convenient heuristic method to obtain low-rank approximations to kernel matrices in nearly linear complexity. Existing studies typically use the method to approximate positive semidefinite matrices with low or modest accuracies. In this work, we propose a series of heuristic strategies to make the Nyström method reach high accuracies for nonsymmetric and/or rectangular matr…
▽ More
The Nyström method is a convenient heuristic method to obtain low-rank approximations to kernel matrices in nearly linear complexity. Existing studies typically use the method to approximate positive semidefinite matrices with low or modest accuracies. In this work, we propose a series of heuristic strategies to make the Nyström method reach high accuracies for nonsymmetric and/or rectangular matrices. The resulting methods (called high-accuracy Nyström methods) treat the Nyström method and a skinny rank-revealing factorization as a fast pivoting strategy in a progressive alternating direction refinement process. Two refinement mechanisms are used: alternating the row and column pivoting starting from a small set of randomly chosen columns, and adaptively increasing the number of samples until a desired rank or accuracy is reached. A fast subset update strategy based on the progressive sampling of Schur complements is further proposed to accelerate the refinement process. Efficient randomized accuracy control is also provided. Relevant accuracy and singular value analysis is given to support some of the heuristics. Extensive tests with various kernel functions and data sets show how the methods can quickly reach prespecified high accuracies in practice, sometimes with quality close to SVDs, using only small numbers of progressive sampling steps.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Conformal invariance of random currents: a stability result
Authors:
Hong-Bin Chen,
Jiaming Xia
Abstract:
We show the convergence of the single sourceless critical random current to a limit identifiable with the nested CLE(3). Our approach is based on viewing the random current as a perturbation of the Ising interface, which is known to converge to CLE(3). Instead of focusing solely on the random current, we provide a general framework for the stability of scaling limits under the perturbation by supe…
▽ More
We show the convergence of the single sourceless critical random current to a limit identifiable with the nested CLE(3). Our approach is based on viewing the random current as a perturbation of the Ising interface, which is known to converge to CLE(3). Instead of focusing solely on the random current, we provide a general framework for the stability of scaling limits under the perturbation by superimposing an independent Bernoulli percolation.
△ Less
Submitted 18 June, 2023;
originally announced June 2023.