-
The inverse $Z$-polynomial of a matroid
Authors:
Alice L. L. Gao,
Xuan Ruan,
Matthew H. Y. Xie
Abstract:
Motivated by the $Z$-polynomials of matroids, Ferroni, Matherne, Stevens, and Vecchi introduced the inverse $Z$-polynomial of a matroid. In this paper, we prove several fundamental properties of the inverse $Z$-polynomial, including non-negativity and multiplicativity, and show that it is a valuative invariant. We also provide explicit formulas for the inverse $Z$-polynomials of uniform matroids a…
▽ More
Motivated by the $Z$-polynomials of matroids, Ferroni, Matherne, Stevens, and Vecchi introduced the inverse $Z$-polynomial of a matroid. In this paper, we prove several fundamental properties of the inverse $Z$-polynomial, including non-negativity and multiplicativity, and show that it is a valuative invariant. We also provide explicit formulas for the inverse $Z$-polynomials of uniform matroids and a broader class of matroids, namely sparse paving matroids, which include uniform matroids as a special case. Furthermore, we establish the unimodality and log-concavity of these polynomials in the case of sparse paving matroids. Based on the properties of the $Z$-polynomial, we conjecture that the coefficients of the inverse $Z$-polynomial are unimodal and log-concave.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Computing the Bogoliubov-de Gennes excitations of two-component Bose-Einstein condensates
Authors:
Manting Xie,
Yong Zhang
Abstract:
In this paper, we present an efficient and spectrally accurate numerical method to compute elementary/collective excitations in two-component Bose-Einstein condensates (BEC), around their mean-field ground state, by solving the associated Bogoliubov-de Gennes (BdG) equation. The BdG equation is essentially an eigenvalue problem for a non-Hermitian differential operator with an eigenfunction normal…
▽ More
In this paper, we present an efficient and spectrally accurate numerical method to compute elementary/collective excitations in two-component Bose-Einstein condensates (BEC), around their mean-field ground state, by solving the associated Bogoliubov-de Gennes (BdG) equation. The BdG equation is essentially an eigenvalue problem for a non-Hermitian differential operator with an eigenfunction normalization constraint. Firstly, we investigate its analytical properties, including the exact eigenpairs, generalized nullspace structure and bi-orthogonality of eigenspaces. Subsequently, by combining the Fourier spectral method for spatial discretization and a stable modified Gram-Schmidt bi-orthogonal algorithm, we propose a structure-preserving iterative method for the resulting large-scale dense non-Hermitian discrete eigenvalue problem. Our method is matrix-free, and the matrix-vector multiplication (or the operator-function evaluation) is implemented with a near-optimal complexity ${\mathcal O}(N_{\rm t}\log(N_{\rm t}))$, where $N_{\rm t}$ is the total number of grid points, thanks to the utilization of the discrete Fast Fourier Transform (FFT). Therefore, it is memory-friendly, spectrally accurate, and highly efficient. Finally, we carry out a comprehensive numerical investigation to showcase its superiority in terms of accuracy and efficiency, alongside some applications to compute the excitation spectrum and Bogoliubov amplitudes in one, two, and three-dimensional problems.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
An efficient Fourier spectral algorithm for the Bogoliubov-de Gennes excitation eigenvalue problem
Authors:
Yu Li,
Zhixuan Li,
Manting Xie,
Yong Zhang
Abstract:
In this paper, we propose an efficient Fourier spectral algorithm for an eigenvalue problem, that is, the Bogoliubov-de Gennes (BdG) equation arsing from spin-1 Bose-Einstein condensates (BEC) to describe the elementary/collective excitations around the mean-field ground state. The BdG equation is essentially a constrained eigenvalue/eigenfunction system. Firstly, we investigate its analytical pro…
▽ More
In this paper, we propose an efficient Fourier spectral algorithm for an eigenvalue problem, that is, the Bogoliubov-de Gennes (BdG) equation arsing from spin-1 Bose-Einstein condensates (BEC) to describe the elementary/collective excitations around the mean-field ground state. The BdG equation is essentially a constrained eigenvalue/eigenfunction system. Firstly, we investigate its analytical properties, including exact eigenpairs, generalized nullspace, and bi-orthogonality of eigenspaces. Secondly, by combining the standard Fourier spectral method for spatial discretization and a stable Gram-Schmidt bi-orthogonal algorithm, we develop a subspace iterative solver for such a large-scale dense eigenvalue problem, and it proves to be numerically stable, efficient, and accurate. Our solver is matrix-free and the operator-function evaluation is accelerated by discrete Fast Fourier Transform (FFT) with almost optimal efficiency. Therefore, it is memory-friendly and efficient for large-scale problems. Furthermore, we give a rigorous and detailed numerical analysis on the stability and spectral convergence. Finally, we present extensive numerical results to illustrate the spectral accuracy and efficiency, and investigate the excitation spectrum and Bogoliubov amplitudes around the ground state in 1-3 spatial dimensions.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Parabolic scaling of a stochastic wave map with co-normal noise: limit and fluctuations
Authors:
Sandra Cerrai,
Mengzi Xie
Abstract:
This paper investigates the parabolic scaling limit of a damped stochastic wave map from the real line into the two-dimensional sphere, perturbed by multiplicative Gaussian noise of co-normal type. We prove that under this rescaling, the solutions converge to those of the deterministic heat flow for harmonic maps, revealing a transition from stochastic hyperbolic to deterministic parabolic dynamic…
▽ More
This paper investigates the parabolic scaling limit of a damped stochastic wave map from the real line into the two-dimensional sphere, perturbed by multiplicative Gaussian noise of co-normal type. We prove that under this rescaling, the solutions converge to those of the deterministic heat flow for harmonic maps, revealing a transition from stochastic hyperbolic to deterministic parabolic dynamics. We further analyze the fluctuations around this limit, proving a weak central limit theorem and identifying the limiting process as the solution to a linear stochastic partial differential equation. The study combines tools from geometric analysis, stochastic calculus, and functional analysis, offering insights into the interplay between geometry, noise, and scaling in nonlinear stochastic systems.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Log-concavity of inverse Kazhdan-Lusztig polynomials of paving matroids
Authors:
Matthew H. Y. Xie,
Philip B. Zhang
Abstract:
Gao and Xie (2021) conjectured that the inverse Kazhdan-Lusztig polynomial of any matroid is log-concave. Although the inverse Kazhdan-Lusztig polynomial may not always have only real roots, we conjecture that the Hadamard product of an inverse Kazhdan-Lusztig polynomial of degree $n$ and $(1+t)^n$ has only real roots. Using interlacing polynomials and multiplier sequences, we confirm this conject…
▽ More
Gao and Xie (2021) conjectured that the inverse Kazhdan-Lusztig polynomial of any matroid is log-concave. Although the inverse Kazhdan-Lusztig polynomial may not always have only real roots, we conjecture that the Hadamard product of an inverse Kazhdan-Lusztig polynomial of degree $n$ and $(1+t)^n$ has only real roots. Using interlacing polynomials and multiplier sequences, we confirm this conjecture for paving matroids. This result allows us to confirm the log-concavity conjecture for these matroids by applying Newton's inequalities.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Global Solutions for 5D Quadratic Fourth-Order Schrödinger Equations
Authors:
Ebru Toprak,
Mengyi Xie
Abstract:
We prove small data scattering for the fourth-order Schrödinger equation with quadratic nonlinearity \begin{equation*}
i\partial_t u+Δ^2 u+αu^2 + β\bar{u}^2=0\qquad\text{in }\mathbb{R}^5 \end{equation*} for $α, β\in \mathbb{R}$. We extend the space-time resonance method, originally introduced by Germain, Masmoudi, and Shatah, to the setting involving the bilaplacian. We show that under a smallne…
▽ More
We prove small data scattering for the fourth-order Schrödinger equation with quadratic nonlinearity \begin{equation*}
i\partial_t u+Δ^2 u+αu^2 + β\bar{u}^2=0\qquad\text{in }\mathbb{R}^5 \end{equation*} for $α, β\in \mathbb{R}$. We extend the space-time resonance method, originally introduced by Germain, Masmoudi, and Shatah, to the setting involving the bilaplacian. We show that under a smallness condition on the initial data measured in a suitable norm, the solution satisfies $\|u\|_{L^{\infty}_x }\lesssim t^{-\frac{5}{4}} $ and scatters to the solution to the free equation. Although our work builds upon an established method, the fourth-order nature of the equation presents substantial challenges, requiring different techniques to overcome them.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Nishida-Smoller type large solutions for the compressible Navier-Stokes equations with slip boundary conditions in 3D exterior domains
Authors:
Minghong Xie,
Saiguo Xu,
Yinghui Zhang
Abstract:
This paper investigates the global existence of classical solutions to the isentropic compressible Navier-Stokes equations with slip boundary condition in a three-dimensional (3D) exterior domain. It is shown that the classical solutions with large initial energy and vacuum exist globally in time when the adiabatic exponent $γ>1$ is sufficiently close to 1 (near-isothermal regime). This constitute…
▽ More
This paper investigates the global existence of classical solutions to the isentropic compressible Navier-Stokes equations with slip boundary condition in a three-dimensional (3D) exterior domain. It is shown that the classical solutions with large initial energy and vacuum exist globally in time when the adiabatic exponent $γ>1$ is sufficiently close to 1 (near-isothermal regime). This constitutes an extension of the celebrated result for the one-dimensional Cauchy problem of the isentropic Euler equations that has been established in 1973 by Nishida and Smoller (Comm. Pure Appl. Math. 26 (1973), 183-200). To the best of our knowledge, we establish the first result on the global existence of large-energy solutions with vacuum to the compressible Navier-Stokes equations with slip boundary condition in a 3D exterior domain, which improves previous related works where either small initial energy is required or boundary effects are ignored.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Incompressible Limit of Strong Solutions to the Diffuse Interface Model for Two-phase Flows
Authors:
Yinghua Li,
Manrou Xie
Abstract:
This paper is concerned with the incompressible limit problem for strong solutions of compressible two-phase flow models under periodic boundary conditions, where the Navier-Stokes equations are nonlinearly coupled with either Cahn-Hilliard equations or Allen-Cahn equations. The viscosity coefficients are allowed to depend both on the density and the phase field variable. We establish rigorous con…
▽ More
This paper is concerned with the incompressible limit problem for strong solutions of compressible two-phase flow models under periodic boundary conditions, where the Navier-Stokes equations are nonlinearly coupled with either Cahn-Hilliard equations or Allen-Cahn equations. The viscosity coefficients are allowed to depend both on the density and the phase field variable. We establish rigorous convergence of both local and global strong solutions of compressible systems to their incompressible systems as the Mach number tends to zero.This theoretical framework establishes an essential linkage between compressible and incompressible phase field models, demonstrating that both formulations exhibit consistent physical fidelity in capturing interfacial flow dynamics.Furthermore, we provide some convergence rate estimates of the solutions.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
A universal preprocessing algorithm of average kernel method with Gauss-Laguerre quadrature for double integrals
Authors:
Kejun Pan,
Mingliang Xie
Abstract:
To address the computational challenges posed by nonlinear collision kernels in the Smoluchowski equation, this study proposes a universal preprocessing algorithm for the average kernel method based on the Gauss-Laguerre quadrature for double integrals. With this algorithm, the numerical code accurately and efficiently determines the pre-exponential factor of the average kernel. Additionally, the…
▽ More
To address the computational challenges posed by nonlinear collision kernels in the Smoluchowski equation, this study proposes a universal preprocessing algorithm for the average kernel method based on the Gauss-Laguerre quadrature for double integrals. With this algorithm, the numerical code accurately and efficiently determines the pre-exponential factor of the average kernel. Additionally, the exact pre-exponential factors of the four fundamental average kernels and their associated truncation error estimations were analyzed. The results demonstrate the reasonability and reliability of the preprocessing algorithm.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Weak Serrin-type blowup criterion for the 3D full compressible Navier-Stokes equations
Authors:
Minghong Xie,
Saiguo Xu,
Yinghui Zhang
Abstract:
We investigate weak Serrin-type blowup criterion of the three-dimensional full compressible Navier-Stokes equations for the Cauchy problem, Dirichlet problem and Navier-slip boundary condition. It is shown that the strong or smooth solution exists globally if the density is bounded from above, and either the absolute temperature or velocity satisfies the weak Serrin's condition. Therefore, if the…
▽ More
We investigate weak Serrin-type blowup criterion of the three-dimensional full compressible Navier-Stokes equations for the Cauchy problem, Dirichlet problem and Navier-slip boundary condition. It is shown that the strong or smooth solution exists globally if the density is bounded from above, and either the absolute temperature or velocity satisfies the weak Serrin's condition. Therefore, if the weak Serrin norm of the absolute temperature or the velocity remains bounded, it is not possible for other kinds of singularities (such as vacuum states vanish or vacuum appears in the non-vacuum region or even milder singularities) to form before the density becomes unbounded. In particular, this criterion extends those Serrin-type blowup criterion results in (Math. Ann. 390 (2024): 1201-1248; Arch. Ration. Mech. Anal. 207(2013): 303-316). Furthermore, as a by-product, for the isentropic compressible Navier-Stokes equations, we succeed in removing the technical assumption $ρ_0\in L^1$ in (J. Lond. Math. Soc. (2) 102(2020): 125--142). The initial data can be arbitrarily large and allow to contain vacuum states here.
△ Less
Submitted 19 December, 2024; v1 submitted 7 December, 2024;
originally announced December 2024.
-
On z-Superstable and Critical Configurations of Chip Firing Pairs
Authors:
Zach Benton,
Jane Kwak,
SuHo Oh,
Mateo Torres,
Mckinley Xie
Abstract:
It is well known that there is a duality map between the superstable configurations and the critical configurations of a graph. This was extended to all M-matrices in (Guzmàn-Klivans 2015). We show a natural way to extend this to all $(L,M)$-chip firing pairs introduced in (Guzmàn-Klivans 2016). In addition, we study various properties of this map.
It is well known that there is a duality map between the superstable configurations and the critical configurations of a graph. This was extended to all M-matrices in (Guzmàn-Klivans 2015). We show a natural way to extend this to all $(L,M)$-chip firing pairs introduced in (Guzmàn-Klivans 2016). In addition, we study various properties of this map.
△ Less
Submitted 13 June, 2025; v1 submitted 3 December, 2024;
originally announced December 2024.
-
The small-mass limit for some constrained wave equations with nonlinear conservative noise
Authors:
Sandra Cerrai,
Mengzi Xie
Abstract:
We study the small-mass limit, also known as the Smoluchowski-Kramers diffusion approximation (see \cite{kra} and \cite{smolu}), for a system of stochastic damped wave equations, whose solution is constrained to live in the unitary sphere of the space of square-integrable functions on the interval $(0,L)$. The stochastic perturbation is given by a nonlinear multiplicative Gaussian noise, where the…
▽ More
We study the small-mass limit, also known as the Smoluchowski-Kramers diffusion approximation (see \cite{kra} and \cite{smolu}), for a system of stochastic damped wave equations, whose solution is constrained to live in the unitary sphere of the space of square-integrable functions on the interval $(0,L)$. The stochastic perturbation is given by a nonlinear multiplicative Gaussian noise, where the stochastic differential is understood in Stratonovich sense. Due to its particular structure, such noise not only conserves $\mathbb{P}$-a.s. the constraint, but also preserves a suitable energy functional. In the limit, we derive a deterministic system, that remains confined to the unit sphere of $L^2$, but includes additional terms. These terms depend on the reproducing kernel of the noise and account for the interaction between the constraint and the particular conservative noise we choose.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Multigrid method for nonlinear eigenvalue problems based on Newton iteration
Authors:
Fei Xu,
Manting Xie,
Meiling Yue
Abstract:
In this paper, a novel multigrid method based on Newton iteration is proposed to solve nonlinear eigenvalue problems. Instead of handling the eigenvalue $λ$ and eigenfunction $u$ separately, we treat the eigenpair $(λ, u)$ as one element in a product space $\mathbb R \times H_0^1(Ω)$. Then in the presented multigrid method, only one discrete linear boundary value problem needs to be solved for eac…
▽ More
In this paper, a novel multigrid method based on Newton iteration is proposed to solve nonlinear eigenvalue problems. Instead of handling the eigenvalue $λ$ and eigenfunction $u$ separately, we treat the eigenpair $(λ, u)$ as one element in a product space $\mathbb R \times H_0^1(Ω)$. Then in the presented multigrid method, only one discrete linear boundary value problem needs to be solved for each level of the multigrid sequence. Because we avoid solving large-scale nonlinear eigenvalue problems directly, the overall efficiency is significantly improved. The optimal error estimate and linear computational complexity can be derived simultaneously. In addition, we also provide an improved multigrid method coupled with a mixing scheme to further guarantee the convergence and stability of the iteration scheme. More importantly, we prove convergence for the residuals after each iteration step. For nonlinear eigenvalue problems, such theoretical analysis is missing from the existing literatures on the mixing iteration scheme.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Repro Samples Method for a Performance Guaranteed Inference in General and Irregular Inference Problems
Authors:
Minge Xie,
Peng Wang
Abstract:
Rapid advancements in data science require us to have fundamentally new frameworks to tackle prevalent but highly non-trivial "irregular" inference problems, to which the large sample central limit theorem does not apply. Typical examples are those involving discrete or non-numerical parameters and those involving non-numerical data, etc. In this article, we present an innovative, wide-reaching, a…
▽ More
Rapid advancements in data science require us to have fundamentally new frameworks to tackle prevalent but highly non-trivial "irregular" inference problems, to which the large sample central limit theorem does not apply. Typical examples are those involving discrete or non-numerical parameters and those involving non-numerical data, etc. In this article, we present an innovative, wide-reaching, and effective approach, called "repro samples method," to conduct statistical inference for these irregular problems plus more. The development relates to but improves several existing simulation-inspired inference approaches, and we provide both exact and approximate theories to support our development. Moreover, the proposed approach is broadly applicable and subsumes the classical Neyman-Pearson framework as a special case. For the often-seen irregular inference problems that involve both discrete/non-numerical and continuous parameters, we propose an effective three-step procedure to make inferences for all parameters. We also develop a unique matching scheme that turns the discreteness of discrete/non-numerical parameters from an obstacle for forming inferential theories into a beneficial attribute for improving computational efficiency. We demonstrate the effectiveness of the proposed general methodology using various examples, including a case study example on a Gaussian mixture model with unknown number of components. This case study example provides a solution to a long-standing open inference question in statistics on how to quantify the estimation uncertainty for the unknown number of components and other associated parameters. Real data and simulation studies, with comparisons to existing approaches, demonstrate the far superior performance of the proposed method.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
A Stabilised Semi-Implicit Double-Point Material Point Method for Soil-Water Coupled Problems
Authors:
Mian Xie,
Pedro Navas,
Susana Lopez-Querol
Abstract:
A semi-implicit two-phase double-point Material Point Method (MPM) formulation, based on the incremental fractional-step method to model large deformation geotechnical problems has been derived. The semi-implicit formulation has two advantages compared with the explicit approach: the time step is independent of the water phase, and the pore pressure field is more stable. The semi-implicit MPM mode…
▽ More
A semi-implicit two-phase double-point Material Point Method (MPM) formulation, based on the incremental fractional-step method to model large deformation geotechnical problems has been derived. The semi-implicit formulation has two advantages compared with the explicit approach: the time step is independent of the water phase, and the pore pressure field is more stable. The semi-implicit MPM models based on the incremental fractional-step method available in the literature consist of modelling the soil and water mixture using a single set of material points only, in order to save computational time. In this study, we further derive this formulation with two sets of material points to represent the soil and water phases separately. The stress oscillations that are frequently found in the water and soil phases are stabilised with this approach. A new stabilisation method is developed based on the modified F-bar method. The proposed method is validated with two numerical examples under small and large deformations, respectively. After that, Nor-Sand constitutive soil model is used to simulate landslides. Numerical examples show an excellent performance of the proposed coupled MPM and the stabilisation method. The formulation with two sets of material points yields significantly different but more reliable results in the landslides analysis, compared with the single-point approach. Additionally, this research shows that the additional computational cost caused by the additional water material points is acceptable. Therefore, it is recommended to use two sets of material points for certain large deformation geotechnical problems.
△ Less
Submitted 28 February, 2025; v1 submitted 22 January, 2024;
originally announced January 2024.
-
Well-posedness and invariant measure for quasilinear parabolic SPDE on a bounded domain
Authors:
Mengzi Xie
Abstract:
We study quasilinear parabolic stochastic partial differential equations with general multiplicative noise on a bounded domain in $\mathbb{R}^{d}$, with homogeneous Dirichlet boundary condition. We establish the existence and uniqueness of solutions in a $L^{1}$ setting, and we prove a comparison result and an $L^{1}$-contraction property for the solutions. In addition, we show the existence of an…
▽ More
We study quasilinear parabolic stochastic partial differential equations with general multiplicative noise on a bounded domain in $\mathbb{R}^{d}$, with homogeneous Dirichlet boundary condition. We establish the existence and uniqueness of solutions in a $L^{1}$ setting, and we prove a comparison result and an $L^{1}$-contraction property for the solutions. In addition, we show the existence of an invariant measure in case of non-degenerate diffusion. Finally, we show the uniqueness and ergodicity of the invariant measure in $L^{1}$, in case of bounded diffusion and additive noise.
△ Less
Submitted 24 October, 2023; v1 submitted 27 September, 2023;
originally announced September 2023.
-
On the small-mass limit for stationary solutions of stochastic wave equations with state dependent friction
Authors:
Sandra Cerrai,
Mengzi Xie
Abstract:
We investigate the convergence, in the small mass limit, of the stationary solutions of a class of stochastic damped wave equations, where the friction coefficient depends on the state and the noisy perturbation if of multiplicative type. We show that the Smoluchowski-Kramers approximation that has been previously shown to be true in any fixed time interval, is still valid in the long time regime.…
▽ More
We investigate the convergence, in the small mass limit, of the stationary solutions of a class of stochastic damped wave equations, where the friction coefficient depends on the state and the noisy perturbation if of multiplicative type. We show that the Smoluchowski-Kramers approximation that has been previously shown to be true in any fixed time interval, is still valid in the long time regime. Namely we prove that the first marginals of any sequence of stationary solutions for the damped wave equation converge to the unique invariant measure of the limiting stochastic quasilinear parabolic equation. The convergence is proved with respect to the Wasserstein distance associated with the $H^{-1}$ norm.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
First- and Second-Order Stochastic Adaptive Regularization with Cubics: High Probability Iteration and Sample Complexity
Authors:
Katya Scheinberg,
Miaolan Xie
Abstract:
We present high-probability (and expectation) complexity bounds for two versions of stochastic adaptive regularization methods with cubics (SARC), also known as regularized Newton methods. The first algorithm aims to find first-order stationary points, while the second targets second-order optimality conditions. Both methods employ stochastic zeroth-, first-, and second-order oracles with specific…
▽ More
We present high-probability (and expectation) complexity bounds for two versions of stochastic adaptive regularization methods with cubics (SARC), also known as regularized Newton methods. The first algorithm aims to find first-order stationary points, while the second targets second-order optimality conditions. Both methods employ stochastic zeroth-, first-, and second-order oracles with specific accuracy and reliability requirements. These oracles, which have been previously used with other stochastic adaptive methods like trust-region and line-search algorithms, are applicable to various optimization settings including expected risk minimization and simulation optimization. In this paper, we establish the first high-probability iteration and sample complexity bounds for both first- and second-order SARC algorithms. Our analysis demonstrates that as in the deterministic case, they outperform other stochastic adaptive methods.
△ Less
Submitted 21 April, 2025; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Induced log-concavity of equivariant matroid invariants
Authors:
Alice L. L. Gao,
Ethan Y. H. Li,
Matthew H. Y. Xie,
Arthur L. B. Yang,
Zhong-Xue Zhang
Abstract:
Inspired by the notion of equivariant log-concavity, we introduce the concept of induced log-concavity for a sequence of representations of a finite group. For an equivariant matroid equipped with a symmetric group action or a finite general linear group action, we transform the problem of proving the induced log-concavity of matroid invariants to that of proving the Schur positivity of symmetric…
▽ More
Inspired by the notion of equivariant log-concavity, we introduce the concept of induced log-concavity for a sequence of representations of a finite group. For an equivariant matroid equipped with a symmetric group action or a finite general linear group action, we transform the problem of proving the induced log-concavity of matroid invariants to that of proving the Schur positivity of symmetric functions. We prove the induced log-concavity of the equivariant Kazhdan-Lusztig polynomials of $q$-niform matroids equipped with the action of a finite general linear group, as well as that of the equivariant Kazhdan-Lusztig polynomials of uniform matroids equipped with the action of a symmetric group.
As a consequence of the former, we obtain the log-concavity of Kazhdan-Lusztig polynomials of $q$-niform matroids, thus providing further positive evidence for Elias, Proudfoot and Wakefield's log-concavity conjecture on the matroid Kazhdan-Lusztig polynomials. From the latter we obtain the log-concavity of Kazhdan-Lusztig polynomials of uniform matroids, which was recently proved by Xie and Zhang by using a computer algebra approach. We also establish the induced log-concavity of the equivariant characteristic polynomials and the equivariant inverse Kazhdan-Lusztig polynomials for $q$-niform matroids and uniform matroids.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Sample Complexity Analysis for Adaptive Optimization Algorithms with Stochastic Oracles
Authors:
Billy Jin,
Katya Scheinberg,
Miaolan Xie
Abstract:
Several classical adaptive optimization algorithms, such as line search and trust region methods, have been recently extended to stochastic settings where function values, gradients, and Hessians in some cases, are estimated via stochastic oracles. Unlike the majority of stochastic methods, these methods do not use a pre-specified sequence of step size parameters, but adapt the step size parameter…
▽ More
Several classical adaptive optimization algorithms, such as line search and trust region methods, have been recently extended to stochastic settings where function values, gradients, and Hessians in some cases, are estimated via stochastic oracles. Unlike the majority of stochastic methods, these methods do not use a pre-specified sequence of step size parameters, but adapt the step size parameter according to the estimated progress of the algorithm and use it to dictate the accuracy required from the stochastic approximations. The requirements on stochastic approximations are, thus, also adaptive and the oracle costs can vary from iteration to iteration. The step size parameters in these methods can increase and decrease based on the perceived progress, but unlike the deterministic case they are not bounded away from zero due to possible oracle failures, and bounds on the step size parameter have not been previously derived. This creates obstacles in the total complexity analysis of such methods, because the oracle costs are typically decreasing in the step size parameter, and could be arbitrarily large as the step size parameter goes to 0. Thus, until now only the total iteration complexity of these methods has been analyzed. In this paper, we derive a lower bound on the step size parameter that holds with high probability for a large class of adaptive stochastic methods. We then use this lower bound to derive a framework for analyzing the expected and high probability total oracle complexity of any method in this class. Finally, we apply this framework to analyze the total sample complexity of two particular algorithms, STORM and SASS, in the expected risk minimization problem.
△ Less
Submitted 28 September, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
A Stochastic Quasi-Newton Method in the Absence of Common Random Numbers
Authors:
Matt Menickelly,
Stefan M. Wild,
Miaolan Xie
Abstract:
We present a quasi-Newton method for unconstrained stochastic optimization. Most existing literature on this topic assumes a setting of stochastic optimization in which a finite sum of component functions is a reasonable approximation of an expectation, and hence one can design a quasi-Newton method to exploit common random numbers. In contrast, and motivated by problems in variational quantum alg…
▽ More
We present a quasi-Newton method for unconstrained stochastic optimization. Most existing literature on this topic assumes a setting of stochastic optimization in which a finite sum of component functions is a reasonable approximation of an expectation, and hence one can design a quasi-Newton method to exploit common random numbers. In contrast, and motivated by problems in variational quantum algorithms, we assume that function values and gradients are available only through inexact probabilistic zeroth- and first-order oracles and no common random numbers can be exploited. Our algorithmic framework -- based on prior work on the SASS algorithm -- is general and does not assume common random numbers. We derive a high-probability tail bound on the iteration complexity of the algorithm for nonconvex and strongly convex functions. We present numerical results demonstrating the empirical benefits of augmenting SASS with our quasi-Newton updating scheme, both on synthetic problems and on real problems in quantum chemistry.
△ Less
Submitted 1 September, 2024; v1 submitted 17 February, 2023;
originally announced February 2023.
-
A Sequential Quadratic Programming Method with High Probability Complexity Bounds for Nonlinear Equality Constrained Stochastic Optimization
Authors:
Albert S. Berahas,
Miaolan Xie,
Baoyu Zhou
Abstract:
A step-search sequential quadratic programming method is proposed for solving nonlinear equality constrained stochastic optimization problems. It is assumed that constraint function values and derivatives are available, but only stochastic approximations of the objective function and its associated derivatives can be computed via inexact probabilistic zeroth- and first-order oracles. Under reasona…
▽ More
A step-search sequential quadratic programming method is proposed for solving nonlinear equality constrained stochastic optimization problems. It is assumed that constraint function values and derivatives are available, but only stochastic approximations of the objective function and its associated derivatives can be computed via inexact probabilistic zeroth- and first-order oracles. Under reasonable assumptions, a high-probability bound on the iteration complexity of the algorithm to approximate first-order stationarity is derived. Numerical results on standard nonlinear optimization test problems illustrate the advantages and limitations of our proposed method.
△ Less
Submitted 5 October, 2024; v1 submitted 1 January, 2023;
originally announced January 2023.
-
Finite- and Large- Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples
Authors:
Peng Wang,
Min-Ge Xie,
Linjun Zhang
Abstract:
In this paper, we present a new and effective simulation-based approach to conduct both finite- and large-sample inference for high-dimensional linear regression models. This approach is developed under the so-called repro samples framework, in which we conduct statistical inference by creating and studying the behavior of artificial samples that are obtained by mimicking the sampling mechanism of…
▽ More
In this paper, we present a new and effective simulation-based approach to conduct both finite- and large-sample inference for high-dimensional linear regression models. This approach is developed under the so-called repro samples framework, in which we conduct statistical inference by creating and studying the behavior of artificial samples that are obtained by mimicking the sampling mechanism of the data. We obtain confidence sets for (a) the true model corresponding to the nonzero coefficients, (b) a single or any collection of regression coefficients, and (c) both the model and regression coefficients jointly. We also extend our approaches to drawing inferences on functions of the regression coefficients. The proposed approach fills in two major gaps in the high-dimensional regression literature: (1) lack of effective approaches to address model selection uncertainty and provide valid inference for the underlying true model; (2) lack of effective inference approaches that guarantee finite-sample performances. We provide both finite-sample and asymptotic results to theoretically guarantee the performances of the proposed methods. In addition, our numerical results demonstrate that the proposed methods are valid and achieve better coverage with smaller confidence sets than the existing state-of-art approaches, such as debiasing and bootstrap approaches.
△ Less
Submitted 9 December, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Inference of high quantiles of a heavy-tailed distribution from block data
Authors:
Yongcheng Qi,
Mengzi Xie,
Jingping Yang
Abstract:
In this paper we consider the estimation problem for high quantiles of a heavy-tailed distribution from block data when only a few largest values are observed within blocks. We propose estimators for high quantiles and prove that these estimators are asymptotically normal. Furthermore, we employ empirical likelihood method and adjusted empirical likelihood method to constructing the confidence int…
▽ More
In this paper we consider the estimation problem for high quantiles of a heavy-tailed distribution from block data when only a few largest values are observed within blocks. We propose estimators for high quantiles and prove that these estimators are asymptotically normal. Furthermore, we employ empirical likelihood method and adjusted empirical likelihood method to constructing the confidence intervals of high quantiles. Through a simulation study we also compare the performance of the normal approximation method and the adjusted empirical likelihood methods in terms of the coverage probability and length of the confidence intervals.
△ Less
Submitted 24 June, 2023; v1 submitted 16 July, 2022;
originally announced July 2022.
-
Repro Samples Method for Finite- and Large-Sample Inferences
Authors:
Min-ge Xie,
Peng Wang
Abstract:
This article presents a novel, general, and effective simulation-inspired approach, called {\it repro samples method}, to conduct statistical inference. The approach studies the performance of artificial samples, referred to as {\it repro samples}, obtained by mimicking the true observed sample to achieve uncertainty quantification and construct confidence sets for parameters of interest with guar…
▽ More
This article presents a novel, general, and effective simulation-inspired approach, called {\it repro samples method}, to conduct statistical inference. The approach studies the performance of artificial samples, referred to as {\it repro samples}, obtained by mimicking the true observed sample to achieve uncertainty quantification and construct confidence sets for parameters of interest with guaranteed coverage rates. Both exact and asymptotic inferences are developed. An attractive feature of the general framework developed is that it does not rely on the large sample central limit theorem and is likelihood-free. As such, it is thus effective for complicated inference problems which we can not solve using the large sample central limit theorem. The proposed method is applicable to a wide range of problems, including many open questions where solutions were previously unavailable, for example, those involving discrete or non-numerical parameters. To reduce the large computational cost of such inference problems, we develop a unique matching scheme to obtain a data-driven candidate set. Moreover, we show the advantages of the proposed framework over the classical Neyman-Pearson framework. We demonstrate the effectiveness of the proposed approach on various models throughout the paper and provide a case study that addresses an open inference question on how to quantify the uncertainty for the unknown number of components in a normal mixture model. To evaluate the empirical performance of our repro samples method, we conduct simulations and study real data examples with comparisons to existing approaches. Although the development pertains to the settings where the large sample central limit theorem does not apply, it also has direct extensions to the cases where the central limit theorem does hold.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Approximate confidence distribution computing
Authors:
Suzanne Thornton,
Wentao Li,
Minge Xie
Abstract:
Approximate confidence distribution computing (ACDC) offers a new take on the rapidly developing field of likelihood-free inference from within a frequentist framework. The appeal of this computational method for statistical inference hinges upon the concept of a confidence distribution, a special type of estimator which is defined with respect to the repeated sampling principle. An ACDC method pr…
▽ More
Approximate confidence distribution computing (ACDC) offers a new take on the rapidly developing field of likelihood-free inference from within a frequentist framework. The appeal of this computational method for statistical inference hinges upon the concept of a confidence distribution, a special type of estimator which is defined with respect to the repeated sampling principle. An ACDC method provides frequentist validation for computational inference in problems with unknown or intractable likelihoods. The main theoretical contribution of this work is the identification of a matching condition necessary for frequentist validity of inference from this method. In addition to providing an example of how a modern understanding of confidence distribution theory can be used to connect Bayesian and frequentist inferential paradigms, we present a case to expand the current scope of so-called approximate Bayesian inference to include non-Bayesian inference by targeting a confidence distribution rather than a posterior. The main practical contribution of this work is the development of a data-driven approach to drive ACDC in both Bayesian or frequentist contexts. The ACDC algorithm is data-driven by the selection of a data-dependent proposal function, the structure of which is quite general and adaptable to many settings. We explore two numerical examples that both verify the theoretical arguments in the development of ACDC and suggest instances in which ACDC outperform approximate Bayesian computing methods computationally.
△ Less
Submitted 12 October, 2022; v1 submitted 3 June, 2022;
originally announced June 2022.
-
On the small noise limit in the Smoluchowski-Kramers approximation of nonlinear wave equations with variable friction
Authors:
Sandra Cerrai,
Mengzi Xie
Abstract:
We study the validity of a large deviation principle for a class of stochastic nonlinear damped wave equations, of Klein-Gordon type, in the joint small mass and small noise limit. The friction term is assumed to be state dependent.
We study the validity of a large deviation principle for a class of stochastic nonlinear damped wave equations, of Klein-Gordon type, in the joint small mass and small noise limit. The friction term is assumed to be state dependent.
△ Less
Submitted 27 August, 2022; v1 submitted 11 March, 2022;
originally announced March 2022.
-
Substantial but heterogeneous impacts of high-speed rail on talent flow in China
Authors:
Mei Xie,
Jian Gao,
Tao Zhou
Abstract:
The great expansion of high-speed rail (HSR) in China facilitates communications and interactions among people across cities. Despite extensive literature documenting the effects of HSR on a variety of variables such as local economic development, research collaboration, tourism, and capital mobility, not much is known about how HSR affects the flow of well-educated workers, says talents. Here we…
▽ More
The great expansion of high-speed rail (HSR) in China facilitates communications and interactions among people across cities. Despite extensive literature documenting the effects of HSR on a variety of variables such as local economic development, research collaboration, tourism, and capital mobility, not much is known about how HSR affects the flow of well-educated workers, says talents. Here we estimate talent flow among Chinese cities based on large-scale resume data of online job seekers and explore how it is affected by HSR. Specifically, we employ both a multiple linear regression model that controls for several socioeconomic factors and a two-stage least square regression model that instruments the introduction of HSR to a city to address endogeneity concerns. We find that the introduction of HSR has an overall positive effect on the talent net inflow of a city although both inflow and outflow are increased. Moreover, the effects of HSR on talent flow are rather heterogeneous for cities with different levels of economic development and for talents working in different industries. Specifically, developed cities benefit from HSR, whereas less-developed cities are relatively impaired. Cities connected by HSR show significant advantage in attracting talents from secondary and tertiary industries. These substantial but heterogeneous effects of HSR suggest a critical need for more comprehensive thinking about the long-term benefits of entering the HSR network, especially for less-developed cities and those with comparative advantage in manufacturing and service industries.
△ Less
Submitted 23 December, 2021; v1 submitted 13 December, 2021;
originally announced December 2021.
-
High Probability Complexity Bounds for Adaptive Step Search Based on Stochastic Oracles
Authors:
Billy Jin,
Katya Scheinberg,
Miaolan Xie
Abstract:
We consider a step search method for continuous optimization under a stochastic setting where the function values and gradients are available only through inexact probabilistic zeroth- and first-order oracles. Unlike the stochastic gradient method and its many variants, the algorithm does not use a pre-specified sequence of step sizes but increases or decreases the step size adaptively according t…
▽ More
We consider a step search method for continuous optimization under a stochastic setting where the function values and gradients are available only through inexact probabilistic zeroth- and first-order oracles. Unlike the stochastic gradient method and its many variants, the algorithm does not use a pre-specified sequence of step sizes but increases or decreases the step size adaptively according to the estimated progress of the algorithm. These oracles capture multiple standard settings including expected loss minimization and zeroth-order optimization. Moreover, our framework is very general and allows the function and gradient estimates to be biased. The proposed algorithm is simple to describe and easy to implement. Under fairly general conditions on the oracles, we derive a high probability tail bound on the iteration complexity of the algorithm when it is applied to non-convex, convex, and strongly convex (more generally, those satisfying the PL condition) functions. Our analysis strengthens and extends prior results for stochastic step and line search methods.
△ Less
Submitted 1 November, 2023; v1 submitted 11 June, 2021;
originally announced June 2021.
-
The equivariant inverse Kazhdan-Lusztig polynomials of uniform matroids
Authors:
Alice L. L. Gao,
Matthew H. Y. Xie,
Arthur L. B. Yang
Abstract:
Motivated by the concepts of the inverse Kazhdan-Lusztig polynomial and the equivariant Kazhdan-Lusztig polynomial, Proudfoot defined the equivariant inverse Kazhdan-Lusztig polynomial for a matroid. In this paper, we show that the equivariant inverse Kazhdan-Lusztig polynomial of a matroid is very useful for determining its equivariant Kazhdan-Lusztig polynomials, and we determine the equivariant…
▽ More
Motivated by the concepts of the inverse Kazhdan-Lusztig polynomial and the equivariant Kazhdan-Lusztig polynomial, Proudfoot defined the equivariant inverse Kazhdan-Lusztig polynomial for a matroid. In this paper, we show that the equivariant inverse Kazhdan-Lusztig polynomial of a matroid is very useful for determining its equivariant Kazhdan-Lusztig polynomials, and we determine the equivariant inverse Kazhdan-Lusztig polynomials for Boolean matroids and uniform matroids. As an application, we give a new proof of Gedeon, Proudfoot and Young's formula for the equivariant Kazhdan-Lusztig polynomials of uniform matroids. Inspired by Lee, Nasr and Radcliffe's combinatorial interpretation for the ordinary Kazhdan-Lusztig polynomials of uniform matroids, we further present a new formula for the corresponding equivariant Kazhdan-Lusztig polynomials.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
Bridging Bayesian, frequentist and fiducial (BFF) inferences using confidence distribution
Authors:
Suzanne Thornton,
Minge Xie
Abstract:
Bayesian, frequentist and fiducial (BFF) inferences are much more congruous than they have been perceived historically in the scientific community (cf., Reid and Cox 2015; Kass 2011; Efron 1998). Most practitioners are probably more familiar with the two dominant statistical inferential paradigms, Bayesian inference and frequentist inference. The third, lesser known fiducial inference paradigm was…
▽ More
Bayesian, frequentist and fiducial (BFF) inferences are much more congruous than they have been perceived historically in the scientific community (cf., Reid and Cox 2015; Kass 2011; Efron 1998). Most practitioners are probably more familiar with the two dominant statistical inferential paradigms, Bayesian inference and frequentist inference. The third, lesser known fiducial inference paradigm was pioneered by R.A. Fisher in an attempt to define an inversion procedure for inference as an alternative to Bayes' theorem. Although each paradigm has its own strengths and limitations subject to their different philosophical underpinnings, this article intends to bridge these different inferential methodologies through the lenses of confidence distribution theory and Monte-Carlo simulation procedures. This article attempts to understand how these three distinct paradigms, Bayesian, frequentist, and fiducial inference, can be unified and compared on a foundational level, thereby increasing the range of possible techniques available to both statistical theorists and practitioners across all fields.
△ Less
Submitted 15 June, 2022; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Optimization Fabrics for Behavioral Design
Authors:
Nathan D. Ratliff,
Karl Van Wyk,
Mandy Xie,
Anqi Li,
Muhammad Asif Rana
Abstract:
A common approach to the provably stable design of reactive behavior, exemplified by operational space control, is to reduce the problem to the design of virtual classical mechanical systems (energy shaping). This framework is widely used, and through it we gain stability, but at the price of expressivity. This work presents a comprehensive theoretical framework expanding this approach showing tha…
▽ More
A common approach to the provably stable design of reactive behavior, exemplified by operational space control, is to reduce the problem to the design of virtual classical mechanical systems (energy shaping). This framework is widely used, and through it we gain stability, but at the price of expressivity. This work presents a comprehensive theoretical framework expanding this approach showing that there is a much larger class of differential equations generalizing classical mechanical systems (and the broader class of Lagrangian systems) and greatly expanding their expressivity while maintaining the same governing stability principles. At the core of our framework is a class of differential equations we call fabrics which constitute a behavioral medium across which we can optimize a potential function. These fabrics shape the system's behavior during optimization but still always provably converge to a local minimum, making them a building block of stable behavioral design. We build the theoretical foundations of our framework here and provide a simple empirical demonstration of a practical class of geometric fabrics, which additionally exhibit a natural geometric path consistency making them convenient for flexible and intuitive behavioral design.
△ Less
Submitted 25 June, 2021; v1 submitted 28 October, 2020;
originally announced October 2020.
-
Optimization Fabrics
Authors:
Nathan D. Ratliff,
Karl Van Wyk,
Mandy Xie,
Anqi Li,
Muhammad Asif Rana
Abstract:
This paper presents a theory of optimization fabrics, second-order differential equations that encode nominal behaviors on a space and can be used to define the behavior of a smooth optimizer. Optimization fabrics can encode commonalities among optimization problems that reflect the structure of the space itself, enabling smooth optimization processes to intelligently navigate each problem even wh…
▽ More
This paper presents a theory of optimization fabrics, second-order differential equations that encode nominal behaviors on a space and can be used to define the behavior of a smooth optimizer. Optimization fabrics can encode commonalities among optimization problems that reflect the structure of the space itself, enabling smooth optimization processes to intelligently navigate each problem even when optimizing simple naive potential functions. Importantly, optimization over a fabric is inherently asymptotically stable. The majority of this paper is dedicated to the development of a tool set for the design and use of a broad class of fabrics called geometric fabrics. Geometric fabrics encode behavior as general nonlinear geometries which are covariant second-order differential equations with a special homogeneity property that ensures their behavior is independent of the system's speed through the medium. A class of Finsler Lagrangian energies can be used to both define how these nonlinear geometries combine with one another and how they react when potential functions force them from their nominal paths. Furthermore, these geometric fabrics are closed under the standard operations of pullback and combination on a transform tree. For behavior representation, this class of geometric fabrics constitutes a broad class of spectral semi-sprays (specs), also known as Riemannian Motion Policies (RMPs) in the context of robotic motion generation, that captures both the intuitive separation between acceleration policy and priority metric critical for modular design and are inherently stable. Therefore, geometric fabrics are safe and easier to use by less experienced behavioral designers. Application of this theory to policy representation and generalization in learning are discussed as well.
△ Less
Submitted 21 August, 2020; v1 submitted 5 August, 2020;
originally announced August 2020.
-
The inverse Kazhdan-Lusztig polynomial of a matroid
Authors:
Alice L. L. Gao,
Matthew H. Y. Xie
Abstract:
In analogy with the classical Kazhdan-Lusztig polynomials for Coxeter groups, Elias, Proudfoot and Wakefield introduced the concept of Kazhdan-Lusztig polynomials for matroids. It is known that both the classical Kazhdan-Lusztig polynomials and the matroid Kazhdan-Lusztig polynomials can be considered as special cases of the Kazhdan-Lusztig-Stanley polynomials for locally finite posets. In the fra…
▽ More
In analogy with the classical Kazhdan-Lusztig polynomials for Coxeter groups, Elias, Proudfoot and Wakefield introduced the concept of Kazhdan-Lusztig polynomials for matroids. It is known that both the classical Kazhdan-Lusztig polynomials and the matroid Kazhdan-Lusztig polynomials can be considered as special cases of the Kazhdan-Lusztig-Stanley polynomials for locally finite posets. In the framework of Kazhdan-Lusztig-Stanley polynomials, we study the inverse of Kazhdan-Lusztig-Stanley functions and define the inverse Kazhdan-Lusztig polynomials for matroids. We also compute these polynomials for boolean matroids and uniform matroids. As an unexpected application of the inverse Kazhdan-Lusztig polynomials, we obtain a new formula to compute the Kazhdan-Lusztig polynomials for uniform matroids. Similar to the Kazhdan-Lusztig polynomial of a matroid, we conjecture that the coefficients of its inverse Kazhdan-Lusztig polynomial are nonnegative and log-concave.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
A system of $k$ Sylvester-type quaternion matrix equations with $3k+1$ variables
Authors:
Qing-Wen Wang,
Mengyan Xie
Abstract:
In this paper, we provide some solvability conditions in terms of ranks for the existence of a general solution to a system of $k$ Sylvester-type quaternion matrix equations with $3k+1$ variables $A_{i}X_{i}+Y_{i}B_{i}+C_{i}Z_{i}D_{i}+F_{i}Z_{i+1}G_{i}=E_{i},~i=\overline{1,k}$. As applications of this system, we present rank equalities as the necessary and sufficient conditions for the existence o…
▽ More
In this paper, we provide some solvability conditions in terms of ranks for the existence of a general solution to a system of $k$ Sylvester-type quaternion matrix equations with $3k+1$ variables $A_{i}X_{i}+Y_{i}B_{i}+C_{i}Z_{i}D_{i}+F_{i}Z_{i+1}G_{i}=E_{i},~i=\overline{1,k}$. As applications of this system, we present rank equalities as the necessary and sufficient conditions for the existence of a general solution to some systems of quaternion matrix equations $A_{i}X_{i}+(A_{i}X_{i})_φ+C_{i}Z_{i}(C_{i})_φ+F_{i}Z_{i+1}(F_{i})_φ=E_{i},~i=\overline{1,k}$.
△ Less
Submitted 30 July, 2020; v1 submitted 28 July, 2020;
originally announced July 2020.
-
Leveraging the Fisher randomization test using confidence distributions: inference, combination and fusion learning
Authors:
Xiaokang Luo,
Tirthankar Dasgupta,
Minge Xie,
Regina Liu
Abstract:
The flexibility and wide applicability of the Fisher randomization test (FRT) makes it an attractive tool for assessment of causal effects of interventions from modern-day randomized experiments that are increasing in size and complexity. This paper provides a theoretical inferential framework for FRT by establishing its connection with confidence distributions Such a connection leads to developme…
▽ More
The flexibility and wide applicability of the Fisher randomization test (FRT) makes it an attractive tool for assessment of causal effects of interventions from modern-day randomized experiments that are increasing in size and complexity. This paper provides a theoretical inferential framework for FRT by establishing its connection with confidence distributions Such a connection leads to development of (i) an unambiguous procedure for inversion of FRTs to generate confidence intervals with guaranteed coverage, (ii) generic and specific methods to combine FRTs from multiple independent experiments with theoretical guarantees and (iii) new insights on the effect of size of the Monte Carlo sample on the results of FRT. Our developments pertain to finite sample settings but have direct extensions to large samples. Simulations and a case example demonstrate the benefit of these new developments.
△ Less
Submitted 17 April, 2020;
originally announced April 2020.
-
Homeostasis phenomenon in predictive inference when using a wrong learning model: a tale of random split of data into training and test sets
Authors:
Min-ge Xie,
Zheshi Zheng
Abstract:
This note uses a conformal prediction procedure to provide further support on several points discussed by Professor Efron (Efron, 2020) concerning prediction, estimation and IID assumption. It aims to convey the following messages: (1) Under the IID (e.g., random split of training and testing data sets) assumption, prediction is indeed an easier task than estimation, since prediction has a 'homeos…
▽ More
This note uses a conformal prediction procedure to provide further support on several points discussed by Professor Efron (Efron, 2020) concerning prediction, estimation and IID assumption. It aims to convey the following messages: (1) Under the IID (e.g., random split of training and testing data sets) assumption, prediction is indeed an easier task than estimation, since prediction has a 'homeostasis property' in this case -- Even if the model used for learning is completely wrong, the prediction results maintain valid. (2) If the IID assumption is violated (e.g., a targeted prediction on specific individuals), the homeostasis property is often disrupted and the prediction results under a wrong model are usually invalid. (3) Better model estimation typically leads to more accurate prediction in both IID and non-IID cases. Good modeling and estimation practices are important and, in many times, crucial for obtaining good prediction results. The discussion also provides one explanation why the deep learning method works so well in academic exercises (with experiments set up by randomly splitting the entire data into training and testing data sets), but fails to deliver many `killer applications' in real world applications.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
Geometric Conditions for the Discrepant Posterior Phenomenon and Connections to Simpson's Paradox
Authors:
Yang Chen,
Ruobin Gong,
Min-ge Xie
Abstract:
The discrepant posterior phenomenon (DPP) is a counter-intuitive phenomenon that can frequently occur in a Bayesian analysis of multivariate parameters. It refers to the phenomenon that a parameter estimate based on a posterior is more extreme than both of those inferred based on either the prior or the likelihood alone. Inferential claims that exhibit DPP defy the common intuition that the poster…
▽ More
The discrepant posterior phenomenon (DPP) is a counter-intuitive phenomenon that can frequently occur in a Bayesian analysis of multivariate parameters. It refers to the phenomenon that a parameter estimate based on a posterior is more extreme than both of those inferred based on either the prior or the likelihood alone. Inferential claims that exhibit DPP defy the common intuition that the posterior is a prior-data compromise, and the phenomenon can be surprisingly ubiquitous in well-behaved Bayesian models. In this paper we revisit this phenomenon and, using point estimation as an example, derive conditions under which the DPP occurs in Bayesian models with exponential quadratic likelihoods and conjugate multivariate Gaussian priors. The family of exponential quadratic likelihood models includes Gaussian models and those models with local asymptotic normality property. We provide an intuitive geometric interpretation of the phenomenon and show that there exists a nontrivial space of marginal directions such that the DPP occurs. We further relate the phenomenon to the Simpson's paradox and discover their deep-rooted connection that is associated with marginalization. We also draw connections with Bayesian computational algorithms when difficult geometry exists. Our discovery demonstrates that DPP is more prevalent than previously understood and anticipated. Theoretical results are complemented by numerical illustrations. Scenarios covered in this study have implications for parameterization, sensitivity analysis, and prior choice for Bayesian modeling.
△ Less
Submitted 12 January, 2022; v1 submitted 22 January, 2020;
originally announced January 2020.
-
Spectral Radii of Products of Random Rectangular Matrices
Authors:
Yongcheng Qi,
Mengzi Xie
Abstract:
We consider m independent random rectangular matrices whose entries are independent and identically distributed standard complex Gaussian random variables. Assume the product of the m rectangular matrices is an n by n square matrix. The maximum absolute values of the n eigenvalues of the product matrix is called spectral radius. In this paper, we study the limiting spectral radii of the product wh…
▽ More
We consider m independent random rectangular matrices whose entries are independent and identically distributed standard complex Gaussian random variables. Assume the product of the m rectangular matrices is an n by n square matrix. The maximum absolute values of the n eigenvalues of the product matrix is called spectral radius. In this paper, we study the limiting spectral radii of the product when m changes with n and can even diverge. We give a complete description for the limiting distribution of the spectral radius. Our results reduce to those in Jiang and Qi [26] when the rectangular matrices are square ones.
△ Less
Submitted 15 July, 2022; v1 submitted 11 September, 2019;
originally announced September 2019.
-
Equivariant Kazhdan-Lusztig polynomials of thagomizer matroids
Authors:
Matthew H. Y. Xie,
Philip B. Zhang
Abstract:
The equivariant Kazhdan-Lusztig polynomial of a matroid was introduced by Gedeon, Proudfoot, and Young. Gedeon conjectured an explicit formula for the equivariant Kazhdan-Lusztig polynomials of thagomizer matroids with an action of symmetric groups. In this paper, we discover a new formula for these polynomials which is related to the equivariant Kazhdan-Lusztig polynomials of uniform matroids. Ba…
▽ More
The equivariant Kazhdan-Lusztig polynomial of a matroid was introduced by Gedeon, Proudfoot, and Young. Gedeon conjectured an explicit formula for the equivariant Kazhdan-Lusztig polynomials of thagomizer matroids with an action of symmetric groups. In this paper, we discover a new formula for these polynomials which is related to the equivariant Kazhdan-Lusztig polynomials of uniform matroids. Based on our new formula, we confirm Gedeon's conjecture by the Pieri rule.
△ Less
Submitted 4 February, 2019;
originally announced February 2019.
-
The Kazhdan-Lusztig polynomials of uniform matroids
Authors:
Alice L. L. Gao,
Linyuan Lu,
Matthew H. Y. Xie,
Arthur L. B. Yang,
Philip B. Zhang
Abstract:
The Kazhdan-Lusztig polynomial of a matroid was introduced by Elias, Proudfoot, and Wakefield [{\it Adv. Math. 2016}]. Let $U_{m,d}$ denote the uniform matroid of rank $d$ on a set of $m+d$ elements. Gedeon, Proudfoot, and Young [{\it J. Combin. Theory Ser. A, 2017}] pointed out that they can derive an explicit formula of the Kazhdan-Lusztig polynomials of $U_{m,d}$ using equivariant Kazhdan-Luszt…
▽ More
The Kazhdan-Lusztig polynomial of a matroid was introduced by Elias, Proudfoot, and Wakefield [{\it Adv. Math. 2016}]. Let $U_{m,d}$ denote the uniform matroid of rank $d$ on a set of $m+d$ elements. Gedeon, Proudfoot, and Young [{\it J. Combin. Theory Ser. A, 2017}] pointed out that they can derive an explicit formula of the Kazhdan-Lusztig polynomials of $U_{m,d}$ using equivariant Kazhdan-Lusztig polynomials. In this paper we give two alternative explicit formulas, which allow us to prove the real-rootedness of the Kazhdan-Lusztig polynomials of $U_{m,d}$ for $2\leq m\leq 15$ and all $d$'s. The case $m=1$ was previously proved by Gedeon, Proudfoot, and Young [{\it Sém. Lothar. Combin. 2017}]. We further determine the $Z$-polynomials of all $U_{m,d}$'s and prove the real-rootedness of the $Z$-polynomials of $U_{m,d}$ for $2\leq m\leq 15$ and all $d$'s. Our formula also enables us to give an alternative proof of Gedeon, Proudfoot, and Young's formula for the Kazhdan-Lusztig polynomials of $U_{m,d}$'s without using the equivariant Kazhdan-Lusztig polynomials.
△ Less
Submitted 28 June, 2018;
originally announced June 2018.
-
Kazhdan-Lusztig polynomials of fan matroids, wheel matroids and whirl matroids
Authors:
Linyuan Lu,
Matthew H. Y. Xie,
Arthur L. B. Yang
Abstract:
The Kazhdan-Lusztig polynomial of a matroid was introduced by Elias, Proudfoot and Wakefield, whose properties need to be further explored. In this paper we prove that the Kazhdan-Lusztig polynomials of fan matroids coincide with Motzkin polynomials, which was recently conjectured by Gedeon. As a byproduct, we determine the Kazhdan-Lusztig polynomials of graphic matroids of squares of paths. We fu…
▽ More
The Kazhdan-Lusztig polynomial of a matroid was introduced by Elias, Proudfoot and Wakefield, whose properties need to be further explored. In this paper we prove that the Kazhdan-Lusztig polynomials of fan matroids coincide with Motzkin polynomials, which was recently conjectured by Gedeon. As a byproduct, we determine the Kazhdan-Lusztig polynomials of graphic matroids of squares of paths. We further obtain explicit formulas of the Kazhdan-Lusztig polynomials of wheel matroids and whirl matroids. We prove the real-rootedness of the Kazhdan-Lusztig polynomials of these matroids, which provides positive evidence for a conjecture due to Gedeon, Proudfoot and Young. Based on the results on the Kazhdan-Lusztig polynomials, we also determine the $Z$-polynomials of fan matroids, wheel matroids and whirl matroids, and prove their real-rootedness, which provides further evidence in support of a conjecture of Proudfoot, Xu, and Young.
△ Less
Submitted 11 February, 2018;
originally announced February 2018.
-
Group-Server Queues
Authors:
Quan-Lin Li,
Jing-Yu Ma,
Mingzhou Xie,
Li Xia
Abstract:
By analyzing energy-efficient management of data centers, this paper proposes and develops a class of interesting {\it Group-Server Queues}, and establishes two representative group-server queues through loss networks and impatient customers, respectively. Furthermore, such two group-server queues are given model descriptions and necessary interpretation. Also, simple mathematical discussion is pr…
▽ More
By analyzing energy-efficient management of data centers, this paper proposes and develops a class of interesting {\it Group-Server Queues}, and establishes two representative group-server queues through loss networks and impatient customers, respectively. Furthermore, such two group-server queues are given model descriptions and necessary interpretation. Also, simple mathematical discussion is provided, and simulations are made to study the expected queue lengths, the expected sojourn times and the expected virtual service times. In addition, this paper also shows that this class of group-server queues are often encountered in many other practical areas including communication networks, manufacturing systems, transportation networks, financial networks and healthcare systems. Note that the group-server queues are always used to design effectively dynamic control mechanisms through regrouping and recombining such many servers in a large-scale service system by means of, for example, bilateral threshold control, and customers transfer to the buffer or server groups. This leads to the large-scale service system that is divided into several adaptive and self-organizing subsystems through scheduling of batch customers and regrouping of service resources, which make the middle layer of this service system more effectively managed and strengthened under a dynamic, real-time and even reward optimal framework. Based on this, performance of such a large-scale service system may be improved greatly in terms of introducing and analyzing such group-server queues. Therefore, not only analysis of group-server queues is regarded as a new interesting research direction, but there also exists many theoretical challenges, basic difficulties and open problems in the area of queueing networks.
△ Less
Submitted 21 July, 2017; v1 submitted 11 June, 2017;
originally announced June 2017.
-
An effective likelihood-free approximate computing method with statistical inferential guarantees
Authors:
Suzanne Thornton,
Wentao Li,
Min-ge Xie
Abstract:
Approximate Bayesian computing is a powerful likelihood-free method that has grown increasingly popular since early applications in population genetics. However, complications arise in the theoretical justification for Bayesian inference conducted from this method with a non-sufficient summary statistic. In this paper, we seek to re-frame approximate Bayesian computing within a frequentist context…
▽ More
Approximate Bayesian computing is a powerful likelihood-free method that has grown increasingly popular since early applications in population genetics. However, complications arise in the theoretical justification for Bayesian inference conducted from this method with a non-sufficient summary statistic. In this paper, we seek to re-frame approximate Bayesian computing within a frequentist context and justify its performance by standards set on the frequency coverage rate. In doing so, we develop a new computational technique called approximate confidence distribution computing, yielding theoretical support for the use of non-sufficient summary statistics in likelihood-free methods. Furthermore, we demonstrate that approximate confidence distribution computing extends the scope of approximate Bayesian computing to include data-dependent priors without damaging the inferential integrity. This data-dependent prior can be viewed as an initial `distribution estimate' of the target parameter which is updated with the results of the approximate confidence distribution computing method. A general strategy for constructing an appropriate data-dependent prior is also discussed and is shown to often increase the computing speed while maintaining statistical inferential guarantees. We supplement the theory with simulation studies illustrating the benefits of the proposed method, namely the potential for broader applications and the increased computing speed compared to the standard approximate Bayesian computing methods.
△ Less
Submitted 30 November, 2018; v1 submitted 29 May, 2017;
originally announced May 2017.
-
Schur positivity and log-concavity related to longest increasing subsequences
Authors:
Alice L. L. Gao,
Matthew H. Y. Xie,
Arthur L. B. Yang
Abstract:
Chen proposed a conjecture on the log-concavity of the generating function for the symmetric group with respect to the length of longest increasing subsequences of permutations. Motivated by Chen's log-concavity conjecture, Bóna, Lackner and Sagan further studied similar problems by restricting the whole symmetric group to certain of its subsets. They obtained the log-concavity of the correspondin…
▽ More
Chen proposed a conjecture on the log-concavity of the generating function for the symmetric group with respect to the length of longest increasing subsequences of permutations. Motivated by Chen's log-concavity conjecture, Bóna, Lackner and Sagan further studied similar problems by restricting the whole symmetric group to certain of its subsets. They obtained the log-concavity of the corresponding generating functions for these subsets by using the hook-length formula. In this paper, we generalize and prove their results by establishing the Schur positivity of certain symmetric functions. This also enables us to propose a new approach to Chen's original conjecture.
△ Less
Submitted 18 March, 2017;
originally announced March 2017.
-
The Smith normal form of a specialized Giambelli-type matrix
Authors:
Alice L. L. Gao,
Matthew H. Y. Xie,
Arthur L. B. Yang
Abstract:
In the study of determinant formulas for Schur functions, Hamel and Goulden introduced a class of Giambelli-type matrices with respect to outside decompositions of partition diagrams, which unify the Jacobi-Trudi matrices, the Giambelli matrices and the Lascoux-Pragacz matrices. Stanley determined the Smith normal form of a specialized Jacobi-Trudi matrix. Motivated by Stanley's work, we obtain th…
▽ More
In the study of determinant formulas for Schur functions, Hamel and Goulden introduced a class of Giambelli-type matrices with respect to outside decompositions of partition diagrams, which unify the Jacobi-Trudi matrices, the Giambelli matrices and the Lascoux-Pragacz matrices. Stanley determined the Smith normal form of a specialized Jacobi-Trudi matrix. Motivated by Stanley's work, we obtain the Smith normal form of a specialized Giambelli matrix and a specialized Lascoux-Pragacz matrix. Furthermore, we show that, for a given partition, the Smith normal form of any specialized Giambelli-type matrix can be obtained from that of the corresponding specialization of the classical Giambelli matrix by a sequence of stabilization operations.
△ Less
Submitted 5 March, 2017;
originally announced March 2017.
-
Computable Error Estimates for Ground State Solution of Bose-Einstein Condensates
Authors:
Hehu Xie,
Manting Xie
Abstract:
In this paper, we propose a computable error estimate of the Gross-Pitaevskii equation for ground state solution of Bose-Einstein condensates by general conforming finite element methods on general meshes. Based on the proposed error estimate, asymptotic lower bounds of the smallest eigenvalue and ground state energy can be obtained. Several numerical examples are presented to validate our theoret…
▽ More
In this paper, we propose a computable error estimate of the Gross-Pitaevskii equation for ground state solution of Bose-Einstein condensates by general conforming finite element methods on general meshes. Based on the proposed error estimate, asymptotic lower bounds of the smallest eigenvalue and ground state energy can be obtained. Several numerical examples are presented to validate our theoretical results in this paper.
△ Less
Submitted 18 April, 2016;
originally announced April 2016.
-
Multi-Objective Optimization of a Port-of-Entry Inspection Policy
Authors:
Christina M. Young,
Mingyu Li,
Yada Zhu,
Minge Xie,
Elsayed A. Elsayed,
Tsvetan Asamov
Abstract:
At the port-of-entry containers are inspected through a specific sequence of sensor stations to detect the presence of nuclear materials, biological and chemical agents, and other illegal cargo. The inspection policy, which includes the sequence in which sensors are applied and the threshold levels used at the inspection stations, affects the probability of misclassifying a container as well as th…
▽ More
At the port-of-entry containers are inspected through a specific sequence of sensor stations to detect the presence of nuclear materials, biological and chemical agents, and other illegal cargo. The inspection policy, which includes the sequence in which sensors are applied and the threshold levels used at the inspection stations, affects the probability of misclassifying a container as well as the cost and time spent in inspection. In this paper we consider a system operating with a Boolean decision function combining station results and present a multi-objective optimization approach to determine the optimal sensor arrangement and threshold levels while considering cost and time. The total cost includes cost incurred by misclassification errors and the total expected cost of inspection, while the time represents the total expected time a container spends in the inspection system. An example which applies the approach in a theoretical inspection system is presented.
△ Less
Submitted 19 May, 2015;
originally announced May 2015.
-
A Full Multigrid Method for Nonlinear Eigenvalue Problems
Authors:
Shanghui Jia,
Hehu Xie,
Manting Xie,
Fei Xu
Abstract:
This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value proble…
▽ More
This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value problems are solved by some multigrid iterations. Besides the multigrid iteration, all other efficient iteration methods for solving boundary value problems can serve as the linear problem solver. We will prove that the computational work of this new scheme is truly optimal, the same as solving the linear corresponding boundary value problem. In this case, this type of iteration scheme certainly improves the overfull efficiency of solving nonlinear eigenvalue problems. Some numerical experiments are presented to validate the efficiency of the new method.
△ Less
Submitted 16 February, 2015;
originally announced February 2015.
-
Melham's Conjecture on Odd Power Sums of Fibonacci Numbers
Authors:
Brian Y. Sun,
Matthew H. Y. Xie,
Arthur L. B. Yang
Abstract:
Ozeki and Prodinger showed that the odd power sum of the first several consecutive Fibonacci numbers of even order is equal to a polynomial evaluated at certain Fibonacci number of odd order. We prove that this polynomial and its derivative both vanish at $1$, and will be an integer polynomial after multiplying it by a product of the first consecutive Lucas numbers of odd order. This presents an a…
▽ More
Ozeki and Prodinger showed that the odd power sum of the first several consecutive Fibonacci numbers of even order is equal to a polynomial evaluated at certain Fibonacci number of odd order. We prove that this polynomial and its derivative both vanish at $1$, and will be an integer polynomial after multiplying it by a product of the first consecutive Lucas numbers of odd order. This presents an affirmative answer to a conjecture of Melham.
△ Less
Submitted 11 February, 2015;
originally announced February 2015.