-
Almost Global Solutions of Kirchhoff Equation
Authors:
Jianjun Liu,
Duohui Xiang
Abstract:
This paper is concerned with the original Kirchhoff equation $$\left\{\begin{aligned}
& \pa_{tt}u-\Big(1+\int_{0}^π|\pa_xu|^2 dx\Big)\pa_{xx}u=0,
\\&u(t,0)=u(t,π)=0. \end{aligned}\right.$$ We obtain almost global existence and stability of solutions for almost any small initial data of size $\varepsilon$. In Sobolev spaces, the time of existence and stability is of order $\varepsilon^{-r}$ for…
▽ More
This paper is concerned with the original Kirchhoff equation $$\left\{\begin{aligned}
& \pa_{tt}u-\Big(1+\int_{0}^π|\pa_xu|^2 dx\Big)\pa_{xx}u=0,
\\&u(t,0)=u(t,π)=0. \end{aligned}\right.$$ We obtain almost global existence and stability of solutions for almost any small initial data of size $\varepsilon$. In Sobolev spaces, the time of existence and stability is of order $\varepsilon^{-r}$ for arbitrary positive integer $r$. In Gevrey and analytic spaces, the time is of order $e^{\frac{|\ln\varepsilon|^2}{c\ln|\ln\varepsilon|}}$ with some positive constant $c$. To achieve these, we build rational normal form for infinite dimensional reversible vector fields without external parameters. We emphasize that for vector fields, the homological equation and the definition of rational normal form are significantly different from those for Hamiltonian functions.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
A frequentist local false discovery rate
Authors:
Daniel Xiang,
Jake A. Soloff,
William Fithian
Abstract:
The local false discovery rate (lfdr) of Efron et al. (2001) enjoys major conceptual and decision-theoretic advantages over the false discovery rate (FDR) as an error criterion in multiple testing, but is only well-defined in Bayesian models where the truth status of each null hypothesis is random. We define a frequentist counterpart to the lfdr based on the relative frequency of nulls at each poi…
▽ More
The local false discovery rate (lfdr) of Efron et al. (2001) enjoys major conceptual and decision-theoretic advantages over the false discovery rate (FDR) as an error criterion in multiple testing, but is only well-defined in Bayesian models where the truth status of each null hypothesis is random. We define a frequentist counterpart to the lfdr based on the relative frequency of nulls at each point in the sample space. The frequentist lfdr is defined without reference to any prior, but preserves several important properties of the Bayesian lfdr: For continuous test statistics, $\text{lfdr}(t)$ gives the probability, conditional on observing some statistic equal to $t$, that the corresponding null hypothesis is true. Evaluating the lfdr at an individual test statistic also yields a calibrated forecast of whether its null hypothesis is true. Finally, thresholding the lfdr at $\frac{1}{1+λ}$ gives the best separable rejection rule under the weighted classification loss where Type I errors are $λ$ times as costly as Type II errors. The lfdr can be estimated efficiently using parametric or non-parametric methods, and a closely related error criterion can be provably controlled in finite samples under independence assumptions. Whereas the FDR measures the average quality of all discoveries in a given rejection region, our lfdr measures how the quality of discoveries varies across the rejection region, allowing for a more fine-grained analysis without requiring the introduction of a prior.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
A Birkhoff Normal Form Theorem for Partial Differential Equations on torus
Authors:
Jianjun Liu,
Duohui Xiang
Abstract:
We prove an abstract Birkhoff normal form theorem for Hamiltonian partial differential equations on torus. The normal form is complete up to arbitrary finite order. The proof is based on a valid non-resonant condition and a suitable norm of Hamiltonian function. Then as two examples, we apply this theorem to nonlinear wave equation in one dimension and nonlinear Schrödinger equation in high dimens…
▽ More
We prove an abstract Birkhoff normal form theorem for Hamiltonian partial differential equations on torus. The normal form is complete up to arbitrary finite order. The proof is based on a valid non-resonant condition and a suitable norm of Hamiltonian function. Then as two examples, we apply this theorem to nonlinear wave equation in one dimension and nonlinear Schrödinger equation in high dimension. Consequently, the polynomially long time stability is proved in Sobolev spaces $H^s$ with the index $s$ being much smaller than before. Further, by taking the iterative steps depending on the size of initial datum, we prove sub-exponentially long time stability for these two equations.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
Non-standard boundary behaviour in binary mixture models
Authors:
Heather Battey,
Peter McCullagh,
Daniel Xiang
Abstract:
Consider a binary mixture model of the form $F_θ= (1-θ)F_0 + θF_1$, where $F_0$ is standard Gaussian and $F_1$ is a completely specified heavy-tailed distribution with the same support. For a sample of $n$ independent and identically distributed values $X_i \sim F_θ$, the maximum likelihood estimator $\hatθ_n$ is asymptotically normal provided that $0 < θ< 1$ is an interior point. This paper inves…
▽ More
Consider a binary mixture model of the form $F_θ= (1-θ)F_0 + θF_1$, where $F_0$ is standard Gaussian and $F_1$ is a completely specified heavy-tailed distribution with the same support. For a sample of $n$ independent and identically distributed values $X_i \sim F_θ$, the maximum likelihood estimator $\hatθ_n$ is asymptotically normal provided that $0 < θ< 1$ is an interior point. This paper investigates the large-sample behaviour for boundary points, which is entirely different and strikingly asymmetric for $θ=0$ and $θ=1$. The reason for the asymmetry has to do with typical choices such that $F_0$ is an extreme boundary point and $F_1$ is usually not extreme. On the right boundary, well known results on boundary parameter problems are recovered, giving $\lim \mathbb{P}_1(\hatθ_n < 1)=1/2$. On the left boundary, $\lim\mathbb{P}_0(\hatθ_n > 0)=1-1/α$, where $1\leq α\leq 2$ indexes the domain of attraction of the density ratio $f_1(X)/f_0(X)$ when $X\sim F_0$. For $α=1$, which is the most important case in practice, we show how the tail behaviour of $F_1$ governs the rate at which $\mathbb{P}_0(\hatθ_n > 0)$ tends to zero. A new limit theorem for the joint distribution of the sample maximum and sample mean conditional on positivity establishes multiple inferential anomalies. Most notably, given $\hatθ_n > 0$, the likelihood ratio statistic has a conditional null limit distribution $G\neqχ^2_1$ determined by the joint limit theorem. We show through this route that no advantage is gained by extending the single distribution $F_1$ to the nonparametric composite mixture generated by the same tail-equivalence class.
△ Less
Submitted 15 August, 2024; v1 submitted 29 July, 2024;
originally announced July 2024.
-
Sharp phase transitions in high-dimensional changepoint detection
Authors:
Daniel Xiang,
Chao Gao
Abstract:
We study a hypothesis testing problem in the context of high-dimensional changepoint detection. Given a matrix $X \in \R^{p \times n}$ with independent Gaussian entries, the goal is to determine whether or not a sparse, non-null fraction of rows in $X$ exhibits a shift in mean at a common index between $1$ and $n$. We focus on three aspects of this problem: the sparsity of non-null rows, the prese…
▽ More
We study a hypothesis testing problem in the context of high-dimensional changepoint detection. Given a matrix $X \in \R^{p \times n}$ with independent Gaussian entries, the goal is to determine whether or not a sparse, non-null fraction of rows in $X$ exhibits a shift in mean at a common index between $1$ and $n$. We focus on three aspects of this problem: the sparsity of non-null rows, the presence of a single, common changepoint in the non-null rows, and the signal strength associated with the changepoint. Within an asymptotic regime relating the data dimensions $n$ and $p$ to the signal sparsity and strength, the information-theoretic limits of this testing problem are characterized by a formula that determines whether or not there exists a testing procedure whose sum of Type I and II errors tends to zero as $n,p \to \infty$. The formula, called the \emph{detection boundary}, partitions the parameter space into a two regions: one where it is possible to detect the presence of a single aligned changepoint (detectable region), and another where no test is able to consistently distinguish the mean matrix from one with constant rows (undetectable region).
△ Less
Submitted 25 March, 2025; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Interpretation of local false discovery rates under the zero assumption
Authors:
Daniel Xiang,
Nikolaos Ignatiadis,
Peter McCullagh
Abstract:
In large-scale studies with parallel signal-plus-noise observations, the local false discovery rate is a summary statistic that is often presumed to be equal to the posterior probability that the signal is null. We prefer to call the latter quantity the local null-signal rate to emphasize our view that a null signal and a false discovery are not identical events. The local null-signal rate is comm…
▽ More
In large-scale studies with parallel signal-plus-noise observations, the local false discovery rate is a summary statistic that is often presumed to be equal to the posterior probability that the signal is null. We prefer to call the latter quantity the local null-signal rate to emphasize our view that a null signal and a false discovery are not identical events. The local null-signal rate is commonly estimated through empirical Bayes procedures that build on the `zero density assumption,' which attributes the density of observations near zero entirely to null signals. In this paper, we argue that this strategy does not furnish estimates of the local null-signal rate, but instead of a quantity we call the complementary local activity rate (clar). Although it is likely to be small, an inactive signal is not necessarily zero. The clar dominates both the local null-signal rate and the local false sign rate and is a weakly continuous functional of the signal distribution. As a consequence, it takes on sensible values when the signal is sparse but not exactly zero. Our findings clarify the interpretation of local false discovery rates estimated under the zero density assumption.
△ Less
Submitted 24 March, 2025; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Exact Global Control of Small Divisors in Rational Normal Form
Authors:
Jianjun Liu,
Duihui Xiang
Abstract:
Rational normal form is a powerful tool to deal with Hamiltonian partial differential equations without external parameters. In this paper, we build rational normal form with exact global control of small divisors. As an application to nonlinear Schrödinger equations in Gevrey spaces, we prove sub-exponentially long time stability results for generic small initial data.
Rational normal form is a powerful tool to deal with Hamiltonian partial differential equations without external parameters. In this paper, we build rational normal form with exact global control of small divisors. As an application to nonlinear Schrödinger equations in Gevrey spaces, we prove sub-exponentially long time stability results for generic small initial data.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Sparse-limit approximation for t-statistics
Authors:
Micol Tresoldi,
Daniel Xiang,
Peter McCullagh
Abstract:
In a range of genomic applications, it is of interest to quantify the evidence that the signal at site~$i$ is active given conditionally independent replicate observations summarized by the sample mean and variance $(\bar Y, s^2)$ at each site. We study the version of the problem in which the signal distribution is sparse, and the error distribution has an unknown site-specific variance so that th…
▽ More
In a range of genomic applications, it is of interest to quantify the evidence that the signal at site~$i$ is active given conditionally independent replicate observations summarized by the sample mean and variance $(\bar Y, s^2)$ at each site. We study the version of the problem in which the signal distribution is sparse, and the error distribution has an unknown site-specific variance so that the null distribution of the standardized statistic is Student-$t$ rather than Gaussian. The main contribution of this paper is a sparse-mixture approximation to the non-null density of the $t$-ratio. This formula demonstrates the effect of low degrees of freedom on the Bayes factor, or the conditional probability that the site is active. We illustrate some differences on a HIV dataset for gene-expression data previously analyzed by Efron (2012).
△ Less
Submitted 19 December, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
The edge of discovery: Controlling the local false discovery rate at the margin
Authors:
Jake A. Soloff,
Daniel Xiang,
William Fithian
Abstract:
Despite the popularity of the false discovery rate (FDR) as an error control metric for large-scale multiple testing, its close Bayesian counterpart the local false discovery rate (lfdr), defined as the posterior probability that a particular null hypothesis is false, is a more directly relevant standard for justifying and interpreting individual rejections. However, the lfdr is difficult to work…
▽ More
Despite the popularity of the false discovery rate (FDR) as an error control metric for large-scale multiple testing, its close Bayesian counterpart the local false discovery rate (lfdr), defined as the posterior probability that a particular null hypothesis is false, is a more directly relevant standard for justifying and interpreting individual rejections. However, the lfdr is difficult to work with in small samples, as the prior distribution is typically unknown. We propose a simple multiple testing procedure and prove that it controls the expectation of the maximum lfdr across all rejections; equivalently, it controls the probability that the rejection with the largest p-value is a false discovery. Our method operates without knowledge of the prior, assuming only that the p-value density is uniform under the null and decreasing under the alternative. We also show that our method asymptotically implements the oracle Bayes procedure for a weighted classification risk, optimally trading off between false positives and false negatives. We derive the limiting distribution of the attained maximum lfdr over the rejections, and the limiting empirical Bayes regret relative to the oracle procedure.
△ Less
Submitted 21 September, 2023; v1 submitted 15 July, 2022;
originally announced July 2022.
-
Elementary analysis of isolated zeroes of a polynomial system
Authors:
Mitali Bafna,
Madhu Sudan,
Santhoshini Velusamy,
David Xiang
Abstract:
Wooley ({\em J. Number Theory}, 1996) gave an elementary proof of a Bezout like theorem allowing one to count the number of isolated integer roots of a system of polynomial equations modulo some prime power.
In this article, we adapt the proof to a slightly different setting. Specifically, we consider polynomials with coefficients from a polynomial ring $\mathbb{F}[t]$ for an arbitrary field…
▽ More
Wooley ({\em J. Number Theory}, 1996) gave an elementary proof of a Bezout like theorem allowing one to count the number of isolated integer roots of a system of polynomial equations modulo some prime power.
In this article, we adapt the proof to a slightly different setting. Specifically, we consider polynomials with coefficients from a polynomial ring $\mathbb{F}[t]$ for an arbitrary field $\mathbb{F}$ and give an upper bound on the number of isolated roots modulo $t^s$ for an arbitrary positive integer $s$. In particular, using $s=1$, we can bound the number of isolated roots of a system of polynomials over an arbitrary field $\mathbb{F}$.
△ Less
Submitted 31 January, 2021;
originally announced February 2021.
-
Permanental Graphs
Authors:
Daniel Xiang,
Peter McCullagh
Abstract:
The two components for infinite exchangeability of a sequence of distributions $(P_n)$ are (i) consistency, and (ii) finite exchangeability for each $n$. A consequence of the Aldous-Hoover theorem is that any node-exchangeable, subselection-consistent sequence of distributions that describes a randomly evolving network yields a sequence of random graphs whose expected number of edges grows quadrat…
▽ More
The two components for infinite exchangeability of a sequence of distributions $(P_n)$ are (i) consistency, and (ii) finite exchangeability for each $n$. A consequence of the Aldous-Hoover theorem is that any node-exchangeable, subselection-consistent sequence of distributions that describes a randomly evolving network yields a sequence of random graphs whose expected number of edges grows quadratically in the number of nodes. In this note, another notion of consistency is considered, namely, delete-and-repair consistency; it is motivated by the sense in which infinitely exchangeable permutations defined by the Chinese restaurant process (CRP) are consistent. A goal is to exploit delete-and-repair consistency to obtain a nontrivial sequence of distributions on graphs $(P_n)$ that is sparse, exchangeable, and consistent with respect to delete-and-repair, a well known example being the Ewens permutations \cite{tavare}. A generalization of the CRP$(α)$ as a distribution on a directed graph using the $α$-weighted permanent is presented along with the corresponding normalization constant and degree distribution; it is dubbed the Permanental Graph Model (PGM). A negative result is obtained: no setting of parameters in the PGM allows for a consistent sequence $(P_n)$ in the sense of either subselection or delete-and-repair.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
A nonuniform Littlewood-Offord inequality for all norms
Authors:
Kyle Luh,
David Xiang
Abstract:
Let $\mathbf{v}_i$ be vectors in $\mathbb{R}^d$ and $\{\varepsilon_i\}$ be independent Rademacher random variables. Then the Littlewood-Offord problem entails finding the best upper bound for $\sup_{\mathbf{x} \in \mathbb{R}^d} \mathbb{P}(\sum \varepsilon_i \mathbf{v}_i = \mathbf{x})$. Generalizing the uniform bounds of Littlewood-Offord, Erdős and Kleitman, a recent result of Dzindzalieta and Juš…
▽ More
Let $\mathbf{v}_i$ be vectors in $\mathbb{R}^d$ and $\{\varepsilon_i\}$ be independent Rademacher random variables. Then the Littlewood-Offord problem entails finding the best upper bound for $\sup_{\mathbf{x} \in \mathbb{R}^d} \mathbb{P}(\sum \varepsilon_i \mathbf{v}_i = \mathbf{x})$. Generalizing the uniform bounds of Littlewood-Offord, Erdős and Kleitman, a recent result of Dzindzalieta and Juškevičius provides a non-uniform bound that is optimal in its dependence on $\|\mathbf{x}\|_2$. In this short note, we provide a simple alternative proof of their result. Furthermore, our proof demonstrates that the bound applies to any norm on $\mathbb{R}^d$, not just the $\ell_2$ norm. This resolves a conjecture of Dzindzalieta and Juškevičius.
△ Less
Submitted 1 September, 2020; v1 submitted 27 August, 2020;
originally announced August 2020.
-
A General Sensitivity Analysis Approach for Demand Response Optimizations
Authors:
Ding Xiang,
Ermin Wei
Abstract:
It is well-known that demand response can improve the system efficiency as well as lower consumers' (prosumers') electricity bills. However, it is not clear how we can either qualitatively identify the prosumer with the most impact potential or quantitatively estimate each prosumer's contribution to the total social welfare improvement when additional resource capacity/flexibility is introduced to…
▽ More
It is well-known that demand response can improve the system efficiency as well as lower consumers' (prosumers') electricity bills. However, it is not clear how we can either qualitatively identify the prosumer with the most impact potential or quantitatively estimate each prosumer's contribution to the total social welfare improvement when additional resource capacity/flexibility is introduced to the system with demand response, such as allowing net-selling behavior. In this work, we build upon existing literature on the electricity market, which consists of price-taking prosumers each with various appliances, an electric utility company and a social welfare optimizing distribution system operator, to design a general sensitivity analysis approach (GSAA) that can estimate the potential of each consumer's contribution to the social welfare when given more resource capacity. GSAA is based on existence of an efficient competitive equilibrium, which we establish in the paper. When prosumers' utility functions are quadratic, GSAA can give closed forms characterization on social welfare improvement based on duality analysis. Furthermore, we extend GSAA to a general convex settings, i.e., utility functions with strong convexity and Lipschitz continuous gradient. Even without knowing the specific forms the utility functions, we can derive upper and lower bounds of the social welfare improvement potential of each prosumer, when extra resource is introduced. For both settings, several applications and numerical examples are provided: including extending AC comfort zone, ability of EV to discharge and net selling. The estimation results show that GSAA can be used to decide how to allocate potentially limited market resources in the most impactful way.
△ Less
Submitted 7 October, 2018;
originally announced October 2018.
-
The facets of the matroid polytope and the independent set polytope of a positroid
Authors:
Suho Oh,
David Xiang
Abstract:
A positroid is a special case of a realizable matroid that arose from the study of the totally nonnegative part of the Grassmannian by Postnikov. In this paper, we study the facets of its matroid polytope and the independent set polytope. This allows one to describe the bases and independent sets directly from the decorated permutation, bypassing the use of the Grassmann necklace. We also describe…
▽ More
A positroid is a special case of a realizable matroid that arose from the study of the totally nonnegative part of the Grassmannian by Postnikov. In this paper, we study the facets of its matroid polytope and the independent set polytope. This allows one to describe the bases and independent sets directly from the decorated permutation, bypassing the use of the Grassmann necklace. We also describe a criterion for determining whether a given cyclic interval is a flat or not using the decorated permutation, then show how it applies to checking the concordancy of positroids.
△ Less
Submitted 15 August, 2021; v1 submitted 30 January, 2017;
originally announced January 2017.