-
Precise gradient descent training dynamics for finite-width multi-layer neural networks
Authors:
Qiyang Han,
Masaaki Imaizumi
Abstract:
In this paper, we provide the first precise distributional characterization of gradient descent iterates for general multi-layer neural networks under the canonical single-index regression model, in the `finite-width proportional regime' where the sample size and feature dimension grow proportionally while the network width and depth remain bounded. Our non-asymptotic state evolution theory captur…
▽ More
In this paper, we provide the first precise distributional characterization of gradient descent iterates for general multi-layer neural networks under the canonical single-index regression model, in the `finite-width proportional regime' where the sample size and feature dimension grow proportionally while the network width and depth remain bounded. Our non-asymptotic state evolution theory captures Gaussian fluctuations in first-layer weights and concentration in deeper-layer weights, and remains valid for non-Gaussian features.
Our theory differs from existing neural tangent kernel (NTK), mean-field (MF) theories and tensor program (TP) in several key aspects. First, our theory operates in the finite-width regime whereas these existing theories are fundamentally infinite-width. Second, our theory allows weights to evolve from individual initializations beyond the lazy training regime, whereas NTK and MF are either frozen at or only weakly sensitive to initialization, and TP relies on special initialization schemes. Third, our theory characterizes both training and generalization errors for general multi-layer neural networks beyond the uniform convergence regime, whereas existing theories study generalization almost exclusively in two-layer settings.
As a statistical application, we show that vanilla gradient descent can be augmented to yield consistent estimates of the generalization error at each iteration, which can be used to guide early stopping and hyperparameter tuning. As a further theoretical implication, we show that despite model misspecification, the model learned by gradient descent retains the structure of a single-index function with an effective signal determined by a linear combination of the true signal and the initialization.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Global $C^{1,α}$ regularity for Monge-Ampère equations on planar convex domains
Authors:
Qing Han,
Jiakun Liu,
Yang Zhou
Abstract:
In this paper, we establish the global Hölder gradient estimate for solutions to the Dirichlet problem of the Monge-Ampère equation $\det D^2u = f$ on strictly convex but not uniformly convex domain $Ω$.
In this paper, we establish the global Hölder gradient estimate for solutions to the Dirichlet problem of the Monge-Ampère equation $\det D^2u = f$ on strictly convex but not uniformly convex domain $Ω$.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
The Mass-Angular Momentum Inequality for Multiple Black Holes
Authors:
Qing Han,
Marcus Khuri,
Gilbert Weinstein,
Jingang Xiong
Abstract:
This is the second in a series of two papers to establish the conjectured mass-angular momentum inequality for multiple black holes, modulo the extreme black hole 'no hair theorem'. More precisely it is shown that either there is a counterexample to black hole uniqueness, in the form of a regular axisymmetric stationary vacuum spacetime with an asymptotically flat end and multiple degenerate horiz…
▽ More
This is the second in a series of two papers to establish the conjectured mass-angular momentum inequality for multiple black holes, modulo the extreme black hole 'no hair theorem'. More precisely it is shown that either there is a counterexample to black hole uniqueness, in the form of a regular axisymmetric stationary vacuum spacetime with an asymptotically flat end and multiple degenerate horizons which is 'ADM minimizing', or the following statement holds. Complete, simply connected, maximal initial data sets for the Einstein equations with multiple ends that are either asymptotically flat or asymptotically cylindrical, admit an ADM mass lower bound given by the square root of total angular momentum, under the assumption of nonnegative energy density and axisymmetry. Moreover, equality is achieved in the mass lower bound only for a constant time slice of an extreme Kerr spacetime. The proof is based on a novel flow of singular harmonic maps with hyperbolic plane target, under which the renormalized harmonic map energy is monotonically nonincreasing. Relevant properties of the flow are achieved through a refined asymptotic analysis of solutions to the harmonic map equations and their linearization.
△ Less
Submitted 25 January, 2025;
originally announced January 2025.
-
Global Schauder Regularity and Convergence for Uniformly Degenerate Parabolic Equations
Authors:
Qing Han,
Jiongduo Xie
Abstract:
In this paper, we study the global Hölder regularity of solutions to uniformly degenerate parabolic equations. We also study the convergence of solutions as time goes to infinity under extra assumptions on the characteristic exponents of the limit uniformly degenerate elliptic equations.
In this paper, we study the global Hölder regularity of solutions to uniformly degenerate parabolic equations. We also study the convergence of solutions as time goes to infinity under extra assumptions on the characteristic exponents of the limit uniformly degenerate elliptic equations.
△ Less
Submitted 13 January, 2025;
originally announced January 2025.
-
Solutions of the Special Lagrangian Equation near Infinity
Authors:
Qing Han,
Ilya Marchenko
Abstract:
Solutions to special Lagrangian equations near infinity, with supercritical phases or with semiconvexity on solutions, are known to be asymptotic to quadratic polynomials for dimension $n\ge 3$, with an extra logarithmic term for $n=2$. Via modified Kelvin transforms, we characterize remainders in the asymptotic expansions by a single function near the origin. Such a function is smooth in even dim…
▽ More
Solutions to special Lagrangian equations near infinity, with supercritical phases or with semiconvexity on solutions, are known to be asymptotic to quadratic polynomials for dimension $n\ge 3$, with an extra logarithmic term for $n=2$. Via modified Kelvin transforms, we characterize remainders in the asymptotic expansions by a single function near the origin. Such a function is smooth in even dimension, but only $C^{n-1,α}$ in odd dimension $n$, for any $α\in (0,1)$.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
Gradient descent inference in empirical risk minimization
Authors:
Qiyang Han,
Xiaocong Xu
Abstract:
Gradient descent is one of the most widely used iterative algorithms in modern statistical learning. However, its precise algorithmic dynamics in high-dimensional settings remain only partially understood, which has therefore limited its broader potential for statistical inference applications.
This paper provides a precise, non-asymptotic distributional characterization of gradient descent iter…
▽ More
Gradient descent is one of the most widely used iterative algorithms in modern statistical learning. However, its precise algorithmic dynamics in high-dimensional settings remain only partially understood, which has therefore limited its broader potential for statistical inference applications.
This paper provides a precise, non-asymptotic distributional characterization of gradient descent iterates in a broad class of empirical risk minimization problems, in the so-called mean-field regime where the sample size is proportional to the signal dimension. Our non-asymptotic state evolution theory holds for both general non-convex loss functions and non-Gaussian data, and reveals the central role of two Onsager correction matrices that precisely characterize the non-trivial dependence among all gradient descent iterates in the mean-field regime.
Although the Onsager correction matrices are typically analytically intractable, our state evolution theory facilitates a generic gradient descent inference algorithm that consistently estimates these matrices across a broad class of models. Leveraging this algorithm, we show that the state evolution can be inverted to construct (i) data-driven estimators for the generalization error of gradient descent iterates and (ii) debiased gradient descent iterates for inference of the unknown signal. Detailed applications to two canonical models--linear regression and (generalized) logistic regression--are worked out to illustrate model-specific features of our general theory and inference methods.
△ Less
Submitted 7 January, 2025; v1 submitted 12 December, 2024;
originally announced December 2024.
-
UCB algorithms for multi-armed bandits: Precise regret and adaptive inference
Authors:
Qiyang Han,
Koulik Khamaru,
Cun-Hui Zhang
Abstract:
Upper Confidence Bound (UCB) algorithms are a widely-used class of sequential algorithms for the $K$-armed bandit problem. Despite extensive research over the past decades aimed at understanding their asymptotic and (near) minimax optimality properties, a precise understanding of their regret behavior remains elusive. This gap has not only hindered the evaluation of their actual algorithmic effici…
▽ More
Upper Confidence Bound (UCB) algorithms are a widely-used class of sequential algorithms for the $K$-armed bandit problem. Despite extensive research over the past decades aimed at understanding their asymptotic and (near) minimax optimality properties, a precise understanding of their regret behavior remains elusive. This gap has not only hindered the evaluation of their actual algorithmic efficiency, but also limited further developments in statistical inference in sequential data collection.
This paper bridges these two fundamental aspects--precise regret analysis and adaptive statistical inference--through a deterministic characterization of the number of arm pulls for an UCB index algorithm [Lai87, Agr95, ACBF02]. Our resulting precise regret formula not only accurately captures the actual behavior of the UCB algorithm for finite time horizons and individual problem instances, but also provides significant new insights into the regimes in which the existing theory remains informative. In particular, we show that the classical Lai-Robbins regret formula is exact if and only if the sub-optimality gaps exceed the order $σ\sqrt{K\log T/T}$. We also show that its maximal regret deviates from the minimax regret by a logarithmic factor, and therefore settling its strict minimax optimality in the negative.
The deterministic characterization of the number of arm pulls for the UCB algorithm also has major implications in adaptive statistical inference. Building on the seminal work of [Lai82], we show that the UCB algorithm satisfies certain stability properties that lead to quantitative central limit theorems in two settings including the empirical means of unknown rewards in the bandit setting. These results have an important practical implication: conventional confidence sets designed for i.i.d. data remain valid even when data are collected sequentially.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
Uniformly Degenerate Elliptic Equations with Varying Characteristic Exponents
Authors:
Qing Han,
Jiongduo Xie
Abstract:
In this paper, we study the regularity of solutions to uniformly degenerate elliptic equations in bounded domains under the condition that the characteristic polynomials have varying characteristic exponents.
In this paper, we study the regularity of solutions to uniformly degenerate elliptic equations in bounded domains under the condition that the characteristic polynomials have varying characteristic exponents.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
Optimal Boundary Regularity for Uniformly Degenerate Elliptic Equations
Authors:
Qing Han,
Jiongduo Xie
Abstract:
In this survey paper, we study the optimal regularity of solutions to uniformly degenerate elliptic equations in bounded domains and establish the Hölder continuity of solutions and their derivatives up to the boundary.
In this survey paper, we study the optimal regularity of solutions to uniformly degenerate elliptic equations in bounded domains and establish the Hölder continuity of solutions and their derivatives up to the boundary.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
SymILO: A Symmetry-Aware Learning Framework for Integer Linear Optimization
Authors:
Qian Chen,
Tianjian Zhang,
Linxin Yang,
Qingyu Han,
Akang Wang,
Ruoyu Sun,
Xiaodong Luo,
Tsung-Hui Chang
Abstract:
Integer linear programs (ILPs) are commonly employed to model diverse practical problems such as scheduling and planning. Recently, machine learning techniques have been utilized to solve ILPs. A straightforward idea is to train a model via supervised learning, with an ILP as the input and an optimal solution as the label. An ILP is symmetric if its variables can be permuted without changing the p…
▽ More
Integer linear programs (ILPs) are commonly employed to model diverse practical problems such as scheduling and planning. Recently, machine learning techniques have been utilized to solve ILPs. A straightforward idea is to train a model via supervised learning, with an ILP as the input and an optimal solution as the label. An ILP is symmetric if its variables can be permuted without changing the problem structure, resulting in numerous equivalent and optimal solutions. Randomly selecting an optimal solution as the label can introduce variability in the training data, which may hinder the model from learning stable patterns. In this work, we incorporate the intrinsic symmetry of ILPs and propose a novel training framework called SymILO. Specifically, we modify the learning task by introducing solution permutation along with neural network weights as learnable parameters and then design an alternating algorithm to jointly optimize the loss function. We conduct extensive experiments on ILPs involving different symmetries and the computational results demonstrate that our symmetry-aware approach significantly outperforms three existing methods -- achieving $50.3\%$, $66.5\%$, and $45.4\%$ average improvements, respectively.
△ Less
Submitted 6 January, 2025; v1 submitted 29 September, 2024;
originally announced September 2024.
-
A novel second order scheme with one step for forward backward stochastic differential equations
Authors:
Qiang Han,
Shihao Lan,
Quanxin Zhu
Abstract:
In this paper, we present a novel explicit second order scheme with one step for solving the forward backward stochastic differential equations, with the Crank-Nicolson method as a specific instance within our proposed framework. We first present a rigorous stability result, followed by precise error estimates that confirm the proposed novel scheme achieves second-order convergence. The theoretica…
▽ More
In this paper, we present a novel explicit second order scheme with one step for solving the forward backward stochastic differential equations, with the Crank-Nicolson method as a specific instance within our proposed framework. We first present a rigorous stability result, followed by precise error estimates that confirm the proposed novel scheme achieves second-order convergence. The theoretical results for the proposed methods are supported by numerical experiments.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Accelerating Low-Rank Factorization-Based Semidefinite Programming Algorithms on GPU
Authors:
Qiushi Han,
Zhenwei Lin,
Hanwen Liu,
Caihua Chen,
Qi Deng,
Dongdong Ge,
Yinyu Ye
Abstract:
In this paper, we address a long-standing challenge: how to achieve both efficiency and scalability in solving semidefinite programming problems. We propose breakthrough acceleration techniques for a wide range of low-rank factorization-based first-order methods using GPUs, making the computation much more efficient and scalable. To illustrate the idea and effectiveness of our approach, we use the…
▽ More
In this paper, we address a long-standing challenge: how to achieve both efficiency and scalability in solving semidefinite programming problems. We propose breakthrough acceleration techniques for a wide range of low-rank factorization-based first-order methods using GPUs, making the computation much more efficient and scalable. To illustrate the idea and effectiveness of our approach, we use the low-rank factorization-based SDP solver, LoRADS, as an example, which involves both the classic Burer-Monterio method and a novel splitting scheme with a starting logarithmic rank. Our numerical results demonstrate that the accelerated GPU version of LoRADS, cuLoRADS, can solve huge-scale semidefinite programming problems with remarkable efficiency. By effectively leveraging GPU computational power, cuLoRADS exhibits outstanding performance. Specifically, it can solve a set of MaxCut problems with $10^7 \times 10^7$ matrix variables in 10 seconds to 1 minute each on an NVIDIA H100 GPU with 80GB memory, whereas previous solvers demonstrated the capability of handling problems of this scale, required at least dozens of hours per problem on CPUs. Additionally, cuLoRADS shows exceptional scalability by solving 1) a MaxCut problem with a $170 \text{ million} \times 170 \text{ million}$ matrix variable and 2) a Matrix Completion problem with a 20 million $\times$ 20 million matrix variable and approximately 200 million constraints, both in a matter of minutes.
△ Less
Submitted 23 August, 2024; v1 submitted 21 July, 2024;
originally announced July 2024.
-
Entrywise dynamics and universality of general first order methods
Authors:
Qiyang Han
Abstract:
General first order methods (GFOMs), including various gradient descent and AMP algorithms, constitute a broad class of iterative algorithms in modern statistical learning problems. Some GFOMs also serve as constructive proof devices, iteratively characterizing the empirical distributions of statistical estimators in the large system limits for any fixed number of iterations.
This paper develops…
▽ More
General first order methods (GFOMs), including various gradient descent and AMP algorithms, constitute a broad class of iterative algorithms in modern statistical learning problems. Some GFOMs also serve as constructive proof devices, iteratively characterizing the empirical distributions of statistical estimators in the large system limits for any fixed number of iterations.
This paper develops a non-asymptotic, entrywise characterization for a general class of GFOMs. Our characterizations capture the precise entrywise behavior of the GFOMs, and hold universally across a broad class of heterogeneous random matrix models. As a corollary, we provide the first non-asymptotic description of the empirical distributions of the GFOMs beyond Gaussian ensembles.
We demonstrate the utility of these general results in two applications. In the first application, we prove entrywise universality for regularized least squares estimators in the linear model, by controlling the entrywise error relative to a suitably constructed GFOM. This algorithmic proof method also leads to systematically improved averaged universality results for regularized regression estimators in the linear model, and resolves the universality conjecture for (regularized) MLEs in logistic regression. In the second application, we obtain entrywise Gaussian approximations for a class of gradient descent algorithms. Our approach provides non-asymptotic state evolution for the bias and variance of the algorithm along the iteration path, applicable for non-convex loss functions.
The proof relies on a new recursive leave-k-out method that provides almost delocalization for the GFOMs and their derivatives. Crucially, our method ensures entrywise universality for up to poly-logarithmic many iterations, which facilitates effective $\ell_2/\ell_\infty$ control between certain GFOMs and statistical estimators in applications.
△ Less
Submitted 29 May, 2025; v1 submitted 27 June, 2024;
originally announced June 2024.
-
A Low-Rank ADMM Splitting Approach for Semidefinite Programming
Authors:
Qiushi Han,
Chenxi Li,
Zhenwei Lin,
Caihua Chen,
Qi Deng,
Dongdong Ge,
Huikang Liu,
Yinyu Ye
Abstract:
We introduce a new first-order method for solving general semidefinite programming problems, based on the alternating direction method of multipliers (ADMM) and a matrix-splitting technique. Our algorithm has an advantage over the Burer-Monteiro approach as it only involves much easier quadratically regularized subproblems in each iteration. For a linear objective, the subproblems are well-conditi…
▽ More
We introduce a new first-order method for solving general semidefinite programming problems, based on the alternating direction method of multipliers (ADMM) and a matrix-splitting technique. Our algorithm has an advantage over the Burer-Monteiro approach as it only involves much easier quadratically regularized subproblems in each iteration. For a linear objective, the subproblems are well-conditioned quadratic programs that can be efficiently solved by the standard conjugate gradient method. We show that the ADMM algorithm achieves sublinear or linear convergence rates to the KKT solutions under different conditions. Building on this theoretical development, we present LoRADS, a new solver for linear SDP based on the Low-Rank ADMM Splitting approach. LoRADS incorporates several strategies that significantly increase its efficiency. Firstly, it initiates with a warm-start phase that uses the Burer-Monteiro approach. Moreover, motivated by the SDP low-rank theory [So et al. 2008], LoRADS chooses an initial rank of logarithmic order and then employs a dynamic approach to increase the rank. Numerical experiments indicate that LoRADS exhibits promising performance on various SDP problems. A noteworthy achievement of LoRADS is its successful solving of a matrix completion problem with $15,694,167$ constraints and a matrix variable of size $40,000 \times 40,000$ in $351$ seconds.
△ Less
Submitted 26 July, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Borel lemma: geometric progression and zeta-functions
Authors:
Qi Han,
Jingbo Liu,
Nadeem Malik
Abstract:
In the proof of the classical Borel lemma \cite{eB} by Hayman \cite{wkH}, each continuous increasing function $T(r)\geq1$ satisfies $T\bigl(r+\frac{1}{T(r)}\bigr)<2T(r)$ outside a possible exceptional set of linear measure $2$. We note in this work $T(r)$ satisfies a sharper inequality $T\bigl(r+\frac{1}{T(r)}\bigr)<\bigl(\sqrt{T(r)}+1\bigr)^2\leq2T(r)$, if $T(r)\geq\bigl(\sqrt{2}+1\bigr)^2$, outs…
▽ More
In the proof of the classical Borel lemma \cite{eB} by Hayman \cite{wkH}, each continuous increasing function $T(r)\geq1$ satisfies $T\bigl(r+\frac{1}{T(r)}\bigr)<2T(r)$ outside a possible exceptional set of linear measure $2$. We note in this work $T(r)$ satisfies a sharper inequality $T\bigl(r+\frac{1}{T(r)}\bigr)<\bigl(\sqrt{T(r)}+1\bigr)^2\leq2T(r)$, if $T(r)\geq\bigl(\sqrt{2}+1\bigr)^2$, outside a possible exceptional set of linear measure $ζ\bigl(2,\sqrt{2}+1\bigr)\leq0.52<2$ for the Hurwitz zeta-function $ζ(s,a)$. This result is worth noting, provided the set of $r$ in which $1\leq T(r)<\bigl(\sqrt{2}+1\bigr)^2$ has linear measure less than $1.48$. Focusing exclusively on meromorphic functions of infinite order, we utilize Hinkkanen's Second Main Theorem \cite{aH}, draw comparisons with Borel \cite{eB}, Nevanlinna \cite{rN}, and Hayman \cite{wkH}, and finally generalize Fernández Árias \cite{aFA1}.
△ Less
Submitted 22 May, 2025; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Efficient simulation of mixed boundary value problems and conformal mappings
Authors:
Qiansheng Han,
Antti Rasila,
Tommi Sottinen
Abstract:
In this paper, we present a stochastic method for the simulation of Laplace's equation with a mixed boundary condition in planar domains that are polygonal or bounded by circular arcs. We call this method the Reflected Walk-on-Spheres algorithm. The method combines a traditional Walk-on-Spheres algorithm with use of reflections at the Neumann boundaries. We apply our algorithm to simulate numerica…
▽ More
In this paper, we present a stochastic method for the simulation of Laplace's equation with a mixed boundary condition in planar domains that are polygonal or bounded by circular arcs. We call this method the Reflected Walk-on-Spheres algorithm. The method combines a traditional Walk-on-Spheres algorithm with use of reflections at the Neumann boundaries. We apply our algorithm to simulate numerical conformal mappings from certain quadrilaterals to the corresponding canonical domains, and to compute their conformal moduli. Finally, we give examples of the method on three dimensional polyhedral domains, and use it to simulate the heat flow on an L-shaped insulated polyhedron.
△ Less
Submitted 9 July, 2024; v1 submitted 23 December, 2023;
originally announced December 2023.
-
A leave-one-out approach to approximate message passing
Authors:
Zhigang Bao,
Qiyang Han,
Xiaocong Xu
Abstract:
Approximate message passing (AMP) has emerged both as a popular class of iterative algorithms and as a powerful analytic tool in a wide range of statistical estimation problems and statistical physics models. A well established line of AMP theory proves Gaussian approximations for the empirical distributions of the AMP iterate in the high dimensional limit, under the GOE random matrix model and it…
▽ More
Approximate message passing (AMP) has emerged both as a popular class of iterative algorithms and as a powerful analytic tool in a wide range of statistical estimation problems and statistical physics models. A well established line of AMP theory proves Gaussian approximations for the empirical distributions of the AMP iterate in the high dimensional limit, under the GOE random matrix model and its variants.
This paper provides a non-asymptotic, leave-one-out representation for the AMP iterate that holds under a broad class of Gaussian random matrix models with general variance profiles. In contrast to the typical AMP theory that describes the empirical distributions of the AMP iterate via a low dimensional state evolution, our leave-one-out representation yields an intrinsically high dimensional state evolution formula which provides non-asymptotic characterizations for the possibly heterogeneous, entrywise behavior of the AMP iterate under the prescribed random matrix models.
To exemplify some distinct features of our AMP theory in applications, we analyze, in the context of regularized linear estimation, the precise stochastic behavior of the Ridge estimator for independent and non-identically distributed observations whose covariates exhibit general variance profiles. We find that its finite-sample distribution is characterized via a weighted Ridge estimator in a heterogeneous Gaussian sequence model. Notably, in contrast to the i.i.d. sampling scenario, the effective noise and regularization are now full dimensional vectors determined via a high dimensional system of equations.
Our leave-one-out method of proof differs significantly from the widely adopted conditioning approach for rotational invariant ensembles, and relies instead on an inductive method that utilizes almost solely integration-by-parts and concentration techniques.
△ Less
Submitted 25 December, 2023; v1 submitted 10 December, 2023;
originally announced December 2023.
-
Blow-up sets of Ricci curvatures of complete conformal metrics
Authors:
Qing Han,
Weiming Shen,
Yue Wang
Abstract:
A version of the singular Yamabe problem in smooth domains in a closed manifold yields complete conformal metrics with negative constant scalar curvatures. In this paper, we study the blow-up phenomena of Ricci curvatures of these metrics on domains whose boundary is close to a certain limit set of a lower dimension. We will characterize the blow-up set according to the Yamabe invariant of the und…
▽ More
A version of the singular Yamabe problem in smooth domains in a closed manifold yields complete conformal metrics with negative constant scalar curvatures. In this paper, we study the blow-up phenomena of Ricci curvatures of these metrics on domains whose boundary is close to a certain limit set of a lower dimension. We will characterize the blow-up set according to the Yamabe invariant of the underlying manifold. In particular, we will prove that all points in the lower dimension part of the limit set belong to the blow-up set on manifolds not conformally equivalent to the standard sphere and that all but one point in the lower dimension part of the limit set belong to the blow-up set on manifolds conformally equivalent to the standard sphere. In certain cases, the blow-up set can be the entire manifold. We will demonstrate by examples that these results are optimal.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Isometric immersions and applications
Authors:
Qing Han,
Marta Lewicka
Abstract:
We provide an introduction to the old-standing problem of isometric immersions. We combine a historical account of its multifaceted advances, which have fascinated geometers and analysts alike, with some of the applications in the mathematical physics and mathematical materials science, old and new.
We provide an introduction to the old-standing problem of isometric immersions. We combine a historical account of its multifaceted advances, which have fascinated geometers and analysts alike, with some of the applications in the mathematical physics and mathematical materials science, old and new.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
On partial differential equations of Waring's-problem form in several complex variables
Authors:
Qi Han
Abstract:
In this paper, we first consider the pseudoprimeness of meromorphic solutions $u$ to a family of partial differential equations (PDEs) $H(u_{z_1},u_{z_2},\ldots,u_{z_n})=P(u)$ of Waring's-problem form, where $H(z_1,z_2,\ldots,z_n)$ is a nontrivial homogenous polynomial of degree $\ell$ in $\mathbf{C}^n$ and $P(w)$ is a polynomial of degree $\hbar$ in $\mathbf{C}$ with all zeros distinct. Then, we…
▽ More
In this paper, we first consider the pseudoprimeness of meromorphic solutions $u$ to a family of partial differential equations (PDEs) $H(u_{z_1},u_{z_2},\ldots,u_{z_n})=P(u)$ of Waring's-problem form, where $H(z_1,z_2,\ldots,z_n)$ is a nontrivial homogenous polynomial of degree $\ell$ in $\mathbf{C}^n$ and $P(w)$ is a polynomial of degree $\hbar$ in $\mathbf{C}$ with all zeros distinct. Then, we study when these PDEs can admit entire solutions in $\mathbf{C}^n$ and further find these solutions for important cases including particularly $u^\ell_{z_1}+u^\ell_{z_2}+\cdots+u^\ell_{z_n}=u^\hbar$, which are (often said to be) PDEs of super-Fermat form if $\hbar=0,\ell$ and an eikonal equation if $\ell=2$ and $\hbar=0$.
△ Less
Submitted 27 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
The Isometric Immersion of Negatively Curved Surfaces with Finite Total Curvature
Authors:
Wentao Cao,
Qing Han,
Feimin Huang,
Dehua Wang
Abstract:
In this paper, we study the smooth isometric immersion of a complete, simply connected surface with a negative Gauss curvature into the three-dimensional Euclidean space. A fundamental and longstanding problem is to find a sufficient condition for a complete negatively curved surface to be isometrically embedded in R^3 [67]. It can be described as an initial and/or boundary value problem for a hyp…
▽ More
In this paper, we study the smooth isometric immersion of a complete, simply connected surface with a negative Gauss curvature into the three-dimensional Euclidean space. A fundamental and longstanding problem is to find a sufficient condition for a complete negatively curved surface to be isometrically embedded in R^3 [67]. It can be described as an initial and/or boundary value problem for a hyperbolic system of nonlinear partial differential equations derived from the Gauss-Codazzi equations. The mathematical theory associated with this system is largely incomplete. The global smooth isometric immersion has been proven in the literature when the Gauss curvature decays rapidly and monotonically. However, when the Gauss curvature oscillates or decays slowly, the problem becomes much more challenging and little is known. In our paper, we find a sufficient condition, consisting of a finite total Gauss curvature and appropriate oscillations of the Gauss curvature. Under this condition we prove the global existence of a smooth solution to the Gauss-Codazzi system, achieving a global smooth isometric immersion of the surface into R^3. Furthermore, we show that the finite total Gauss curvature is necessary for the existence of a solution in a special case of the Gauss-Codazzi system. New techniques are developed to overcome the difficulties posed by the slow decay and oscillations of the Gauss curvature. By observing that certain combinations of the Riemann invariants decay faster than others, we reformulate the Gauss-Codazzi equations as a symmetric hyperbolic system and uncover a crucial structure of partial dampings. These partial dampings, along with the finite total curvature and appropriate oscillations of the Gauss curvature, enable us to obtain a global smooth solution through delicate analysis, and consequently establish a global smooth isometric immersion of such surfaces.
△ Less
Submitted 22 September, 2024; v1 submitted 5 August, 2023;
originally announced August 2023.
-
The distribution of Ridgeless least squares interpolators
Authors:
Qiyang Han,
Xiaocong Xu
Abstract:
The Ridgeless minimum $\ell_2$-norm interpolator in overparametrized linear regression has attracted considerable attention in recent years. While it seems to defy the conventional wisdom that overfitting leads to poor prediction, recent research reveals that its norm minimizing property induces an `implicit regularization' that helps prediction in spite of interpolation. This renders the Ridgeles…
▽ More
The Ridgeless minimum $\ell_2$-norm interpolator in overparametrized linear regression has attracted considerable attention in recent years. While it seems to defy the conventional wisdom that overfitting leads to poor prediction, recent research reveals that its norm minimizing property induces an `implicit regularization' that helps prediction in spite of interpolation. This renders the Ridgeless interpolator a theoretically tractable proxy that offers useful insights into the mechanisms of modern machine learning methods.
This paper takes a different perspective that aims at understanding the precise stochastic behavior of the Ridgeless interpolator as a statistical estimator. Specifically, we characterize the distribution of the Ridgeless interpolator in high dimensions, in terms of a Ridge estimator in an associated Gaussian sequence model with positive regularization, which plays the role of the prescribed implicit regularization in the context of prediction risk. Our distributional characterizations hold for general random designs and extend uniformly to positively regularized Ridge estimators. As a demonstration of the analytic power of these characterizations, we derive approximate formulae for a general class of weighted $\ell_q$ risks for Ridge(less) estimators that were previously available only for $\ell_2$. Our theory also provides certain further conceptual reconciliation with the conventional wisdom: given any data covariance, a certain amount of regularization in Ridge regression remains beneficial for `most' signals across various statistical tasks including prediction, estimation and inference, as long as the noise level is non-trivial. Surprisingly, optimal tuning can be achieved simultaneously for all the designated statistical tasks by a single generalized or $k$-fold cross-validation scheme, despite being designed specifically for tuning prediction risk.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Railway Virtual Coupling: A Survey of Emerging Control Techniques
Authors:
Qing Wu,
Xiaohua Ge,
Qing-Long Han,
Yafei Liu
Abstract:
This paper provides a systematic review of emerging control techniques used for railway Virtual Coupling (VC) studies. Train motion models are first reviewed, including model formulations and the force elements involved. Control objectives and typical design constraints are then elaborated. Next, the existing VC control techniques are surveyed and classified into five groups: consensus-based contr…
▽ More
This paper provides a systematic review of emerging control techniques used for railway Virtual Coupling (VC) studies. Train motion models are first reviewed, including model formulations and the force elements involved. Control objectives and typical design constraints are then elaborated. Next, the existing VC control techniques are surveyed and classified into five groups: consensus-based control, model prediction control, sliding mode control, machine learning-based control, and constraints-following control. Their advantages and disadvantages for VC applications are also discussed in detail. Furthermore, several future studies for achieving better controller development and implementation, respectively, are presented. The purposes of this survey are to help researchers to achieve a better systematic understanding regarding VC control, to spark more research into VC and to further speed-up the realization of this emerging technology in railway and other relevant fields such as road vehicles.
△ Less
Submitted 19 February, 2023;
originally announced February 2023.
-
A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming
Authors:
Qingyu Han,
Linxin Yang,
Qian Chen,
Xiang Zhou,
Dong Zhang,
Akang Wang,
Ruoyu Sun,
Xiaodong Luo
Abstract:
Mixed-integer linear programming (MILP) is widely employed for modeling combinatorial optimization problems. In practice, similar MILP instances with only coefficient variations are routinely solved, and machine learning (ML) algorithms are capable of capturing common patterns across these MILP instances. In this work, we combine ML with optimization and propose a novel predict-and-search framewor…
▽ More
Mixed-integer linear programming (MILP) is widely employed for modeling combinatorial optimization problems. In practice, similar MILP instances with only coefficient variations are routinely solved, and machine learning (ML) algorithms are capable of capturing common patterns across these MILP instances. In this work, we combine ML with optimization and propose a novel predict-and-search framework for efficiently identifying high-quality feasible solutions. Specifically, we first utilize graph neural networks to predict the marginal probability of each variable, and then search for the best feasible solution within a properly defined ball around the predicted solution. We conduct extensive experiments on public datasets, and computational results demonstrate that our proposed framework achieves 51.1% and 9.9% performance improvements to MILP solvers SCIP and Gurobi on primal gaps, respectively.
△ Less
Submitted 6 March, 2023; v1 submitted 11 February, 2023;
originally announced February 2023.
-
Asymptotic Analysis of Harmonic Maps With Prescribed Singularities
Authors:
Qing Han,
Marcus Khuri,
Gilbert Weinstein,
Jingang Xiong
Abstract:
This is the first in a series of two papers to establish the mass-angular momentum inequality for multiple black holes. We study singular harmonic maps from domains of 3-dimensional Euclidean space to the hyperbolic plane having bounded hyperbolic distance to extreme Kerr harmonic maps. We prove that every such harmonic map admits a unique tangent harmonic map at the extreme black hole horizon. Th…
▽ More
This is the first in a series of two papers to establish the mass-angular momentum inequality for multiple black holes. We study singular harmonic maps from domains of 3-dimensional Euclidean space to the hyperbolic plane having bounded hyperbolic distance to extreme Kerr harmonic maps. We prove that every such harmonic map admits a unique tangent harmonic map at the extreme black hole horizon. The possible tangent maps are classified and shown to be shifted `extreme Kerr' geodesics in the hyperbolic plane that depend on two parameters, one determined by angular momentum and another by conical singularities. In addition, rates of convergence to the tangent map are established. Similarly, expansions in the asymptotically flat end are presented. These results, together with those of Li-Tian [24, 25] and Weinstein [35,36], provide a complete regularity theory for harmonic maps from $\mathbb R^3\setminus z\text{-axis}$ to $\mathbb H^2$ with these prescribed singularities. The analysis is additionally utilized to prove existence of the so called near horizon limit, and to compute the associated near horizon geometries of extreme black holes.
△ Less
Submitted 30 August, 2024; v1 submitted 30 December, 2022;
originally announced December 2022.
-
Online Statistical Inference in Decision-Making with Matrix Context
Authors:
Qiyu Han,
Will Wei Sun,
Yichen Zhang
Abstract:
The study of online decision-making problems that leverage contextual information has drawn notable attention due to their significant applications in fields ranging from healthcare to autonomous systems. In modern applications, contextual information can be rich and is often represented as a matrix. Moreover, while existing online decision algorithms mainly focus on reward maximization, less atte…
▽ More
The study of online decision-making problems that leverage contextual information has drawn notable attention due to their significant applications in fields ranging from healthcare to autonomous systems. In modern applications, contextual information can be rich and is often represented as a matrix. Moreover, while existing online decision algorithms mainly focus on reward maximization, less attention has been devoted to statistical inference. To address these gaps, in this work, we consider an online decision-making problem with a matrix context where the true model parameters have a low-rank structure. We propose a fully online procedure to conduct statistical inference with adaptively collected data. The low-rank structure of the model parameter and the adaptive nature of the data collection process make this difficult: standard low-rank estimators are biased and cannot be obtained in a sequential manner while existing inference approaches in sequential decision-making algorithms fail to account for the low-rankness and are also biased. To overcome these challenges, we introduce a new online debiasing procedure to simultaneously handle both sources of bias. Our inference framework encompasses both parameter inference and optimal policy value inference. In theory, we establish the asymptotic normality of the proposed online debiased estimators and prove the validity of the constructed confidence intervals for both inference tasks. Our inference results are built upon a newly developed low-rank stochastic gradient descent estimator and its convergence result, which are also of independent interest.
△ Less
Submitted 18 April, 2025; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Gaussian random projections of convex cones: approximate kinematic formulae and applications
Authors:
Qiyang Han,
Huachen Ren
Abstract:
Understanding the stochastic behavior of random projections of geometric sets constitutes a fundamental problem in high dimension probability that finds wide applications in diverse fields. This paper provides a kinematic description for the behavior of Gaussian random projections of closed convex cones, in analogy to that of randomly rotated cones studied in [ALMT14]. Formally, let $K$ be a close…
▽ More
Understanding the stochastic behavior of random projections of geometric sets constitutes a fundamental problem in high dimension probability that finds wide applications in diverse fields. This paper provides a kinematic description for the behavior of Gaussian random projections of closed convex cones, in analogy to that of randomly rotated cones studied in [ALMT14]. Formally, let $K$ be a closed convex cone in $\mathbb{R}^n$, and $G\in \mathbb{R}^{m\times n}$ be a Gaussian matrix with i.i.d. $\mathcal{N}(0,1)$ entries. We show that $GK\equiv \{Gμ: μ\in K\}$ behaves like a randomly rotated cone in $\mathbb{R}^m$ with statistical dimension $\min\{δ(K),m\}$, in the following kinematic sense: for any fixed closed convex cone $L$ in $\mathbb{R}^m$, \begin{align*} &δ(L)+δ(K)\ll m\, \Rightarrow\, L\cap GK = \{0\} \hbox{ with high probability},\\ &δ(L)+δ(K)\gg m\, \Rightarrow\, L\cap GK \neq \{0\} \hbox{ with high probability}. \end{align*} A similar kinematic description is obtained for $G^{-1}L\equiv \{μ\in \mathbb{R}^n: Gμ\in L\}$.
The practical usefulness and broad applicability of the prescribed approximate kinematic formulae are demonstrated in a number of distinct problems arising from statistical learning, mathematical programming and asymptotic geometric analysis. In particular, we prove (i) new phase transitions of the existence of cone constrained maximum likelihood estimators in logistic regression, (ii) new phase transitions of the cost optimum of deterministic conic programs with random constraints, and (iii) a local version of the Gaussian Dvoretzky-Milman theorem that describes almost deterministic, low-dimensional behaviors of subspace sections of randomly projected convex sets.
△ Less
Submitted 11 December, 2022;
originally announced December 2022.
-
Continuity of weak solutions to an elliptic problem on $p$-fractional Laplacian
Authors:
Wei Chen,
Qi Han,
Guoping Zhan
Abstract:
In this paper we study an elliptic variational problem regarding the $p$-fractional Laplacian in $\mathbb{R}^N$ on the basis of recent result \cite{Ha1}, which generalizes the nice work \cite{AT,AP,XZR1}, and then give some sufficient conditions under which some weak solutions to the above elliptic variational problem are continuous in $\mathbb{R}^N$. In the final appendix we correct the proofs of…
▽ More
In this paper we study an elliptic variational problem regarding the $p$-fractional Laplacian in $\mathbb{R}^N$ on the basis of recent result \cite{Ha1}, which generalizes the nice work \cite{AT,AP,XZR1}, and then give some sufficient conditions under which some weak solutions to the above elliptic variational problem are continuous in $\mathbb{R}^N$. In the final appendix we correct the proofs of both \cite[Lemma 10]{PXZ1} and \cite[Lemma A.6]{PXZ} for $1<p<2$.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Exact bounds for some quadratic empirical processes with applications
Authors:
Qiyang Han
Abstract:
Let $Z_1,\ldots,Z_n$ be i.i.d. isotropic random vectors in $\mathbb{R}^p$, and $T \subset \mathbb{R}^p$ be a compact set. A classical line of empirical process theory characterizes the size of the suprema of the quadratic process \begin{align*} \sup_{t \in T} \bigg| \frac{1}{n}\sum_{i=1}^n \langle Z_i,t \rangle^2-\lVert t \rVert^2 \bigg|, \end{align*} via a single parameter known as the Gaussian w…
▽ More
Let $Z_1,\ldots,Z_n$ be i.i.d. isotropic random vectors in $\mathbb{R}^p$, and $T \subset \mathbb{R}^p$ be a compact set. A classical line of empirical process theory characterizes the size of the suprema of the quadratic process \begin{align*} \sup_{t \in T} \bigg| \frac{1}{n}\sum_{i=1}^n \langle Z_i,t \rangle^2-\lVert t \rVert^2 \bigg|, \end{align*} via a single parameter known as the Gaussian width of $T$.
This paper introduces an improved bound for the suprema of this quadratic process for standard Gaussian vectors $\{Z_i\}$ that can be exactly attained for certain choices of $T$, and is thus referred to as an exact bound. Our exact bound is expressed via a collection of (stochastic) Gaussian widths over spherical sections of $T$ that serves as a natural multi-scale analogue to the Gaussian width of $T$. Compared to the classical bounds for the quadratic process, our new bounds not only determine the optimal constants in the classical bounds that can be attained for some $T$, but also precisely capture certain subtle phase transitional behavior of the quadratic process beyond the reach of the classical bounds.
To illustrate the utility of our results, we obtain tight versions of the Gaussian Dvoretzky-Milman theorem for random projection, and the Koltchinskii-Lounici theorem for covariance estimation, both with optimal constants. Moreover, our bounds recover the celebrated BBP phase transitional behavior of the top eigenvalue of the sample covariance and its generalization to the sample covariance error.
The proof of our results exploits recently sharpened Gaussian comparison inequalities. The technical scope of our method of proof is further demonstrated in obtaining an exact bound for a two-sided Chevet inequality.
△ Less
Submitted 22 July, 2024; v1 submitted 27 July, 2022;
originally announced July 2022.
-
Universality of regularized regression estimators in high dimensions
Authors:
Qiyang Han,
Yandi Shen
Abstract:
The Convex Gaussian Min-Max Theorem (CGMT) has emerged as a prominent theoretical tool for analyzing the precise stochastic behavior of various statistical estimators in the so-called high dimensional proportional regime, where the sample size and the signal dimension are of the same order. However, a well recognized limitation of the existing CGMT machinery rests in its stringent requirement on t…
▽ More
The Convex Gaussian Min-Max Theorem (CGMT) has emerged as a prominent theoretical tool for analyzing the precise stochastic behavior of various statistical estimators in the so-called high dimensional proportional regime, where the sample size and the signal dimension are of the same order. However, a well recognized limitation of the existing CGMT machinery rests in its stringent requirement on the exact Gaussianity of the design matrix, therefore rendering the obtained precise high dimensional asymptotics largely a specific Gaussian theory in various important statistical models.
This paper provides a structural universality framework for a broad class of regularized regression estimators that is particularly compatible with the CGMT machinery. In particular, we show that with a good enough $\ell_\infty$ bound for the regression estimator $\hatμ_A$, any `structural property' that can be detected via the CGMT for $\hatμ_G$ (under a standard Gaussian design $G$) also holds for $\hatμ_A$ under a general design $A$ with independent entries. As a proof of concept, we demonstrate our new universality framework in three key examples of regularized regression estimators: the Ridge, Lasso and regularized robust regression estimators, where new universality properties of risk asymptotics and/or distributions of regression estimators and other related quantities are proved. As a major statistical implication of the Lasso universality results, we validate inference procedures using the degrees-of-freedom adjusted debiased Lasso under general design and error distributions. We also provide a counterexample, showing that universality properties for regularized regression estimators do not extend to general isotropic designs.
△ Less
Submitted 27 June, 2022; v1 submitted 16 June, 2022;
originally announced June 2022.
-
Studying the mixed transmission in a community with age heterogeneity: COVID-19 as a case study
Authors:
Xiaoying Wang,
Qing Han,
Jude Dzevela Kong
Abstract:
COVID-19 has been prevalent worldwide for about 2 years now and has brought unprecedented challenges to our society. Before vaccines were available, the main disease intervention strategies were non-pharmaceutical. Starting December 2020, in Ontario, Canada, vaccines were approved for administering to vulnerable individuals and gradually expanded to all individuals above the age of 12. As the vacc…
▽ More
COVID-19 has been prevalent worldwide for about 2 years now and has brought unprecedented challenges to our society. Before vaccines were available, the main disease intervention strategies were non-pharmaceutical. Starting December 2020, in Ontario, Canada, vaccines were approved for administering to vulnerable individuals and gradually expanded to all individuals above the age of 12. As the vaccine coverage reached a satisfactory level among the eligible population, normal social activities resumed and schools reopened starting September 2021. However, when schools reopen for in-person learning, children under the age of 12 are unvaccinated and are at higher risks of contracting the virus. We propose an age-stratified model based on the age and vaccine eligibility of the individuals. We fit our model to the data in Ontario, Canada and obtain a good fitting result. The results show that a relaxed between-group contact rate may trigger future epidemic waves more easily than an increased within-group contact rate. An increasing mixed contact rate of the older group quickly amplifies the daily incidence numbers for both groups whereas an increasing mixed contact rate of the younger group mainly leads to future waves in the younger group alone. The results indicate the importance of accelerating vaccine rollout for younger individuals in mitigating disease spread.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
Noisy linear inverse problems under convex constraints: Exact risk asymptotics in high dimensions
Authors:
Qiyang Han
Abstract:
In the standard Gaussian linear measurement model $Y=Xμ_0+ξ\in \mathbb{R}^m$ with a fixed noise level $σ>0$, we consider the problem of estimating the unknown signal $μ_0$ under a convex constraint $μ_0 \in K$, where $K$ is a closed convex set in $\mathbb{R}^n$. We show that the risk of the natural convex constrained least squares estimator (LSE) $\hatμ(σ)$ can be characterized exactly in high dim…
▽ More
In the standard Gaussian linear measurement model $Y=Xμ_0+ξ\in \mathbb{R}^m$ with a fixed noise level $σ>0$, we consider the problem of estimating the unknown signal $μ_0$ under a convex constraint $μ_0 \in K$, where $K$ is a closed convex set in $\mathbb{R}^n$. We show that the risk of the natural convex constrained least squares estimator (LSE) $\hatμ(σ)$ can be characterized exactly in high dimensional limits, by that of the convex constrained LSE $\hatμ_K^{\mathsf{seq}}$ in the corresponding Gaussian sequence model at a different noise level. The characterization holds (uniformly) for risks in the maximal regime that ranges from constant order all the way down to essentially the parametric rate, as long as certain necessary non-degeneracy condition is satisfied for $\hatμ(σ)$.
The precise risk characterization reveals a fundamental difference between noiseless (or low noise limit) and noisy linear inverse problems in terms of the sample complexity for signal recovery. A concrete example is given by the isotonic regression problem: While exact recovery of a general monotone signal requires $m\gg n^{1/3}$ samples in the noiseless setting, consistent signal recovery in the noisy setting requires as few as $m\gg \log n$ samples. Such a discrepancy occurs when the low and high noise risk behavior of $\hatμ_K^{\mathsf{seq}}$ differ significantly. In statistical languages, this occurs when $\hatμ_K^{\mathsf{seq}}$ estimates $0$ at a faster `adaptation rate' than the slower `worst-case rate' for general signals. Several other examples, including non-negative least squares and generalized Lasso (in constrained forms), are also worked out to demonstrate the concrete applicability of the theory in problems of different types.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Nonparametric, tuning-free estimation of S-shaped functions
Authors:
Oliver Y. Feng,
Yining Chen,
Qiyang Han,
Raymond J. Carroll,
Richard J. Samworth
Abstract:
We consider the nonparametric estimation of an S-shaped regression function. The least squares estimator provides a very natural, tuning-free approach, but results in a non-convex optimisation problem, since the inflection point is unknown. We show that the estimator may nevertheless be regarded as a projection onto a finite union of convex cones, which allows us to propose a mixed primal-dual bas…
▽ More
We consider the nonparametric estimation of an S-shaped regression function. The least squares estimator provides a very natural, tuning-free approach, but results in a non-convex optimisation problem, since the inflection point is unknown. We show that the estimator may nevertheless be regarded as a projection onto a finite union of convex cones, which allows us to propose a mixed primal-dual bases algorithm for its efficient, sequential computation. After developing a projection framework that demonstrates the consistency and robustness to misspecification of the estimator, our main theoretical results provide sharp oracle inequalities that yield worst-case and adaptive risk bounds for the estimation of the regression function, as well as a rate of convergence for the estimation of the inflection point. These results reveal not only that the estimator achieves the minimax optimal rate of convergence for both the estimation of the regression function and its inflection point (up to a logarithmic factor in the latter case), but also that it is able to achieve an almost-parametric rate when the true regression function is piecewise affine with not too many affine pieces. Simulations and a real data application to air pollution modelling also confirm the desirable finite-sample properties of the estimator, and our algorithm is implemented in the R package Sshaped.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Generalized kernel distance covariance in high dimensions: non-null CLTs and power universality
Authors:
Qiyang Han,
Yandi Shen
Abstract:
Distance covariance is a popular dependence measure for two random vectors $X$ and $Y$ of possibly different dimensions and types. Recent years have witnessed concentrated efforts in the literature to understand the distributional properties of the sample distance covariance in a high-dimensional setting, with an exclusive emphasis on the null case that $X$ and $Y$ are independent. This paper deri…
▽ More
Distance covariance is a popular dependence measure for two random vectors $X$ and $Y$ of possibly different dimensions and types. Recent years have witnessed concentrated efforts in the literature to understand the distributional properties of the sample distance covariance in a high-dimensional setting, with an exclusive emphasis on the null case that $X$ and $Y$ are independent. This paper derives the first non-null central limit theorem for the sample distance covariance, and the more general sample (Hilbert-Schmidt) kernel distance covariance in high dimensions, primarily in the Gaussian case. The new non-null central limit theorem yields an asymptotically exact first-order power formula for the widely used generalized kernel distance correlation test of independence between $X$ and $Y$. The power formula in particular unveils an interesting universality phenomenon: the power of the generalized kernel distance correlation test is completely determined by $n\cdot \text{dcor}^2(X,Y)/\sqrt{2}$ in the high dimensional limit, regardless of a wide range of choices of the kernels and bandwidth parameters. Furthermore, this separation rate is also shown to be optimal in a minimax sense. The key step in the proof of the non-null central limit theorem is a precise expansion of the mean and variance of the sample distance covariance in high dimensions, which shows, among other things, that the non-null Gaussian approximation of the sample distance covariance involves a rather subtle interplay between the dimension-to-sample ratio and the dependence between $X$ and $Y$.
△ Less
Submitted 1 August, 2024; v1 submitted 14 June, 2021;
originally announced June 2021.
-
On generalized Fermat Diophantine functional and partial differential equations in $\mathbf{C}^2$
Authors:
Wei Chen,
Qi Han,
Qiong Wang
Abstract:
In this paper, we characterize meromorphic solutions $f(z_1,z_2),g(z_1,z_2)$ to the generalized Fermat Diophantine functional equations $h(z_1,z_2)f^m+k(z_1,z_2)g^n=1$ in $\mathbf{C}^2$ for integers $m,n\geq2$ and nonzero meromorphic functions $h(z_1,z_2),k(z_1,z_2)$ in $\mathbf{C}^2$. Meromorphic solutions to associated partial differential equations are also studied.
In this paper, we characterize meromorphic solutions $f(z_1,z_2),g(z_1,z_2)$ to the generalized Fermat Diophantine functional equations $h(z_1,z_2)f^m+k(z_1,z_2)g^n=1$ in $\mathbf{C}^2$ for integers $m,n\geq2$ and nonzero meromorphic functions $h(z_1,z_2),k(z_1,z_2)$ in $\mathbf{C}^2$. Meromorphic solutions to associated partial differential equations are also studied.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
The Shuffle Variant of a Diophantine equation of Miyazaki and Togbé
Authors:
Elif Kızıldere,
Gökhan Soydan,
Qing Han,
Pingzhi Yuan
Abstract:
In 2012, T. Miyazaki and A. Togbé gave all of the solutions of the Diophantine equations $(2am-1)^x+(2m)^y=(2am+1)^z$ and $b^x+2^y=(b+2)^z$ in positive integers $x,y,z,$ $a>1$ and $b\ge 5$ odd. In this paper, we propose a similar problem (which we call the shuffle variant of a Diophantine equation of Miyazaki and Togbé). Here we first prove that the Diophantine equation…
▽ More
In 2012, T. Miyazaki and A. Togbé gave all of the solutions of the Diophantine equations $(2am-1)^x+(2m)^y=(2am+1)^z$ and $b^x+2^y=(b+2)^z$ in positive integers $x,y,z,$ $a>1$ and $b\ge 5$ odd. In this paper, we propose a similar problem (which we call the shuffle variant of a Diophantine equation of Miyazaki and Togbé). Here we first prove that the Diophantine equation $(2am+1)^x+(2m)^y=(2am-1)^z$ has only the solutions $(a, m, x, y, z)=(2, 1, 2, 1, 3)$ and $(2,1,1,2,2)$ in positive integers $a>1,m,x,y,z$. Then using this result, we show that the Diophantine equation $b^x+2^y=(b-2)^z$ has only the solutions $(b,x, y, z)=(5, 2, 1, 3)$ and $(5,1,2,2)$ in positive integers $x,y,z$ and $b$ odd.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Geodesics and isometric immersions in kirigami
Authors:
Qing Han,
Marta Lewicka,
L. Mahadevan
Abstract:
Kirigami is the art of cutting paper to make it articulated and deployable, allowing for it to be shaped into complex two and three-dimensional geometries. The mechanical response of a kirigami sheet when it is pulled at its ends is enabled and limited by the presence of cuts that serve to guide the possible non-planar deformations. Inspired by the geometry of this art form, we ask two questions:…
▽ More
Kirigami is the art of cutting paper to make it articulated and deployable, allowing for it to be shaped into complex two and three-dimensional geometries. The mechanical response of a kirigami sheet when it is pulled at its ends is enabled and limited by the presence of cuts that serve to guide the possible non-planar deformations. Inspired by the geometry of this art form, we ask two questions: (i) What is the shortest path between points at which forces are applied? (ii) What is the nature of the ultimate shape of the sheet when it is strongly stretched?
Mathematically, these questions are related to the nature and form of geodesics in the Euclidean plane with linear obstructions (cuts), and the nature and form of isometric immersions of the sheet with cuts when it can be folded on itself. We provide a constructive proof that the geodesic connecting any two points in the plane is piecewise polygonal. We then prove that the family of polygonal geodesics can be simultaneously rectified into a straight line by flat-folding the sheet so that its configuration is a (non-unique) piecewise affine planar isometric immersion.
△ Less
Submitted 18 April, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Novel multi-step predictor-corrector schemes for backward stochastic differential equations
Authors:
Qiang Han,
Shaolin Ji
Abstract:
Novel multi-step predictor-corrector numerical schemes have been derived for approximating decoupled forward-backward stochastic differential equations (FBSDEs). The stability and high order rate of convergence of the schemes are rigorously proved. We also present a sufficient and necessary condition for the stability of the schemes. Numerical experiments are given to illustrate the stability and…
▽ More
Novel multi-step predictor-corrector numerical schemes have been derived for approximating decoupled forward-backward stochastic differential equations (FBSDEs). The stability and high order rate of convergence of the schemes are rigorously proved. We also present a sufficient and necessary condition for the stability of the schemes. Numerical experiments are given to illustrate the stability and convergence rates of the proposed methods.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Multiplier U-processes: sharp bounds and applications
Authors:
Qiyang Han
Abstract:
The theory for multiplier empirical processes has been one of the central topics in the development of the classical theory of empirical processes, due to its wide applicability to various statistical problems. In this paper, we develop theory and tools for studying multiplier $U$-processes, a natural higher-order generalization of the multiplier empirical processes. To this end, we develop a mult…
▽ More
The theory for multiplier empirical processes has been one of the central topics in the development of the classical theory of empirical processes, due to its wide applicability to various statistical problems. In this paper, we develop theory and tools for studying multiplier $U$-processes, a natural higher-order generalization of the multiplier empirical processes. To this end, we develop a multiplier inequality that quantifies the moduli of continuity of the multiplier $U$-process in terms of that of the (decoupled) symmetrized $U$-process. The new inequality finds a variety of applications including (i) multiplier and bootstrap central limit theorems for $U$-processes, (ii) general theory for bootstrap $M$-estimators based on $U$-statistics, and (iii) theory for $M$-estimation under general complex sampling designs, again based on $U$-statistics.
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
Contiguity under high dimensional Gaussianity with applications to covariance testing
Authors:
Qiyang Han,
Tiefeng Jiang,
Yandi Shen
Abstract:
Le Cam's third/contiguity lemma is a fundamental probabilistic tool to compute the limiting distribution of a given statistic $T_n$ under a non-null sequence of probability measures $\{Q_n\}$, provided its limiting distribution under a null sequence $\{P_n\}$ is available, and the log likelihood ratio $\{\log (dQ_n/dP_n)\}$ has a distributional limit. Despite its wide-spread applications to low-di…
▽ More
Le Cam's third/contiguity lemma is a fundamental probabilistic tool to compute the limiting distribution of a given statistic $T_n$ under a non-null sequence of probability measures $\{Q_n\}$, provided its limiting distribution under a null sequence $\{P_n\}$ is available, and the log likelihood ratio $\{\log (dQ_n/dP_n)\}$ has a distributional limit. Despite its wide-spread applications to low-dimensional statistical problems, the stringent requirement of Le Cam's third/contiguity lemma on the distributional limit of the log likelihood ratio makes it challenging, or even impossible to use in many modern high-dimensional statistical problems.
This paper provides a non-asymptotic analogue of Le Cam's third/contiguity lemma under high dimensional normal populations. Our contiguity method is particularly compatible with sufficiently regular statistics $T_n$: the regularity of $T_n$ effectively reduces both the problems of (i) obtaining a null (Gaussian) limit distribution and of (ii) verifying our new quantitative contiguity condition, to those of derivative calculations and moment bounding exercises. More important, our method bypasses the need to understand the precise behavior of the log likelihood ratio, and therefore possibly works even when it necessarily fails to stabilize -- a regime beyond the reach of classical contiguity methods.
As a demonstration of the scope of our new contiguity method, we obtain asymptotically exact power formulae for a number of widely used high-dimensional covariance tests, including the likelihood ratio tests and trace tests, that hold uniformly over all possible alternative covariance under mild growth conditions on the dimension-to-sample ratio. These new results go much beyond the scope of previous available case-specific techniques, and exhibit new phenomenon regarding the behavior of these important class of covariance tests.
△ Less
Submitted 14 November, 2022; v1 submitted 26 January, 2021;
originally announced January 2021.
-
The Loewner-Nirenberg Problem in Cones
Authors:
Qing Han,
Xumin Jiang,
Weiming Shen
Abstract:
We study asymptotic behaviors of solutions to the Loewner-Nirenberg problem in finite cones and establish optimal asymptotic expansions in terms of the corresponding solutions in infinite cones. The spherical domains over which cones are formed are allowed to have singularities. An elliptic operator on such spherical domains with coefficients singular on boundary play an important role. Due to the…
▽ More
We study asymptotic behaviors of solutions to the Loewner-Nirenberg problem in finite cones and establish optimal asymptotic expansions in terms of the corresponding solutions in infinite cones. The spherical domains over which cones are formed are allowed to have singularities. An elliptic operator on such spherical domains with coefficients singular on boundary play an important role. Due to the singularity of the spherical domains, extra cares are needed for the study of the global regularity of the eigenfunctions and solutions of the associated singular Dirichlet problem.
△ Less
Submitted 12 December, 2020;
originally announced December 2020.
-
Anisotropic Dynamical Horizons Arising in Gravitational Collapse
Authors:
Xinliang An,
Qing Han
Abstract:
For the study of $3+1$ dimensional Einstein vacuum equations (EVEs), substantial progress has been made recently on the problem of trapped surface formation. However, very limited knowledge of existence and associated properties is acquired on the boundary of the emerged trapped region, i.e., the apparent horizon, which is composed of marginally outer trapped surfaces (MOTSs) and is of great physi…
▽ More
For the study of $3+1$ dimensional Einstein vacuum equations (EVEs), substantial progress has been made recently on the problem of trapped surface formation. However, very limited knowledge of existence and associated properties is acquired on the boundary of the emerged trapped region, i.e., the apparent horizon, which is composed of marginally outer trapped surfaces (MOTSs) and is of great physical importance. In this paper, concerning this emerged apparent horizon we prove a folklore conjecture relating to both cosmic censorship and black hole thermodynamics. In a framework set up by Christodoulou and under a general anisotropic condition introduced by Klainerman, Luk and Rodnianski, for $3+1$ EVEs we prove that in the process of gravitational collapse, a smooth and spacelike apparent horizon (dynamical horizon) emerges from general (both isotropic and anisotropic) initial data. This dynamical horizon censors singularities formed in gravitational collapse from non-trapped local observers near the center, and it also enables the extension of black hole thermodynamical theory along the apparent horizon to anisotropic scenarios. Our analysis builds on scale-critical hyperbolic method and non-perturbative elliptic techniques. New observations and equation structures are exploited. Geometrically, we furthermore construct explicit finger-type single and multi-valley anisotropic apparent horizons. They are the first mathematical examples of the anisotropic MOTS and the anisotropic apparent horizon formed in dynamics, which have potential applications in geometric analysis, black hole mechanics, numerical relativity and gravitational wave phenomenology.
△ Less
Submitted 4 May, 2021; v1 submitted 23 October, 2020;
originally announced October 2020.
-
High dimensional asymptotics of likelihood ratio tests in the Gaussian sequence model under convex constraints
Authors:
Qiyang Han,
Bodhisattva Sen,
Yandi Shen
Abstract:
In the Gaussian sequence model $Y=μ+ξ$, we study the likelihood ratio test (LRT) for testing $H_0: μ=μ_0$ versus $H_1: μ\in K$, where $μ_0 \in K$, and $K$ is a closed convex set in $\mathbb{R}^n$. In particular, we show that under the null hypothesis, normal approximation holds for the log-likelihood ratio statistic for a general pair $(μ_0,K)$, in the high dimensional regime where the estimation…
▽ More
In the Gaussian sequence model $Y=μ+ξ$, we study the likelihood ratio test (LRT) for testing $H_0: μ=μ_0$ versus $H_1: μ\in K$, where $μ_0 \in K$, and $K$ is a closed convex set in $\mathbb{R}^n$. In particular, we show that under the null hypothesis, normal approximation holds for the log-likelihood ratio statistic for a general pair $(μ_0,K)$, in the high dimensional regime where the estimation error of the associated least squares estimator diverges in an appropriate sense. The normal approximation further leads to a precise characterization of the power behavior of the LRT in the high dimensional regime. These characterizations show that the power behavior of the LRT is in general non-uniform with respect to the Euclidean metric, and illustrate the conservative nature of existing minimax optimality and sub-optimality results for the LRT. A variety of examples, including testing in the orthant/circular cone, isotonic regression, Lasso, and testing parametric assumptions versus shape-constrained alternatives, are worked out to demonstrate the versatility of the developed theory.
△ Less
Submitted 20 June, 2021; v1 submitted 7 October, 2020;
originally announced October 2020.
-
Inference for local parameters in convexity constrained models
Authors:
Hang Deng,
Qiyang Han,
Bodhisattva Sen
Abstract:
We consider the problem of inference for local parameters of a convex regression function $f_0: [0,1] \to \mathbb{R}$ based on observations from a standard nonparametric regression model, using the convex least squares estimator (LSE) $\widehat{f}_n$. For $x_0 \in (0,1)$, the local parameters include the pointwise function value $f_0(x_0)$, the pointwise derivative $f_0'(x_0)$, and the anti-mode (…
▽ More
We consider the problem of inference for local parameters of a convex regression function $f_0: [0,1] \to \mathbb{R}$ based on observations from a standard nonparametric regression model, using the convex least squares estimator (LSE) $\widehat{f}_n$. For $x_0 \in (0,1)$, the local parameters include the pointwise function value $f_0(x_0)$, the pointwise derivative $f_0'(x_0)$, and the anti-mode (i.e., the smallest minimizer) of $f_0$. The existing limiting distribution of the estimation error $(\widehat{f}_n(x_0) - f_0(x_0), \widehat{f}_n'(x_0) - f_0'(x_0) )$ depends on the unknown second derivative $f_0''(x_0)$, and is therefore not directly applicable for inference. To circumvent this impasse, we show that the following locally normalized errors (LNEs) enjoy pivotal limiting behavior: Let $[\widehat{u}(x_0), \widehat{v}(x_0)]$ be the maximal interval containing $x_0$ where $\widehat{f}_n$ is linear. Then, under standard conditions, $$\binom{ \sqrt{n(\widehat{v}(x_0)-\widehat{u}(x_0))}(\widehat{f}_n(x_0)-f_0(x_0)) }{ \sqrt{n(\widehat{v}(x_0)-\widehat{u}(x_0))^3}(\widehat{f}_n'(x_0)-f_0'(x_0))} \rightsquigarrow σ\cdot \binom{\mathbb{L}^{(0)}_2}{\mathbb{L}^{(1)}_2},$$ where $n$ is the sample size, $σ$ is the standard deviation of the errors, and $\mathbb{L}^{(0)}_2, \mathbb{L}^{(1)}_2$ are universal random variables. This asymptotically pivotal LNE theory instantly yields a simple tuning-free procedure for constructing CIs with asymptotically exact coverage and optimal length for $f_0(x_0)$ and $f_0'(x_0)$. We also construct an asymptotically pivotal LNE for the anti-mode of $f_0$, and its limiting distribution does not even depend on $σ$. These asymptotically pivotal LNE theories are further extended to other convexity/concavity constrained models (e.g., log-concave density estimation) for which a limit distribution theory is available for problem-specific estimators.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
On a phase transition in general order spline regression
Authors:
Yandi Shen,
Qiyang Han,
Fang Han
Abstract:
In the Gaussian sequence model $Y= θ_0 + \varepsilon$ in $\mathbb{R}^n$, we study the fundamental limit of approximating the signal $θ_0$ by a class $Θ(d,d_0,k)$ of (generalized) splines with free knots. Here $d$ is the degree of the spline, $d_0$ is the order of differentiability at each inner knot, and $k$ is the maximal number of pieces. We show that, given any integer $d\geq 0$ and…
▽ More
In the Gaussian sequence model $Y= θ_0 + \varepsilon$ in $\mathbb{R}^n$, we study the fundamental limit of approximating the signal $θ_0$ by a class $Θ(d,d_0,k)$ of (generalized) splines with free knots. Here $d$ is the degree of the spline, $d_0$ is the order of differentiability at each inner knot, and $k$ is the maximal number of pieces. We show that, given any integer $d\geq 0$ and $d_0\in\{-1,0,\ldots,d-1\}$, the minimax rate of estimation over $Θ(d,d_0,k)$ exhibits the following phase transition: \begin{equation*} \begin{aligned} \inf_{\widetildeθ}\sup_{θ\inΘ(d,d_0, k)}\mathbb{E}_θ\|\widetildeθ - θ\|^2 \asymp_d \begin{cases} k\log\log(16n/k), & 2\leq k\leq k_0,\\ k\log(en/k), & k \geq k_0+1. \end{cases} \end{aligned} \end{equation*} The transition boundary $k_0$, which takes the form $\lfloor{(d+1)/(d-d_0)\rfloor} + 1$, demonstrates the critical role of the regularity parameter $d_0$ in the separation between a faster $\log \log(16n)$ and a slower $\log(en)$ rate. We further show that, once encouraging an additional '$d$-monotonicity' shape constraint (including monotonicity for $d = 0$ and convexity for $d=1$), the above phase transition is eliminated and the faster $k\log\log(16n/k)$ rate can be achieved for all $k$. These results provide theoretical support for developing $\ell_0$-penalized (shape-constrained) spline regression procedures as useful alternatives to $\ell_1$- and $\ell_2$-penalized ones.
△ Less
Submitted 6 May, 2020; v1 submitted 22 April, 2020;
originally announced April 2020.
-
Confidence intervals for multiple isotonic regression and other monotone models
Authors:
Hang Deng,
Qiyang Han,
Cun-Hui Zhang
Abstract:
We consider the problem of constructing pointwise confidence intervals in the multiple isotonic regression model. Recently, [HZ19] obtained a pointwise limit distribution theory for the so-called block max-min and min-max estimators [FLN17] in this model, but inference remains a difficult problem due to the nuisance parameter in the limit distribution that involves multiple unknown partial derivat…
▽ More
We consider the problem of constructing pointwise confidence intervals in the multiple isotonic regression model. Recently, [HZ19] obtained a pointwise limit distribution theory for the so-called block max-min and min-max estimators [FLN17] in this model, but inference remains a difficult problem due to the nuisance parameter in the limit distribution that involves multiple unknown partial derivatives of the true regression function.
In this paper, we show that this difficult nuisance parameter can be effectively eliminated by taking advantage of information beyond point estimates in the block max-min and min-max estimators. Formally, let $\hat{u}(x_0)$ (resp. $\hat{v}(x_0)$) be the maximizing lower-left (resp. minimizing upper-right) vertex in the block max-min (resp. min-max) estimator, and $\hat{f}_n$ be the average of the block max-min and min-max estimators. If all (first-order) partial derivatives of $f_0$ are non-vanishing at $x_0$, then the following pivotal limit distribution theory holds: $$ \sqrt{n_{\hat{u},\hat{v}}(x_0)}\big(\hat{f}_n(x_0)-f_0(x_0)\big)\rightsquigarrow σ\cdot \mathbb{L}_{1_d}. $$ Here $n_{\hat{u},\hat{v}}(x_0)$ is the number of design points in the block $[\hat{u}(x_0),\hat{v}(x_0)]$, $σ$ is the standard deviation of the errors, and $\mathbb{L}_{1_d}$ is a universal limit distribution free of nuisance parameters. This immediately yields confidence intervals for $f_0(x_0)$ with asymptotically exact confidence level and oracle length. Notably, the construction of the confidence intervals, even new in the univariate setting, requires no more efforts than performing an isotonic regression for once using the block max-min and min-max estimators, and can be easily adapted to other common monotone models. Extensive simulations are carried out to support our theory.
△ Less
Submitted 30 September, 2020; v1 submitted 20 January, 2020;
originally announced January 2020.
-
Berry-Esseen bounds for Chernoff-type non-standard asymptotics in isotonic regression
Authors:
Qiyang Han,
Kengo Kato
Abstract:
A Chernoff-type distribution is a nonnormal distribution defined by the slope at zero of the greatest convex minorant of a two-sided Brownian motion with a polynomial drift. While a Chernoff-type distribution is known to appear as the distributional limit in many non-regular statistical estimation problems, the accuracy of Chernoff-type approximations has remained largely unknown. In the present p…
▽ More
A Chernoff-type distribution is a nonnormal distribution defined by the slope at zero of the greatest convex minorant of a two-sided Brownian motion with a polynomial drift. While a Chernoff-type distribution is known to appear as the distributional limit in many non-regular statistical estimation problems, the accuracy of Chernoff-type approximations has remained largely unknown. In the present paper, we tackle this problem and derive Berry-Esseen bounds for Chernoff-type limit distributions in the canonical non-regular statistical estimation problem of isotonic (or monotone) regression. The derived Berry-Esseen bounds match those of the oracle local average estimator with optimal bandwidth in each scenario of possibly different Chernoff-type asymptotics, up to multiplicative logarithmic factors. Our method of proof differs from standard techniques on Berry-Esseen bounds, and relies on new localization techniques in isotonic regression and an anti-concentration inequality for the supremum of a Brownian motion with a Lipschitz drift.
△ Less
Submitted 22 June, 2021; v1 submitted 21 October, 2019;
originally announced October 2019.
-
Singular solutions to the Yamabe equation with prescribed asymptotics
Authors:
Qing Han,
Yichao Li
Abstract:
We study positive solutions of the Yamabe equation with isolated singularity and prove the existence of solutions with prescribed asymptotic expansions near singular points and an arbitrarily high order of approximation.
We study positive solutions of the Yamabe equation with isolated singularity and prove the existence of solutions with prescribed asymptotic expansions near singular points and an arbitrarily high order of approximation.
△ Less
Submitted 22 September, 2019;
originally announced September 2019.
-
Asymptotic expansions of solutions of the Yamabe equation and the $σ_k$-Yamabe equation near isolated singular points
Authors:
Qing Han,
Xiaoxiao Li,
Yichao Li
Abstract:
We study asymptotic behaviors of positive solutions to the Yamabe equation and the $σ$k-Yamabe equation near isolated singular points and establish expansions up to arbitrary orders. Such results generalize an earlier pioneering work by Caffarelli, Gidas, and Spruck, and a work by Korevaar, Mazzeo, Pacard, and Schoen, on the Yamabe equation, and a work by Han, Li, and Teixeira on the $σ_k$-Yamabe…
▽ More
We study asymptotic behaviors of positive solutions to the Yamabe equation and the $σ$k-Yamabe equation near isolated singular points and establish expansions up to arbitrary orders. Such results generalize an earlier pioneering work by Caffarelli, Gidas, and Spruck, and a work by Korevaar, Mazzeo, Pacard, and Schoen, on the Yamabe equation, and a work by Han, Li, and Teixeira on the $σ_k$-Yamabe equation. The study is based on a combination of classification of global singular solutions and an analysis of linearized operators at these global singular solutions. Such linearized equations are uniformly elliptic near singular points for $1 \leq k \leq n/2$ and become degenerate for $n/2 < k \leq n$. In a significant portion of the paper, we establish a degree 1 expansion for the $σ_k$-Yamabe equation for $n/2 < k < n$, generalizing a similar result for $k = 1$ by Korevaar, Mazzeo, Pacard, and Schoen and for $2 \leq k \leq n/2$ by Han, Li, and Teixeira.
△ Less
Submitted 16 September, 2019;
originally announced September 2019.
-
Elliptic variational problems with mixed nonlinearities
Authors:
Qi Han
Abstract:
In this paper, we study the existence and multiplicity results of nontrivial positive solutions to a quasilinear elliptic equation in $\RN$, when $N\geq2$, as \begin{equation} \Lp u+u^{p-1}=λ\hspace{0.2mm}k(x)u^{r-1}-h(x)u^{q-1}.\nonumber \end{equation} Here, $h(x),k(x)>0$ are Lebesgue measurable functions, $1<p<q<\infty$, $p<r<\min\{p^*,q\}$ if $p<N$ while $p<r<q$ if $p\geq N$, and $λ>0$ is a par…
▽ More
In this paper, we study the existence and multiplicity results of nontrivial positive solutions to a quasilinear elliptic equation in $\RN$, when $N\geq2$, as \begin{equation} \Lp u+u^{p-1}=λ\hspace{0.2mm}k(x)u^{r-1}-h(x)u^{q-1}.\nonumber \end{equation} Here, $h(x),k(x)>0$ are Lebesgue measurable functions, $1<p<q<\infty$, $p<r<\min\{p^*,q\}$ if $p<N$ while $p<r<q$ if $p\geq N$, and $λ>0$ is a parameter.
△ Less
Submitted 10 June, 2019;
originally announced June 2019.