Search | arXiv e-print repository

arXiv:2504.19450 [pdf, other]

Signal detection from spiked noise via asymmetrization

Authors: Zhigang Bao, Kha Man Cheong, Yuji Li

Abstract: The signal plus noise model $H=S+Y$ is a fundamental model in signal detection when a low rank signal $S$ is polluted by noise $Y$. In the high-dimensional setting, one often uses the leading singular values and corresponding singular vectors of $H$ to conduct the statistical inference of the signal $S$. Especially, when $Y$ consists of iid random entries, the singular values of $S$ can be estimat… ▽ More The signal plus noise model $H=S+Y$ is a fundamental model in signal detection when a low rank signal $S$ is polluted by noise $Y$. In the high-dimensional setting, one often uses the leading singular values and corresponding singular vectors of $H$ to conduct the statistical inference of the signal $S$. Especially, when $Y$ consists of iid random entries, the singular values of $S$ can be estimated from those of $H$ as long as the signal $S$ is strong enough. However, when the $Y$ entries are heteroscedastic or correlated, this standard approach may fail. Especially in this work, we consider a situation that can easily arise with heteroscedastic noise but is particularly difficult to address using the singular value approach, namely, when the noise $Y$ itself may create spiked singular values. It has been a recurring question how to distinguish the signal $S$ from the spikes in $Y$, as this seems impossible by examining the leading singular values of $H$. Inspired by the work \cite{CCF21}, we turn to study the eigenvalues of an asymmetrized model when two samples $H_1=S+Y_1$ and $H_2=S+Y_2$ are available. We show that by looking into the leading eigenvalues (in magnitude) of the asymmetrized model $H_1H_2^*$, one can easily detect $S$. Unlike \cite{CCF21}, we show that even if the spikes from $Y$ is much larger than the strength of $S$, and thus the operator norm of $Y$ is much larger than that of $S$, the detection is still effective. Second, we establish the precise detection threshold. Third, we do not require any structural assumption on the singular vectors of $S$. Finally, we derive precise limiting behaviour of the leading eigenvalues of the asymmetrized model. Based on the limiting results, we propose a completely data-based approach for the detection of $S$. △ Less

Submitted 27 April, 2025; originally announced April 2025.

arXiv:2503.18922 [pdf, ps, other]

Law of fractional logarithm for random matrices

Authors: Zhigang Bao, Giorgio Cipolloni, László Erdős, Joscha Henheik, Oleksii Kolupaiev

Abstract: We prove the Paquette-Zeitouni law of fractional logarithm (LFL) for the extreme eigenvalues [arXiv:1505.05627] in full generality, and thereby verify a conjecture from [arXiv:1505.05627]. Our result holds for any Wigner minor process and both symmetry classes, in particular for the GOE minor process, while [arXiv:1505.05627] and the recent full resolution of LFL by Baslingker et.~al.~[arXiv:2410.… ▽ More We prove the Paquette-Zeitouni law of fractional logarithm (LFL) for the extreme eigenvalues [arXiv:1505.05627] in full generality, and thereby verify a conjecture from [arXiv:1505.05627]. Our result holds for any Wigner minor process and both symmetry classes, in particular for the GOE minor process, while [arXiv:1505.05627] and the recent full resolution of LFL by Baslingker et.~al.~[arXiv:2410.11836] cover only the GUE case which is determinantal. Lacking the possibility for a direct comparison with the Gaussian case, we develop a robust and natural method for both key parts of the proof. On one hand, we rely on a powerful martingale technique to describe precisely the strong correlation between the largest eigenvalue of an $N\times N$ Wigner matrix and its $(N-k)\times (N-k)$ minor if $k\ll N^{2/3}$. On the other hand, we use dynamical methods to show that this correlation is weak if $k\gg N^{2/3}$. △ Less

Submitted 24 March, 2025; originally announced March 2025.

MSC Class: 60B20; 60G55; 82C10

arXiv:2503.06549 [pdf, other]

Decorrelation transition in the Wigner minor process

Authors: Zhigang Bao, Giorgio Cipolloni, László Erdős, Joscha Henheik, Oleksii Kolupaiev

Abstract: We consider the Wigner minor process, i.e. the eigenvalues of an $N\times N$ Wigner matrix $H^{(N)}$ together with the eigenvalues of all its $n\times n$ minors, $H^{(n)}$, $n\le N$. The top eigenvalues of $H^{(N)}$ and those of its immediate minor $H^{(N-1)}$ are very strongly correlated, but this correlation becomes weaker for smaller minors $H^{(N-k)}$ as $k$ increases. For the GUE minor proces… ▽ More We consider the Wigner minor process, i.e. the eigenvalues of an $N\times N$ Wigner matrix $H^{(N)}$ together with the eigenvalues of all its $n\times n$ minors, $H^{(n)}$, $n\le N$. The top eigenvalues of $H^{(N)}$ and those of its immediate minor $H^{(N-1)}$ are very strongly correlated, but this correlation becomes weaker for smaller minors $H^{(N-k)}$ as $k$ increases. For the GUE minor process the critical transition regime around $k\sim N^{2/3}$ was analyzed by Forrester and Nagao (J. Stat. Mech.: Theory and Experiment, 2011) providing an explicit formula for the nontrivial joint correlation function. We prove that this formula is universal, i.e. it holds for the Wigner minor process. Moreover, we give a complete analysis of the sub- and supercritical regimes both for eigenvalues and for the corresponding eigenvector overlaps, thus we prove the decorrelation transition in full generality. △ Less

Submitted 7 April, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

Comments: 33 pages, 3 figures; v1->v2->v3: minor updates

MSC Class: 60B20; 60G55; 82C10

arXiv:2411.11341 [pdf, other]

Ultra high order cumulants and quantitative CLT for polynomials in Random Matrices

Authors: Zhigang Bao, Daniel Munoz George

Abstract: From the study of the high order freeness of random matrices, it is known that the order $r$ cumulant of the trace of a polynomial of $N$-dimensional GUE/GOE is of order $N^{2-r}$ if $r$ is fixed. In this work, we extend the study along three directions. First, we also consider generally distributed Wigner matrices with subexponential entries. Second, we include the deterministic matrices into dis… ▽ More From the study of the high order freeness of random matrices, it is known that the order $r$ cumulant of the trace of a polynomial of $N$-dimensional GUE/GOE is of order $N^{2-r}$ if $r$ is fixed. In this work, we extend the study along three directions. First, we also consider generally distributed Wigner matrices with subexponential entries. Second, we include the deterministic matrices into discussion and consider arbitrary polynomials in random matrices and deterministic matrices. Third, more importantly, we consider the ultra high order cumulants in the sense that $r$ is arbitrary, i.e., could be $N$ dependent. Our main results are the upper bounds of the ultra high order cumulants, for which not only the $N$-dependence but also the $r$-dependence become significant. These results are then used to derive three types of quantitative CLT for the trace of any given self-adjoint polynomial in these random matrix variables: a CLT with a Cramér type correction, a Berry-Esseen bound, and a concentration inequality which captures both the Gaussian tail in the small deviation regime and $M$-dependent tail in the large deviation regime, where $M$ is the degree of the polynomial. In contrast to the second order freeness which implies the CLT for linear eigenvalue statistics of polynomials in random matrices, our study on the ultra high order cumulants leads to the quantitative versions of the CLT. △ Less

Submitted 18 November, 2024; originally announced November 2024.

arXiv:2409.01819 [pdf, other]

Phase transition for the bottom singular vector of rectangular random matrices

Authors: Zhigang Bao, Jaehun Lee, Xiaocong Xu

Abstract: In this paper, we consider the rectangular random matrix $X=(x_{ij})\in \mathbb{R}^{N\times n}$ whose entries are iid with tail $\mathbb{P}(|x_{ij}|>t)\sim t^{-α}$ for some $α>0$. We consider the regime $N(n)/n\to \mathsf{a}>1$ as $n$ tends to infinity. Our main interest lies in the right singular vector corresponding to the smallest singular value, which we will refer to as the "bottom singular v… ▽ More In this paper, we consider the rectangular random matrix $X=(x_{ij})\in \mathbb{R}^{N\times n}$ whose entries are iid with tail $\mathbb{P}(|x_{ij}|>t)\sim t^{-α}$ for some $α>0$. We consider the regime $N(n)/n\to \mathsf{a}>1$ as $n$ tends to infinity. Our main interest lies in the right singular vector corresponding to the smallest singular value, which we will refer to as the "bottom singular vector", denoted by $\mathfrak{u}$. In this paper, we prove the following phase transition regarding the localization length of $\mathfrak{u}$: when $α<2$ the localization length is $O(n/\log n)$; when $α>2$ the localization length is of order $n$. Similar results hold for all right singular vectors around the smallest singular value. The variational definition of the bottom singular vector suggests that the mechanism for this localization-delocalization transition when $α$ goes across $2$ is intrinsically different from the one for the top singular vector when $α$ goes across $4$. △ Less

Submitted 17 September, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

Comments: minor update

arXiv:2403.12542 [pdf, ps, other]

Attitude Tracking of Uncertain Flexible Spacecraft Systems Subject to Unknown External Disturbances

Authors: Zean Bao, Maobin Lu, Fang Deng, Jie Chen

Abstract: In this paper, we investigate the attitude tracking problem of uncertain flexible spacecraft systems subject to external disturbances. In sharp contrast to existing results, the dynamics of flexible spacecraft systems and external disturbances are allowed to be unknown. To deal with the challenges by these unknown factors, we develop a class of nonlinear internal models which converts the attitude… ▽ More In this paper, we investigate the attitude tracking problem of uncertain flexible spacecraft systems subject to external disturbances. In sharp contrast to existing results, the dynamics of flexible spacecraft systems and external disturbances are allowed to be unknown. To deal with the challenges by these unknown factors, we develop a class of nonlinear internal models which converts the attitude tracking problem of uncertain flexible spacecraft systems into a regulation problem of an augmented system. Furthermore, to overcome the difficulties caused by the unmeasurable modal variable, the uncertainty introduced by the internal model, and the cross-coupling of the uncertainties with the system state, we design an auxiliary dynamic system for auxiliary stabilization, a dynamic compensator for dynamic compensation, and a linearly parameterized transformation for adaptive regulation in sequence. By introducing a series of coordinate and input transformations, we propose an adaptive dynamic control law to achieve regulation of the augmented system and thus leading to the solution to the attitude tracking problem. In addition, we analyze the convergence issue of the estimated parameter to its true value by the persistently exciting condition. Finally, the effec tiveness of the developed approach is verified by its application to the attitude manoeuvre of a flexible spacecraft system in the presence of external disturbances. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 8 pages, 2 figures, submitted to TAC on 6 Dec. 2023

arXiv:2312.05911 [pdf, ps, other]

A leave-one-out approach to approximate message passing

Authors: Zhigang Bao, Qiyang Han, Xiaocong Xu

Abstract: Approximate message passing (AMP) has emerged both as a popular class of iterative algorithms and as a powerful analytic tool in a wide range of statistical estimation problems and statistical physics models. A well established line of AMP theory proves Gaussian approximations for the empirical distributions of the AMP iterate in the high dimensional limit, under the GOE random matrix model and it… ▽ More Approximate message passing (AMP) has emerged both as a popular class of iterative algorithms and as a powerful analytic tool in a wide range of statistical estimation problems and statistical physics models. A well established line of AMP theory proves Gaussian approximations for the empirical distributions of the AMP iterate in the high dimensional limit, under the GOE random matrix model and its variants. This paper provides a non-asymptotic, leave-one-out representation for the AMP iterate that holds under a broad class of Gaussian random matrix models with general variance profiles. In contrast to the typical AMP theory that describes the empirical distributions of the AMP iterate via a low dimensional state evolution, our leave-one-out representation yields an intrinsically high dimensional state evolution formula which provides non-asymptotic characterizations for the possibly heterogeneous, entrywise behavior of the AMP iterate under the prescribed random matrix models. To exemplify some distinct features of our AMP theory in applications, we analyze, in the context of regularized linear estimation, the precise stochastic behavior of the Ridge estimator for independent and non-identically distributed observations whose covariates exhibit general variance profiles. We find that its finite-sample distribution is characterized via a weighted Ridge estimator in a heterogeneous Gaussian sequence model. Notably, in contrast to the i.i.d. sampling scenario, the effective noise and regularization are now full dimensional vectors determined via a high dimensional system of equations. Our leave-one-out method of proof differs significantly from the widely adopted conditioning approach for rotational invariant ensembles, and relies instead on an inductive method that utilizes almost solely integration-by-parts and concentration techniques. △ Less

Submitted 25 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

arXiv:2308.09581 [pdf, ps, other]

Phase transition for the smallest eigenvalue of covariance matrices

Authors: Zhigang Bao, Jaehun Lee, Xiaocong Xu

Abstract: In this paper, we study the smallest non-zero eigenvalue of the sample covariance matrices $\mathcal{S}(Y)=YY^*$, where $Y=(y_{ij})$ is an $M\times N$ matrix with iid mean $0$ variance $N^{-1}$ entries. We prove a phase transition for its distribution, induced by the fatness of the tail of $y_{ij}$'s. More specifically, we assume that $y_{ij}$ is symmetrically distributed with tail probability… ▽ More In this paper, we study the smallest non-zero eigenvalue of the sample covariance matrices $\mathcal{S}(Y)=YY^*$, where $Y=(y_{ij})$ is an $M\times N$ matrix with iid mean $0$ variance $N^{-1}$ entries. We prove a phase transition for its distribution, induced by the fatness of the tail of $y_{ij}$'s. More specifically, we assume that $y_{ij}$ is symmetrically distributed with tail probability $\mathbb{P}(|\sqrt{N}y_{ij}|\geq x)\sim x^{-α}$ when $x\to \infty$, for some $α\in (2,4)$. We show the following conclusions: (i). When $α>\frac83$, the smallest eigenvalue follows the Tracy-Widom law on scale $N^{-\frac23}$; (ii). When $2<α<\frac83$, the smallest eigenvalue follows the Gaussian law on scale $N^{-\fracα{4}}$; (iii). When $α=\frac83$, the distribution is given by an interpolation between Tracy-Widom and Gaussian; (iv). In case $α\leq \frac{10}{3}$, in addition to the left edge of the MP law, a deterministic shift of order $N^{1-\fracα{2}}$ shall be subtracted from the smallest eigenvalue, in both the Tracy-Widom law and the Gaussian law. Overall speaking, our proof strategy is inspired by \cite{ALY} which is originally done for the bulk regime of the Lévy Wigner matrices. In addition to various technical complications arising from the bulk-to-edge extension, two ingredients are needed for our derivation: an intermediate left edge local law based on a simple but effective matrix minor argument, and a mesoscopic CLT for the linear spectral statistic with asymptotic expansion for its expectation. △ Less

Submitted 8 November, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

Comments: Typos in equations (1.13) and (2.3) have been corrected

arXiv:2303.01810 [pdf, other]

DC-based Security Constraints Formulation: A Perspective of Primal-Dual Interior Point Method

Authors: Zhiyuan Bao, Zechun Hu, Asad Mujeeb

Abstract: The DC network security constraints have been extensively studied in numerous power system problems, such as optimal power flow (OPF), security-constrained economic dispatch (SCED), and security-constrained unit commitment (SCUC). Linear shift factors, i.e., power transfer distribution factors (PTDFs), are widely applied to replace DC power flow constraints. However, the PTDF matrix is extremely d… ▽ More The DC network security constraints have been extensively studied in numerous power system problems, such as optimal power flow (OPF), security-constrained economic dispatch (SCED), and security-constrained unit commitment (SCUC). Linear shift factors, i.e., power transfer distribution factors (PTDFs), are widely applied to replace DC power flow constraints. However, the PTDF matrix is extremely dense, making it difficult to solve security-constraint optimization problems. This paper analyzes/investigates the computational inefficiency of PTDF-based security constraints from the sparse structure perspective of the primal-dual interior point method(IPM). Additionally, a matrix transformation method is proposed for restoring the sparsity of the linear system during IPM iterations. It turns out that the transformation method is equivalent to solving the original optimization problem expressed in pure voltage angle, which preserves the sparsity structure but introduces additional variables and constraints proportional to one to two times the total number of buses. The regular B-$θ$ formulation is also a variant of the proposed transformation. Numerical studies show that sparsity rather than the size of variables and constraints is the key factor impacting the speed of solving convex quadratic problems (QP), i.e., OPF and SCED problems. In contrast, sparsity is less desirable when solving a mixed integer problem (MIP), such as the SCUC problem, where reoptimization techniques are significantly more critical and the dual simplex method is typically employed rather than IPM. △ Less

Submitted 3 March, 2023; originally announced March 2023.

arXiv:2212.11634 [pdf, ps, other]

Extreme eigenvalues of Log-concave Ensemble

Authors: Zhigang Bao, Xiaocong Xu

Abstract: In this paper, we consider the log-concave ensemble of random matrices, a class of covariance-type matrices $XX^*$ with isotropic log-concave $X$-columns. A main example is the covariance estimator of the uniform measure on isotropic convex body. Non-asymptotic estimates and first order asymptotic limits for the extreme eigenvalues have been obtained in the literature. In this paper, with the rece… ▽ More In this paper, we consider the log-concave ensemble of random matrices, a class of covariance-type matrices $XX^*$ with isotropic log-concave $X$-columns. A main example is the covariance estimator of the uniform measure on isotropic convex body. Non-asymptotic estimates and first order asymptotic limits for the extreme eigenvalues have been obtained in the literature. In this paper, with the recent advancements on log-concave measures \cite{chen, KL22}, we take a step further to locate the eigenvalues with a nearly optimal precision, namely, the spectral rigidity of this ensemble is derived. Based on the spectral rigidity and an additional ``unconditional" assumption, we further derive the Tracy-Widom law for the extreme eigenvalues of $XX^*$, and the Gaussian law for the extreme eigenvalues in case strong spikes are present. △ Less

Submitted 22 December, 2022; originally announced December 2022.

arXiv:2207.06107 [pdf, other]

Spectral Statistics of Sample Block Correlation Matrices

Authors: Zhigang Bao, Jiang Hu, Xiaocong Xu, Xiaozhuo Zhang

Abstract: A fundamental concept in multivariate statistics, sample correlation matrix, is often used to infer the correlation/dependence structure among random variables, when the population mean and covariance are unknown. A natural block extension of it, {\it sample block correlation matrix}, is proposed to take on the same role, when random variables are generalized to random sub-vectors. In this paper,… ▽ More A fundamental concept in multivariate statistics, sample correlation matrix, is often used to infer the correlation/dependence structure among random variables, when the population mean and covariance are unknown. A natural block extension of it, {\it sample block correlation matrix}, is proposed to take on the same role, when random variables are generalized to random sub-vectors. In this paper, we establish a spectral theory of the sample block correlation matrices and apply it to group independent test and related problem, under the high-dimensional setting. More specifically, we consider a random vector of dimension $p$, consisting of $k$ sub-vectors of dimension $p_t$'s, where $p_t$'s can vary from $1$ to order $p$. Our primary goal is to investigate the dependence of the $k$ sub-vectors. We construct a random matrix model called sample block correlation matrix based on $n$ samples for this purpose. The spectral statistics of the sample block correlation matrix include the classical Wilks' statistic and Schott's statistic as special cases. It turns out that the spectral statistics do not depend on the unknown population mean and covariance. Further, under the null hypothesis that the sub-vectors are independent, the limiting behavior of the spectral statistics can be described with the aid of the Free Probability Theory. Specifically, under three different settings of possibly $n$-dependent $k$ and $p_t$'s, we show that the empirical spectral distribution of the sample block correlation matrix converges to the free Poisson binomial distribution, free Poisson distribution (Marchenko-Pastur law) and free Gaussian distribution (semicircle law), respectively. We then further derive the CLTs for the linear spectral statistics of the block correlation matrix under general setting. △ Less

Submitted 7 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

arXiv:2112.00329 [pdf, other]

Non-splitting Neyman-Pearson Classifiers

Authors: Jingming Wang, Lucy Xia, Zhigang Bao, Xin Tong

Abstract: The Neyman-Pearson (NP) binary classification paradigm constrains the more severe type of error (e.g., the type I error) under a preferred level while minimizing the other (e.g., the type II error). This paradigm is suitable for applications such as severe disease diagnosis, fraud detection, among others. A series of NP classifiers have been developed to guarantee the type I error control with hig… ▽ More The Neyman-Pearson (NP) binary classification paradigm constrains the more severe type of error (e.g., the type I error) under a preferred level while minimizing the other (e.g., the type II error). This paradigm is suitable for applications such as severe disease diagnosis, fraud detection, among others. A series of NP classifiers have been developed to guarantee the type I error control with high probability. However, these existing classifiers involve a sample splitting step: a mixture of class 0 and class 1 observations to construct a scoring function and some left-out class 0 observations to construct a threshold. This splitting enables classifier construction built upon independence, but it amounts to insufficient use of data for training and a potentially higher type II error. Leveraging a canonical linear discriminant analysis model, we derive a quantitative CLT for a certain functional of quadratic forms of the inverse of sample and population covariance matrices, and based on this result, develop for the first time NP classifiers without splitting the training sample. Numerical experiments have confirmed the advantages of our new non-splitting parametric strategy. △ Less

Submitted 4 June, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

arXiv:2108.03306 [pdf, ps, other]

A non-commutative Nullstellensatz

Authors: Zhengheng Bao, Zinovy Reichstein

Abstract: Let $K$ be a field and $D$ be a finite-dimensional central division algebra over $K$. We prove a variant of the Nullstellensatz for $2$-sided ideals in the ring of polynomial maps $D^n \to D$. In the case where $D = K$ is commutative, our main result reduces to the $K$-Nullstellensatz of Laksov and Adkins-Gianni-Tognoli. In the case, where $K = \mathbb R$ is the field of real numbers and $D$ is th… ▽ More Let $K$ be a field and $D$ be a finite-dimensional central division algebra over $K$. We prove a variant of the Nullstellensatz for $2$-sided ideals in the ring of polynomial maps $D^n \to D$. In the case where $D = K$ is commutative, our main result reduces to the $K$-Nullstellensatz of Laksov and Adkins-Gianni-Tognoli. In the case, where $K = \mathbb R$ is the field of real numbers and $D$ is the algebra of Hamilton quaternions, it reduces to the quaternionic Nullstellensatz recently proved by Alon and Paran. △ Less

Submitted 6 August, 2021; originally announced August 2021.

Comments: 7 pages

MSC Class: 16S36; 14A25

arXiv:2103.05402 [pdf, ps, other]

Quantitative CLT for linear eigenvalue statistics of Wigner matrices

Authors: Zhigang Bao, Yukun He

Abstract: In this article, we establish a near-optimal convergence rate for the CLT of linear eigenvalue statistics of Wigner matrices, in Kolmogorov-Smirnov distance. For all test functions $f\in C^5(\mathbb R)$, we show that the convergence rate is either $N^{-1/2+\varepsilon}$ or $N^{-1+\varepsilon}$, depending on the first Chebyshev coefficient of $f$ and the third moment of the diagonal matrix entries.… ▽ More In this article, we establish a near-optimal convergence rate for the CLT of linear eigenvalue statistics of Wigner matrices, in Kolmogorov-Smirnov distance. For all test functions $f\in C^5(\mathbb R)$, we show that the convergence rate is either $N^{-1/2+\varepsilon}$ or $N^{-1+\varepsilon}$, depending on the first Chebyshev coefficient of $f$ and the third moment of the diagonal matrix entries. The condition that distinguishes these two rates is necessary and sufficient. For a general class of test functions, we further identify matching lower bounds for the convergence rates. In addition, we identify an explicit, non-universal contribution in the linear eigenvalue statistics, which is responsible for the slow rate $N^{-1/2+\varepsilon}$ for non-Gaussian ensembles. By removing this non-universal part, we show that the shifted linear eigenvalue statistics have the unified convergence rate $N^{-1+\varepsilon}$ for all test functions. △ Less

Submitted 19 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

Comments: Minor updates

MSC Class: 60B20; 60F05

arXiv:2009.13143 [pdf, ps, other]

Eigenvector distribution in the critical regime of BBP transition

Authors: Zhigang Bao, Dong Wang

Abstract: In this paper, we study the random matrix model of Gaussian Unitary Ensemble (GUE) with fixed-rank (aka spiked) external source. We will focus on the critical regime of the Baik-Ben Arous-Péché (BBP) phase transition and establish the distribution of the eigenvectors associated with the leading eigenvalues. The distribution is given in terms of a determinantal point process with extended Airy kern… ▽ More In this paper, we study the random matrix model of Gaussian Unitary Ensemble (GUE) with fixed-rank (aka spiked) external source. We will focus on the critical regime of the Baik-Ben Arous-Péché (BBP) phase transition and establish the distribution of the eigenvectors associated with the leading eigenvalues. The distribution is given in terms of a determinantal point process with extended Airy kernel. Our result can be regarded as an eigenvector counterpart of the BBP eigenvalue phase transition (arXiv:math/0403022). The derivation of the distribution makes use of the recently re-discovered eigenvector-eigenvalue identity, together with the determinantal point process representation of the GUE minor process with external source. △ Less

Submitted 26 April, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

Comments: 54 pages, 19 figures, some computational details elaborated

MSC Class: 60B20(primary) 15A18 (secondary)

arXiv:2008.11903 [pdf, ps, other]

Statistical inference for principal components of spiked covariance matrices

Authors: Zhigang Bao, Xiucai Ding, Jingming Wang, Ke Wang

Abstract: In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. Especially, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, i.e., the projections of the eigenvecto… ▽ More In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. Especially, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, i.e., the projections of the eigenvectors onto arbitrary given direction, assuming that the dimension and sample size are comparably large. In general, the joint distribution is given in terms of linear combinations of finitely many Gaussian and Chi-square variables, with parameters depending on the projection direction and the spikes. Our assumption on the spikes is fully general. First, the strengths of spikes are only required to be slightly above the critical threshold and no upper bound on the strengths is needed. Second, multiple spikes, i.e., spikes with the same strength, are allowed. Third, no structural assumption is imposed on the spikes. Thanks to the general setting, we can then apply the results to various high dimensional statistical hypothesis testing problems involving both the eigenvalues and eigenvectors. Specifically, we propose accurate and powerful statistics to conduct hypothesis testing on the principal components. These statistics are data-dependent and adaptive to the underlying true spikes. Numerical simulations also confirm the accuracy and powerfulness of our proposed statistics and illustrate significantly better performance compared to the existing methods in the literature. Especially, our methods are accurate and powerful even when either the spikes are small or the dimension is large. △ Less

Submitted 3 September, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

Comments: This work largely extends our previous work arXiv:1907.12251, and the latter shall be regarded as a very special case of the current work. In addition, various statistical applications are discussed

arXiv:2008.07061 [pdf, ps, other]

doi 10.1017/fms.2021.38

Equipartition principle for Wigner matrices

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: We prove that the energy of any eigenvector of a sum of several independent large Wigner matrices is equally distributed among these matrices with very high precision. This shows a particularly strong microcanonical form of the equipartition principle for quantum systems whose components are modelled by Wigner matrices. We prove that the energy of any eigenvector of a sum of several independent large Wigner matrices is equally distributed among these matrices with very high precision. This shows a particularly strong microcanonical form of the equipartition principle for quantum systems whose components are modelled by Wigner matrices. △ Less

Submitted 16 August, 2020; originally announced August 2020.

Journal ref: Forum of Mathematics, Sigma 9 (2021) e44

arXiv:2001.07661 [pdf, ps, other]

Central limit theorem for mesoscopic eigenvalue statistics of the free sum of matrices

Authors: Zhigang Bao, Kevin Schnelli, Yuanyuan Xu

Abstract: We consider random matrices of the form $H_N=A_N+U_N B_N U^*_N$, where $A_N$, $B_N$ are two $N$ by $N$ deterministic Hermitian matrices and $U_N$ is a Haar distributed random unitary matrix. We establish a universal Central Limit Theorem for the linear eigenvalue statistics of $H_N$ on all mesoscopic scales inside the regular bulk of the spectrum. The proof is based on studying the characteristic… ▽ More We consider random matrices of the form $H_N=A_N+U_N B_N U^*_N$, where $A_N$, $B_N$ are two $N$ by $N$ deterministic Hermitian matrices and $U_N$ is a Haar distributed random unitary matrix. We establish a universal Central Limit Theorem for the linear eigenvalue statistics of $H_N$ on all mesoscopic scales inside the regular bulk of the spectrum. The proof is based on studying the characteristic function of the linear eigenvalue statistics, and consists of two main steps: (1) generating Ward identities using the left-translation-invariance of the Haar measure, along with a local law for the resolvent of $H_N$ and analytic subordination properties of the free additive convolution, allow us to derive an explicit formula for the derivative of the characteristic function; (2) a local law for two-point product functions of resolvents is derived using a partial randomness decomposition of the Haar measure. We also prove the corresponding results for orthogonal conjugations. △ Less

Submitted 19 August, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

MSC Class: 15B52; 60B20

arXiv:1911.04151 [pdf, other]

On Cramér-von Mises statistic for the spectral distribution of random matrices

Authors: Zhigang Bao, Yukun He

Abstract: Let $F_N$ and $F$ be the empirical and limiting spectral distributions of an $N\times N$ Wigner matrix. The Cramér-von Mises (CvM) statistic is a classical goodness-of-fit statistic that characterizes the distance between $F_N$ and $F$ in $\ell^2$-norm. In this paper, we consider a mesoscopic approximation of the CvM statistic for Wigner matrices, and derive its limiting distribution. In the appen… ▽ More Let $F_N$ and $F$ be the empirical and limiting spectral distributions of an $N\times N$ Wigner matrix. The Cramér-von Mises (CvM) statistic is a classical goodness-of-fit statistic that characterizes the distance between $F_N$ and $F$ in $\ell^2$-norm. In this paper, we consider a mesoscopic approximation of the CvM statistic for Wigner matrices, and derive its limiting distribution. In the appendix, we also give the limiting distribution of the CvM statistic (without approximation) for the toy model CUE. △ Less

Submitted 24 July, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

arXiv:1907.13099 [pdf, other]

Equivalence of pth moment stability between stochastic differential delay equations and their numerical methods

Authors: Zhenyu Bao, Jingwen Tang, Yan Shen, Wei Liu

Abstract: In this paper, a general theorem on the equivalence of pth moment stability between stochastic differential delay equations (SDDEs) and their numerical methods is proved under the assumptions that the numerical methods are strongly convergent and have the bouneded $p$th moment in the finite time. The truncated Euler-Maruyama (EM) method is studied as an example to illustrate that the theorem indee… ▽ More In this paper, a general theorem on the equivalence of pth moment stability between stochastic differential delay equations (SDDEs) and their numerical methods is proved under the assumptions that the numerical methods are strongly convergent and have the bouneded $p$th moment in the finite time. The truncated Euler-Maruyama (EM) method is studied as an example to illustrate that the theorem indeed covers a large ranges of SDDEs. Alongside the investigation of the truncated EM method, the requirements on the step size of the method are significantly released compared with the work, where the method was initially proposed. △ Less

Submitted 28 July, 2019; originally announced July 2019.

Comments: 29 pages, 1 figure. arXiv admin note: text overlap with arXiv:1703.09565, arXiv:1804.07635 by other authors

arXiv:1907.12251 [pdf, ps, other]

Principal components of spiked covariance matrices in the supercritical regime

Authors: Zhigang Bao, Xiucai Ding, Jingming Wang, Ke Wang

Abstract: In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the spiked covariance matrices, in the supercritical regime. Specifically, we derive the joint distribution of the extreme eigenvalues and the generalized components of their associated eigenvectors in this regime. In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the spiked covariance matrices, in the supercritical regime. Specifically, we derive the joint distribution of the extreme eigenvalues and the generalized components of their associated eigenvectors in this regime. △ Less

Submitted 28 August, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

Comments: This paper has been included in arXiv: 2008.11903 as a special case. Hence, this paper will not be published separately

arXiv:1809.10476 [pdf, other]

Singular vector and singular subspace distribution for the matrix denoising model

Authors: Zhigang Bao, Xiucai Ding, Ke Wang

Abstract: In this paper, we study the matrix denosing model $Y=S+X$, where $S$ is a low-rank deterministic signal matrix and $X$ is a random noise matrix, and both are $M\times n$. In the scenario that $M$ and $n$ are comparably large and the signals are supercritical, we study the fluctuation of the outlier singular vectors of $Y$. More specifically, we derive the limiting distribution of angles between th… ▽ More In this paper, we study the matrix denosing model $Y=S+X$, where $S$ is a low-rank deterministic signal matrix and $X$ is a random noise matrix, and both are $M\times n$. In the scenario that $M$ and $n$ are comparably large and the signals are supercritical, we study the fluctuation of the outlier singular vectors of $Y$. More specifically, we derive the limiting distribution of angles between the principal singular vectors of $Y$ and their deterministic counterparts, the singular vectors of $S$. Further, we also derive the distribution of the distance between the subspace spanned by the principal singular vectors of $Y$ and that spanned by the singular vectors of $S$. It turns out that the limiting distributions depend on the structure of the singular vectors of $S$ and the distribution of $X$, and thus they are non-universal. △ Less

Submitted 7 July, 2020; v1 submitted 27 September, 2018; originally announced September 2018.

Comments: Final version. Accepted by the Annals of Statistics

arXiv:1804.11199 [pdf, ps, other]

On the support of the free additive convolution

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: We consider the free additive convolution of two probability measures $μ$ and $ν$ on the real line and show that $μ\boxplusν$ is supported on a single interval if $μ$ and $ν$ each has single interval support. Moreover, the density of $μ\boxplusν$ is proven to vanish as a square root near the edges of its support if both $μ$ and $ν$ have power law behavior with exponents between $-1$ and $1$ near t… ▽ More We consider the free additive convolution of two probability measures $μ$ and $ν$ on the real line and show that $μ\boxplusν$ is supported on a single interval if $μ$ and $ν$ each has single interval support. Moreover, the density of $μ\boxplusν$ is proven to vanish as a square root near the edges of its support if both $μ$ and $ν$ have power law behavior with exponents between $-1$ and $1$ near their edges. In particular, these results show the ubiquity of the conditions in our recent work on optimal local law at the spectral edges for addition of random matrices [4]. △ Less

Submitted 27 October, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

MSC Class: 46L54; 60B20; 30A99

arXiv:1712.00892 [pdf, other]

Tracy-Widom limit for Kendall's tau

Authors: Zhigang Bao

Abstract: In this paper, we study a high-dimensional random matrix model from nonparametric statistics called the Kendall rank correlation matrix, which is a natural multivariate extension of the Kendall rank correlation coefficient. We establish the Tracy-Widom law for its largest eigenvalue. It is the first Tracy-Widom law for a nonparametric random matrix model, and also the first Tracy-Widom law for a h… ▽ More In this paper, we study a high-dimensional random matrix model from nonparametric statistics called the Kendall rank correlation matrix, which is a natural multivariate extension of the Kendall rank correlation coefficient. We establish the Tracy-Widom law for its largest eigenvalue. It is the first Tracy-Widom law for a nonparametric random matrix model, and also the first Tracy-Widom law for a high-dimensional U-statistic. △ Less

Submitted 14 May, 2020; v1 submitted 3 December, 2017; originally announced December 2017.

Comments: supplementary material is attached to the end of the paper for the convenience of the reader

arXiv:1708.01597 [pdf, ps, other]

Spectral rigidity for addition of random matrices at the regular edge

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: We consider the sum of two large Hermitian matrices $A$ and $B$ with a Haar unitary conjugation bringing them into a general relative position. We prove that the eigenvalue density on the scale slightly above the local eigenvalue spacing is asymptotically given by the free convolution of the laws of $A$ and $B$ as the dimension of the matrix increases. This implies optimal rigidity of the eigenv… ▽ More We consider the sum of two large Hermitian matrices $A$ and $B$ with a Haar unitary conjugation bringing them into a general relative position. We prove that the eigenvalue density on the scale slightly above the local eigenvalue spacing is asymptotically given by the free convolution of the laws of $A$ and $B$ as the dimension of the matrix increases. This implies optimal rigidity of the eigenvalues and optimal rate of convergence in Voiculescu's theorem. Our previous works [3,4] established these results in the bulk spectrum, the current paper completely settles the problem at the spectral edges provided they have the typical square-root behavior. The key element of our proof is to compensate the deterioration of the stability of the subordination equations by sharp error estimates that properly account for the local density near the edge. Our results also hold if the Haar unitary matrix is replaced by the Haar orthogonal matrix. △ Less

Submitted 14 May, 2020; v1 submitted 4 August, 2017; originally announced August 2017.

arXiv:1704.02408 [pdf, other]

Canonical correlation coefficients of high-dimensional Gaussian vectors: finite rank case

Authors: Zhigang Bao, Jiang Hu, Guangming Pan, Wang Zhou

Abstract: Consider a Gaussian vector $\mathbf{z}=(\mathbf{x}',\mathbf{y}')'$, consisting of two sub-vectors $\mathbf{x}$ and $\mathbf{y}$ with dimensions $p$ and $q$ respectively, where both $p$ and $q$ are proportional to the sample size $n$. Denote by $Σ_{\mathbf{u}\mathbf{v}}$ the population cross-covariance matrix of random vectors $\mathbf{u}$ and $\mathbf{v}$, and denote by $S_{\mathbf{u}\mathbf{v}}$… ▽ More Consider a Gaussian vector $\mathbf{z}=(\mathbf{x}',\mathbf{y}')'$, consisting of two sub-vectors $\mathbf{x}$ and $\mathbf{y}$ with dimensions $p$ and $q$ respectively, where both $p$ and $q$ are proportional to the sample size $n$. Denote by $Σ_{\mathbf{u}\mathbf{v}}$ the population cross-covariance matrix of random vectors $\mathbf{u}$ and $\mathbf{v}$, and denote by $S_{\mathbf{u}\mathbf{v}}$ the sample counterpart. The canonical correlation coefficients between $\mathbf{x}$ and $\mathbf{y}$ are known as the square roots of the nonzero eigenvalues of the canonical correlation matrix $Σ_{\mathbf{x}\mathbf{x}}^{-1}Σ_{\mathbf{x}\mathbf{y}}Σ_{\mathbf{y}\mathbf{y}}^{-1}Σ_{\mathbf{y}\mathbf{x}}$. In this paper, we focus on the case that $Σ_{\mathbf{x}\mathbf{y}}$ is of finite rank $k$, i.e. there are $k$ nonzero canonical correlation coefficients, whose squares are denoted by $r_1\geq\cdots\geq r_k>0$. We study the sample counterparts of $r_i,i=1,\ldots,k$, i.e. the largest $k$ eigenvalues of the sample canonical correlation matrix $§_{\mathbf{x}\mathbf{x}}^{-1}§_{\mathbf{x}\mathbf{y}}§_{\mathbf{y}\mathbf{y}}^{-1}§_{\mathbf{y}\mathbf{x}}$, denoted by $λ_1\geq\cdots\geq λ_k$. We show that there exists a threshold $r_c\in(0,1)$, such that for each $i\in\{1,\ldots,k\}$, when $r_i\leq r_c$, $λ_i$ converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by $d_{+}$. When $r_i>r_c$, $λ_i$ possesses an almost sure limit in $(d_{+},1]$. We also obtain the limiting distribution of $λ_i$'s under appropriate normalization. Specifically, $λ_i$ possesses Gaussian type fluctuation if $r_i>r_c$, and follows Tracy-Widom distribution if $r_i<r_c$. Some applications of our results are also discussed. △ Less

Submitted 6 June, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

Comments: This is an extended version of the previous work arXiv:1407.7194v2. In the current work, we have included the result on the fluctuations, and the limit part has also been reorganized

arXiv:1612.05920 [pdf, ps, other]

Local single ring theorem on optimal scale

Authors: Zhigang Bao, László Erdős, Kevin Schnelli

Abstract: Let $U$ and $V$ be two independent $N$ by $N$ random matrices that are distributed according to Haar measure on $U(N)$. Let $Σ$ be a non-negative deterministic $N$ by $N$ matrix. The single ring theorem [26] asserts that the empirical eigenvalue distribution of the matrix $X:= UΣV^*$ converges weakly, in the limit of large $N$, to a deterministic measure which is supported on a single ring centere… ▽ More Let $U$ and $V$ be two independent $N$ by $N$ random matrices that are distributed according to Haar measure on $U(N)$. Let $Σ$ be a non-negative deterministic $N$ by $N$ matrix. The single ring theorem [26] asserts that the empirical eigenvalue distribution of the matrix $X:= UΣV^*$ converges weakly, in the limit of large $N$, to a deterministic measure which is supported on a single ring centered at the origin in $\mathbb{C}$. Within the bulk regime, i.e. in the interior of the single ring, we establish the convergence of the empirical eigenvalue distribution on the optimal local scale of order $N^{-1/2+\varepsilon}$ and establish the optimal convergence rate. The same results hold true when~$U$ and~$V$ are Haar distributed on $O(N)$. △ Less

Submitted 1 March, 2019; v1 submitted 18 December, 2016; originally announced December 2016.

Comments: A gap in the proof of Lemma 5.5 has been fixed

MSC Class: 46L54; 60B20

arXiv:1606.03076 [pdf, ps, other]

Convergence Rate for Spectral Distribution of Addition of Random Matrices

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: Let $A$ and $B$ be two $N$ by $N$ deterministic Hermitian matrices and let $U$ be an $N$ by $N$ Haar distributed unitary matrix. It is well known that the spectral distribution of the sum $H=A+UBU^*$ converges weakly to the free additive convolution of the spectral distributions of $A$ and $B$, as $N$ tends to infinity. We establish the optimal convergence rate ${\frac{1}{N}}$ in the bulk of the s… ▽ More Let $A$ and $B$ be two $N$ by $N$ deterministic Hermitian matrices and let $U$ be an $N$ by $N$ Haar distributed unitary matrix. It is well known that the spectral distribution of the sum $H=A+UBU^*$ converges weakly to the free additive convolution of the spectral distributions of $A$ and $B$, as $N$ tends to infinity. We establish the optimal convergence rate ${\frac{1}{N}}$ in the bulk of the spectrum. △ Less

Submitted 9 June, 2016; originally announced June 2016.

arXiv:1509.07080 [pdf, ps, other]

doi 10.1007/s00220-016-2805-6

Local law of addition of random matrices on optimal scale

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: The eigenvalue distribution of the sum of two large Hermitian matrices, when one of them is conjugated by a Haar distributed unitary matrix, is asymptotically given by the free convolution of their spectral distributions. We prove that this convergence also holds locally in the bulk of the spectrum, down to the optimal scales larger than the eigenvalue spacing. The corresponding eigenvectors are f… ▽ More The eigenvalue distribution of the sum of two large Hermitian matrices, when one of them is conjugated by a Haar distributed unitary matrix, is asymptotically given by the free convolution of their spectral distributions. We prove that this convergence also holds locally in the bulk of the spectrum, down to the optimal scales larger than the eigenvalue spacing. The corresponding eigenvectors are fully delocalized. Similar results hold for the sum of two real symmetric matrices, when one is conjugated by a Haar orthogonal matrix. △ Less

Submitted 15 January, 2016; v1 submitted 23 September, 2015; originally announced September 2015.

Comments: More details on the continuity argument in Sections 7 and 8 are added

MSC Class: 46L54; 60B20

arXiv:1508.05905 [pdf, ps, other]

Local Stability of the Free Additive Convolution

Authors: Zhigang Bao, Laszlo Erdos, Kevin Schnelli

Abstract: We prove that the system of subordination equations, defining the free additive convolution of two probability measures, is stable away from the edges of the support and blow-up singularities by showing that the recent smoothness condition of Kargin is always satisfied. As an application, we consider the local spectral statistics of the random matrix ensemble $A+UBU^*$, where $U$ is a Haar distrib… ▽ More We prove that the system of subordination equations, defining the free additive convolution of two probability measures, is stable away from the edges of the support and blow-up singularities by showing that the recent smoothness condition of Kargin is always satisfied. As an application, we consider the local spectral statistics of the random matrix ensemble $A+UBU^*$, where $U$ is a Haar distributed random unitary or orthogonal matrix, and $A$ and $B$ are deterministic matrices. In the bulk regime, we prove that the empirical spectral distribution of $A+UBU^*$ concentrates around the free additive convolution of the spectral distributions of $A$ and $B$ on scales down to $N^{-2/3}$. △ Less

Submitted 2 January, 2016; v1 submitted 24 August, 2015; originally announced August 2015.

Comments: Third version: More details added to Lemma 6.3 and proof of Theorem 2.8

MSC Class: 46L54; 60B20

arXiv:1503.07510 [pdf, ps, other]

Delocalization for a class of random block band matrices

Authors: Zhigang Bao, Laszlo Erdos

Abstract: We consider $N\times N$ Hermitian random matrices $H$ consisting of blocks of size $M\geq N^{6/7}$. The matrix elements are i.i.d. within the blocks, close to a Gaussian in the four moment matching sense, but their distribution varies from block to block to form a block-band structure, with an essential band width $M$. We show that the entries of the Green's function $G(z)=(H-z)^{-1}$ satisfy the… ▽ More We consider $N\times N$ Hermitian random matrices $H$ consisting of blocks of size $M\geq N^{6/7}$. The matrix elements are i.i.d. within the blocks, close to a Gaussian in the four moment matching sense, but their distribution varies from block to block to form a block-band structure, with an essential band width $M$. We show that the entries of the Green's function $G(z)=(H-z)^{-1}$ satisfy the local semicircle law with spectral parameter $z=E+\mathbf{i}η$ down to the real axis for any $η\gg N^{-1}$, using a combination of the supersymmetry method inspired by \cite{Sh2014} and the Green's function comparison strategy. Previous estimates were valid only for $η\gg M^{-1}$. The new estimate also implies that the eigenvectors in the middle of the spectrum are fully delocalized. △ Less

Submitted 25 March, 2015; originally announced March 2015.

Comments: 81 pages

arXiv:1410.5082 [pdf, other]

Test of Independence for High-dimensional Random Vectors Based on Block Correlation Matrices

Authors: Zhigang Bao, Jiang Hu, Guangming Pan, Wang Zhou

Abstract: In this paper, we are concerned with the independence test for $k$ high-dimensional sub-vectors of a normal vector, with fixed positive integer $k$. A natural high-dimensional extension of the classical sample correlation matrix, namely block correlation matrix, is raised for this purpose. We then construct the so-called Schott type statistic as our test statistic, which turns out to be a particul… ▽ More In this paper, we are concerned with the independence test for $k$ high-dimensional sub-vectors of a normal vector, with fixed positive integer $k$. A natural high-dimensional extension of the classical sample correlation matrix, namely block correlation matrix, is raised for this purpose. We then construct the so-called Schott type statistic as our test statistic, which turns out to be a particular linear spectral statistic of the block correlation matrix. Interestingly, the limiting behavior of the Schott type statistic can be figured out with the aid of the Free Probability Theory and the Random Matrix Theory. Specifically, we will bring the so-called real second order freeness for Haar distributed orthogonal matrices, derived in \cite{MP2013}, into the framework of this high-dimensional testing problem. Our test does not require the sample size to be larger than the total or any partial sum of the dimensions of the $k$ sub-vectors. Simulated results show the effect of the Schott type statistic, in contrast to those of the statistics proposed in \cite{JY2013} and \cite{JBZ2013}, is satisfactory. Real data analysis is also used to illustrate our method. △ Less

Submitted 19 October, 2014; originally announced October 2014.

arXiv:1407.7194 [pdf, other]

Canonical correlation coefficients of high-dimensional normal vectors: finite rank case

Authors: Zhigang Bao, Jiang Hu, Guangming Pan, Wang Zhou

Abstract: Consider a normal vector $\mathbf{z}=(\mathbf{x}',\mathbf{y}')'$, consisting of two sub-vectors $\mathbf{x}$ and $\mathbf{y}$ with dimensions $p$ and $q$ respectively. With $n$ independent observations of $\mathbf{z}$ at hand, we study the correlation between $\mathbf{x}$ and $\mathbf{y}$, from the perspective of the Canonical Correlation Analysis, under the high-dimensional setting: both $p$ and… ▽ More Consider a normal vector $\mathbf{z}=(\mathbf{x}',\mathbf{y}')'$, consisting of two sub-vectors $\mathbf{x}$ and $\mathbf{y}$ with dimensions $p$ and $q$ respectively. With $n$ independent observations of $\mathbf{z}$ at hand, we study the correlation between $\mathbf{x}$ and $\mathbf{y}$, from the perspective of the Canonical Correlation Analysis, under the high-dimensional setting: both $p$ and $q$ are proportional to the sample size $n$. In this paper, we focus on the case that $Σ_{\mathbf{x}\mathbf{y}}$ is of finite rank $k$, i.e. there are $k$ nonzero canonical correlation coefficients, whose squares are denoted by $r_1\geq\cdots\geq r_k>0$. Under the additional assumptions $(p+q)/n\to y\in (0,1)$ and $p/q\not\to 1$, we study the sample counterparts of $r_i,i=1,\ldots,k$, i.e. the largest k eigenvalues of the sample canonical correlation matrix $S_{\mathbf{x}\mathbf{x}}^{-1}S_{\mathbf{x}\mathbf{y}}S_{\mathbf{y}\mathbf{y}}^{-1}S_{\mathbf{y}\mathbf{x}}$, namely $λ_1\geq\cdots\geq λ_k$. We show that there exists a threshold $r_c\in(0,1)$, such that for each $i\in\{1,\ldots,k\}$, when $r_i\leq r_c$, $λ_i$ converges almost surely to the right edge of the limiting spectral distribution of the sample canonical correlation matrix, denoted by $d_r$. When $r_i>r_c$, $λ_i$ possesses an almost sure limit in $(d_r,1]$, from which we can recover $r_i$ in turn, thus provide an estimate of the latter in the high-dimensional scenario. △ Less

Submitted 5 August, 2014; v1 submitted 27 July, 2014; originally announced July 2014.

Comments: Some typos were corrected

arXiv:1312.5119 [pdf, ps, other]

doi 10.1214/15-AOS1353

Spectral statistics of large dimensional Spearman's rank correlation matrix and its application

Authors: Zhigang Bao, Liang-Ching Lin, Guangming Pan, Wang Zhou

Abstract: Let $\mathbf{Q}=(Q_1,\ldots,Q_n)$ be a random vector drawn from the uniform distribution on the set of all $n!$ permutations of $\{1,2,\ldots,n\}$. Let $\mathbf{Z}=(Z_1,\ldots,Z_n)$, where $Z_j$ is the mean zero variance one random variable obtained by centralizing and normalizing $Q_j$, $j=1,\ldots,n$. Assume that $\mathbf {X}_i,i=1,\ldots ,p$ are i.i.d. copies of $\frac{1}{\sqrt{p}}\mathbf{Z}$ a… ▽ More Let $\mathbf{Q}=(Q_1,\ldots,Q_n)$ be a random vector drawn from the uniform distribution on the set of all $n!$ permutations of $\{1,2,\ldots,n\}$. Let $\mathbf{Z}=(Z_1,\ldots,Z_n)$, where $Z_j$ is the mean zero variance one random variable obtained by centralizing and normalizing $Q_j$, $j=1,\ldots,n$. Assume that $\mathbf {X}_i,i=1,\ldots ,p$ are i.i.d. copies of $\frac{1}{\sqrt{p}}\mathbf{Z}$ and $X=X_{p,n}$ is the $p\times n$ random matrix with $\mathbf{X}_i$ as its $i$th row. Then $S_n=XX^*$ is called the $p\times n$ Spearman's rank correlation matrix which can be regarded as a high dimensional extension of the classical nonparametric statistic Spearman's rank correlation coefficient between two independent random variables. In this paper, we establish a CLT for the linear spectral statistics of this nonparametric random matrix model in the scenario of high dimension, namely, $p=p(n)$ and $p/n\to c\in(0,\infty)$ as $n\to\infty$. We propose a novel evaluation scheme to estimate the core quantity in Anderson and Zeitouni's cumulant method in [Ann. Statist. 36 (2008) 2553-2576] to bypass the so-called joint cumulant summability. In addition, we raise a two-step comparison approach to obtain the explicit formulae for the mean and covariance functions in the CLT. Relying on this CLT, we then construct a distribution-free statistic to test complete independence for components of random vectors. Owing to the nonparametric property, we can use this test on generally distributed random variables including the heavy-tailed ones. △ Less

Submitted 17 November, 2015; v1 submitted 18 December, 2013; originally announced December 2013.

Comments: Published at http://dx.doi.org/10.1214/15-AOS1353 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1353

Journal ref: Annals of Statistics 2015, Vol. 43, No. 6, 2588-2623

arXiv:1304.5690 [pdf, ps, other]

doi 10.1214/14-AOS1281

Universality for the largest eigenvalue of sample covariance matrices with general population

Authors: Zhigang Bao, Guangming Pan, Wang Zhou

Abstract: This paper is aimed at deriving the universality of the largest eigenvalue of a class of high-dimensional real or complex sample covariance matrices of the form $\mathcal{W}_N=Σ^{1/2}XX^*Σ^{1/2}$. Here, $X=(x_{ij})_{M,N}$ is an $M\times N$ random matrix with independent entries $x_{ij},1\leq i\leq M,1\leq j\leq N$ such that $\mathbb{E}x_{ij}=0$, $\mathbb{E}|x_{ij}|^2=1/N$. On dimensionality, we as… ▽ More This paper is aimed at deriving the universality of the largest eigenvalue of a class of high-dimensional real or complex sample covariance matrices of the form $\mathcal{W}_N=Σ^{1/2}XX^*Σ^{1/2}$. Here, $X=(x_{ij})_{M,N}$ is an $M\times N$ random matrix with independent entries $x_{ij},1\leq i\leq M,1\leq j\leq N$ such that $\mathbb{E}x_{ij}=0$, $\mathbb{E}|x_{ij}|^2=1/N$. On dimensionality, we assume that $M=M(N)$ and $N/M\rightarrow d\in(0,\infty)$ as $N\rightarrow\infty$. For a class of general deterministic positive-definite $M\times M$ matrices $Σ$, under some additional assumptions on the distribution of $x_{ij}$'s, we show that the limiting behavior of the largest eigenvalue of $\mathcal{W}_N$ is universal, via pursuing a Green function comparison strategy raised in [Probab. Theory Related Fields 154 (2012) 341-407, Adv. Math. 229 (2012) 1435-1515] by Erdős, Yau and Yin for Wigner matrices and extended by Pillai and Yin [Ann. Appl. Probab. 24 (2014) 935-1001] to sample covariance matrices in the null case ($Σ=I$). Consequently, in the standard complex case ($\mathbb{E}x_{ij}^2=0$), combing this universality property and the results known for Gaussian matrices obtained by El Karoui in [Ann. Probab. 35 (2007) 663-714] (nonsingular case) and Onatski in [Ann. Appl. Probab. 18 (2008) 470-490] (singular case), we show that after an appropriate normalization the largest eigenvalue of $\mathcal{W}_N$ converges weakly to the type 2 Tracy-Widom distribution $\mathrm{TW}_2$. Moreover, in the real case, we show that when $Σ$ is spiked with a fixed number of subcritical spikes, the type 1 Tracy-Widom limit $\mathrm{TW}_1$ holds for the normalized largest eigenvalue of $\mathcal {W}_N$, which extends a result of Féral and Péché in [J. Math. Phys. 50 (2009) 073302] to the scenario of nondiagonal $Σ$ and more generally distributed $X$. △ Less

Submitted 5 March, 2015; v1 submitted 21 April, 2013; originally announced April 2013.

Comments: Published in at http://dx.doi.org/10.1214/14-AOS1281 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1281

Journal ref: Annals of Statistics 2015, Vol. 43, No. 1, 382-421

arXiv:1211.2507 [pdf, ps, other]

Universality for a global property of the eigenvectors of Wigner matrices

Authors: Zhigang Bao, Guangming Pan, Wang Zhou

Abstract: Let $M_n$ be an $n\times n$ real (resp. complex) Wigner matrix and $U_nΛ_n U_n^*$ be its spectral decomposition. Set $(y_1,y_2...,y_n)^T=U_n^*x$, where $x=(x_1,x_2,...,$ $x_n)^T$ is a real (resp. complex) unit vector. Under the assumption that the elements of $M_n$ have 4 matching moments with those of GOE (resp. GUE), we show that the process… ▽ More Let $M_n$ be an $n\times n$ real (resp. complex) Wigner matrix and $U_nΛ_n U_n^*$ be its spectral decomposition. Set $(y_1,y_2...,y_n)^T=U_n^*x$, where $x=(x_1,x_2,...,$ $x_n)^T$ is a real (resp. complex) unit vector. Under the assumption that the elements of $M_n$ have 4 matching moments with those of GOE (resp. GUE), we show that the process $X_n(t)=\sqrt{\frac{βn}{2}}\sum_{i=1}^{\lfloor nt\rfloor}(|y_i|^2-\frac1n)$ converges weakly to the Brownian bridge for any $\mathbf{x}$ such that $||x||_\infty\rightarrow 0$ as $n\rightarrow \infty$, where $β=1$ for the real case and $β=2$ for the complex case. Such a result indicates that the othorgonal (resp. unitary) matrices with columns being the eigenvectors of Wigner matrices are asymptotically Haar distributed on the orthorgonal (resp. unitary) group from a certain perspective. △ Less

Submitted 26 October, 2013; v1 submitted 11 November, 2012; originally announced November 2012.

Comments: typos corrected

arXiv:1208.5823 [pdf, ps, other]

doi 10.3150/14-BEJ615

The logarithmic law of random determinant

Authors: Zhigang Bao, Guangming Pan, Wang Zhou

Abstract: Consider the square random matrix $A_n=(a_{ij})_{n,n}$, where $\{a_{ij}:=a_{ij}^{(n)},i,j=1,\ldots,n\}$ is a collection of independent real random variables with means zero and variances one. Under the additional moment condition \[\sup_n\max_{1\leq i,j\leq n}\mathbb{E}a_{ij}^4<\infty,\] we prove Girko's logarithmic law of $\det A_n$ in the sense that as $n\rightarrow\infty$ \begin{eqnarray*}\frac… ▽ More Consider the square random matrix $A_n=(a_{ij})_{n,n}$, where $\{a_{ij}:=a_{ij}^{(n)},i,j=1,\ldots,n\}$ is a collection of independent real random variables with means zero and variances one. Under the additional moment condition \[\sup_n\max_{1\leq i,j\leq n}\mathbb{E}a_{ij}^4<\infty,\] we prove Girko's logarithmic law of $\det A_n$ in the sense that as $n\rightarrow\infty$ \begin{eqnarray*}\frac{\log|\det A_n|-(1/2)\log(n-1)!}{\sqrt{(1/2)\log n}}\stackrel{d}{ \longrightarrow}N(0,1).\end{eqnarray*} △ Less

Submitted 28 July, 2015; v1 submitted 28 August, 2012; originally announced August 2012.

Comments: Published at http://dx.doi.org/10.3150/14-BEJ615 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ615

Journal ref: Bernoulli 2015, Vol. 21, No. 3, 1600-1628

arXiv:1206.0508 [pdf, ps, other]

doi 10.1007/s10955-012-0663-y

Central limit theorem for partial linear eigenvalue statistics of Wigner matrices

Authors: Zhigang Bao, Guangming Pan, Wang Zhou

Abstract: In this paper, we study the complex Wigner matrices $M_n=\frac{1}{\sqrt{n}}W_n$ whose eigenvalues are typically in the interval $[-2,2]$. Let $λ_1\leq λ_2...\leqλ_n$ be the ordered eigenvalues of $M_n$. Under the assumption of four matching moments with the Gaussian Unitary Ensemble(GUE), for test function $f$ 4-times continuously differentiable on an open interval including $[-2,2]$, we establish… ▽ More In this paper, we study the complex Wigner matrices $M_n=\frac{1}{\sqrt{n}}W_n$ whose eigenvalues are typically in the interval $[-2,2]$. Let $λ_1\leq λ_2...\leqλ_n$ be the ordered eigenvalues of $M_n$. Under the assumption of four matching moments with the Gaussian Unitary Ensemble(GUE), for test function $f$ 4-times continuously differentiable on an open interval including $[-2,2]$, we establish central limit theorems for two types of partial linear statistics of the eigenvalues. The first type is defined with a threshold $u$ in the bulk of the Wigner semicircle law as $\mathcal{A}_n[f; u]=\sum_{l=1}^nf(λ_l)\mathbf{1}_{\{λ_l\leq u\}}$. And the second one is $\mathcal{B}_n[f; k]=\sum_{l=1}^{k}f(λ_l)$ with positive integer $k=k_n$ such that $k/n\rightarrow y\in (0,1)$ as $n$ tends to infinity. Moreover, we derive a weak convergence result for a partial sum process constructed from $\mathcal{B}_n[f; \lfloor nt\rfloor]$. △ Less

Submitted 3 June, 2012; originally announced June 2012.

Comments: 39 pages

MSC Class: 15B52; 60F05; 60F17

arXiv:1110.5208 [pdf, ps, other]

Tracy-Widom law for the extreme eigenvalues of sample correlation matrices

Authors: Zhigang Bao, Guangming Pan, Wang Zhou

Abstract: Let the sample correlation matrix be $W=YY^T$, where $Y=(y_{ij})_{p,n}$ with $y_{ij}=x_{ij}/\sqrt{\sum_{j=1}^nx_{ij}^2}$. We assume $\{x_{ij}: 1\leq i\leq p, 1\leq j\leq n\}$ to be a collection of independent symmetric distributed random variables with sub-exponential tails. Moreover, for any $i$, we assume $x_{ij}, 1\leq j\leq n$ to be identically distributed. We assume $0<p<n$ and… ▽ More Let the sample correlation matrix be $W=YY^T$, where $Y=(y_{ij})_{p,n}$ with $y_{ij}=x_{ij}/\sqrt{\sum_{j=1}^nx_{ij}^2}$. We assume $\{x_{ij}: 1\leq i\leq p, 1\leq j\leq n\}$ to be a collection of independent symmetric distributed random variables with sub-exponential tails. Moreover, for any $i$, we assume $x_{ij}, 1\leq j\leq n$ to be identically distributed. We assume $0<p<n$ and $p/n\rightarrow y$ with some $y\in(0,1)$ as $p,n\rightarrow\infty$. In this paper, we provide the Tracy-Widom law ($TW_1$) for both the largest and smallest eigenvalues of $W$. If $x_{ij}$ are i.i.d. standard normal, we can derive the $TW_1$ for both the largest and smallest eigenvalues of the matrix $\mathcal{R}=RR^T$, where $R=(r_{ij})_{p,n}$ with $r_{ij}=(x_{ij}-\bar x_i)/\sqrt{\sum_{j=1}^n(x_{ij}-\bar x_i)^2}$, $\bar x_i=n^{-1}\sum_{j=1}^nx_{ij}$. △ Less

Submitted 31 October, 2011; v1 submitted 24 October, 2011; originally announced October 2011.

Comments: 35 pages, a major revision

MSC Class: 15B52; 62H25; 62H10

arXiv:1104.3470 [pdf, ps, other]

On asymptotic expansion and CLT of linear eigenvalue statistics for sample covariance matrices when $N/M\rightarrow0$

Authors: Zhigang Bao

Abstract: We study the renormalized real sample covariance matrix $H=X^TX/\sqrt{MN}-\sqrt{M/N}$ with $N/M\rightarrow0$ as $N, M\rightarrow \infty$ in this paper. And we always assume $M=M(N)$. Here $X=[X_{jk}]_{M\times N}$ is an $M\times N$ real random matrix with i.i.d entries, and we assume $\mathbb{E}|X_{11}|^{5+δ}<\infty$ with some small positive $δ$. The Stieltjes transform $m_N(z)=N^{-1}Tr(H-z)^{-1}$… ▽ More We study the renormalized real sample covariance matrix $H=X^TX/\sqrt{MN}-\sqrt{M/N}$ with $N/M\rightarrow0$ as $N, M\rightarrow \infty$ in this paper. And we always assume $M=M(N)$. Here $X=[X_{jk}]_{M\times N}$ is an $M\times N$ real random matrix with i.i.d entries, and we assume $\mathbb{E}|X_{11}|^{5+δ}<\infty$ with some small positive $δ$. The Stieltjes transform $m_N(z)=N^{-1}Tr(H-z)^{-1}$ and the linear eigenvalue statistics of $H$ are considered. We mainly focus on the asymptotic expansion of $\mathbb{E}\{m_N(z)\}$ in this paper. Then for some fine test function, a central limit theorem for the linear eigenvalue statistics of $H$ is established. We show that the variance of the limiting normal distribution coincides with the case of a real Wigner matrix with Gaussian entries. △ Less

Submitted 15 November, 2011; v1 submitted 18 April, 2011; originally announced April 2011.

Comments: 24 pages

arXiv:1104.3431 [pdf, ps, other]

Local Semicircle law and Gaussian fluctuation for Hermite $β$ ensemble

Authors: Zhigang Bao, Zhonggen Su

Abstract: Let $β>0$ and consider an $n$-point process $λ_1, λ_2,..., λ_n$ from Hermite $β$ ensemble on the real line $\mathbb{R}$. Dumitriu and Edelman discovered a tri-diagonal matrix model and established the global Wigner semicircle law for normalized empirical measures. In this paper we prove that the average number of states in a small interval in the bulk converges in probability when the length of th… ▽ More Let $β>0$ and consider an $n$-point process $λ_1, λ_2,..., λ_n$ from Hermite $β$ ensemble on the real line $\mathbb{R}$. Dumitriu and Edelman discovered a tri-diagonal matrix model and established the global Wigner semicircle law for normalized empirical measures. In this paper we prove that the average number of states in a small interval in the bulk converges in probability when the length of the interval is larger than $\sqrt {\log n}$, i.e., local semicircle law holds. And the number of positive states in $(0,\infty)$ is proved to fluctuate normally around its mean $n/2$ with variance like $\log n/π^2β$. The proofs rely largely on the way invented by Valk$\acute{o}$ and Vir$\acute{a}$g of counting states in any interval and the classical martingale argument. △ Less

Submitted 18 April, 2011; originally announced April 2011.

Comments: 14 pages

arXiv:math/0609557 [pdf, ps, other]

doi 10.1515/FORM.2011.052

Manifolds associated with $(Z_2)^n$-colored regular graphs

Authors: Zhiqiang Bao, Zhi Lü

Abstract: In this article we describe a canonical way to expand a certain kind of $(\mathbb Z_2)^{n+1}$-colored regular graphs into closed $n$-manifolds by adding cells determined by the edge-colorings inductively. We show that every closed combinatorial $n$-manifold can be obtained in this way. When $n\leq 3$, we give simple equivalent conditions for a colored graph to admit an expansion. In addition, we… ▽ More In this article we describe a canonical way to expand a certain kind of $(\mathbb Z_2)^{n+1}$-colored regular graphs into closed $n$-manifolds by adding cells determined by the edge-colorings inductively. We show that every closed combinatorial $n$-manifold can be obtained in this way. When $n\leq 3$, we give simple equivalent conditions for a colored graph to admit an expansion. In addition, we show that if a $(\mathbb Z_2)^{n+1}$-colored regular graph admits an $n$-skeletal expansion, then it is realizable as the moment graph of an $(n+1)$-dimensional closed $(\mathbb Z_2)^{n+1}$-manifold. △ Less

Submitted 12 February, 2008; v1 submitted 20 September, 2006; originally announced September 2006.

Comments: 20 pages with 9 figures, in AMS-LaTex, v4 added a new section on reconstructing a space with a $(Z_2)^n$-action for which its moment graph is a given colored graph

MSC Class: 57Q15; 57S17; 05C15; 05C25 (Primary) 20F65; 55N91 (Secondary)

Journal ref: Forum Math. 24(2012), 121-149

Showing 1–42 of 42 results for author: Bao, Z