-
Joint Probability Estimation of Many Binary Outcomes via Localized Adversarial Lasso
Authors:
Alexandre Belloni,
Yan Chen,
Matthew Harding
Abstract:
In this work we consider estimating the probability of many (possibly dependent) binary outcomes which is at the core of many applications, e.g., multi-level treatments in causal inference, demands for bundle of products, etc. Without further conditions, the probability distribution of an M dimensional binary vector is characterized by exponentially in M coefficients which can lead to a high-dimen…
▽ More
In this work we consider estimating the probability of many (possibly dependent) binary outcomes which is at the core of many applications, e.g., multi-level treatments in causal inference, demands for bundle of products, etc. Without further conditions, the probability distribution of an M dimensional binary vector is characterized by exponentially in M coefficients which can lead to a high-dimensional problem even without the presence of covariates. Understanding the (in)dependence structure allows us to substantially improve the estimation as it allows for an effective factorization of the probability distribution. In order to estimate the probability distribution of a M dimensional binary vector, we leverage a Bahadur representation that connects the sparsity of its coefficients with independence across the components. We propose to use regularized and adversarial regularized estimators to obtain an adaptive estimator with respect to the dependence structure which allows for rates of convergence to depend on this intrinsic (lower) dimension. These estimators are needed to handle several challenges within this setting, including estimating nuisance parameters, estimating covariates, and nonseparable moment conditions. Our main results consider the presence of (low dimensional) covariates for which we propose a locally penalized estimator. We provide pointwise rates of convergence addressing several issues in the theoretical analyses as we strive for making a computationally tractable formulation. We apply our results in the estimation of causal effects with multiple binary treatments and show how our estimators can improve the finite sample performance when compared with non-adaptive estimators that try to estimate all the probabilities directly. We also provide simulations that are consistent with our theoretical findings.
△ Less
Submitted 30 January, 2025; v1 submitted 19 October, 2024;
originally announced October 2024.
-
Order Determination of Large Dimensional Dynamic Factor Model
Authors:
Z. D. Bai,
Chen Wang,
Ya Xue,
Matthew Harding
Abstract:
Consider the following dynamic factor model: $\mathbf{R}_t=\sum_{i=0}^q \mathbfΛ_i \mathbf{f}_{t-i}+\mathbf{e}_t,t=1,...,T$, where $\mathbfΛ_i$ is an $n\times k$ loading matrix of full rank, $\{\mathbf{f}_t\}$ are i.i.d. $k\times1$-factors, and $\mathbf{e}_t$ are independent $n\times1$ white noises. Now, assuming that $n/T\to c>0$, we want to estimate the orders $k$ and $q$ respectively. Define a…
▽ More
Consider the following dynamic factor model: $\mathbf{R}_t=\sum_{i=0}^q \mathbfΛ_i \mathbf{f}_{t-i}+\mathbf{e}_t,t=1,...,T$, where $\mathbfΛ_i$ is an $n\times k$ loading matrix of full rank, $\{\mathbf{f}_t\}$ are i.i.d. $k\times1$-factors, and $\mathbf{e}_t$ are independent $n\times1$ white noises. Now, assuming that $n/T\to c>0$, we want to estimate the orders $k$ and $q$ respectively. Define a random matrix $$\mathbfΦ_n(τ)=\frac{1}{2T}\sum_{j=1}^T (\mathbf{R}_j \mathbf{R}_{j+τ}^* + \mathbf{R}_{j+τ} \mathbf{R}_j^*),$$ where $τ\ge 0$ is an integer. When there are no factors, the matrix $Φ_{n}(τ)$ reduces to $$\mathbf{M}_n(τ) = \frac{1}{2T} \sum_{j=1}^T (\mathbf{e}_j \mathbf{e}_{j+τ}^* + \mathbf{e}_{j+τ} \mathbf{e}_j^*).$$ When $τ=0$, $\mathbf{M}_n(τ)$ reduces to the usual sample covariance matrix whose ESD tends to the well known MP law and $\mathbfΦ_n(0)$ reduces to the standard spike model. Hence the number $k(q+1)$ can be estimated by the number of spiked eigenvalues of $\mathbfΦ_n(0)$. To obtain separate estimates of $k$ and $q$ , we have employed the spectral analysis of $\mathbf{M}_n(τ)$ and established the spiked model analysis for $\mathbfΦ_n(τ)$.
△ Less
Submitted 31 March, 2017; v1 submitted 8 November, 2015;
originally announced November 2015.
-
Strong limit of the extreme eigenvalues of a symmetrized auto-cross covariance matrix
Authors:
Chen Wang,
Baisuo Jin,
Z. D. Bai,
K. Krishnan Nair,
Matthew Harding
Abstract:
The auto-cross covariance matrix is defined as \[\mathbf{M}_n=\frac{1} {2T}\sum_{j=1}^T\bigl(\mathbf{e}_j\mathbf{e}_{j+τ}^*+\mathbf{e}_{j+ τ}\mathbf{e}_j^*\bigr),\] where $\mathbf{e}_j$'s are $n$-dimensional vectors of independent standard complex components with a common mean 0, variance $σ^2$, and uniformly bounded $2+η$th moments and $τ$ is the lag. Jin et al. [Ann. Appl. Probab. 24 (2014) 1199…
▽ More
The auto-cross covariance matrix is defined as \[\mathbf{M}_n=\frac{1} {2T}\sum_{j=1}^T\bigl(\mathbf{e}_j\mathbf{e}_{j+τ}^*+\mathbf{e}_{j+ τ}\mathbf{e}_j^*\bigr),\] where $\mathbf{e}_j$'s are $n$-dimensional vectors of independent standard complex components with a common mean 0, variance $σ^2$, and uniformly bounded $2+η$th moments and $τ$ is the lag. Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225] has proved that the LSD of $\mathbf{M}_n$ exists uniquely and nonrandomly, and independent of $τ$ for all $τ\ge 1$. And in addition they gave an analytic expression of the LSD. As a continuation of Jin et al. [Ann. Appl. Probab. 24 (2014) 1199-1225], this paper proved that under the condition of uniformly bounded fourth moments, in any closed interval outside the support of the LSD, with probability 1 there will be no eigenvalues of $\mathbf{M}_n$ for all large $n$. As a consequence of the main theorem, the limits of the largest and smallest eigenvalue of $\mathbf{M}_n$ are also obtained.
△ Less
Submitted 29 October, 2015; v1 submitted 8 December, 2013;
originally announced December 2013.