Skip to main content

Showing 1–50 of 143 results for author: Cai, T

Searching in archive math. Search in all archives.
.
  1. arXiv:2508.16879   

    math.AP

    Inverse problem for fractional Schrödinger equations with drift on closed Riemannian manifolds

    Authors: Tianyu Cai, Xi Chen

    Abstract: This paper is concerned about the inverse coefficient problems of variable-coefficient fractional Schrödinger equations with drift on connected closed Riemannian manifolds. We prove that the knowledge of the underlying equation on any non-empty open subset of the underlying manifold determines the Riemannian metric, the drift and the potential, simultaneously and uniquely, up to a gauge transforma… ▽ More

    Submitted 30 August, 2025; v1 submitted 22 August, 2025; originally announced August 2025.

    Comments: There is a gap in the proof

  2. arXiv:2507.09388  [pdf, ps, other

    math.ST stat.ME stat.ML

    Optimal Differentially Private Ranking from Pairwise Comparisons

    Authors: T. Tony Cai, Abhinav Chakraborty, Yichen Wang

    Abstract: Data privacy is a central concern in many applications involving ranking from incomplete and noisy pairwise comparisons, such as recommendation systems, educational assessments, and opinion surveys on sensitive topics. In this work, we propose differentially private algorithms for ranking based on pairwise comparisons. Specifically, we develop and analyze ranking methods under two privacy notions:… ▽ More

    Submitted 12 July, 2025; originally announced July 2025.

  3. arXiv:2504.08084  [pdf, other

    math.GR

    Generalized torsion in amalgams

    Authors: Tommy Wuxing Cai, Adam Clay

    Abstract: We give a condition sufficient to ensure that an amalgam of groups is generalized torsion-free. As applications, we construct a closed 3-manifold whose fundamental group is generalized torsion-free and non bi-orderable; a one-relator group which is generalized torsion-free and non bi-orderable; and a group which is generalized torsion-free and non left-orderable.

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 50 pages, 1 figure,

    MSC Class: 05E16; 06F15; 20F60; 57M05

  4. arXiv:2412.18992  [pdf, other

    math.ST cs.LG

    Optimal Federated Learning for Functional Mean Estimation under Heterogeneous Privacy Constraints

    Authors: Tony Cai, Abhinav Chakraborty, Lasse Vuursteen

    Abstract: Federated learning (FL) is a distributed machine learning technique designed to preserve data privacy and security, and it has gained significant importance due to its broad range of applications. This paper addresses the problem of optimal functional mean estimation from discretely sampled data in a federated setting. We consider a heterogeneous framework where the number of individuals, measur… ▽ More

    Submitted 15 January, 2025; v1 submitted 25 December, 2024; originally announced December 2024.

    Comments: 54 pages: 25 page article and 29 pages of appendix

    MSC Class: 62G08; 62C20; 68P27; 62F30

  5. arXiv:2411.15660  [pdf, other

    math.ST cs.IT stat.ML

    Federated PCA and Estimation for Spiked Covariance Matrices: Optimal Rates and Efficient Algorithm

    Authors: Jingyang Li, T. Tony Cai, Dong Xia, Anru R. Zhang

    Abstract: Federated Learning (FL) has gained significant recent attention in machine learning for its enhanced privacy and data security, making it indispensable in fields such as healthcare, finance, and personalized services. This paper investigates federated PCA and estimation for spiked covariance matrices under distributed differential privacy constraints. We establish minimax rates of convergence, wit… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

  6. arXiv:2410.07454  [pdf, other

    stat.ME cs.LG math.ST

    Representation-Enhanced Neural Knowledge Integration with Application to Large-Scale Medical Ontology Learning

    Authors: Suqi Liu, Tianxi Cai, Xiaoou Li

    Abstract: A large-scale knowledge graph enhances reproducibility in biomedical data discovery by providing a standardized, integrated framework that ensures consistent interpretation across diverse datasets. It improves generalizability by connecting data from various sources, enabling broader applicability of findings across different populations and conditions. Generating reliable knowledge graph, leverag… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  7. arXiv:2406.20088  [pdf, other

    math.ST stat.ME stat.ML

    Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints

    Authors: Arnab Auddy, T. Tony Cai, Abhinav Chakraborty

    Abstract: This paper considers minimax and adaptive transfer learning for nonparametric classification under the posterior drift model with distributed differential privacy constraints. Our study is conducted within a heterogeneous framework, encompassing diverse sample sizes, varying privacy parameters, and data heterogeneity across different servers. We first establish the minimax misclassification rate,… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    MSC Class: 62G08; 62G20

  8. arXiv:2406.18876  [pdf, other

    math.GR math.GT

    Ordered bases, order-preserving automorphisms and bi-orderable link groups

    Authors: Tommy Wuxing Cai, Adam Clay, Dale Rolfsen

    Abstract: We give a new criterion which guarantees that a free group admits a bi-ordering that is invariant under a given automorphism. As an application, we show that the fundamental group of the "magic manifold" is bi-orderable, answering a question of Kin and Rolfsen.

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 2 figures

    MSC Class: 06F15; 20F60; 57M05; 57K30

  9. arXiv:2406.06755  [pdf, other

    math.ST cs.LG stat.ML

    Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints

    Authors: T. Tony Cai, Abhinav Chakraborty, Lasse Vuursteen

    Abstract: This paper studies federated learning for nonparametric regression in the context of distributed samples across different servers, each adhering to distinct differential privacy constraints. The setting we consider is heterogeneous, encompassing both varying sample sizes and differential privacy constraints across servers. Within this framework, both global and pointwise estimation are considered,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 49 pages total, consisting of an article (24 pages) and a supplement (25 pages)

    MSC Class: 62G08; 62C20; 68P27; 62F30;

  10. arXiv:2406.06749  [pdf, other

    math.ST cs.LG stat.ML

    Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests

    Authors: T. Tony Cai, Abhinav Chakraborty, Lasse Vuursteen

    Abstract: Federated learning has attracted significant recent attention due to its applicability across a wide range of settings where data is collected and analyzed across disparate locations. In this paper, we study federated nonparametric goodness-of-fit testing in the white-noise-with-drift model under distributed differential privacy (DP) constraints. We first establish matching lower and upper bound… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 77 pages total; consisting of a main article (28 pages) and supplement (49 pages)

    MSC Class: 62G10; 62C20; 68P27; 62F30

  11. arXiv:2403.19410  [pdf, ps, other

    math.NT

    On the Exact Fourier Dimension of Sets of Well-Approximable Matrices

    Authors: Thomas Cai, Kyle Hambrook

    Abstract: We compute the exact Fourier dimension of the set of $Ψ$-well-approximable $m \times n$ matrices (and the set of $Ψ$-well-approximable numbers) in the homogeneous and inhomogeneous cases for any approximation function $Ψ$ satisfying $\sum_{q \in \mathbb{Z}^n} Ψ(q)^m < \infty$.

    Submitted 28 March, 2024; originally announced March 2024.

  12. arXiv:2401.12331  [pdf, other

    math.ST

    Transfer Learning for Functional Mean Estimation: Phase Transition and Adaptive Algorithms

    Authors: T. Tony Cai, Dongwoo Kim, Hongming Pu

    Abstract: This paper studies transfer learning for estimating the mean of random functions based on discretely sampled data, where, in addition to observations from the target distribution, auxiliary samples from similar but distinct source distributions are available. The paper considers both common and independent designs and establishes the minimax rates of convergence for both designs. The results revea… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    MSC Class: Primary 62J05; secondary 62G20

  13. arXiv:2401.03820  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

    Authors: T. Tony Cai, Dong Xia, Mengyue Zha

    Abstract: Estimating a covariance matrix and its associated principal components is a fundamental problem in contemporary statistics. While optimal estimation procedures have been developed with well-understood properties, the increasing demand for privacy preservation introduces new complexities to this classical problem. In this paper, we study optimal differentially private Principal Component Analysis (… ▽ More

    Submitted 27 September, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  14. arXiv:2305.19997  [pdf, other

    stat.ML math.ST

    Knowledge Graph Embedding with Electronic Health Records Data via Latent Graphical Block Model

    Authors: Junwei Lu, Jin Yin, Tianxi Cai

    Abstract: Due to the increasing adoption of electronic health records (EHR), large scale EHRs have become another rich data source for translational clinical research. Despite its potential, deriving generalizable knowledge from EHR data remains challenging. First, EHR data are generated as part of clinical care with data elements too detailed and fragmented for research. Despite recent progress in mapping… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  15. arXiv:2305.17608  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Reward Collapse in Aligning Large Language Models

    Authors: Ziang Song, Tianle Cai, Jason D. Lee, Weijie J. Su

    Abstract: The extraordinary capabilities of large language models (LLMs) such as ChatGPT and GPT-4 are in part unleashed by aligning them with reward models that are trained on human preferences, which are often represented as rankings of responses to prompts. In this paper, we document the phenomenon of \textit{reward collapse}, an empirical observation where the prevailing ranking-based approach results i… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  16. arXiv:2305.00164  [pdf, other

    math.ST stat.ME

    Estimation and inference for minimizer and minimum of convex functions: optimality, adaptivity and uncertainty principles

    Authors: T. Tony Cai, Ran Chen, Yuancheng Zhu

    Abstract: Optimal estimation and inference for both the minimizer and minimum of a convex regression function under the white noise and nonparametric regression models are studied in a nonasymptotic local minimax framework, where the performance of a procedure is evaluated at individual functions. Fully adaptive and computationally efficient algorithms are proposed and sharp minimax lower bounds are given f… ▽ More

    Submitted 9 March, 2024; v1 submitted 29 April, 2023; originally announced May 2023.

    Journal ref: Ann. Statist. 52(1): 392-411 (February 2024)

  17. arXiv:2303.07152  [pdf, ps, other

    math.ST cs.CR cs.LG stat.ME stat.ML

    Score Attack: A Lower Bound Technique for Optimal Differentially Private Learning

    Authors: T. Tony Cai, Yichen Wang, Linjun Zhang

    Abstract: Achieving optimal statistical performance while ensuring the privacy of personal data is a challenging yet crucial objective in modern data analysis. However, characterizing the optimality, particularly the minimax lower bound, under privacy constraints is technically difficult. To address this issue, we propose a novel approach called the score attack, which provides a lower bound on the differen… ▽ More

    Submitted 12 July, 2025; v1 submitted 13 March, 2023; originally announced March 2023.

  18. arXiv:2301.10392  [pdf, other

    stat.ME math.ST

    Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

    Authors: T. Tony Cai, Zijian Guo, Yin Xia

    Abstract: This paper presents a selective survey of recent developments in statistical inference and multiple testing for high-dimensional regression models, including linear and logistic regression. We examine the construction of confidence intervals and hypothesis tests for various low-dimensional objectives such as regression coefficients and linear and quadratic functionals. The key technique is to gene… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  19. arXiv:2301.01381  [pdf, other

    stat.ME math.ST stat.ML

    Testing High-dimensional Multinomials with Applications to Text Analysis

    Authors: T. Tony Cai, Zheng Tracy Ke, Paxton Turner

    Abstract: Motivated by applications in text mining and discrete distribution inference, we investigate the testing for equality of probability mass functions of $K$ groups of high-dimensional multinomial distributions. A test statistic, which is shown to have an asymptotic standard normal distribution under the null, is proposed. The optimal detection boundary is established, and the proposed test is shown… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  20. arXiv:2211.12612  [pdf, ps, other

    stat.ML cs.LG math.ST

    Transfer Learning for Contextual Multi-armed Bandits

    Authors: Changxiao Cai, T. Tony Cai, Hongzhe Li

    Abstract: Motivated by a range of applications, we study in this paper the problem of transfer learning for nonparametric contextual multi-armed bandits under the covariate shift model, where we have data collected on source bandits before the start of the target bandit learning. The minimax rate of convergence for the cumulative regret is established and a novel transfer learning algorithm that attains the… ▽ More

    Submitted 24 January, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted to the Annals of Statistics

  21. arXiv:2202.10007  [pdf, other

    stat.ME math.ST stat.AP

    Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

    Authors: Rong Ma, Zijian Guo, T. Tony Cai, Hongzhe Li

    Abstract: This paper studies the problem of statistical inference for genetic relatedness between binary traits based on individual-level genome-wide association data. Specifically, under the high-dimensional logistic regression models, we define parameters characterizing the cross-trait genetic correlation, the genetic covariance and the trait-specific genetic variance. A novel weighted debiasing method is… ▽ More

    Submitted 5 October, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

  22. arXiv:2201.06438  [pdf, other

    math.ST stat.ME stat.ML

    Matrix Reordering for Noisy Disordered Matrices: Optimality and Computationally Efficient Algorithms

    Authors: T. Tony Cai, Rong Ma

    Abstract: Motivated by applications in single-cell biology and metagenomics, we investigate the problem of matrix reordering based on a noisy disordered monotone Toeplitz matrix model. We establish the fundamental statistical limit for this problem in a decision-theoretic framework and demonstrate that a constrained least squares estimator achieves the optimal rate. However, due to its computational complex… ▽ More

    Submitted 13 August, 2023; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: accepted by IEEE Transactions on Information Theory

  23. arXiv:2201.03727  [pdf, ps, other

    stat.ME math.ST

    Estimation and Inference with Proxy Data and its Genetic Applications

    Authors: Sai Li, T. Tony Cai, Hongzhe Li

    Abstract: Existing high-dimensional statistical methods are largely established for analyzing individual-level data. In this work, we study estimation and inference for high-dimensional linear models where we only observe "proxy data", which include the marginal statistics and sample covariance matrix that are computed based on different sets of individuals. We develop a rate optimal method for estimation a… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  24. arXiv:2112.09313  [pdf, other

    stat.ME math.ST stat.AP

    Federated Adaptive Causal Estimation (FACE) of Target Treatment Effects

    Authors: Larry Han, Jue Hou, Kelly Cho, Rui Duan, Tianxi Cai

    Abstract: Federated learning of causal estimands may greatly improve estimation efficiency by leveraging data from multiple study sites, but robustness to heterogeneity and model misspecifications is vital for ensuring validity. We develop a Federated Adaptive Causal Estimation (FACE) framework to incorporate heterogeneous data from multiple sites to provide treatment effect estimation and inference for a f… ▽ More

    Submitted 5 October, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: 59 pages

  25. arXiv:2111.02826  [pdf, other

    math.ST stat.ME

    Finding the Optimal Dynamic Treatment Regime Using Smooth Fisher Consistent Surrogate Loss

    Authors: Nilanjana Laha, Aaron Sonabend-W, Rajarshi Mukherjee, Tianxi Cai

    Abstract: Large health care data repositories such as electronic health records (EHR) open new opportunities to derive individualized treatment strategies for complicated diseases such as sepsis. In this paper, we consider the problem of estimating sequential treatment rules tailored to a patient's individual characteristics, often referred to as dynamic treatment regimes (DTRs). Our main objective is to fi… ▽ More

    Submitted 30 September, 2023; v1 submitted 3 November, 2021; originally announced November 2021.

    MSC Class: 62G20 ACM Class: G.3

  26. arXiv:2110.12336  [pdf, other

    stat.ME math.ST

    Efficient and Robust Semi-supervised Estimation of ATE with Partially Annotated Treatment and Response

    Authors: Jue Hou, Rajarshi Mukherjee, Tianxi Cai

    Abstract: A notable challenge of leveraging Electronic Health Records (EHR) for treatment effect assessment is the lack of precise information on important clinical variables, including the treatment received and the response. Both treatment information and response often cannot be accurately captured by readily available EHR features and require labor intensive manual chart review to precisely annotate, wh… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

  27. arXiv:2107.00179  [pdf

    math.ST cs.DC cs.LG stat.ML

    Distributed Nonparametric Function Estimation: Optimal Rate of Convergence and Cost of Adaptation

    Authors: T. Tony Cai, Hongji Wei

    Abstract: Distributed minimax estimation and distributed adaptive estimation under communication constraints for Gaussian sequence model and white noise model are studied. The minimax rate of convergence for distributed estimation over a given Besov class, which serves as a benchmark for the cost of adaptation, is established. We then quantify the exact communication cost for adaptation and construct an opt… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    MSC Class: 62F30

  28. arXiv:2105.10360  [pdf, other

    stat.ML cs.LG math.ST stat.AP stat.ME

    Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices

    Authors: Doudou Zhou, Tianxi Cai, Junwei Lu

    Abstract: Matrix completion has attracted attention in many fields, including statistics, applied mathematics, and electrical engineering. Most of the works focus on the independent sampling models under which the observed entries are sampled independently. Motivated by applications in the integration of knowledge graphs derived from multi-source biomedical data such as those from Electronic Health Records… ▽ More

    Submitted 9 October, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  29. arXiv:2105.07536  [pdf, other

    stat.ML cs.LG math.ST

    Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data

    Authors: T. Tony Cai, Rong Ma

    Abstract: This paper investigates the theoretical foundations of the t-distributed stochastic neighbor embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data visualization method. A novel theoretical framework for the analysis of t-SNE based on the gradient descent approach is presented. For the early exaggeration stage of t-SNE, we show its asymptotic equivalence to power iterations… ▽ More

    Submitted 31 October, 2022; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: Accepted by Journal of Machine Learning Research

  30. arXiv:2105.01264  [pdf, other

    math.ST stat.ME stat.ML

    Surrogate Assisted Semi-supervised Inference for High Dimensional Risk Prediction

    Authors: Jue Hou, Zijian Guo, Tianxi Cai

    Abstract: Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors. In this paper, we develop a surrogate assisted semi-supervised-learning (SAS) approach to risk modeling with high dimensional predictors, leveraging a large unlabeled data on candidate predictors and surrogates of outcome, as well as a sma… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  31. arXiv:2103.12846  [pdf, ps, other

    math.ST

    On the global identifiability of logistic regression models with misclassified outcomes

    Authors: Rui Duan, Yang Ning, Jiasheng Shi, Raymond J Carroll, Tianxi Cai, Yong Chen

    Abstract: In the last decade, the secondary use of large data from health systems, such as electronic health records, has demonstrated great promise in advancing biomedical discoveries and improving clinical decision making. However, there is an increasing concern about biases in association studies caused by misclassification in the binary outcomes derived from electronic health records. We revisit the cla… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  32. arXiv:2102.08807  [pdf, other

    math.OC

    The Linearized Hellinger--Kantorovich Distance

    Authors: Tianji Cai, Junyi Cheng, Bernhard Schmitzer, Matthew Thorpe

    Abstract: In this paper we study the local linearization of the Hellinger--Kantorovich distance via its Riemannian structure. We give explicit expressions for the logarithmic and exponential map and identify a suitable notion of a Riemannian inner product. Samples can thus be represented as vectors in the tangent space of a suitable reference measure where the norm locally approximates the original metric.… ▽ More

    Submitted 24 September, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

  33. arXiv:2011.03900  [pdf, other

    stat.ML cs.CR cs.LG math.ST stat.ME

    The Cost of Privacy in Generalized Linear Models: Algorithms and Minimax Lower Bounds

    Authors: T. Tony Cai, Yichen Wang, Linjun Zhang

    Abstract: We propose differentially private algorithms for parameter estimation in both low-dimensional and high-dimensional sparse generalized linear models (GLMs) by constructing private versions of projected gradient descent. We show that the proposed algorithms are nearly rate-optimal by characterizing their statistical performance and establishing privacy-constrained minimax lower bounds for GLMs. The… ▽ More

    Submitted 5 December, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: 56 pages, 6 figures

  34. arXiv:2009.03294  [pdf, other

    cs.LG math.OC stat.ML

    GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

    Authors: Tianle Cai, Shengjie Luo, Keyulu Xu, Di He, Tie-Yan Liu, Liwei Wang

    Abstract: Normalization is known to help the optimization of deep neural networks. Curiously, different architectures require specialized normalization methods. In this paper, we study what normalization is effective for Graph Neural Networks (GNNs). First, we adapt and evaluate the existing methods from other domains to GNNs. Faster convergence is achieved with InstanceNorm compared to BatchNorm and LayerN… ▽ More

    Submitted 11 June, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: ICML 2021, Code: https://github.com/lsj2408/GraphNorm

  35. arXiv:2008.12434  [pdf, ps, other

    math.ST math.PR

    On the Non-Asymptotic Concentration of Heteroskedastic Wishart-type Matrix

    Authors: T. Tony Cai, Rungang Han, Anru R. Zhang

    Abstract: This paper focuses on the non-asymptotic concentration of the heteroskedastic Wishart-type matrices. Suppose $Z$ is a $p_1$-by-$p_2$ random matrix and $Z_{ij} \sim N(0,σ_{ij}^2)$ independently, we prove the expected spectral norm of Wishart matrix deviations (i.e., $\mathbb{E} \left\|ZZ^\top - \mathbb{E} ZZ^\top\right\|$) is upper bounded by \begin{equation*} \begin{split} (1+ε)\left\{2σ_Cσ_R… ▽ More

    Submitted 16 February, 2022; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Electronic Journal of Probability, to appear

  36. arXiv:2006.02025  [pdf, ps, other

    math.QA math.CO

    Deformation of Cayley's hyperdeterminants

    Authors: Tommy Wuxing Cai, Naihuan Jing

    Abstract: We introduce a deformation of Cayley's second hyperdeterminant for even-dimensional hypermatrices. As an application, we formulate a generalization of the Jacobi-Trudi formula for Macdonald functions of rectangular shapes generalizing Matsumoto's formula for Jack functions.

    Submitted 5 June, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 9 pages, 0 figures

    MSC Class: Primary: 05E05; Secondary: 17B69; 05E10

    Journal ref: Elec. J. Combin. 27(2) (2020) P2.50

  37. arXiv:2002.07624  [pdf, other

    math.ST stat.ML

    Optimal Structured Principal Subspace Estimation: Metric Entropy and Minimax Rates

    Authors: T. Tony Cai, Hongzhe Li, Rong Ma

    Abstract: Driven by a wide range of applications, many principal subspace estimation problems have been studied individually under different structural constraints. This paper presents a unified framework for the statistical analysis of a general structured principal subspace estimation problem which includes as special cases non-negative PCA/SVD, sparse PCA/SVD, subspace constrained PCA/SVD, and spectral c… ▽ More

    Submitted 16 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  38. arXiv:2001.08877  [pdf, other

    math.ST cs.DC cs.IT cs.LG stat.ML

    Distributed Gaussian Mean Estimation under Communication Constraints: Optimal Rates and Communication-Efficient Algorithms

    Authors: T. Tony Cai, Hongji Wei

    Abstract: We study distributed estimation of a Gaussian mean under communication constraints in a decision theoretical framework. Minimax rates of convergence, which characterize the tradeoff between the communication costs and statistical accuracy, are established in both the univariate and multivariate settings. Communication-efficient and statistically optimal procedures are developed. In the univariate… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  39. arXiv:1912.03870  [pdf, ps, other

    math.QA hep-th math-ph quant-ph

    Correlation functions of charged free boson and fermion systems

    Authors: Naihuan Jing, Zhijun Li, Tommy Wuxing Cai

    Abstract: Using the idea of the quantum inverse scattering method, we introduce the operators $\mathbf{B}(x), \mathbf{C}(x)$ and $\mathbf{\tilde{B}}(x), \mathbf{\tilde{C}}(x)$ corresponding to the off-diagonal entries of the monodromy matrix $T$ for the phase model and $i$-boson model in terms of bc fermions and neutral fermions respectively, thus giving alternative treatment of the KP and BKP hierarchies.… ▽ More

    Submitted 16 June, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: 26 pages. Final version for J. Stat. Mech

    MSC Class: Primary: 17B37; Secondary: 58A17; 15A75; 15B33; 15A15; 05E05

    Journal ref: J. Stat. Mech. (2020), 083101, 27pp

  40. Optimal Estimation of Bacterial Growth Rates Based on Permuted Monotone Matrix

    Authors: Rong Ma, T. Tony Cai, Hongzhe Li

    Abstract: Motivated by the problem of estimating the bacterial growth rates for genome assemblies from shotgun metagenomic data, we consider the permuted monotone matrix model $Y=ΘΠ+Z$, where $Y\in \mathbb{R}^{n\times p}$ is observed, $Θ\in \mathbb{R}^{n\times p}$ is an unknown approximately rank-one signal matrix with monotone rows, $Π\in \mathbb{R}^{p\times p}$ is an unknown permutation matrix, and… ▽ More

    Submitted 26 August, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

    Journal ref: Biometrika (2020)

  41. arXiv:1911.11345  [pdf, other

    stat.ME math.ST stat.ML

    High Dimensional M-Estimation with Missing Outcomes: A Semi-Parametric Framework

    Authors: Abhishek Chakrabortty, Jiarui Lu, T. Tony Cai, Hongzhe Li

    Abstract: We consider high dimensional $M$-estimation in settings where the response $Y$ is possibly missing at random and the covariates $\mathbf{X} \in \mathbb{R}^p$ can be high dimensional compared to the sample size $n$. The parameter of interest $\boldsymbolθ_0 \in \mathbb{R}^d$ is defined as the minimizer of the risk of a convex loss, under a fully non-parametric model, and $\boldsymbolθ_0$ itself is… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 34 pages, 4 tables; (Supplement: 58 pages, 10 tables);

  42. Optimal Permutation Recovery in Permuted Monotone Matrix Model

    Authors: Rong Ma, T. Tony Cai, Hongzhe Li

    Abstract: Motivated by recent research on quantifying bacterial growth dynamics based on genome assemblies, we consider a permuted monotone matrix model $Y=ΘΠ+Z$, where the rows represent different samples, the columns represent contigs in genome assemblies and the elements represent log-read counts after preprocessing steps and Guanine-Cytosine (GC) adjustment. In this model, $Θ$ is an unknown mean matrix… ▽ More

    Submitted 13 July, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

    Journal ref: Journal of the American Statistical Association, 2020

  43. arXiv:1911.08176  [pdf, ps, other

    math.NT

    A Generalization of A Result of Gauss on Primitive Root

    Authors: Hao Zhong, Tianxin Cai

    Abstract: A primitive root modulo an integer $n$ is the generator of the multiplicative group of integers modulo $n$. Gauss proved that for any prime number $p$ greater than $3$, the sum of its primitive roots is congruent to $1$ modulo $p$ while its product is congruent to $μ(p-1)$ modulo $p$, where $μ$ is the Möbius function. In this paper, we will generalize these two interesting congruences and give the… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 9 pages

  44. arXiv:1909.09851  [pdf, other

    math.ST cs.LG stat.ML

    Sparse Group Lasso: Optimal Sample Complexity, Convergence Rate, and Statistical Inference

    Authors: T. Tony Cai, Anru R. Zhang, Yuchen Zhou

    Abstract: We study sparse group Lasso for high-dimensional double sparse linear regression, where the parameter of interest is simultaneously element-wise and group-wise sparse. This problem is an important instance of the simultaneously structured model -- an actively studied topic in statistics and machine learning. In the noiseless case, matching upper and lower bounds on sample complexity are establishe… ▽ More

    Submitted 6 May, 2022; v1 submitted 21 September, 2019; originally announced September 2019.

    Comments: IEEE Transactions on Information Theory, to appear

  45. arXiv:1908.05598  [pdf, ps, other

    math.NT

    On the divisor problem with congruence conditions

    Authors: Lirui Jia, Wenguang Zhai, Tianxin Cai

    Abstract: Let $d(n; r_1, q_1, r_2, q_2)$ be the number of factorization $n=n_1n_2$ satisfying $n_i\equiv r_i\pmod{q_i}$ ($i=1,2$) and $Δ(x; r_1, q_1, r_2, q_2)$ be the error term of the summatory function of $d(n; r_1, q_1, r_2, q_2)$ with $x\geq (q_1q_2)^{1+\varepsilon}, 1\leq r_i\leq q_i$, and $(r_i, q_i)=1$ ($i=1, 2$). We study the power moments and sign changes of $Δ(x; r_1, q_1, r_2, q_2)$, and prove t… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.04977

  46. arXiv:1906.02903  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Transfer Learning for Nonparametric Classification: Minimax Rate and Adaptive Classifier

    Authors: T. Tony Cai, Hongji Wei

    Abstract: Human learners have the natural ability to use knowledge gained in one setting for learning in a different but related setting. This ability to transfer knowledge from one task to another is essential for effective learning. In this paper, we study transfer learning in the context of nonparametric classification based on observations from different distributions under the posterior drift model, wh… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  47. arXiv:1905.11675  [pdf, ps, other

    cs.LG math.OC stat.ML

    Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems

    Authors: Tianle Cai, Ruiqi Gao, Jikai Hou, Siyu Chen, Dong Wang, Di He, Zhihua Zhang, Liwei Wang

    Abstract: First-order methods such as stochastic gradient descent (SGD) are currently the standard algorithm for training deep neural networks. Second-order methods, despite their better convergence rate, are rarely used in practice due to the prohibitive computational cost in calculating the second-order information. In this paper, we propose a novel Gram-Gauss-Newton (GGN) algorithm to train deep neural n… ▽ More

    Submitted 25 September, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  48. arXiv:1905.08757  [pdf, other

    math.ST math.PR

    Asymptotic Analysis for Extreme Eigenvalues of Principal Minors of Random Matrices

    Authors: T. Tony Cai, Tiefeng Jiang, Xiaoou Li

    Abstract: Consider a standard white Wishart matrix with parameters $n$ and $p$. Motivated by applications in high-dimensional statistics and signal processing, we perform asymptotic analysis on the maxima and minima of the eigenvalues of all the $m \times m$ principal minors, under the asymptotic regime that $n,p,m$ go to infinity. Asymptotic results concerning extreme eigenvalues of principal minors of rea… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  49. arXiv:1904.12891  [pdf, other

    stat.ME math.ST

    Optimal Statistical Inference for Individualized Treatment Effects in High-dimensional Models

    Authors: Tianxi Cai, Tony Cai, Zijian Guo

    Abstract: The ability to predict individualized treatment effects (ITEs) based on a given patient's profile is essential for personalized medicine. We propose a hypothesis testing approach to choosing between two potential treatments for a given individual in the framework of high-dimensional linear models. The methodological novelty lies in the construction of a debiased estimator of the ITE and establishm… ▽ More

    Submitted 7 August, 2020; v1 submitted 29 April, 2019; originally announced April 2019.

  50. arXiv:1810.08316  [pdf, other

    math.ST stat.CO stat.ME stat.ML

    Heteroskedastic PCA: Algorithm, Optimality, and Applications

    Authors: Anru R. Zhang, T. Tony Cai, Yihong Wu

    Abstract: A general framework for principal component analysis (PCA) in the presence of heteroskedastic noise is introduced. We propose an algorithm called HeteroPCA, which involves iteratively imputing the diagonal entries of the sample covariance matrix to remove estimation bias due to heteroskedasticity. This procedure is computationally efficient and provably optimal under the generalized spiked covaria… ▽ More

    Submitted 1 April, 2021; v1 submitted 18 October, 2018; originally announced October 2018.