Skip to main content

Showing 1–50 of 88 results for author: Gao, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.03271  [pdf, ps, other

    stat.ML cs.LG

    LILI clustering algorithm: Limit Inferior Leaf Interval Integrated into Causal Forest for Causal Interference

    Authors: Yiran Dong, Di Fan, Chuanhou Gao

    Abstract: Causal forest methods are powerful tools in causal inference. Similar to traditional random forest in machine learning, causal forest independently considers each causal tree. However, this independence consideration increases the likelihood that classification errors in one tree are repeated in others, potentially leading to significant bias in causal e ect estimation. In this paper, we propose a… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  2. arXiv:2504.02723  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Computing High-dimensional Confidence Sets for Arbitrary Distributions

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: We study the problem of learning a high-density region of an arbitrary distribution over $\mathbb{R}^d$. Given a target coverage parameter $δ$, and sample access to an arbitrary distribution $D$, we want to output a confidence set $S \subset \mathbb{R}^d$ such that $S$ achieves $δ$ coverage of $D$, i.e., $\mathbb{P}_{y \sim D} \left[ y \in S \right] \ge δ$, and the volume of $S$ is as small as pos… ▽ More

    Submitted 12 May, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: Improves volume approximation factor from $\exp(\tilde{O}(d^{2/3}))$ to $\exp(\tilde{O}(d^{1/2}))$, along with other minor edits. To appear in COLT 2025

  3. arXiv:2503.06864  [pdf, other

    stat.ME stat.AP

    Doubly robust omnibus sensitivity analysis of externally controlled trials with intercurrent events

    Authors: Chenyin Gao, Xiang Zhang, Shu Yang

    Abstract: Externally controlled trials are crucial in clinical development when randomized controlled trials are unethical or impractical. These trials consist of a full treatment arm with the experimental treatment and a full external control arm. However, they present significant challenges in learning the treatment effect due to the lack of randomization and a parallel control group. Besides baseline inc… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  4. arXiv:2502.16658  [pdf, other

    cs.LG stat.ML

    Volume Optimality in Conformal Prediction with Structured Prediction Sets

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: Conformal Prediction is a widely studied technique to construct prediction sets of future observations. Most conformal prediction methods focus on achieving the necessary coverage guarantees, but do not provide formal guarantees on the size (volume) of the prediction sets. We first prove an impossibility of volume optimality where any distribution-free method can only find a trivial solution. We t… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 41 pages, 19 figures, 2 tables

  5. arXiv:2412.14497  [pdf, other

    cs.LG cs.AI stat.ML

    Disentangled Graph Autoencoder for Treatment Effect Estimation

    Authors: Di Fan, Renlei Jiang, Yunhao Wen, Chuanhou Gao

    Abstract: Treatment effect estimation from observational data has attracted significant attention across various research fields. However, many widely used methods rely on the unconfoundedness assumption, which is often unrealistic due to the inability to observe all confounders, thereby overlooking the influence of latent confounders. To address this limitation, recent approaches have utilized auxiliary ne… ▽ More

    Submitted 20 February, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: 22 pages, 6 figures

  6. arXiv:2412.12365  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    On the Role of Surrogates in Conformal Inference of Individual Causal Effects

    Authors: Chenyin Gao, Peter B. Gilbert, Larry Han

    Abstract: Learning the Individual Treatment Effect (ITE) is essential for personalized decision-making, yet causal inference has traditionally focused on aggregated treatment effects. While integrating conformal prediction with causal inference can provide valid uncertainty quantification for ITEs, the resulting prediction intervals are often excessively wide, limiting their practical utility. To address th… ▽ More

    Submitted 21 January, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

  7. arXiv:2412.11003  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Rates for Robust Stochastic Convex Optimization

    Authors: Changyu Gao, Andrew Lowy, Xingyu Zhou, Stephen J. Wright

    Abstract: Machine learning algorithms in high-dimensional settings are highly susceptible to the influence of even a small fraction of structured outliers, making robust optimization techniques essential. In particular, within the $ε$-contamination model, where an adversary can inspect and replace up to an $ε$-fraction of the samples, a fundamental open problem is determining the optimal rates for robust st… ▽ More

    Submitted 23 April, 2025; v1 submitted 14 December, 2024; originally announced December 2024.

    Comments: The 6th annual Symposium on Foundations of Responsible Computing (FORC 2025)

  8. arXiv:2412.05421  [pdf, other

    cs.LG cs.AI stat.ML

    KEDformer:Knowledge Extraction Seasonal Trend Decomposition for Long-term Sequence Prediction

    Authors: Zhenkai Qin, Baozhong Wei, Caifeng Gao, Jianyuan Ni

    Abstract: Time series forecasting is a critical task in domains such as energy, finance, and meteorology, where accurate long-term predictions are essential. While Transformer-based models have shown promise in capturing temporal dependencies, their application to extended sequences is limited by computational inefficiencies and limited generalization. In this study, we propose KEDformer, a knowledge extrac… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  9. arXiv:2412.03528  [pdf, other

    stat.AP stat.ME

    The R.O.A.D. to clinical trial emulation

    Authors: Dimitris Bertsimas, Angelos G. Koulouras, Hiroshi Nagata, Carol Gao, Junki Mizusawa, Yukihide Kanemitsu, Georgios Antonios Margonis

    Abstract: Observational studies provide the only evidence on the effectiveness of interventions when randomized controlled trials (RCTs) are impractical due to cost, ethical concerns, or time constraints. While many methodologies aim to draw causal inferences from observational data, there is a growing trend to model observational study designs after RCTs, a strategy known as "target trial emulation." Despi… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  10. arXiv:2410.23610  [pdf, other

    stat.ML cs.LG math.ST

    Global Convergence in Training Large-Scale Transformers

    Authors: Cheng Gao, Yuan Cao, Zihao Li, Yihan He, Mengdi Wang, Han Liu, Jason Matthew Klusowski, Jianqing Fan

    Abstract: Despite the widespread success of Transformers across various domains, their optimization guarantees in large-scale model settings are not well-understood. This paper rigorously analyzes the convergence properties of gradient flow in training Transformers with weight decay regularization. First, we construct the mean-field limit of large-scale Transformers, showing that as the model width and dept… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: to be published in 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

    MSC Class: 35Q93

  11. arXiv:2410.22647  [pdf, ps, other

    math.ST stat.ME

    Adaptive Robust Confidence Intervals

    Authors: Yuetian Luo, Chao Gao

    Abstract: This paper studies the construction of adaptive confidence intervals under Huber's contamination model when the contamination proportion is unknown. For the robust confidence interval of a Gaussian mean, we show that the optimal length of an adaptive interval must be exponentially wider than that of a non-adaptive one. An optimal construction is achieved through simultaneous uncertainty quantifica… ▽ More

    Submitted 3 June, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

  12. arXiv:2410.18409  [pdf, other

    stat.ME stat.AP

    Doubly protected estimation for survival outcomes utilizing external controls for randomized clinical trials

    Authors: Chenyin Gao, Shu Yang, Mingyang Shan, Wenyu Wendy Ye, Ilya Lipkovich, Douglas Faries

    Abstract: Censored survival data are common in clinical trials, but small control groups can pose challenges, particularly in rare diseases or where balanced randomization is impractical. Recent approaches leverage external controls from historical studies or real-world data to strengthen treatment evaluation for survival outcomes. However, using external controls directly may introduce biases due to data h… ▽ More

    Submitted 14 May, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: accepted at ICML 2025

  13. arXiv:2410.05225  [pdf, other

    cs.LG cs.RO stat.ML

    ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

    Authors: Ehsan Futuhi, Shayan Karimi, Chao Gao, Martin Müller

    Abstract: We consider deep deterministic policy gradient (DDPG) in the context of reinforcement learning with sparse rewards. To enhance exploration, we introduce a search procedure, \emph{$ε{t}$-greedy}, which generates exploratory options for exploring less-visited states. We prove that search using $εt$-greedy has polynomial sample complexity under mild MDP assumptions. To more efficiently use the inform… ▽ More

    Submitted 17 February, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: We have expanded the related work section with more detailed discussions and enhanced our experiments by incorporating additional data and analysis

  14. arXiv:2405.11377  [pdf, other

    stat.ML cs.LG stat.ME

    Causal Customer Churn Analysis with Low-rank Tensor Block Hazard Model

    Authors: Chenyin Gao, Zhiming Zhang, Shu Yang

    Abstract: This study introduces an innovative method for analyzing the impact of various interventions on customer churn, using the potential outcomes framework. We present a new causal model, the tensorized latent factor block hazard model, which incorporates tensor completion methods for a principled causal analysis of customer churn. A crucial element of our approach is the formulation of a 1-bit tensor… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in ICML, 2024

  15. arXiv:2402.01143  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Network Representations with Disentangled Graph Auto-Encoder

    Authors: Di Fan, Chuanhou Gao

    Abstract: The (variational) graph auto-encoder is widely used to learn representations for graph-structured data. However, the formation of real-world graphs is a complicated and heterogeneous process influenced by latent factors. Existing encoders are fundamentally holistic, neglecting the entanglement of latent factors. This reduces the effectiveness of graph analysis tasks, while also making it more diff… ▽ More

    Submitted 16 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 15 pages, 9 figures

  16. arXiv:2401.06350  [pdf, ps, other

    math.ST stat.ME

    Optimal estimation of the null distribution in large-scale inference

    Authors: Subhodh Kotekal, Chao Gao

    Abstract: The advent of large-scale inference has spurred reexamination of conventional statistical thinking. In a Gaussian model for $n$ many $z$-scores with at most $k < \frac{n}{2}$ nonnulls, Efron suggests estimating the location and scale parameters of the null distribution. Placing no assumptions on the nonnull effects, the statistical task can be viewed as a robust estimation problem. However, the be… ▽ More

    Submitted 14 January, 2025; v1 submitted 11 January, 2024; originally announced January 2024.

  17. arXiv:2312.09356  [pdf, other

    math.ST stat.ME

    Sparsity meets correlation in Gaussian sequence model

    Authors: Subhodh Kotekal, Chao Gao

    Abstract: We study estimation of an $s$-sparse signal in the $p$-dimensional Gaussian sequence model with equicorrelated observations and derive the minimax rate. A new phenomenon emerges from correlation, namely the rate scales with respect to $p-2s$ and exhibits a phase transition at $p-2s \asymp \sqrt{p}$. Correlation is shown to be a blessing provided it is sufficiently strong, and the critical correlat… ▽ More

    Submitted 21 January, 2025; v1 submitted 14 December, 2023; originally announced December 2023.

  18. arXiv:2310.04606  [pdf, ps, other

    stat.ML cs.LG math.ST

    Robust Transfer Learning with Unreliable Source Data

    Authors: Jianqing Fan, Cheng Gao, Jason M. Klusowski

    Abstract: This paper addresses challenges in robust transfer learning stemming from ambiguity in Bayes classifiers and weak transferable signals between the target and source distribution. We introduce a novel quantity called the ''ambiguity level'' that measures the discrepancy between the target and source regression functions, propose a simple transfer learning procedure, and establish a general theorem… ▽ More

    Submitted 3 May, 2025; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in the Annals of Statistics

  19. arXiv:2309.07273  [pdf

    stat.ME stat.AP

    Real Effect or Bias? Best Practices for Evaluating the Robustness of Real-World Evidence through Quantitative Sensitivity Analysis for Unmeasured Confounding

    Authors: Douglas Faries, Chenyin Gao, Xiang Zhang, Chad Hazlett, James Stamey, Shu Yang, Peng Ding, Mingyang Shan, Kristin Sheffield, Nancy Dreyer

    Abstract: The assumption of no unmeasured confounders is a critical but unverifiable assumption required for causal inference yet quantitative sensitivity analyses to assess robustness of real-world evidence remains underutilized. The lack of use is likely in part due to complexity of implementation and often specific and restrictive data requirements required for application of each method. With the advent… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 16 pages which includes 5 figures

    MSC Class: Primary 62

  20. arXiv:2308.15728  [pdf, ps, other

    math.ST cs.CC cs.DS stat.ML

    Computational Lower Bounds for Graphon Estimation via Low-degree Polynomials

    Authors: Yuetian Luo, Chao Gao

    Abstract: Graphon estimation has been one of the most fundamental problems in network analysis and has received considerable attention in the past decade. From the statistical perspective, the minimax error rate of graphon estimation has been established by Gao et al (2015) for both stochastic block model and nonparametric graphon estimation. The statistical optimal estimators are based on constrained least… ▽ More

    Submitted 12 August, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: Added low-degree upper bound

  21. arXiv:2307.00227  [pdf, other

    stat.ML cs.LG

    Causal Structure Learning by Using Intersection of Markov Blankets

    Authors: Yiran Dong, Chuanhou Gao

    Abstract: In this paper, we introduce a novel causal structure learning algorithm called Endogenous and Exogenous Markov Blankets Intersection (EEMBI), which combines the properties of Bayesian networks and Structural Causal Models (SCM). Furthermore, we propose an extended version of EEMBI, namely EEMBI-PC, which integrates the last step of the PC algorithm into EEMBI.

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 41 pages, 13 figures

  22. arXiv:2306.16642  [pdf, other

    stat.ME stat.AP

    Improving randomized controlled trial analysis via data-adaptive borrowing

    Authors: Chenyin Gao, Shu Yang, Mingyang Shan, Wenyu Ye, Ilya Lipkovich, Douglas Faries

    Abstract: In recent years, real-world external controls have grown in popularity as a tool to empower randomized placebo-controlled trials, particularly in rare diseases or cases where balanced randomization is unethical or impractical. However, as external controls are not always comparable to the trials, direct borrowing without scrutiny may heavily bias the treatment effect estimator. Our paper proposes… ▽ More

    Submitted 12 November, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: accepted by Biometrika

  23. arXiv:2305.17801  [pdf, other

    stat.ME stat.AP

    Pretest estimation in combining probability and non-probability samples

    Authors: Chenyin Gao, Shu Yang

    Abstract: Multiple heterogeneous data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example in finite-population inference, we develop a unified framework of the test-and-pool approach to general parameter estimation by combining gold-standard probability and non-probability samples. We focus on the case when the study variable is observed in bo… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted in Electronic Journal of Statistics

  24. arXiv:2304.09398  [pdf, ps, other

    math.ST stat.ME stat.ML

    Minimax Signal Detection in Sparse Additive Models

    Authors: Subhodh Kotekal, Chao Gao

    Abstract: Sparse additive models are an attractive choice in circumstances calling for modelling flexibility in the face of high dimensionality. We study the signal detection problem and establish the minimax separation rate for the detection of a sparse additive signal. Our result is nonasymptotic and applicable to the general case where the univariate component functions belong to a generic reproducing ke… ▽ More

    Submitted 1 October, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

  25. arXiv:2304.09010  [pdf, other

    cs.LG stat.ME

    Causal Flow-based Variational Auto-Encoder for Disentangled Causal Representation Learning

    Authors: Di Fan, Yannian Kou, Chuanhou Gao

    Abstract: Disentangled representation learning aims to learn low-dimensional representations where each dimension corresponds to an underlying generative factor. While the Variational Auto-Encoder (VAE) is widely used for this purpose, most existing methods assume independence among factors, a simplification that does not hold in many real-world scenarios where factors are often interdependent and exhibit c… ▽ More

    Submitted 30 December, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 22 pages, 14 figures

  26. arXiv:2302.04972  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Differentially Private Optimization for Smooth Nonconvex ERM

    Authors: Changyu Gao, Stephen J. Wright

    Abstract: We develop simple differentially private optimization algorithms that move along directions of (expected) descent to find an approximate second-order solution for nonconvex ERM. We use line search, mini-batching, and a two-phase strategy to improve the speed and practicality of the algorithm. Numerical experiments demonstrate the effectiveness of these approaches.

    Submitted 9 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  27. arXiv:2209.12715  [pdf, other

    cs.CV cs.LG stat.AP stat.ML

    Enhancing convolutional neural network generalizability via low-rank weight approximation

    Authors: Chenyin Gao, Shu Yang, Anru R. Zhang

    Abstract: Noise is ubiquitous during image acquisition. Sufficient denoising is often an important first step for image processing. In recent decades, deep neural networks (DNNs) have been widely used for image denoising. Most DNN-based image denoising methods require a large-scale dataset or focus on supervised settings, in which single/pairs of clean images or a set of noisy images are required. This pose… ▽ More

    Submitted 1 August, 2024; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: accepted by IET Image Processing

  28. Soft calibration for selection bias problems under mixed-effects models

    Authors: Chenyin Gao, Shu Yang, Jae Kwang Kim

    Abstract: Calibration weighting has been widely used to correct selection biases in non-probability sampling, missing data, and causal inference. The main idea is to calibrate the biased sample to the benchmark by adjusting the subject weights. However, hard calibration can produce enormous weights when an exact calibration is enforced on a large set of extraneous covariates. This article proposes a soft ca… ▽ More

    Submitted 22 February, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in Biometrika

  29. arXiv:2204.09532  [pdf, other

    stat.ML cs.LG

    Gaussian mixture modeling of nodes in Bayesian network according to maximal parental cliques

    Authors: Yiran Dong, Chuanhou Gao

    Abstract: This paper uses Gaussian mixture model instead of linear Gaussian model to fit the distribution of every node in Bayesian network. We will explain why and how we use Gaussian mixture models in Bayesian network. Meanwhile we propose a new method, called double iteration algorithm, to optimize the mixture model, the double iteration algorithm combines the expectation maximization algorithm and gradi… ▽ More

    Submitted 16 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: 22 pages 6 figures

  30. arXiv:2202.11276  [pdf, other

    stat.ME stat.AP

    Nearest neighbor ratio imputation with incomplete multi-nomial outcome in survey sampling

    Authors: Chenyin Gao, Katherine Jenny Thompson, Shu Yang, Jae Kwang Kim

    Abstract: Nonresponse is a common problem in survey sampling. Appropriate treatment can be challenging, especially when dealing with detailed breakdowns of totals. Often, the nearest neighbor imputation method is used to handle such incomplete multinomial data. In this article, we investigate the nearest neighbor ratio imputation estimator, in which auxiliary variables are used to identify the closest donor… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted for publication in JRSS(A)

  31. arXiv:2111.08493  [pdf, other

    stat.ML cs.LG

    ELBD: Efficient score algorithm for feature selection on latent variables of VAE

    Authors: Yiran Dong, Chuanhou Gao

    Abstract: In this paper, we develop the notion of evidence lower bound difference (ELBD), based on which an efficient score algorithm is presented to implement feature selection on latent variables of VAE and its variants. Further, we propose weak convergence approximation algorithms to optimize VAE related models through weighing the ``more important" latent variables selected and accordingly increasing ev… ▽ More

    Submitted 10 October, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: 16 pages 7 figures

  32. arXiv:2110.12966  [pdf, ps, other

    math.ST stat.ME

    Minimax rates for sparse signal detection under correlation

    Authors: Subhodh Kotekal, Chao Gao

    Abstract: We fully characterize the nonasymptotic minimax separation rate for sparse signal detection in the Gaussian sequence model with $p$ equicorrelated observations, generalizing a result of Collier, Comminges, and Tsybakov. As a consequence of the rate characterization, we find that strong correlation is a blessing, moderate correlation is a curse, and weak correlation is irrelevant. Moreover, the thr… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 74 pages

  33. arXiv:2110.03874  [pdf, other

    math.ST stat.ML

    Uncertainty quantification in the Bradley-Terry-Luce model

    Authors: Chao Gao, Yandi Shen, Anderson Y. Zhang

    Abstract: The Bradley-Terry-Luce (BTL) model is a benchmark model for pairwise comparisons between individuals. Despite recent progress on the first-order asymptotics of several popular procedures, the understanding of uncertainty quantification in the BTL model remains largely incomplete, especially when the underlying comparison graph is sparse. In this paper, we fill this gap by focusing on two estimator… ▽ More

    Submitted 9 August, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

  34. arXiv:2109.13491  [pdf, ps, other

    math.ST math.OC stat.ML

    Optimal Orthogonal Group Synchronization and Rotation Group Synchronization

    Authors: Chao Gao, Anderson Y. Zhang

    Abstract: We study the statistical estimation problem of orthogonal group synchronization and rotation group synchronization. The model is $Y_{ij} = Z_i^* Z_j^{*T} + σW_{ij}\in\mathbb{R}^{d\times d}$ where $W_{ij}$ is a Gaussian random matrix and $Z_i^*$ is either an orthogonal matrix or a rotation matrix, and each $Y_{ij}$ is observed independently with probability $p$. We analyze an iterative polar decomp… ▽ More

    Submitted 25 April, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

  35. arXiv:2107.02847  [pdf, other

    stat.ML cs.LG

    Transfer Learning in Information Criteria-based Feature Selection

    Authors: Shaohan Chen, Nikolaos V. Sahinidis, Chuanhou Gao

    Abstract: This paper investigates the effectiveness of transfer learning based on Mallows' Cp. We propose a procedure that combines transfer learning with Mallows' Cp (TLCp) and prove that it outperforms the conventional Mallows' Cp criterion in terms of accuracy and stability. Our theoretical results indicate that, for any sample size in the target domain, the proposed TLCp estimator performs better than t… ▽ More

    Submitted 29 May, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted to the Journal of Machine Learning Research

    ACM Class: I.3; I.5

  36. arXiv:2106.15400  [pdf, other

    cs.LG stat.ML

    Online Interaction Detection for Click-Through Rate Prediction

    Authors: Qiuqiang Lin, Chuanhou Gao

    Abstract: Click-Through Rate prediction aims to predict the ratio of clicks to impressions of a specific link. This is a challenging task since (1) there are usually categorical features, and the inputs will be extremely high-dimensional if one-hot encoding is applied, (2) not only the original features but also their interactions are important, (3) an effective prediction may rely on different features and… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: 11pages, 4 figures, 1 supplement

  37. arXiv:2104.04714  [pdf, other

    stat.ML cs.LG

    Random Intersection Chains

    Authors: Qiuqiang Lin, Chuanhou Gao

    Abstract: Interactions between several features sometimes play an important role in prediction tasks. But taking all the interactions into consideration will lead to an extremely heavy computational burden. For categorical features, the situation is more complicated since the input will be extremely high-dimensional and sparse if one-hot encoding is applied. Inspired by association rule mining, we propose a… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  38. arXiv:2101.08421  [pdf, other

    math.ST stat.ML

    Optimal Full Ranking from Pairwise Comparisons

    Authors: Pinhan Chen, Chao Gao, Anderson Y. Zhang

    Abstract: We consider the problem of ranking $n$ players from partial pairwise comparison data under the Bradley-Terry-Luce model. For the first time in the literature, the minimax rate of this ranking problem is derived with respect to the Kendall's tau distance that measures the difference between two rank vectors by counting the number of inversions. The minimax rate of ranking exhibits a transition betw… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

  39. arXiv:2101.02347  [pdf, other

    math.ST math.OC stat.ML

    SDP Achieves Exact Minimax Optimality in Phase Synchronization

    Authors: Chao Gao, Anderson Y. Zhang

    Abstract: We study the phase synchronization problem with noisy measurements $Y=z^*z^{*H}+σW\in\mathbb{C}^{n\times n}$, where $z^*$ is an $n$-dimensional complex unit-modulus vector and $W$ is a complex-valued Gaussian random matrix. It is assumed that each entry $Y_{jk}$ is observed with probability $p$. We prove that an SDP relaxation of the MLE achieves the error bound $(1+o(1))\frac{σ^2}{2np}$ under a n… ▽ More

    Submitted 17 March, 2022; v1 submitted 6 January, 2021; originally announced January 2021.

  40. arXiv:2009.03969  [pdf, ps, other

    math.ST stat.ML

    Convergence Rates of Empirical Bayes Posterior Distributions: A Variational Perspective

    Authors: Fengshuo Zhang, Chao Gao

    Abstract: We study the convergence rates of empirical Bayes posterior distributions for nonparametric and high-dimensional inference. We show that as long as the hyperparameter set is discrete, the empirical Bayes posterior distribution induced by the maximum marginal likelihood estimator can be regarded as a variational approximation to a hierarchical Bayes posterior distribution. This connection between e… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  41. arXiv:2009.02528  [pdf, other

    stat.AP eess.SP

    Structured Sparsity Modeling for Improved Multivariate Statistical Analysis based Fault Isolation

    Authors: Wei Chen, Jiusun Zeng, Xiaobin Xu, Shihua Luo, Chuanhou Gao

    Abstract: In order to improve the fault diagnosis capability of multivariate statistical methods, this article introduces a fault isolation framework based on structured sparsity modeling. The developed method relies on the reconstruction based contribution analysis and the process structure information can be incorporated into the reconstruction objective function in the form of structured sparsity regular… ▽ More

    Submitted 21 December, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: 36 pages, 12 figures

  42. arXiv:2006.16485  [pdf, other

    math.ST stat.ML

    Partial Recovery for Top-$k$ Ranking: Optimality of MLE and Sub-Optimality of Spectral Method

    Authors: Pinhan Chen, Chao Gao, Anderson Y. Zhang

    Abstract: Given partially observed pairwise comparison data generated by the Bradley-Terry-Luce (BTL) model, we study the problem of top-$k$ ranking. That is, to optimally identify the set of top-$k$ players. We derive the minimax rate with respect to a normalized Hamming loss. This provides the first result in the literature that characterizes the partial recovery error in terms of the proportion of mistak… ▽ More

    Submitted 15 July, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

  43. arXiv:2005.12017  [pdf, other

    stat.ME

    Estimating spatially varying health effects of wildland fire smoke using mobile health data

    Authors: Lili Wu, Chenyin Gao, Shu Yang, Brian J. Reich, Ana G. Rappold

    Abstract: Wildland fire smoke exposures are an increasing threat to public health, and thus there is a growing need for studying the effects of protective behaviors on reducing health outcomes. Emerging smartphone applications provide unprecedented opportunities to deliver health risk communication messages to a large number of individuals when and where they experience the exposure and subsequently study t… ▽ More

    Submitted 6 July, 2024; v1 submitted 25 May, 2020; originally announced May 2020.

  44. arXiv:2005.10579  [pdf, other

    stat.ME

    Elastic Integrative Analysis of Randomized Trial and Real-World Data for Treatment Heterogeneity Estimation

    Authors: Shu Yang, Chenyin Gao, Donglin Zeng, Xiaofei Wang

    Abstract: We propose a test-based elastic integrative analysis of the randomized trial and real-world data to estimate treatment effect heterogeneity with a vector of known effect modifiers. When the real-world data are not subject to bias, our approach combines the trial and real-world data for efficient estimation. Utilizing the trial design, we construct a test to decide whether or not to use real-world… ▽ More

    Submitted 29 November, 2022; v1 submitted 21 May, 2020; originally announced May 2020.

  45. arXiv:2005.09912  [pdf, other

    math.ST stat.ML

    Model Repair: Robust Recovery of Over-Parameterized Statistical Models

    Authors: Chao Gao, John Lafferty

    Abstract: A new type of robust estimation problem is introduced where the goal is to recover a statistical model that has been corrupted after it has been estimated from data. Methods are proposed for "repairing" the model using only the design and not the response values used to fit the model in a supervised learning setting. Theory is developed which reveals that two important ingredients are necessary fo… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  46. arXiv:2004.12908  [pdf, other

    cs.AI cs.LG stat.ML

    Simple Lifelong Learning Machines

    Authors: Jayanta Dey, Joshua T. Vogelstein, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

    Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More

    Submitted 20 April, 2025; v1 submitted 27 April, 2020; originally announced April 2020.

  47. arXiv:2001.08290  [pdf, other

    eess.AS cs.LG cs.NE cs.SD stat.ML

    Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture

    Authors: Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan

    Abstract: Recently, Transformer has gained success in automatic speech recognition (ASR) field. However, it is challenging to deploy a Transformer-based end-to-end (E2E) model for online speech recognition. In this paper, we propose the Transformer-based online CTC/attention E2E ASR architecture, which contains the chunk self-attention encoder (chunk-SAE) and the monotonic truncated attention (MTA) based se… ▽ More

    Submitted 11 February, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted by ICASSP 2020

  48. arXiv:2001.05486  [pdf, other

    physics.comp-ph cs.LG hep-ph stat.ML

    i-flow: High-dimensional Integration and Sampling with Normalizing Flows

    Authors: Christina Gao, Joshua Isaacson, Claudius Krause

    Abstract: In many fields of science, high-dimensional integration is required. Numerical methods have been developed to evaluate these complex integrals. We introduce the code i-flow, a python package that performs high-dimensional numerical integration utilizing normalizing flows. Normalizing flows are machine-learned, bijective mappings between two distributions. i-flow can also be used to sample random p… ▽ More

    Submitted 17 August, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: 21 pages, 5 figures, 4 tables; v2: improved presentation and discussion, matches published version. Mach. Learn.: Sci. Technol (2020)

    Report number: FERMILAB-PUB-20-010-T

  49. arXiv:1911.05121  [pdf, other

    cs.LG stat.ML

    Detecting Patterns of Physiological Response to Hemodynamic Stress via Unsupervised Deep Learning

    Authors: Chufan Gao, Fabian Falck, Mononito Goswami, Anthony Wertz, Michael R. Pinsky, Artur Dubrawski

    Abstract: Monitoring physiological responses to hemodynamic stress can help in determining appropriate treatment and ensuring good patient outcomes. Physicians' intuition suggests that the human body has a number of physiological response patterns to hemorrhage which escalate as blood loss continues, however the exact etiology and phenotypes of such responses are not well known or understood only at a coars… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  50. arXiv:1911.01018  [pdf, ps, other

    math.ST stat.CO stat.ME stat.ML

    Iterative Algorithm for Discrete Structure Recovery

    Authors: Chao Gao, Anderson Y. Zhang

    Abstract: We propose a general modeling and algorithmic framework for discrete structure recovery that can be applied to a wide range of problems. Under this framework, we are able to study the recovery of clustering labels, ranks of players, signs of regression coefficients, cyclic shifts, and even group elements from a unified perspective. A simple iterative algorithm is proposed for discrete structure re… ▽ More

    Submitted 27 September, 2020; v1 submitted 3 November, 2019; originally announced November 2019.