Skip to main content

Showing 1–50 of 261 results for author: Huang, j

Searching in archive stat. Search in all archives.
.
  1. arXiv:2509.19633  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Mamba Modulation: On the Length Generalization of Mamba

    Authors: Peng Lu, Jerry Huang, Qiuhao Zeng, Xinyu Wang, Boxing Wang, Philippe Langlais, Yufei Cui

    Abstract: The quadratic complexity of the attention mechanism in Transformer models has motivated the development of alternative architectures with sub-quadratic scaling, such as state-space models. Among these, Mamba has emerged as a leading architecture, achieving state-of-the-art results across a range of language modeling tasks. However, Mamba's performance significantly deteriorates when applied to con… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: Accepted to The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS) 2025. First two authors contributed equally

  2. arXiv:2508.16902  [pdf, ps, other

    stat.ME math.ST

    Efficient Semiparametric Inference for Distributed Data with Blockwise Missingness

    Authors: Jingyue Huang, Huiyuan Wang, Yuqing Lei, Yong Chen

    Abstract: We consider statistical inference for a finite-dimensional parameter in a regular semiparametric model under a distributed setting with blockwise missingness, where entire blocks of variables are unavailable at certain sites and sharing individual-level data is not allowed. To improve efficiency of the internal study, we propose a class of augmented one-step estimators that incorporate information… ▽ More

    Submitted 23 August, 2025; originally announced August 2025.

    MSC Class: 62F12; 62G10

  3. arXiv:2508.15928  [pdf, ps, other

    cs.LG stat.ML

    Transforming Causality: Transformer-Based Temporal Causal Discovery with Prior Knowledge Integration

    Authors: Jihua Huang, Yi Yao, Ajay Divakaran

    Abstract: We introduce a novel framework for temporal causal discovery and inference that addresses two key challenges: complex nonlinear dependencies and spurious correlations. Our approach employs a multi-layer Transformer-based time-series forecaster to capture long-range, nonlinear temporal relationships among variables. After training, we extract the underlying causal structure and associated time lags… ▽ More

    Submitted 21 August, 2025; originally announced August 2025.

  4. arXiv:2508.13174  [pdf, ps, other

    cs.AI cs.LG q-fin.CP stat.ML

    AlphaEval: A Comprehensive and Efficient Evaluation Framework for Formula Alpha Mining

    Authors: Hongjun Ding, Binqi Chen, Jinsheng Huang, Taian Guo, Zhengyang Mao, Guoyi Shao, Lutong Zou, Luchen Liu, Ming Zhang

    Abstract: Formula alpha mining, which generates predictive signals from financial data, is critical for quantitative investment. Although various algorithmic approaches-such as genetic programming, reinforcement learning, and large language models-have significantly expanded the capacity for alpha discovery, systematic evaluation remains a key challenge. Existing evaluation metrics predominantly include bac… ▽ More

    Submitted 10 August, 2025; originally announced August 2025.

    Comments: 12 pages, 5 figures

  5. arXiv:2508.11847  [pdf, ps, other

    stat.ML cs.LG

    Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

    Authors: Jenny Y. Huang, Yunyi Shen, Dennis Wei, Tamara Broderick

    Abstract: We propose a method for evaluating the robustness of a widely used LLM ranking system -- the Bradley--Terry ranking system -- to dropping a worst-case very small fraction of evaluation data. Our approach is computationally fast and easy to adopt. When we apply our method to matchups from two popular human-preference platforms, Chatbot Arena and MT-Bench, we find that the Bradley--Terry rankings of… ▽ More

    Submitted 15 August, 2025; originally announced August 2025.

  6. arXiv:2508.00264  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Calibrated Language Models and How to Find Them with Label Smoothing

    Authors: Jerry Huang, Peng Lu, Qiuhao Zeng

    Abstract: Recent advances in natural language processing (NLP) have opened up greater opportunities to enable fine-tuned large language models (LLMs) to behave as more powerful interactive agents through improved instruction-following ability. However, understanding how this impacts confidence calibration for reliable model output has not been researched in full. In this work, we examine various open-source… ▽ More

    Submitted 31 July, 2025; originally announced August 2025.

    Comments: Accepted to the Forty-second International Conference on Machine Learning (ICML) 2025. First two authors contributed equally

  7. arXiv:2507.21442  [pdf

    stat.ME

    Detection of a Sparse Change in High-Dimensional Time Series

    Authors: Jingyan Huang

    Abstract: Consider the detection of a sparse change in high-dimensional time-series. We introduce Sparsity Likelihood-based (SL-based) score and the change-points detection procedure in multivariate normal model with general covariance structure. SL-based algorithm is proved to achieve that supremum of error probabilities converges to 0. We run the simulation studies for SL-based algorithm and also illustra… ▽ More

    Submitted 28 July, 2025; originally announced July 2025.

  8. arXiv:2507.14206  [pdf, ps, other

    eess.SP cs.AI cs.LG stat.ML

    A Comprehensive Benchmark for Electrocardiogram Time-Series

    Authors: Zhijiang Tang, Jiaxin Qi, Yuhua Zheng, Jianqiang Huang

    Abstract: Electrocardiogram~(ECG), a key bioelectrical time-series signal, is crucial for assessing cardiac health and diagnosing various diseases. Given its time-series format, ECG data is often incorporated into pre-training datasets for large-scale time-series model training. However, existing studies often overlook its unique characteristics and specialized downstream applications, which differ signific… ▽ More

    Submitted 14 July, 2025; originally announced July 2025.

    Comments: Accepted to ACM MM 2025

  9. arXiv:2505.11770  [pdf, ps, other

    cs.LG cs.AI cs.CL stat.ML

    Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

    Authors: Jing Huang, Junyi Tao, Thomas Icard, Diyi Yang, Christopher Potts

    Abstract: Interpretability research now offers a variety of techniques for identifying abstract internal mechanisms in neural networks. Can such techniques be used to predict how models will behave on out-of-distribution examples? In this work, we provide a positive answer to this question. Through a diverse set of language modeling tasks--including symbol manipulation, knowledge retrieval, and instruction… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: ICML 2025

  10. arXiv:2505.07967  [pdf, ps, other

    stat.ML cs.LG

    Wasserstein Distributionally Robust Nonparametric Regression

    Authors: Changyu Liu, Yuling Jiao, Junhui Wang, Jian Huang

    Abstract: Distributionally robust optimization has become a powerful tool for prediction and decision-making under model uncertainty. By focusing on the local worst-case risk, it enhances robustness by identifying the most unfavorable distribution within a predefined ambiguity set. While extensive research has been conducted in parametric settings, studies on nonparametric frameworks remain limited. This pa… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 50 pages

    MSC Class: 62G05; 62G08; 68T07

  11. arXiv:2505.07180  [pdf, ps, other

    cs.LG stat.ML

    Causal View of Time Series Imputation: Some Identification Results on Missing Mechanism

    Authors: Ruichu Cai, Kaitao Zheng, Junxian Huang, Zijian Li, Zhengming Chen, Boyan Xu, Zhifeng Hao

    Abstract: Time series imputation is one of the most challenge problems and has broad applications in various fields like health care and the Internet of Things. Existing methods mainly aim to model the temporally latent dependencies and the generation process from the observed time series data. In real-world scenarios, different types of missing mechanisms, like MAR (Missing At Random), and MNAR (Missing No… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  12. arXiv:2505.04992  [pdf, other

    stat.ML cs.LG stat.AP

    Boosting Statistic Learning with Synthetic Data from Pretrained Large Models

    Authors: Jialong Jiang, Wenkang Hu, Jian Huang, Yuling Jiao, Xu Liu

    Abstract: The rapid advancement of generative models, such as Stable Diffusion, raises a key question: how can synthetic data from these models enhance predictive modeling? While they can generate vast amounts of datasets, only a subset meaningfully improves performance. We propose a novel end-to-end framework that generates and systematically filters synthetic data through domain-specific statistical metho… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  13. arXiv:2505.00308  [pdf

    cs.CV cs.AI stat.AP

    AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality

    Authors: Biling Wang, Austen Maniscalco, Ti Bai, Siqiu Wang, Michael Dohopolski, Mu-Han Lin, Chenyang Shen, Dan Nguyen, Junzhou Huang, Steve Jiang, Xinlei Wang

    Abstract: Purpose: This study presents a Deep Learning (DL)-based quality assessment (QA) approach for evaluating auto-generated contours (auto-contours) in radiotherapy, with emphasis on Online Adaptive Radiotherapy (OART). Leveraging Bayesian Ordinal Classification (BOC) and calibrated uncertainty thresholds, the method enables confident QA predictions without relying on ground truth contours or extensive… ▽ More

    Submitted 11 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  14. arXiv:2504.11353  [pdf, other

    cs.LG stat.ML

    An Adaptive Dropout Approach for High-Dimensional Bayesian Optimization

    Authors: Jundi Huang, Dawei Zhan

    Abstract: Bayesian optimization (BO) is a widely used algorithm for solving expensive black-box optimization problems. However, its performance decreases significantly on high-dimensional problems due to the inherent high-dimensionality of the acquisition function. In the proposed algorithm, we adaptively dropout the variables of the acquisition function along the iterations. By gradually reducing the dimen… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  15. arXiv:2504.10540  [pdf, other

    stat.ML cs.AI cs.LG

    AB-Cache: Training-Free Acceleration of Diffusion Models via Adams-Bashforth Cached Feature Reuse

    Authors: Zichao Yu, Zhen Zou, Guojiang Shao, Chengwei Zhang, Shengze Xu, Jie Huang, Feng Zhao, Xiaodong Cun, Wenyi Zhang

    Abstract: Diffusion models have demonstrated remarkable success in generative tasks, yet their iterative denoising process results in slow inference, limiting their practicality. While existing acceleration methods exploit the well-known U-shaped similarity pattern between adjacent steps through caching mechanisms, they lack theoretical foundation and rely on simplistic computation reuse, often leading to p… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  16. arXiv:2504.09567  [pdf, ps, other

    stat.ML cs.LG stat.ME

    From Conditional to Unconditional Independence: Testing Conditional Independence via Transport Maps

    Authors: Chenxuan He, Yuan Gao, Liping Zhu, Jian Huang

    Abstract: Testing conditional independence between two random vectors given a third is a fundamental and challenging problem in statistics, particularly in multivariate nonparametric settings due to the complexity of conditional structures. We propose a novel method for testing conditional independence by transforming it to an unconditional independence test problem. We achieve this by constructing two tran… ▽ More

    Submitted 24 July, 2025; v1 submitted 13 April, 2025; originally announced April 2025.

    Comments: 41 pages

    MSC Class: 62G05; 62G08; 68T07

  17. arXiv:2504.01031  [pdf, other

    stat.ML cs.LG

    Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift

    Authors: Shuntuo Xu, Zhou Yu, Jian Huang

    Abstract: The density ratio is an important metric for evaluating the relative likelihood of two probability distributions, with extensive applications in statistics and machine learning. However, existing estimation theories for density ratios often depend on stringent regularity conditions, mainly focusing on density ratio functions with bounded domains and ranges. In this paper, we study density ratio es… ▽ More

    Submitted 29 March, 2025; originally announced April 2025.

    MSC Class: 62G05; 62G08; 68T07

  18. arXiv:2504.01030  [pdf, other

    stat.ML cs.LG

    Fair Sufficient Representation Learning

    Authors: Xueyu Zhou, Chun Yin IP, Jian Huang

    Abstract: The main objective of fair statistical modeling and machine learning is to minimize or eliminate biases that may arise from the data or the model itself, ensuring that predictions and decisions are not unjustly influenced by sensitive attributes such as race, gender, age, or other protected characteristics. In this paper, we introduce a Fair Sufficient Representation Learning (FSRL) method that ba… ▽ More

    Submitted 29 March, 2025; originally announced April 2025.

    Comments: 35 pages, 11 figures, and 6 tables (1 in the main text, 5 in the appendix)

    MSC Class: 62G05; 68T07

  19. arXiv:2503.21123  [pdf, other

    stat.AP

    De Novo Functional Protein Sequence Generation: Overcoming Data Scarcity through Regeneration and Large Models

    Authors: Chenyu Ren, Daihai He, Jian Huang

    Abstract: Proteins are essential components of all living organisms and play a critical role in cellular survival. They have a broad range of applications, from clinical treatments to material engineering. This versatility has spurred the development of protein design, with amino acid sequence design being a crucial step in the process. Recent advancements in deep generative models have shown promise for pr… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  20. arXiv:2503.16807  [pdf, other

    stat.ME

    Multi-View Orthogonal Projection Regression with Application in Multi-omics integration

    Authors: Zongrui Dai, Yvonne J. Huang, Gen Li

    Abstract: Multi-omics integration offers novel insights into complex biological mechanisms by utlizing the fused information from various omics datasets. However, the inherent within- and inter-modality correlations in multi-omics data present significant challenges for traditional variable selection methods, such as Lasso regression. These correlations can lead to multicollinearity, compromising the stabil… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  21. arXiv:2503.12784  [pdf, other

    stat.ME cs.LG stat.AP

    Causal Feature Learning in the Social Sciences

    Authors: Jingzhou Huang, Jiuyao Lu, Alexander Williams Tolbert

    Abstract: Variable selection poses a significant challenge in causal modeling, particularly within the social sciences, where constructs often rely on inter-related factors such as age, socioeconomic status, gender, and race. Indeed, it has been argued that such attributes must be modeled as macro-level abstractions of lower-level manipulable features, in order to preserve the modularity assumption essentia… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  22. arXiv:2503.12155  [pdf, other

    stat.ME math.ST stat.AP

    On self-training of summary data with genetic applications

    Authors: Buxin Su, Jiaoyang Huang, Jin Jin, Bingxin Zhao

    Abstract: Prediction model training is often hindered by limited access to individual-level data due to privacy concerns and logistical challenges, particularly in biomedical research. Resampling-based self-training presents a promising approach for building prediction models using only summary-level data. These methods leverage summary statistics to sample pseudo datasets for model training and parameter o… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  23. arXiv:2503.09309  [pdf, ps, other

    cs.LG cs.AI cs.MA stat.ML

    Steering No-Regret Agents in MFGs under Model Uncertainty

    Authors: Leo Widmer, Jiawei Huang, Niao He

    Abstract: Incentive design is a popular framework for guiding agents' learning dynamics towards desired outcomes by providing additional payments beyond intrinsic rewards. However, most existing works focus on a finite, small set of agents or assume complete knowledge of the game, limiting their applicability to real-world scenarios involving large populations and model uncertainty. To address this gap, we… ▽ More

    Submitted 14 April, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

    Comments: AISTATS 2025; 34 Pages

  24. arXiv:2503.01728  [pdf, other

    cs.LG stat.ME

    DeepSuM: Deep Sufficient Modality Learning Framework

    Authors: Zhe Gao, Jian Huang, Ting Li, Xueqin Wang

    Abstract: Multimodal learning has become a pivotal approach in developing robust learning models with applications spanning multimedia, robotics, large language models, and healthcare. The efficiency of multimodal systems is a critical concern, given the varying costs and resource demands of different modalities. This underscores the necessity for effective modality selection to balance performance gains ag… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  25. arXiv:2502.20414  [pdf, other

    stat.ML cs.LG

    Transfer Learning through Enhanced Sufficient Representation: Enriching Source Domain Knowledge with Target Data

    Authors: Yeheng Ge, Xueyu Zhou, Jian Huang

    Abstract: Transfer learning is an important approach for addressing the challenges posed by limited data availability in various applications. It accomplishes this by transferring knowledge from well-established source domains to a less familiar target domain. However, traditional transfer learning methods often face difficulties due to rigid model assumptions and the need for a high degree of similarity be… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Comments: 44 pages

    MSC Class: 62G05; 68T07

  26. arXiv:2502.19255  [pdf, other

    cs.LG cs.AI stat.ML

    Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

    Authors: Jiawei Huang, Bingcong Li, Christoph Dann, Niao He

    Abstract: Sample efficiency is critical for online Reinforcement Learning from Human Feedback (RLHF). While existing works investigate sample-efficient online exploration strategies, the potential of utilizing misspecified yet relevant reward models to accelerate learning remains underexplored. This paper studies how to transfer knowledge from those imperfect reward models in online RLHF. We start by identi… ▽ More

    Submitted 18 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 36 Pages; ICML 2025

  27. arXiv:2502.16637  [pdf, other

    cs.LG cs.AI stat.ME

    Time Series Domain Adaptation via Latent Invariant Causal Mechanism

    Authors: Ruichu Cai, Junxian Huang, Zhenhui Yang, Zijian Li, Emadeldeen Eldele, Min Wu, Fuchun Sun

    Abstract: Time series domain adaptation aims to transfer the complex temporal dependence from the labeled source domain to the unlabeled target domain. Recent advances leverage the stable causal mechanism over observed variables to model the domain-invariant temporal dependence. However, modeling precise causal structures in high-dimensional data, such as videos, remains challenging. Additionally, direct ca… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  28. arXiv:2502.15655  [pdf, other

    math.ST math.PR stat.ML

    Local geometry of high-dimensional mixture models: Effective spectral theory and dynamical transitions

    Authors: Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang, Aukosh Jagannath

    Abstract: We study the local geometry of empirical risks in high dimensions via the spectral theory of their Hessian and information matrices. We focus on settings where the data, $(Y_\ell)_{\ell =1}^n\in \mathbb R^d$, are i.i.d. draws of a $k$-component Gaussian mixture model, and the loss depends on the projection of the data into a fixed number of vectors, namely $\mathbf{x}^\top Y$, where… ▽ More

    Submitted 15 May, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Figures added. 59 pages, 7 figures

  29. arXiv:2502.00172  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Distribution-Specific Agnostic Conditional Classification With Halfspaces

    Authors: Jizhou Huang, Brendan Juba

    Abstract: We study ``selective'' or ``conditional'' classification problems under an agnostic setting. Classification tasks commonly focus on modeling the relationship between features and categories that captures the vast majority of data. In contrast to common machine learning frameworks, conditional classification intends to model such relationships only on a subset of the data defined by some selection… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

  30. arXiv:2412.20611  [pdf, other

    stat.ME math.ST

    Uncertainty of high-dimensional genetic data prediction with polygenic risk scores

    Authors: Haoxuan Fu, Jiaoyang Huang, Zirui Fan, Bingxin Zhao

    Abstract: In many predictive tasks, there are a large number of true predictors with weak signals, leading to substantial uncertainties in prediction outcomes. The polygenic risk score (PRS) is an example of such a scenario, where many genetic variants are used as predictors for complex traits, each contributing only a small amount of information. Although PRS has been a standard tool in genetic predictions… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

  31. arXiv:2412.14222  [pdf, ps, other

    cs.AI cs.CL cs.LG stat.OT

    A Survey on Large Language Model-based Agents for Statistics and Data Science

    Authors: Maojun Sun, Ruijian Han, Binyan Jiang, Houduo Qi, Defeng Sun, Yancheng Yuan, Jian Huang

    Abstract: In recent years, data science agents powered by Large Language Models (LLMs), known as "data agents," have shown significant potential to transform the traditional data analysis paradigm. This survey provides an overview of the evolution, capabilities, and applications of LLM-based data agents, highlighting their role in simplifying complex data tasks and lowering the entry barrier for users witho… ▽ More

    Submitted 14 September, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

  32. arXiv:2411.16663  [pdf, ps, other

    stat.ML cs.LG math.AC math.NA

    Gaussian Process Priors for Boundary Value Problems of Linear Partial Differential Equations

    Authors: Jianlei Huang, Marc Härkönen, Markus Lange-Hegermann, Bogdan Raiţă

    Abstract: Working with systems of partial differential equations (PDEs) is a fundamental task in computational science. Well-posed systems are addressed by numerical solvers or neural operators, whereas systems described by data are often addressed by PINNs or Gaussian processes. In this work, we propose Boundary Ehrenpreis--Palamodov Gaussian Processes (B-EPGPs), a novel probabilistic framework for constru… ▽ More

    Submitted 26 September, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: 36 pages, 18 figures. Code available at $\href{https://github.com/Jimmy000207/Boundary-EPGP}{\text{this https URL}}$. The paper and all ancillary files are released under CC-BY

    MSC Class: 60G15; 13N10; 13P25; 60-08; 35G35

  33. arXiv:2411.01833  [pdf, other

    cs.LG cs.CV stat.ML

    OwMatch: Conditional Self-Labeling with Consistency for Open-World Semi-Supervised Learning

    Authors: Shengjie Niu, Lifan Lin, Jian Huang, Chao Wang

    Abstract: Semi-supervised learning (SSL) offers a robust framework for harnessing the potential of unannotated data. Traditionally, SSL mandates that all classes possess labeled instances. However, the emergence of open-world SSL (OwSSL) introduces a more practical challenge, wherein unlabeled data may encompass samples from unseen classes. This scenario leads to misclassification of unseen classes as known… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 camera-ready (10 pages, 4 figures) with the appendices (10 pages, 7 figures)

  34. arXiv:2411.01487  [pdf, other

    stat.ML cs.CV cs.LG

    DSDE: Using Proportion Estimation to Improve Model Selection for Out-of-Distribution Detection

    Authors: Jingyao Geng, Yuan Zhang, Jiaqi Huang, Feng Xue, Falong Tan, Chuanlong Xie, Shumei Zhang

    Abstract: Model library is an effective tool for improving the performance of single-model Out-of-Distribution (OoD) detector, mainly through model selection and detector fusion. However, existing methods in the literature do not provide uncertainty quantification for model selection results. Additionally, the model ensemble process primarily focuses on controlling the True Positive Rate (TPR) while neglect… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: 16 pages, 2 figures

  35. arXiv:2410.19226  [pdf, other

    stat.ME

    Deep Transformation Model

    Authors: Tong Wang, Shunqin Zhang, Sanguo Zhang, Jian Huang, Shuangge Ma

    Abstract: There has been a significant recent surge in deep neural network (DNN) techniques. Most of the existing DNN techniques have restricted model formats/assumptions. To overcome their limitations, we propose the nonparametric transformation model, which encompasses many popular models as special cases and hence is less sensitive to model mis-specification. This model also has the potential of accommod… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  36. arXiv:2410.18021  [pdf, other

    stat.ME math.ST

    Deep Nonparametric Inference for Conditional Hazard Function

    Authors: Wen Su, Kin-Yat Liu, Guosheng Yin, Jian Huang, Xingqiu Zhao

    Abstract: We propose a novel deep learning approach to nonparametric statistical inference for the conditional hazard function of survival time with right-censored data. We use a deep neural network (DNN) to approximate the logarithm of a conditional hazard function given covariates and obtain a DNN likelihood-based estimator of the conditional hazard function. Such an estimation approach renders model flex… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  37. arXiv:2408.14036  [pdf, ps, other

    stat.ME

    Robust subgroup-classifier learning and testing in change-plane regressions

    Authors: Xu Liu, Jian Huang, Yong Zhou, Xiao Zhang

    Abstract: Considered here are robust subgroup-classifier learning and testing in change-plane regressions with heavy-tailed errors, which can identify subgroups as a basis for making optimal recommendations for individualized treatment. A new subgroup classifier is proposed by smoothing the indicator function, which is learned by minimizing the smoothed Huber loss. Nonasymptotic properties and the Bahadur r… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  38. arXiv:2408.09008  [pdf, ps, other

    stat.ME stat.CO

    Approximations to worst-case data dropping: unmasking failure modes

    Authors: Jenny Y. Huang, David R. Burt, Yunyi Shen, Tin D. Nguyen, Tamara Broderick

    Abstract: A data analyst might worry about generalization if dropping a very small fraction of data points from a study could change its substantive conclusions. Checking this non-robustness directly poses a combinatorial optimization problem and is intractable even for simple models and moderate data sizes. Recently various authors have proposed a diverse set of approximations to detect this non-robustness… ▽ More

    Submitted 30 May, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: 71 pages

    Journal ref: Transactions on Machine Learning Research, July 2025

  39. arXiv:2407.10207  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Learning to Steer Markovian Agents under Model Uncertainty

    Authors: Jiawei Huang, Vinzenz Thoma, Zebang Shen, Heinrich H. Nax, Niao He

    Abstract: Designing incentives for an adapting population is a ubiquitous problem in a wide array of economic applications and beyond. In this work, we study how to design additional rewards to steer multi-agent systems towards desired policies \emph{without} prior knowledge of the agents' underlying learning dynamics. Motivated by the limitation of existing works, we consider a new and general category of… ▽ More

    Submitted 8 February, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 35 Pages; ICLR 2025

  40. arXiv:2407.01015  [pdf, other

    stat.ML cs.LG

    Bayesian Entropy Neural Networks for Physics-Aware Prediction

    Authors: Rahul Rathnakumar, Jiayu Huang, Hao Yan, Yongming Liu

    Abstract: This paper addresses the need for deep learning models to integrate well-defined constraints into their outputs, driven by their application in surrogate models, learning with limited data and partial information, and scenarios requiring flexible model behavior to incorporate non-data sample information. We introduce Bayesian Entropy Neural Networks (BENN), a framework grounded in Maximum Entropy… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 15 pages

    ACM Class: I.5.1

  41. arXiv:2406.13197  [pdf, other

    stat.ME

    Representation Transfer Learning for Semiparametric Regression

    Authors: Baihua He, Huihang Liu, Xinyu Zhang, Jian Huang

    Abstract: We propose a transfer learning method that utilizes data representations in a semiparametric regression model. Our aim is to perform statistical inference on the parameter of primary interest in the target model while accounting for potential nonlinear effects of confounding variables. We leverage knowledge from source domains, assuming that the sample size of the source data is substantially larg… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 42 pages, 11 figures, 5 tables

    MSC Class: 62F99

  42. arXiv:2406.07525  [pdf

    econ.GN stat.AP

    Will Southeast Asia be the next global manufacturing hub? A multiway cointegration, causality, and dynamic connectedness analyses on factors influencing offshore decisions

    Authors: Haibo Wang, Lutfu S. Sua, Jun Huang, Jaime Ortiz, Bahram Alidaee

    Abstract: The COVID-19 pandemic has compelled multinational corporations to diversify their global supply chain risk and to relocate their factories to Southeast Asian countries beyond China. Such recent phenomena provide a good opportunity to understand the factors that influenced offshore decisions in the last two decades. We propose a new conceptual framework based on econometric approaches to examine th… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 30 pages

  43. arXiv:2406.03683  [pdf, other

    cs.LG stat.ML

    Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models

    Authors: Ding Huang, Ting Li, Jian Huang

    Abstract: We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 25 pages, 26 figures, and 4 tables

    MSC Class: 62G05; 68T07

  44. arXiv:2405.18284  [pdf, other

    stat.ML cs.LG

    Adaptive debiased SGD in high-dimensional GLMs with streaming data

    Authors: Ruijian Han, Lan Luo, Yuanhang Luo, Yuanyuan Lin, Jian Huang

    Abstract: Online statistical inference facilitates real-time analysis of sequentially collected data, making it different from traditional methods that rely on static datasets. This paper introduces a novel approach to online inference in high-dimensional generalized linear models, where we update regression coefficient estimates and their standard errors upon each new data arrival. In contrast to existing… ▽ More

    Submitted 26 February, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 30 pages, 4 figures

  45. arXiv:2404.15760  [pdf, other

    cs.LG cs.AI stat.ML

    Debiasing Machine Unlearning with Counterfactual Examples

    Authors: Ziheng Chen, Jia Wang, Jun Zhuang, Abbavaram Gowtham Reddy, Fabrizio Silvestri, Jin Huang, Kaushiki Nag, Kun Kuang, Xin Ning, Gabriele Tolomei

    Abstract: The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  46. arXiv:2404.00551  [pdf, other

    stat.ML cs.LG

    Convergence of Continuous Normalizing Flows for Learning Probability Distributions

    Authors: Yuan Gao, Jian Huang, Yuling Jiao, Shurong Zheng

    Abstract: Continuous normalizing flows (CNFs) are a generative method for learning probability distributions, which is based on ordinary differential equations. This method has shown remarkable empirical success across various applications, including large-scale image synthesis, protein structure prediction, and molecule generation. In this work, we study the theoretical properties of CNFs with linear inter… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 60 pages, 3 tables, and 3 figures

    MSC Class: 62G05; 68T07

  47. arXiv:2403.16283  [pdf, other

    stat.ME

    Sample Empirical Likelihood Methods for Causal Inference

    Authors: Jingyue Huang, Changbao Wu, Leilei Zeng

    Abstract: Causal inference is crucial for understanding the true impact of interventions, policies, or actions, enabling informed decision-making and providing insights into the underlying mechanisms that shape our world. In this paper, we establish a framework for the estimation and inference of average treatment effects using a two-sample empirical likelihood function. Two different approaches to incorpor… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  48. arXiv:2403.12367  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Learning covariate importance for matching in policy-relevant observational research

    Authors: Hongzhe Zhang, Jiasheng Shi, Jing Huang

    Abstract: Matching methods are widely used to reduce confounding effects in observational studies, but conventional approaches often treat all covariates as equally important, which can result in poor performance when covariates differ in their relevance to the study. We propose the Priority-Aware one-to-one Matching Algorithm (PAMA), a novel semi-supervised framework that learns a covariate importance meas… ▽ More

    Submitted 29 August, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

  49. arXiv:2403.12243  [pdf, other

    stat.ME

    Unlocking the Power of Time-Since-Infection Models: Data Augmentation for Improved Instantaneous Reproduction Number Estimation

    Authors: Jiasheng Shi, Yizhao Zhou, Jing Huang

    Abstract: The Time Since Infection (TSI) models, which use disease surveillance data to model infectious diseases, have become increasingly popular due to their flexibility and capacity to address complex disease control questions. However, a notable limitation of TSI models is their primary reliance on incidence data. Even when hospitalization data are available, existing TSI models have not been crafted t… ▽ More

    Submitted 9 January, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

  50. arXiv:2402.16661  [pdf, other

    stat.ML cs.LG stat.ME

    Penalized Generative Variable Selection

    Authors: Tong Wang, Jian Huang, Shuangge Ma

    Abstract: Deep networks are increasingly applied to a wide variety of data, including data with high-dimensional predictors. In such analysis, variable selection can be needed along with estimation/model building. Many of the existing deep network studies that incorporate variable selection have been limited to methodological and numerical developments. In this study, we consider modeling/estimation using t… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.