Skip to main content

Showing 1–50 of 427 results for author: Wu, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.03584  [pdf, ps, other

    stat.ME

    Small Area Estimation of Fertility in Low- and Middle-Income Countries

    Authors: Yunhan Wu, Jon Wakefield

    Abstract: Accurate fertility estimates at fine spatial resolution are essential for localized public health planning, particularly in low- and middle-income countries (LMICs). While national-level indicators such as age-specific fertility rates (ASFR) and total fertility rate (TFR) are often reported through official statistics, they lack the spatial granularity needed to guide targeted interventions. To ad… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

    Comments: 35 pages and 13 figures

  2. arXiv:2506.21991  [pdf

    stat.ME stat.AP

    Simulated Intervention on Cross-Sectional Nested Data: Development of a Multilevel NIRA Approach

    Authors: Yiming Wu, Fei Wang

    Abstract: With the rise of the network perspective, researchers have made numerous important discoveries over the past decade by constructing psychological networks. Unfortunately, most of these networks are based on cross-sectional data, which can only reveal associations between variables but not their directional or causal relationships. Recently, the development of the nodeIdentifyR algorithm (NIRA) tec… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  3. arXiv:2506.20406  [pdf, ps, other

    stat.ML cs.IT cs.LG stat.ME

    POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes

    Authors: Ruijia Zhang, Zhengling Qi, Yue Wu, Xiangyu Zhang, Yanxun Xu

    Abstract: Dynamic treatment regimes (DTRs) provide a principled framework for optimizing sequential decision-making in domains where decisions must adapt over time in response to individual trajectories, such as healthcare, education, and digital interventions. However, existing statistical methods often rely on strong positivity assumptions and lack robustness under partial data coverage, while offline rei… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  4. arXiv:2506.08438  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Learning to Lead: Incentivizing Strategic Agents in the Dark

    Authors: Yuchen Wu, Xinyi Zhong, Zhuoran Yang

    Abstract: We study an online learning version of the generalized principal-agent model, where a principal interacts repeatedly with a strategic agent possessing private types, private rewards, and taking unobservable actions. The agent is non-myopic, optimizing a discounted sum of future rewards and may strategically misreport types to manipulate the principal's learning. The principal, observing only her o… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 81 pages, 7 figures

  5. arXiv:2506.02524  [pdf, ps, other

    stat.ME stat.AP

    Variable Selection in Functional Linear Cox Model

    Authors: Yuanzhen Yue, Stella Self, Yichao Wu, Jiajia Zhang, Rahul Ghosal

    Abstract: Modern biomedical studies frequently collect complex, high-dimensional physiological signals using wearables and sensors along with time-to-event outcomes, making efficient variable selection methods crucial for interpretation and improving the accuracy of survival models. We propose a novel variable selection method for a functional linear Cox model with multiple functional and scalar covariates… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  6. arXiv:2505.19097  [pdf, ps, other

    cs.LG stat.ML

    Towards Robust Influence Functions with Flat Validation Minima

    Authors: Xichen Ye, Yifan Wu, Weizhong Zhang, Cheng Jin, Yifan Chen

    Abstract: The Influence Function (IF) is a widely used technique for assessing the impact of individual training samples on model predictions. However, existing IF methods often fail to provide reliable influence estimates in deep neural networks, particularly when applied to noisy training data. This issue does not stem from inaccuracies in parameter change estimation, which has been the primary focus of p… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML 2025. arXiv admin note: text overlap with arXiv:2310.00902 by other authors

  7. arXiv:2505.14999  [pdf, ps, other

    cs.LG cs.AI cs.CL stat.ML

    Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision

    Authors: Eric Hanchen Jiang, Haozheng Luo, Shengyuan Pang, Xiaomin Li, Zhenting Qi, Hengli Li, Cheng-Fu Yang, Zongyu Lin, Xinfeng Li, Hao Xu, Kai-Wei Chang, Ying Nian Wu

    Abstract: Mathematical reasoning presents a significant challenge for Large Language Models (LLMs), often requiring robust multi step logical consistency. While Chain of Thought (CoT) prompting elicits reasoning steps, it doesn't guarantee correctness, and improving reliability via extensive sampling is computationally costly. This paper introduces the Energy Outcome Reward Model (EORM), an effective, light… ▽ More

    Submitted 14 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  8. arXiv:2505.14806  [pdf, ps, other

    q-bio.NC cs.LG stat.ML

    Place Cells as Proximity-Preserving Embeddings: From Multi-Scale Random Walk to Straight-Forward Path Planning

    Authors: Minglu Zhao, Dehong Xu, Deqian Kong, Wen-Hao Zhang, Ying Nian Wu

    Abstract: The hippocampus enables spatial navigation through place cell populations forming cognitive maps. We propose proximity-preserving neural embeddings to encode multi-scale random walk transitions, where the inner product $\langle h(x, t), h(y, t) \rangle = q(y|x, t)$ represents normalized transition probabilities, with $h(x, t)$ as the embedding at location $x$ and $q(y|x, t)$ as the transition prob… ▽ More

    Submitted 2 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  9. arXiv:2505.12308  [pdf

    stat.ME math.ST

    A Hybrid Prior Bayesian Method for Combining Domestic Real-World Data and Overseas Data in Global Drug Development

    Authors: Keer Chen, Zengyue Zheng, Pengfei Zhu, Shuping Jiang, Nan Li, Jumin Deng, Pingyan Chen, Zhenyu Wu, Ying Wu

    Abstract: Background Hybrid clinical trial design integrates randomized controlled trials (RCTs) with real-world data (RWD) to enhance efficiency through dynamic incorporation of external data. Existing methods like the Meta-Analytic Predictive Prior (MAP) inadequately control data heterogeneity, adjust baseline discrepancies, or optimize dynamic borrowing proportions, introducing bias and limiting applicat… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: 10 figures

  10. arXiv:2505.11841  [pdf, ps, other

    stat.AP

    Framing Causal Questions in Sports Analytics: A Case Study of Crossing in Soccer

    Authors: Shomoita Alam, Erica E. M. Moodie, Lucas Y. Wu, Tim B. Swartz

    Abstract: Causal inference has become an accepted analytic framework in settings where experimentation is impossible, which is frequently the case in sports analytics, particularly for studying in-game tactics. However, subtle differences in implementation can lead to important differences in interpretation. In this work, we provide a case study to demonstrate the utility and the nuance of these approaches.… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: 22 pages, 2 figures

  11. arXiv:2505.05613  [pdf, other

    stat.ML cs.CR cs.IT cs.LG math.ST

    Optimal Regret of Bernoulli Bandits under Global Differential Privacy

    Authors: Achraf Azize, Yulian Wu, Junya Honda, Francesco Orabona, Shinji Ito, Debabrota Basu

    Abstract: As sequential learning algorithms are increasingly applied to real life, ensuring data privacy while maintaining their utilities emerges as a timely question. In this context, regret minimisation in stochastic bandits under $ε$-global Differential Privacy (DP) has been widely studied. Unlike bandits without DP, there is a significant gap between the best-known regret lower and upper bound in this… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  12. arXiv:2505.02220  [pdf, ps, other

    stat.ME

    Statistical method for pooling categorical biomarkers from multi-center matched/nested case-control studies

    Authors: Yujie Wu, Xiao Wu, Mitchell H. Gail, Regina G. Ziegler, Stephanie A. Smith-Warner, Molin Wang

    Abstract: Pooled analyses that aggregate data from multiple studies are becoming increasingly common in collaborative epidemiologic research in order to increase the size and diversity of the study population. However, biomarker measurements from different studies are subject to systematic measurement errors and directly pooling them for analyses may lead to biased estimates of the regression parameters. Th… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  13. arXiv:2505.02217  [pdf, other

    stat.ME

    Statistical methods for clustered competing risk data when the event types are only available in a training dataset

    Authors: Yujie Wu, Molin Wang

    Abstract: We develop methods to analyze clustered competing risks data when the event types are only available in a training dataset and are missing in the main study. We propose to estimate the exposure effects through the cause-specific proportional hazards frailty model where random effects are introduced into the model to account for the within-cluster correlation. We propose a weighted penalized partia… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  14. arXiv:2505.01467  [pdf, other

    stat.CO

    sae4health: An R Shiny Application for Small Area Estimation in Low- and Middle-Income Countries

    Authors: Yunhan Wu, Qianyu Dong, Jieyi Xu, Zehang Richard Li, Jon Wakefield

    Abstract: Accurate subnational estimation of health indicators is critical for public health planning, especially in low- and middle-income countries (LMICs), where data and tools are often limited. The sae4health R shiny app, built on the surveyPrev package, provides a user-friendly tool for prevalence mapping using small area estimation (SAE) methods. Both area- and unit-level models with spatial random e… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  15. arXiv:2504.16435  [pdf, other

    stat.AP

    Toward a Principled Workflow for Prevalence Mapping Using Household Survey Data

    Authors: Qianyu Dong, Yunhan Wu, Zehang Richard Li, Jon Wakefield

    Abstract: Understanding the prevalence of key demographic and health indicators in small geographic areas and domains is of global interest, especially in low- and middle-income countries (LMICs), where vital registration data is sparse and household surveys are the primary source of information. Recent advances in computation and the increasing availability of spatially detailed datasets have led to much p… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 34 pages, 14 figures

  16. arXiv:2504.15582  [pdf, other

    cs.LG cs.DS stat.ML

    Smooth Calibration and Decision Making

    Authors: Jason Hartline, Yifan Wu, Yunran Yang

    Abstract: Calibration requires predictor outputs to be consistent with their Bayesian posteriors. For machine learning predictors that do not distinguish between small perturbations, calibration errors are continuous in predictions, e.g., smooth calibration error (Foster and Hart, 2018), Distance to Calibration (Blasiok et al., 2023a). On the contrary, decision-makers who use predictions make optimal decisi… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: In FORC 2025

  17. Bayesian Rao test for distributed target detection in interference and noise with limited training data

    Authors: Daipeng Xiao, Weijian Liu, Jun Liu, Yuntao Wu, Qinglei Du, Xiaoqiang Hua

    Abstract: This paper has studied the problem of detecting a range-spread target in interference and noise when the number of training data is limited. The interference is located within a certain subspace with an unknown coordinate, while the noise follows a Gaussian distribution with an unknown covariance matrix. We concentrate on the scenarios where the training data are limited and employ a Bayesian fram… ▽ More

    Submitted 5 May, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: 14 pages,18 figures. This manuscript has been accepted by SCIENCE CHINA Information Sciences

  18. arXiv:2503.11990  [pdf, ps, other

    stat.ME

    Testing Stochastic Block Models Based on Maximum Sampling Entry-Wise Deviations

    Authors: Yujia Wu, Wei Lan, Long Feng, Chih-Ling Tsai

    Abstract: The stochastic block model (SBM) has been widely used to analyze network data. Various goodness-of-fit tests have been proposed to assess the adequacy of model structures. To the best of our knowledge, however, none of the existing approaches are applicable for sparse networks in which the connection probability of any two communities is of order log n/n, and the number of communities is divergent… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  19. arXiv:2503.11709  [pdf, other

    cs.LG cs.AI stat.ML

    Conformal Prediction and Human Decision Making

    Authors: Jessica Hullman, Yifan Wu, Dawei Xie, Ziyang Guo, Andrew Gelman

    Abstract: Methods to quantify uncertainty in predictions from arbitrary models are in demand in high-stakes domains like medicine and finance. Conformal prediction has emerged as a popular method for producing a set of predictions with specified average coverage, in place of a single prediction and confidence value. However, the value of conformal prediction sets to assist human decisions remains elusive du… ▽ More

    Submitted 18 March, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  20. arXiv:2502.17773  [pdf, other

    stat.ME cs.AI cs.LG

    Uncertainty Quantification for LLM-Based Survey Simulations

    Authors: Chengpiao Huang, Yuhang Wu, Kaizheng Wang

    Abstract: We investigate the use of large language models (LLMs) to simulate human responses to survey questions, and perform uncertainty quantification to gain reliable insights. Our approach converts imperfect LLM-simulated responses into confidence sets for population parameters of human responses, addressing the distribution shift between the simulated and real populations. A key innovation lies in dete… ▽ More

    Submitted 26 May, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: 33 pages, 7 figures, 10 tables

  21. arXiv:2502.16504  [pdf, other

    stat.ME

    Local Information for Global Network Estimation in Latent Space Models

    Authors: Lijia Wang, Xiao Han, Yanhui Wu, Y. X. Rachel Wang

    Abstract: In social networks, neighborhood is crucial for understanding individual behavior in response to environments, and thus it is essential to analyze an individual's local perspective within the global network. This paper studies how to utilize a partial information network centered around a given individual for global network estimation by fitting a general latent space model. Compared to the entire… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  22. arXiv:2502.16120  [pdf, other

    math.OC stat.ML

    A Fenchel-Young Loss Approach to Data-Driven Inverse Optimization

    Authors: Zhehao Li, Yanchen Wu, Xiaojie Mao

    Abstract: Data-driven inverse optimization seeks to estimate unknown parameters in an optimization model from observations of optimization solutions. Many existing methods are ineffective in handling noisy and suboptimal solution observations and also suffer from computational challenges. In this paper, we build a connection between inverse optimization and the Fenchel-Young (FY) loss originally designed fo… ▽ More

    Submitted 2 April, 2025; v1 submitted 22 February, 2025; originally announced February 2025.

  23. arXiv:2502.07946  [pdf, other

    stat.ME

    Small Area Estimation of Education Levels in Low- and Middle-Income Countries

    Authors: Yunhan Wu, Ameer Dharamshi, Jon Wakefield

    Abstract: Education is a key driver of social and economic mobility, yet disparities in attainment persist, particularly in low- and middle-income countries (LMICs). Existing indicators, such as mean years of schooling for adults aged 25 and older (MYS25) and expected years of schooling (EYS), offer a snapshot of an educational system, but lack either cohort-specific or temporal granularity. To address thes… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  24. arXiv:2502.03669  [pdf, ps, other

    cs.LG cs.AI cs.DM math.OC stat.ML

    Time to Rethink AI for Combinatorial Optimization: Classical Algorithms Remain Tough to Match

    Authors: Yikai Wu, Haoyu Zhao, Sanjeev Arora

    Abstract: This position paper argues that the machine learning community should fundamentally rethink how AI-inspired methods are developed and evaluated for combinatorial optimization (CO). We present comprehensive empirical benchmarks comparing various recent AI-inspired GPU-based methods with several classical CPU-based solvers on the Maximum Independent Set (MIS) problem. Strikingly, even on in-distribu… ▽ More

    Submitted 29 June, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 28 pages, 6 figures, 98 tables

  25. arXiv:2502.03435  [pdf, other

    stat.ML cs.LG

    Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization

    Authors: Yu-Han Wu, Pierre Marion, Gérard Biau, Claire Boyer

    Abstract: Denoising score matching plays a pivotal role in the performance of diffusion-based generative models. However, the empirical optimal score--the exact solution to the denoising score matching--leads to memorization, where generated samples replicate the training data. Yet, in practice, only a moderate degree of memorization is observed, even without explicit regularization. In this paper, we inves… ▽ More

    Submitted 6 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  26. arXiv:2502.01567  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Latent Thought Models with Variational Bayes Inference-Time Computation

    Authors: Deqian Kong, Minglu Zhao, Dehong Xu, Bo Pang, Shu Wang, Edouardo Honig, Zhangzhang Si, Chuan Li, Jianwen Xie, Sirui Xie, Ying Nian Wu

    Abstract: We propose a novel class of language models, Latent Thought Models (LTMs), which incorporate explicit latent thought vectors that follow an explicit prior model in latent space. These latent thought vectors guide the autoregressive generation of ground tokens through a Transformer decoder. Training employs a dual-rate optimization process within the classical variational Bayes framework: fast lear… ▽ More

    Submitted 6 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  27. arXiv:2501.07795  [pdf, other

    stat.CO stat.ML

    Black-box Optimization with Simultaneous Statistical Inference for Optimal Performance

    Authors: Teng Lian, Jian-Qiang Hu, Yuhang Wu, Zeyu Zheng

    Abstract: Black-box optimization is often encountered for decision-making in complex systems management, where the knowledge of system is limited. Under these circumstances, it is essential to balance the utilization of new information with computational efficiency. In practice, decision-makers often face the dual tasks of optimization and statistical inference for the optimal performance, in order to achie… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  28. arXiv:2412.20586  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models

    Authors: Yufei Wu, Stefan Radev, Francis Tuerlinckx

    Abstract: Contaminant observations and outliers often cause problems when estimating the parameters of cognitive models, which are statistical models representing cognitive processes. In this study, we test and improve the robustness of parameter estimation using amortized Bayesian inference (ABI) with neural networks. To this end, we conduct systematic analyses on a toy example and analyze both synthetic a… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

  29. arXiv:2412.04767  [pdf, other

    cs.LG cs.DS stat.ML

    Towards counterfactual fairness through auxiliary variables

    Authors: Bowei Tian, Ziyao Wang, Shwai He, Wanghao Ye, Guoheng Sun, Yucong Dai, Yongkai Wu, Ang Li

    Abstract: The challenge of balancing fairness and predictive accuracy in machine learning models, especially when sensitive attributes such as race, gender, or age are considered, has motivated substantial research in recent years. Counterfactual fairness ensures that predictions remain consistent across counterfactual variations of sensitive attributes, which is a crucial concept in addressing societal bia… ▽ More

    Submitted 20 February, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.08232 by other authors

    Journal ref: The International Conference on Learning Representations (ICLR 2025)

  30. arXiv:2411.17472  [pdf, other

    cs.CV cs.LG stat.ML

    Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

    Authors: Eric Hanchen Jiang, Yasi Zhang, Zhi Zhang, Yixin Wan, Andrew Lizarraga, Shufan Li, Ying Nian Wu

    Abstract: Text-to-image (T2I) diffusion models have revolutionized generative modeling by producing high-fidelity, diverse, and visually realistic images from textual prompts. Despite these advances, existing models struggle with complex prompts involving multiple objects and attributes, often misaligning modifiers with their corresponding nouns or neglecting certain elements. Recent attention-based methods… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  31. arXiv:2411.17154  [pdf, other

    q-bio.PE cs.LG stat.ML

    Emergenet: A Digital Twin of Sequence Evolution for Scalable Emergence Risk Assessment of Animal Influenza A Strains

    Authors: Kevin Yuanbo Wu, Jin Li, Aaron Esser-Kahn, Ishanu Chattopadhyay

    Abstract: Despite having triggered devastating pandemics in the past, our ability to quantitatively assess the emergence potential of individual strains of animal influenza viruses remains limited. This study introduces Emergenet, a tool to infer a digital twin of sequence evolution to chart how new variants might emerge in the wild. Our predictions based on Emergenets built only using 220,151 Hemagglutinni… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: 35 pages, 15 figures

  32. arXiv:2411.12578  [pdf, other

    stat.ME math.ST

    Robust Inference for High-dimensional Linear Models with Heavy-tailed Errors via Partial Gini Covariance

    Authors: Yilin Zhang, Songshan Yang, Yunan Wu, Lan Wang

    Abstract: This paper introduces the partial Gini covariance, a novel dependence measure that addresses the challenges of high-dimensional inference with heavy-tailed errors, often encountered in fields like finance, insurance, climate, and biology. Conventional high-dimensional regression inference methods suffer from inaccurate type I errors and reduced power in heavy-tailed contexts, limiting their effect… ▽ More

    Submitted 20 November, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

  33. arXiv:2411.10596  [pdf, other

    q-bio.NC cs.AI cs.CV stat.ML

    A minimalistic representation model for head direction system

    Authors: Minglu Zhao, Dehong Xu, Deqian Kong, Wen-Hao Zhang, Ying Nian Wu

    Abstract: We present a minimalistic representation model for the head direction (HD) system, aiming to learn a high-dimensional representation of head direction that captures essential properties of HD cells. Our model is a representation of rotation group $U(1)$, and we study both the fully connected version and convolutional version. We demonstrate the emergence of Gaussian-like tuning profiles and a 2D c… ▽ More

    Submitted 2 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci 2025)

  34. arXiv:2411.09686  [pdf, other

    stat.ML cs.LG

    Conditional regression for the Nonlinear Single-Variable Model

    Authors: Yantao Wu, Mauro Maggioni

    Abstract: Several statistical models for regression of a function $F$ on $\mathbb{R}^d$ without the statistical and computational curse of dimensionality exist, for example by imposing and exploiting geometric assumptions on the distribution of the data (e.g. that its support is low-dimensional), or strong smoothness assumptions on $F$, or a special structure $F$. Among the latter, compositional models assu… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: 55 pages, 10 figures

    MSC Class: 62G08

  35. arXiv:2411.01371  [pdf, other

    cs.LG stat.ML

    Network Causal Effect Estimation In Graphical Models Of Contagion And Latent Confounding

    Authors: Yufeng Wu, Rohit Bhattacharya

    Abstract: A key question in many network studies is whether the observed correlations between units are primarily due to contagion or latent confounding. Here, we study this question using a segregated graph (Shpitser, 2015) representation of these mechanisms, and examine how uncertainty about the true underlying mechanism impacts downstream computation of network causal effects, particularly under full int… ▽ More

    Submitted 5 March, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

    Comments: 27 pages, in proceedings of the 4th Conference on Causal Learning and Reasoning

  36. arXiv:2410.15241  [pdf, other

    cs.LG stat.ML

    Conditional Uncertainty Quantification for Tensorized Topological Neural Networks

    Authors: Yujia Wu, Bo Yang, Yang Zhao, Elynn Chen, Yuzhou Chen, Zheshi Zheng

    Abstract: Graph Neural Networks (GNNs) have become the de facto standard for analyzing graph-structured data, leveraging message-passing techniques to capture both structural and node feature information. However, recent studies have raised concerns about the statistical reliability of uncertainty estimates produced by GNNs. This paper addresses this crucial challenge by introducing a novel technique for qu… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.12007

  37. arXiv:2410.15239  [pdf, other

    cs.LG stat.ML

    Conditional Prediction ROC Bands for Graph Classification

    Authors: Yujia Wu, Bo Yang, Elynn Chen, Yuzhou Chen, Zheshi Zheng

    Abstract: Graph classification in medical imaging and drug discovery requires accuracy and robust uncertainty quantification. To address this need, we introduce Conditional Prediction ROC (CP-ROC) bands, offering uncertainty quantification for ROC curves and robustness to distributional shifts in test data. Although developed for Tensorized Graph Neural Networks (TGNNs), CP-ROC is adaptable to general Graph… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  38. arXiv:2410.12730  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Counterfactual Generative Modeling with Variational Causal Inference

    Authors: Yulun Wu, Louie McConnell, Claudia Iriondo

    Abstract: Estimating an individual's counterfactual outcomes under interventions is a challenging task for traditional causal inference and supervised learning approaches when the outcome is high-dimensional (e.g. gene expressions, facial images) and covariates are relatively limited. In this case, to predict one's outcomes under counterfactual treatments, it is crucial to leverage individual information co… ▽ More

    Submitted 18 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at ICLR 2025

  39. arXiv:2410.11359  [pdf, other

    cs.LG cs.RO stat.ML

    DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

    Authors: Eric Hanchen Jiang, Zhi Zhang, Dinghuai Zhang, Andrew Lizarraga, Chenheng Xu, Yasi Zhang, Siyan Zhao, Zhengjie Xu, Peiyu Yu, Yuer Tang, Deqian Kong, Ying Nian Wu

    Abstract: Advancements in reinforcement learning have led to the development of sophisticated models capable of learning complex decision-making tasks. However, efficiently integrating world models with decision transformers remains a challenge. In this paper, we introduce a novel approach that combines the Dreamer algorithm's ability to generate anticipatory trajectories with the adaptive learning strength… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  40. arXiv:2410.06281  [pdf, other

    stat.ME

    Sequential Design with Derived Win Statistics

    Authors: Baoshan Zhang, Yuan Wu

    Abstract: The Win Ratio has gained significant traction in cardiovascular trials as a novel method for analyzing composite endpoints (Pocock and others, 2012). Compared with conventional approaches based on time to the first event, the Win Ratio accommodates the varying priorities and types of outcomes among components, potentially offering greater statistical power by fully utilizing the information contai… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 36 pages, 1 figure, 5 tables

  41. arXiv:2410.04760  [pdf, other

    stat.ML cs.LG

    Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models

    Authors: Yuchen Wu, Yuxin Chen, Yuting Wei

    Abstract: Diffusion models play a pivotal role in contemporary generative modeling, claiming state-of-the-art performance across various domains. Despite their superior sample quality, mainstream diffusion-based stochastic samplers like DDPM often require a large number of score function evaluations, incurring considerably higher computational cost compared to single-step generators like generative adversar… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 45 pages, 3 figures

  42. arXiv:2409.10835  [pdf, other

    stat.ME stat.AP

    BMRMM: An R Package for Bayesian Markov (Renewal) Mixed Models

    Authors: Yutong Wu, Abhra Sarkar

    Abstract: We introduce the BMRMM package implementing Bayesian inference for a class of Markov renewal mixed models which can characterize the stochastic dynamics of a collection of sequences, each comprising alternative instances of categorical states and associated continuous duration times, while being influenced by a set of exogenous factors as well as a 'random' individual. The default setting flexibly… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 20 pages, 11 figures

  43. arXiv:2409.08551  [pdf, other

    stat.ML cs.LG

    Think Twice Before You Act: Improving Inverse Problem Solving With MCMC

    Authors: Yaxuan Zhu, Zehao Dou, Haoxin Zheng, Yasi Zhang, Ying Nian Wu, Ruiqi Gao

    Abstract: Recent studies demonstrate that diffusion models can serve as a strong prior for solving inverse problems. A prominent example is Diffusion Posterior Sampling (DPS), which approximates the posterior distribution of data given the measure using Tweedie's formula. Despite the merits of being versatile in solving various inverse problems without re-training, the performance of DPS is hindered by the… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  44. arXiv:2409.05276  [pdf, ps, other

    stat.ME

    An Eigengap Ratio Test for Determining the Number of Communities in Network Data

    Authors: Yujia Wu, Jingfei Zhang, Wei Lan, Chih-Ling Tsai

    Abstract: To characterize the community structure in network data, researchers have introduced various block-type models, including the stochastic block model, degree-corrected stochastic block model, mixed membership block model, degree-corrected mixed membership block model, and others. A critical step in applying these models effectively is determining the number of communities in the network. However, t… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  45. arXiv:2409.04982  [pdf, other

    cs.CV math.PR stat.ML

    2DSig-Detect: a semi-supervised framework for anomaly detection on image data using 2D-signatures

    Authors: Xinheng Xie, Kureha Yamaguchi, Margaux Leblanc, Simon Malzard, Varun Chhabra, Victoria Nockles, Yue Wu

    Abstract: The rapid advancement of machine learning technologies raises questions about the security of machine learning models, with respect to both training-time (poisoning) and test-time (evasion, impersonation, and inversion) attacks. Models performing image-related tasks, e.g. detection, and classification, are vulnerable to adversarial attacks that can degrade their performance and produce undesirable… ▽ More

    Submitted 20 March, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

    MSC Class: 60L99; 68T10

  46. arXiv:2409.03845  [pdf, other

    cs.LG stat.ML

    Latent Space Energy-based Neural ODEs

    Authors: Sheng Cheng, Deqian Kong, Jianwen Xie, Kookjin Lee, Ying Nian Wu, Yezhou Yang

    Abstract: This paper introduces novel deep dynamical models designed to represent continuous-time sequences. Our approach employs a neural emission model to generate each data point in the time series through a non-linear transformation of a latent state vector. The evolution of these latent states is implicitly defined by a neural ordinary differential equation (ODE), with the initial state drawn from an i… ▽ More

    Submitted 5 February, 2025; v1 submitted 5 September, 2024; originally announced September 2024.

  47. arXiv:2408.10609  [pdf, ps, other

    cs.LG q-bio.GN stat.ML

    PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

    Authors: Yan Wu, Esther Wershof, Sebastian M Schmon, Marcel Nassar, Błażej Osiński, Ridvan Eksi, Zichao Yan, Rory Stark, Kun Zhang, Thore Graepel

    Abstract: We introduce a comprehensive framework for perturbation response modeling in single cells, aimed at standardizing benchmarking in this rapidly evolving field. Our approach includes a modular and user-friendly model development and evaluation platform, a collection of diverse perturbational datasets, and a set of metrics designed to fairly compare models and dissect their performance nuances. Throu… ▽ More

    Submitted 16 June, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 10 pages plus 20 pages supplementary material. Code is available at https://github.com/altoslabs/perturbench

  48. arXiv:2408.09722  [pdf, other

    cs.LG stat.ML

    Towards Few-Shot Learning in the Open World: A Review and Beyond

    Authors: Hui Xue, Yuexuan An, Yongchun Qin, Wenqian Li, Yixin Wu, Yongjuan Che, Pengfei Fang, Minling Zhang

    Abstract: Human intelligence is characterized by our ability to absorb and apply knowledge from the world around us, especially in rapidly acquiring new concepts from minimal examples, underpinned by prior knowledge. Few-shot learning (FSL) aims to mimic this capacity by enabling significant generalizations and transferability. However, traditional FSL frameworks often rely on assumptions of clean, complete… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  49. arXiv:2408.07941  [pdf, other

    stat.ML cs.LG

    Robust Offline Active Learning on Graphs

    Authors: Yuanchen Wu, Yubai Yuan

    Abstract: We consider the problem of active learning on graphs, which has crucial applications in many real-world networks where labeling node responses is expensive. In this paper, we propose an offline active learning method that selects nodes to query by explicitly incorporating information from both the network structure and node covariates. Building on graph signal recovery theories and the random spec… ▽ More

    Submitted 6 November, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  50. arXiv:2406.19550  [pdf, other

    stat.ME math.ST

    Provably Efficient Posterior Sampling for Sparse Linear Regression via Measure Decomposition

    Authors: Andrea Montanari, Yuchen Wu

    Abstract: We consider the problem of sampling from the posterior distribution of a $d$-dimensional coefficient vector $\boldsymbolθ$, given linear observations $\boldsymbol{y} = \boldsymbol{X}\boldsymbolθ+\boldsymbol{\varepsilon}$. In general, such posteriors are multimodal, and therefore challenging to sample from. This observation has prompted the exploration of various heuristics that aim at approximatin… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 29 pages, 10 figures