Skip to main content

Showing 1–50 of 264 results for author: Yang, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.00173  [pdf, ps, other

    stat.ME stat.AP

    Penalized FCI for Causal Structure Learning in a Sparse DAG for Biomarker Discovery in Parkinson's Disease

    Authors: Samhita Pal, Dhrubajyoti Ghosh, Shu Yang

    Abstract: Parkinson's disease (PD) is a progressive neurodegenerative disorder that lacks reliable early-stage biomarkers for diagnosis, prognosis, and therapeutic monitoring. While cerebrospinal fluid (CSF) biomarkers, such as alpha-synuclein seed amplification assays (alphaSyn-SAA), offer diagnostic potential, their clinical utility is limited by invasiveness and incomplete specificity. Plasma biomarkers… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

  2. arXiv:2506.09325  [pdf, ps, other

    stat.ME

    A Spectral Confounder Adjustment for Spatial Regression with Multiple Exposures and Outcomes

    Authors: Shih-Ni Prim, Yawen Guan, Shu Yang, Ana G Rappold, K. Lloyd Hill, Wei-Lun Tsai, Corinna Keeler, Brian J Reich

    Abstract: Unmeasured spatial confounding complicates exposure effect estimation in environmental health studies. This problem is exacerbated in studies with multiple health outcomes and environmental exposure variables, as the source and magnitude of confounding bias may differ across exposure/outcome pairs. We propose to mitigate the effects of spatial confounding in multivariate studies by projecting to t… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2506.03943  [pdf, ps, other

    cs.LG stat.ML

    Lower Ricci Curvature for Hypergraphs

    Authors: Shiyi Yang, Can Chen, Didong Li

    Abstract: Networks with higher-order interactions, prevalent in biological, social, and information systems, are naturally represented as hypergraphs, yet their structural complexity poses fundamental challenges for geometric characterization. While curvature-based methods offer powerful insights in graph analysis, existing extensions to hypergraphs suffer from critical trade-offs: combinatorial approaches… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. arXiv:2505.21723  [pdf, ps, other

    stat.CO cs.LG stat.ML

    Are Statistical Methods Obsolete in the Era of Deep Learning?

    Authors: Skyler Wu, Shihao Yang, S. C. Kou

    Abstract: In the era of AI, neural networks have become increasingly popular for modeling, inference, and prediction, largely due to their potential for universal approximation. With the proliferation of such deep learning models, a question arises: are leaner statistical methods still relevant? To shed insight on this question, we employ the mechanistic nonlinear ordinary differential equation (ODE) invers… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 35 pages, 11 figures (main text)

  5. arXiv:2505.19491  [pdf, ps, other

    cs.LG stat.ML

    Discounted Online Convex Optimization: Uniform Regret Across a Continuous Interval

    Authors: Wenhao Yang, Sifan Yang, Lijun Zhang

    Abstract: Reflecting the greater significance of recent history over the distant past in non-stationary environments, $λ$-discounted regret has been introduced in online convex optimization (OCO) to gracefully forget past data as new information arrives. When the discount factor $λ$ is given, online gradient descent with an appropriate step size achieves an $O(1/\sqrt{1-λ})$ discounted regret. However, the… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  6. arXiv:2505.09949  [pdf

    cs.LG cs.CL stat.AP

    Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors

    Authors: Ahmed S. Abdelrahman, Mohamed Abdel-Aty, Samgyu Yang, Abdulrahman Faden

    Abstract: Understanding the factors contributing to traffic crashes and developing strategies to mitigate their severity is essential. Traditional statistical methods and machine learning models often struggle to capture the complex interactions between various factors and the unique characteristics of each crash. This research leverages large language model (LLM) to analyze freeway crash data and provide c… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  7. arXiv:2505.08092  [pdf, ps, other

    stat.ME stat.ML

    Doubly Robust Fusion of Many Treatments for Policy Learning

    Authors: Ke Zhu, Jianing Chu, Ilya Lipkovich, Wenyu Ye, Shu Yang

    Abstract: Individualized treatment rules/recommendations (ITRs) aim to improve patient outcomes by tailoring treatments to the characteristics of each individual. However, when there are many treatment groups, existing methods face significant challenges due to data sparsity within treatment groups and highly unbalanced covariate distributions across groups. To address these challenges, we propose a novel c… ▽ More

    Submitted 23 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML 2025

  8. arXiv:2505.02675  [pdf, other

    stat.ME stat.AP

    Attractor-Based Coevolving Dot Product Random Graph Model

    Authors: Shiwen Yang, Daniel L. Sussman

    Abstract: We introduce the attractor-based coevolving dot product random graph model (ABCDPRGM) to analyze time-series network data manifesting polarizing or flocking behavior. Graphs are generated based on latent positions under the random dot product graph regime. We assign group membership to each node. When evolving through time, the latent position of each node will change based on its current position… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  9. arXiv:2505.00217  [pdf, other

    stat.ME

    Robust Estimation and Inference in Hybrid Controlled Trials for Binary Outcomes: A Case Study on Non-Small Cell Lung Cancer

    Authors: Jiajun Liu, Ke Zhu, Shu Yang, Xiaofei Wang

    Abstract: Hybrid controlled trials (HCTs), which augment randomized controlled trials (RCTs) with external controls (ECs), are increasingly receiving attention as a way to address limited power, slow accrual, and ethical concerns in clinical research. However, borrowing from ECs raises critical statistical challenges in estimation and inference, especially for binary outcomes where hidden bias is harder to… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  10. arXiv:2504.19924  [pdf, other

    stat.ME

    Collaborative Inference for Sparse High-Dimensional Models with Non-Shared Data

    Authors: Yifan Gu, Hanfang Yang, Songshan Yang, Hui Zou

    Abstract: In modern data analysis, statistical efficiency improvement is expected via effective collaboration among multiple data holders with non-shared data. In this article, we propose a collaborative score-type test (CST) for testing linear hypotheses, which accommodates potentially high-dimensional nuisance parameters and a diverging number of constraints and target parameters. Through a careful decomp… ▽ More

    Submitted 28 April, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

  11. arXiv:2504.16172  [pdf, other

    math.NA cs.AI cs.LG math.PR stat.ML

    Physics-Informed Inference Time Scaling via Simulation-Calibrated Scientific Machine Learning

    Authors: Zexi Fan, Yan Sun, Shihao Yang, Yiping Lu

    Abstract: High-dimensional partial differential equations (PDEs) pose significant computational challenges across fields ranging from quantum chemistry to economics and finance. Although scientific machine learning (SciML) techniques offer approximate solutions, they often suffer from bias and neglect crucial physical insights. Inspired by inference-time scaling strategies in language models, we propose Sim… ▽ More

    Submitted 25 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  12. arXiv:2504.08824  [pdf, other

    cs.LG cs.AI cs.CV cs.HC stat.AP

    ColonScopeX: Leveraging Explainable Expert Systems with Multimodal Data for Improved Early Diagnosis of Colorectal Cancer

    Authors: Natalia Sikora, Robert L. Manschke, Alethea M. Tang, Peter Dunstan, Dean A. Harris, Su Yang

    Abstract: Colorectal cancer (CRC) ranks as the second leading cause of cancer-related deaths and the third most prevalent malignant tumour worldwide. Early detection of CRC remains problematic due to its non-specific and often embarrassing symptoms, which patients frequently overlook or hesitate to report to clinicians. Crucially, the stage at which CRC is diagnosed significantly impacts survivability, with… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: Published to AAAI-25 Bridge Program

  13. arXiv:2504.07032  [pdf, other

    stat.AP

    Restoring the Forecasting Power of Google Trends with Statistical Preprocessing

    Authors: Candice Djorno, Mauricio Santillana, Shihao Yang

    Abstract: Google Trends reports how frequently specific queries are searched on Google over time. It is widely used in research and industry to gain early insights into public interest. However, its data generation mechanism introduces missing values, sampling variability, noise, and trends. These issues arise from privacy thresholds mapping low search volumes to zeros, daily sampling variations causing dis… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  14. arXiv:2503.15967  [pdf, other

    stat.ME

    Integrative Analysis of High-dimensional RCT and RWD Subject to Censoring and Hidden Confounding

    Authors: Xin Ye, Shu Yang, Xiaofei Wang, Yanyan Liu

    Abstract: In this study, we focus on estimating the heterogeneous treatment effect (HTE) for survival outcome. The outcome is subject to censoring and the number of covariates is high-dimensional. We utilize data from both the randomized controlled trial (RCT), considered as the gold standard, and real-world data (RWD), possibly affected by hidden confounding factors. To achieve a more efficient HTE estimat… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  15. arXiv:2503.15745  [pdf, ps, other

    stat.ME stat.AP

    Statistical Inference for Heterogeneous Treatment Effect with Right-censored Data from Synthesizing Randomized Clinical Trials and Real-world Data

    Authors: Guangcai Mao, Shu Yang, Xiaofei Wang

    Abstract: The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  16. arXiv:2503.06864  [pdf, other

    stat.ME stat.AP

    Doubly robust omnibus sensitivity analysis of externally controlled trials with intercurrent events

    Authors: Chenyin Gao, Xiang Zhang, Shu Yang

    Abstract: Externally controlled trials are crucial in clinical development when randomized controlled trials are unethical or impractical. These trials consist of a full treatment arm with the experimental treatment and a full external control arm. However, they present significant challenges in learning the treatment effect due to the lack of randomization and a parallel control group. Besides baseline inc… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  17. arXiv:2502.18647  [pdf

    stat.AP

    Predicting Long-term Urban Overheating and Their Mitigations from Nature Based Solutions Using Machine Learning and Field Measurements

    Authors: Jiwei Zou, Lin Wang, Senwen Yang, Michael Lacasse, Liangzhu, Wang

    Abstract: Urban overheating, exacerbated by climate change, threatens public health and urban sustainability. Traditional approaches, such as numerical simulations and field measurements, face challenges due to uncertainties in input data. This study integrates field measurements with machine learning models to predict the duration and severity of future urban overheating events, focusing on the role of urb… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  18. arXiv:2502.07244  [pdf, other

    cs.LG cs.AI stat.ML

    Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

    Authors: Jiecheng Lu, Shihao Yang

    Abstract: Autoregressive attention-based time series forecasting (TSF) has drawn increasing interest, with mechanisms like linear attention sometimes outperforming vanilla attention. However, deeper Transformer architectures frequently misalign with autoregressive objectives, obscuring the underlying VAR structure embedded within linear attention and hindering their ability to capture the data generative pr… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  19. arXiv:2501.18798  [pdf, other

    stat.ME math.ST stat.ML

    Targeted Data Fusion for Causal Survival Analysis Under Distribution Shift

    Authors: Yi Liu, Alexander W. Levis, Ke Zhu, Shu Yang, Peter B. Gilbert, Larry Han

    Abstract: Causal inference across multiple data sources offers a promising avenue to enhance the generalizability and replicability of scientific findings. However, data integration methods for time-to-event outcomes, common in biomedical research, are underdeveloped. Existing approaches focus on binary or continuous outcomes but fail to address the unique challenges of survival analysis, such as censoring… ▽ More

    Submitted 14 May, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

  20. arXiv:2501.16958  [pdf, other

    stat.AP

    Estimating the Causal Effect of Redlining on Present-day Air Pollution

    Authors: Xiaodan Zhou, Shu Yang, Brian J Reich

    Abstract: Recent studies have shown associations between redlining policies (1935-1974) and present-day fine particulate matter (PM$_{2.5}$) and nitrogen dioxide (NO$_2$) air pollution concentrations. In this paper, we reevaluate these associations using spatial causal inference. Redlining policies enacted in the 1930s, so there is very limited documentation of pre-treatment covariates. Consequently, tradit… ▽ More

    Submitted 14 March, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  21. arXiv:2501.14107  [pdf, other

    stat.ML cs.LG

    EFiGP: Eigen-Fourier Physics-Informed Gaussian Process for Inference of Dynamic Systems

    Authors: Jianhong Chen, Shihao Yang

    Abstract: Parameter estimation and trajectory reconstruction for data-driven dynamical systems governed by ordinary differential equations (ODEs) are essential tasks in fields such as biology, engineering, and physics. These inverse problems -- estimating ODE parameters from observational data -- are particularly challenging when the data are noisy, sparse, and the dynamics are nonlinear. We propose the Eig… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  22. arXiv:2501.08945  [pdf, other

    stat.ME

    COADVISE: Covariate Adjustment with Variable Selection in Randomized Controlled Trials

    Authors: Yi Liu, Ke Zhu, Larry Han, Shu Yang

    Abstract: Adjusting for covariates in randomized controlled trials can enhance the credibility and efficiency of treatment effect estimation. However, handling numerous covariates and their complex (non-linear) transformations poses a challenge. Motivated by the case study of the Best Apnea Interventions for Research (BestAIR) trial data from the National Sleep Research Resource (NSRR), where the number of… ▽ More

    Submitted 26 February, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

  23. arXiv:2501.02128  [pdf, other

    stat.AP

    Transfer Learning for Individualized Treatment Rules: Application to Sepsis Patients Data from eICU-CRD and MIMIC-III Databases

    Authors: Andong Wang, Kelly Wentzlof, Johnny Rajala, Miontranese Green, Yunshu Zhang, Shu Yang

    Abstract: Modern precision medicine aims to utilize real-world data to provide the best treatment for an individual patient. An individualized treatment rule (ITR) maps each patient's characteristics to a recommended treatment scheme that maximizes the expected outcome of the patient. A challenge precision medicine faces is population heterogeneity, as studies on treatment effects are often conducted on sou… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

    Comments: 23 pages, 4 figures

  24. arXiv:2412.11575  [pdf, other

    stat.ME q-fin.PM

    Cost-aware Portfolios in a Large Universe of Assets

    Authors: Qingliang Fan, Marcelo C. Medeiros, Hanming Yang, Songshan Yang

    Abstract: This paper considers the finite horizon portfolio rebalancing problem in terms of mean-variance optimization, where decisions are made based on current information on asset returns and transaction costs. The study's novelty is that the transaction costs are integrated within the optimization problem in a high-dimensional portfolio setting where the number of assets is larger than the sample size.… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  25. arXiv:2411.17766  [pdf, ps, other

    cs.LG stat.ML

    Integrating Dual Prototypes for Task-Wise Adaption in Pre-Trained Model-Based Class-Incremental Learning

    Authors: Zhiming Xu, Suorong Yang, Baile Xu, Furao Shen, Jian Zhao

    Abstract: Class-incremental learning (CIL) aims to acquire new classes while conserving historical knowledge incrementally. Despite existing pre-trained model (PTM) based methods performing excellently in CIL, it is better to fine-tune them on downstream incremental tasks with massive patterns unknown to PTMs. However, using task streams for fine-tuning could lead to \textit{catastrophic forgetting} that wi… ▽ More

    Submitted 1 July, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: 10 pages,9 figures,2 tables

  26. arXiv:2411.12578  [pdf, other

    stat.ME math.ST

    Robust Inference for High-dimensional Linear Models with Heavy-tailed Errors via Partial Gini Covariance

    Authors: Yilin Zhang, Songshan Yang, Yunan Wu, Lan Wang

    Abstract: This paper introduces the partial Gini covariance, a novel dependence measure that addresses the challenges of high-dimensional inference with heavy-tailed errors, often encountered in fields like finance, insurance, climate, and biology. Conventional high-dimensional regression inference methods suffer from inaccurate type I errors and reduced power in heavy-tailed contexts, limiting their effect… ▽ More

    Submitted 20 November, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

  27. arXiv:2411.12277  [pdf, other

    stat.AP

    O-MAGIC: Online Change-Point Detection for Dynamic Systems

    Authors: Yan Sun, Yeping Wang, Zhaohui Li, Shihao Yang

    Abstract: The capture of changes in dynamic systems, especially ordinary differential equations (ODEs), is an important and challenging task, with multiple applications in biomedical research and other scientific areas. This article proposes a fast and mathematically rigorous online method, called ODE-informed MAnifold-constrained Gaussian process Inference for Change point detection(O-MAGIC), to detect cha… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  28. arXiv:2411.05852  [pdf, other

    cs.LG stat.ML

    $\spadesuit$ SPADE $\spadesuit$ Split Peak Attention DEcomposition

    Authors: Malcolm Wolff, Kin G. Olivares, Boris Oreshkin, Sunny Ruan, Sitan Yang, Abhinav Katoch, Shankar Ramasubramanian, Youxin Zhang, Michael W. Mahoney, Dmitry Efimov, Vincent Quenneville-Bélair

    Abstract: Demand forecasting faces challenges induced by Peak Events (PEs) corresponding to special periods such as promotions and holidays. Peak events create significant spikes in demand followed by demand ramp down periods. Neural networks like MQCNN and MQT overreact to demand peaks by carrying over the elevated PE demand into subsequent Post-Peak-Event (PPE) periods, resulting in significantly over-bia… ▽ More

    Submitted 21 January, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Journal ref: 31st Conference on Neural Information Processing In 38th Conference on Neural Information Processing Systems NIPS 2017, Time Series in the Age of Large Models Workshop, 2024

  29. arXiv:2410.21213  [pdf, other

    stat.ME

    Spatial causal inference in the presence of preferential sampling to study the impacts of marine protected areas

    Authors: Dongjae Son, Brian J. Reich, Erin M. Schliep, Shu Yang, David A. Gill

    Abstract: Marine Protected Areas (MPAs) have been established globally to conserve marine resources. Given their maintenance costs and impact on commercial fishing, it is critical to evaluate their effectiveness to support future conservation. In this paper, we use data collected from the Australian coast to estimate the effect of MPAs on biodiversity. Environmental studies such as these are often observati… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  30. arXiv:2410.18409  [pdf, other

    stat.ME stat.AP

    Doubly protected estimation for survival outcomes utilizing external controls for randomized clinical trials

    Authors: Chenyin Gao, Shu Yang, Mingyang Shan, Wenyu Wendy Ye, Ilya Lipkovich, Douglas Faries

    Abstract: Censored survival data are common in clinical trials, but small control groups can pose challenges, particularly in rare diseases or where balanced randomization is impractical. Recent approaches leverage external controls from historical studies or real-world data to strengthen treatment evaluation for survival outcomes. However, using external controls directly may introduce biases due to data h… ▽ More

    Submitted 14 May, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: accepted at ICML 2025

  31. arXiv:2410.11713  [pdf, other

    stat.ME

    Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

    Authors: Ke Zhu, Shu Yang, Xiaofei Wang

    Abstract: External controls from historical trials or observational data can augment randomized controlled trials when large-scale randomization is impractical or unethical, such as in drug evaluation for rare diseases. However, non-randomized external controls can introduce biases, and existing Bayesian and frequentist methods may inflate the type I error rate, particularly in small-sample trials where ext… ▽ More

    Submitted 7 May, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: Accepted by ICML 2025

  32. arXiv:2410.03937  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Clustering Alzheimer's Disease Subtypes via Similarity Learning and Graph Diffusion

    Authors: Tianyi Wei, Shu Yang, Davoud Ataee Tarzanagh, Jingxuan Bao, Jia Xu, Patryk Orzechowski, Joost B. Wagenaar, Qi Long, Li Shen

    Abstract: Alzheimer's disease (AD) is a complex neurodegenerative disorder that affects millions of people worldwide. Due to the heterogeneous nature of AD, its diagnosis and treatment pose critical challenges. Consequently, there is a growing research interest in identifying homogeneous AD subtypes that can assist in addressing these challenges in recent years. In this study, we aim to identify subtypes of… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: ICIBM'23': International Conference on Intelligent Biology and Medicine, Tampa, FL, USA, July 16-19, 2023

  33. arXiv:2410.03159  [pdf, other

    cs.LG cs.AI stat.ML

    WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

    Authors: Jiecheng Lu, Xu Han, Yan Sun, Shihao Yang

    Abstract: We propose a Weighted Autoregressive Varying gatE (WAVE) attention mechanism equipped with both Autoregressive (AR) and Moving-average (MA) components. It can adapt to various attention mechanisms, enhancing and decoupling their ability to capture long-range and local temporal patterns in time series data. In this paper, we first demonstrate that, for the time series forecasting (TSF) task, the pr… ▽ More

    Submitted 11 February, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

  34. arXiv:2410.01196  [pdf, other

    stat.AP cs.LG stat.ML

    Expected Diverse Utility (EDU): Diverse Bayesian Optimization of Expensive Computer Simulators

    Authors: John Joshua Miller, Simon Mak, Benny Sun, Sai Ranjeet Narayanan, Suo Yang, Zongxuan Sun, Kenneth S. Kim, Chol-Bum Mike Kweon

    Abstract: The optimization of expensive black-box simulators arises in a myriad of modern scientific and engineering applications. Bayesian optimization provides an appealing solution, by leveraging a fitted surrogate model to guide the selection of subsequent simulator evaluations. In practice, however, the objective is often not to obtain a single good solution, but rather a ``basket'' of good solutions f… ▽ More

    Submitted 2 February, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  35. arXiv:2409.16463  [pdf, ps, other

    stat.ME math.ST

    Double-Estimation-Friendly Inference for High-Dimensional Measurement Error Models with Non-Sparse Adaptability

    Authors: Shijie Cui, Xu Guo, Songshan Yang, Zhe Zhang

    Abstract: In this paper, we introduce an innovative testing procedure for assessing individual hypotheses in high-dimensional linear regression models with measurement errors. This method remains robust even when either the X-model or Y-model is misspecified. We develop a double robust score function that maintains a zero expectation if one of the models is incorrect, and we construct a corresponding score… ▽ More

    Submitted 11 January, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

  36. arXiv:2409.09236  [pdf, other

    stat.ME

    Off-Policy Evaluation with Irregularly-Spaced, Outcome-Dependent Observation Times

    Authors: Xin Chen, Wenbin Lu, Shu Yang, Dipankar Bandyopadhyay

    Abstract: While the classic off-policy evaluation (OPE) literature commonly assumes decision time points to be evenly spaced for simplicity, in many real-world scenarios, such as those involving user-initiated visits, decisions are made at irregularly-spaced and potentially outcome-dependent time points. For a more principled evaluation of the dynamic policies, this paper constructs a novel OPE framework, w… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  37. arXiv:2409.07391  [pdf, other

    stat.ME

    Improve Sensitivity Analysis Synthesizing Randomized Clinical Trials With Limited Overlap

    Authors: Kuan Jiang, Wenjie Hu, Shu Yang, Xinxing Lai, Xiaohua Zhou

    Abstract: Randomized clinical trials are the gold standard when estimating the average treatment effect. However, they are usually not a random sample from the real-world population because of the inclusion/exclusion rules. Meanwhile, observational studies typically consist of representative samples from the real-world population. However, due to unmeasured confounding, sensitivity analysis is often used to… ▽ More

    Submitted 10 December, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

  38. arXiv:2408.00799  [pdf, other

    cs.IR cs.LG stat.ML

    Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

    Authors: Xin Jiang, Kaiqiang Wang, Yinlong Wang, Fengchang Lv, Taiyang Peng, Shuai Yang, Xianteng Wu, Pengye Zhang, Shuo Yuan, Yifan Zeng

    Abstract: In recommendation systems, the relevance and novelty of the final results are selected through a cascade system of Matching -> Ranking -> Strategy. The matching model serves as the starting point of the pipeline and determines the upper bound of the subsequent stages. Balancing the relevance and novelty of matching results is a crucial step in the design and optimization of recommendation systems,… ▽ More

    Submitted 5 August, 2024; v1 submitted 21 July, 2024; originally announced August 2024.

    Comments: accepted by cikm2024

  39. arXiv:2407.18488  [pdf, other

    cs.LG cs.IT stat.ML

    Conversational Dueling Bandits in Generalized Linear Models

    Authors: Shuhua Yang, Hui Yuan, Xiaoying Zhang, Mengdi Wang, Hong Zhang, Huazheng Wang

    Abstract: Conversational recommendation systems elicit user preferences by interacting with users to obtain their feedback on recommended commodities. Such systems utilize a multi-armed bandit framework to learn user preferences in an online manner and have received great success in recent years. However, existing conversational bandit methods have several limitations. First, they only enable users to provi… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  40. arXiv:2407.15084  [pdf, other

    stat.ME stat.AP

    High-dimensional log contrast models with measurement errors

    Authors: Wenxi Tan, Lingzhou Xue, Songshan Yang, Xiang Zhan

    Abstract: High-dimensional compositional data are frequently encountered in many fields of modern scientific research. In regression analysis of compositional data, the presence of covariate measurement errors poses grand challenges for existing statistical error-in-variable regression analysis methods since measurement error in one component of the composition has an impact on others. To simultaneously add… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  41. arXiv:2407.09522  [pdf, other

    cs.DB cs.AI cs.LG stat.ML

    UQE: A Query Engine for Unstructured Databases

    Authors: Hanjun Dai, Bethany Yixin Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans

    Abstract: Analytics on structured data is a mature field with many successful methods. However, most real world data exists in unstructured form, such as images and conversations. We investigate the potential of Large Language Models (LLMs) to enable unstructured data analytics. In particular, we propose a new Universal Query Engine (UQE) that directly interrogates and draws insights from unstructured data… ▽ More

    Submitted 16 November, 2024; v1 submitted 23 June, 2024; originally announced July 2024.

    Journal ref: NeurIPS 2024

  42. arXiv:2407.04142  [pdf, other

    stat.ME

    Bayesian Structured Mediation Analysis With Unobserved Confounders

    Authors: Yuliang Xu, Shu Yang, Jian Kang

    Abstract: We explore methods to reduce the impact of unobserved confounders on the causal mediation analysis of high-dimensional mediators with spatially smooth structures, such as brain imaging data. The key approach is to incorporate the latent individual effects, which influence the structured mediators, as unobserved confounders in the outcome model, thereby potentially debiasing the mediation effects.… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  43. arXiv:2406.16221  [pdf, other

    cs.LG cs.AI cs.GR econ.EM stat.ME

    F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data

    Authors: Zexing Xu, Linjun Zhang, Sitan Yang, Rasoul Etesami, Hanghang Tong, Huan Zhang, Jiawei Han

    Abstract: Demand prediction is a crucial task for e-commerce and physical retail businesses, especially during high-stake sales events. However, the limited availability of historical data from these peak periods poses a significant challenge for traditional forecasting methods. In this paper, we propose a novel approach that leverages strategically chosen proxy data reflective of potential sales patterns f… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    MSC Class: 68T07; 68T05; 62M10; 62M20; 90C90; 91B84

  44. arXiv:2406.13478  [pdf, other

    stat.ME

    Semiparametric Localized Principal Stratification Analysis with Continuous Strata

    Authors: Yichi Zhang, Shu Yang

    Abstract: Principal stratification is essential for revealing causal mechanisms involving post-treatment intermediate variables, in real-world applications like surrogate marker evaluation. Principal stratification analysis with continuous intermediate variables is increasingly common but challenging due to the infinite principal strata and the nonidentifiability and nonregularity of principal causal effect… ▽ More

    Submitted 29 January, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

  45. arXiv:2406.04107  [pdf

    stat.AP

    A Practical Analysis Procedure on Generalizing Comparative Effectiveness in the Randomized Clinical Trial to the Real-world Trialeligible Population

    Authors: Kuan Jiang, Xin-xing Lai, Shu Yang, Ying Gao, Xiao-Hua Zhou

    Abstract: When evaluating the effectiveness of a drug, a Randomized Controlled Trial (RCT) is often considered the gold standard due to its perfect randomization. While RCT assures strong internal validity, its restricted external validity poses challenges in extending treatment effects to the broader real-world population due to possible heterogeneity in covariates. In this paper, we introduce a procedure… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 21 pages, 3 figures, 3tables

  46. arXiv:2405.19320  [pdf, other

    cs.LG cs.AI stat.ML

    Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

    Authors: Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Reinforcement learning from human feedback (RLHF) has demonstrated great promise in aligning large language models (LLMs) with human preference. Depending on the availability of preference data, both online and offline RLHF are active areas of investigation. A key bottleneck is understanding how to incorporate uncertainty estimation in the reward function learned from the preference data for RLHF,… ▽ More

    Submitted 18 February, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: ICLR 2025

  47. arXiv:2405.19206  [pdf, other

    stat.ML cs.LG

    Matrix Manifold Neural Networks++

    Authors: Xuan Son Nguyen, Shuo Yang, Aymeric Histace

    Abstract: Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature language processing tasks. One of the key factors that contribute to the success of these networks is that spherical and hyperbolic manifolds have the rich alge… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  48. arXiv:2405.16161  [pdf, ps, other

    stat.ME

    Inference for Optimal Linear Treatment Regimes in Personalized Decision-making

    Authors: Yuwen Cheng, Shu Yang

    Abstract: Personalized decision-making, tailored to individual characteristics, is gaining significant attention. The optimal treatment regime aims to provide the best-expected outcome in the entire population, known as the value function. One approach to determine this optimal regime is by maximizing the Augmented Inverse Probability Weighting (AIPW) estimator of the value function. However, the derived tr… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  49. arXiv:2405.14982  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    In-context Time Series Predictor

    Authors: Jiecheng Lu, Yan Sun, Shihao Yang

    Abstract: Recent Transformer-based large language models (LLMs) demonstrate in-context learning ability to perform various functions based solely on the provided context, without updating model parameters. To fully utilize the in-context capabilities in time series forecasting (TSF) problems, unlike previous Transformer-based or LLM-based time series forecasting methods, we reformulate "time series forecast… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  50. arXiv:2405.11377  [pdf, other

    stat.ML cs.LG stat.ME

    Causal Customer Churn Analysis with Low-rank Tensor Block Hazard Model

    Authors: Chenyin Gao, Zhiming Zhang, Shu Yang

    Abstract: This study introduces an innovative method for analyzing the impact of various interventions on customer churn, using the potential outcomes framework. We present a new causal model, the tensorized latent factor block hazard model, which incorporates tensor completion methods for a principled causal analysis of customer churn. A crucial element of our approach is the formulation of a 1-bit tensor… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in ICML, 2024