Skip to main content

Showing 1–50 of 212 results for author: Xie, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.23113  [pdf, ps, other

    stat.AP

    Watermark in the Classroom: A Conformal Framework for Adaptive AI Usage Detection

    Authors: Yangxinyu Xie, Xuyang Chen, Zhimei Ren, Weijie J. Su

    Abstract: As artificial intelligence tools become ubiquitous in education, maintaining academic integrity while accommodating pedagogically beneficial AI assistance presents unprecedented challenges. Current AI detection systems fail to control false positive rates (FPR) and suffer from bias against minority student groups, prompting institutional suspensions of these technologies. Watermarking techniques o… ▽ More

    Submitted 30 July, 2025; originally announced July 2025.

  2. arXiv:2506.18562  [pdf, ps, other

    stat.ME

    Multi-Rank Subspace Change-Point Detection for Monitoring Robotic Swarms

    Authors: Jonghyeok Lee, Yao Xie, Youngser Park, Jason Hindes, Ira Schwartz, Carey Priebe

    Abstract: We study the problem of real-time detection of covariance structure changes in high-dimensional streaming data, motivated by applications such as robotic swarm monitoring. Building upon the spiked covariance model, we propose the multi-rank Subspace-CUSUM procedure, which extends the classical CUSUM framework by tracking the top principal components to approximate a likelihood ratio. We provide a… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  3. arXiv:2505.18526  [pdf, ps, other

    stat.ML cs.LG

    Scalable Gaussian Processes with Low-Rank Deep Kernel Decomposition

    Authors: Yunqin Zhu, Henry Shaowu Yuchi, Yao Xie

    Abstract: Kernels are key to encoding prior beliefs and data structures in Gaussian process (GP) models. The design of expressive and scalable kernels has garnered significant research attention. Deep kernel learning enhances kernel flexibility by feeding inputs through a neural network before applying a standard parametric form. However, this approach remains limited by the choice of base kernels, inherits… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  4. arXiv:2505.16051  [pdf, ps, other

    stat.ML cs.LG

    PO-Flow: Flow-based Generative Models for Sampling Potential Outcomes and Counterfactuals

    Authors: Dongze Wu, David I. Inouye, Yao Xie

    Abstract: We propose PO-Flow, a novel continuous normalizing flow (CNF) framework for causal inference that jointly models potential outcomes and counterfactuals. Trained via flow matching, PO-Flow provides a unified framework for individualized potential outcome prediction, counterfactual predictions, and uncertainty-aware density learning. Among generative models, it is the first to enable density learnin… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  5. arXiv:2504.09706  [pdf, other

    stat.AP stat.ML

    Modeling Discrete Coating Degradation Events via Hawkes Processes

    Authors: Matthew Repasky, Henry Yuchi, Fritz Friedersdorf, Yao Xie

    Abstract: Forecasting the degradation of coated materials has long been a topic of critical interest in engineering, as it has enormous implications for both system maintenance and sustainable material use. Material degradation is affected by many factors, including the history of corrosion and characteristics of the environment, which can be measured by high-frequency sensors. However, the high volume of d… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  6. arXiv:2504.09348   

    stat.ME cs.LG eess.SP

    Graph-Based Prediction Models for Data Debiasing

    Authors: Dongze Wu, Hanyang Jiang, Yao Xie

    Abstract: Bias in data collection, arising from both under-reporting and over-reporting, poses significant challenges in critical applications such as healthcare and public safety. In this work, we introduce Graph-based Over- and Under-reporting Debiasing (GROUD), a novel graph-based optimization framework that debiases reported data by jointly estimating the true incident counts and the associated reportin… ▽ More

    Submitted 18 April, 2025; v1 submitted 12 April, 2025; originally announced April 2025.

    Comments: We submitted this arXiv version by mistake. We have decided to update the original submission (arXiv:2307.07898) instead of submitting a separate article

  7. arXiv:2504.06364  [pdf, ps, other

    stat.ML cs.LG math.ST

    Deep spatio-temporal point processes: Advances and new directions

    Authors: Xiuyuan Cheng, Zheng Dong, Yao Xie

    Abstract: Spatio-temporal point processes (STPPs) model discrete events distributed in time and space, with important applications in areas such as criminology, seismology, epidemiology, and social networks. Traditional models often rely on parametric kernels, limiting their ability to capture heterogeneous, nonstationary dynamics. Recent innovations integrate deep neural architectures -- either by modeling… ▽ More

    Submitted 22 August, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: Annual Review of Statistics and Its Application, 2025

  8. arXiv:2502.13394  [pdf, ps, other

    cs.LG math.ST stat.ML

    Flow-based generative models as iterative algorithms in probability space

    Authors: Yao Xie, Xiuyuan Cheng

    Abstract: Generative AI (GenAI) has revolutionized data-driven modeling by enabling the synthesis of high-dimensional data across various applications, including image generation, language modeling, biomedical signal processing, and anomaly detection. Flow-based generative models provide a powerful framework for capturing complex probability distributions, offering exact likelihood estimation, efficient sam… ▽ More

    Submitted 6 September, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: IEEE Signal Processing Magazine, Special Issue on The Mathematics of Deep Learning, 2025

  9. arXiv:2502.05709  [pdf, other

    cs.LG stat.ML

    Flow-based Conformal Prediction for Multi-dimensional Time Series

    Authors: Junghwan Lee, Chen Xu, Yao Xie

    Abstract: Conformal prediction for time series presents two key challenges: (1) leveraging sequential correlations in features and non-conformity scores and (2) handling multi-dimensional outcomes. We propose a novel conformal prediction method to address these two key challenges by integrating Transformer and Normalizing Flow. Specifically, the Transformer encodes the historical context of time series, and… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  10. arXiv:2501.16393  [pdf, other

    cs.LG cs.CR stat.ML

    Improving Network Threat Detection by Knowledge Graph, Large Language Model, and Imbalanced Learning

    Authors: Lili Zhang, Quanyan Zhu, Herman Ray, Ying Xie

    Abstract: Network threat detection has been challenging due to the complexities of attack activities and the limitation of historical threat data to learn from. To help enhance the existing practices of using analytics, machine learning, and artificial intelligence methods to detect the network threats, we propose an integrated modelling framework, where Knowledge Graph is used to analyze the users' activit… ▽ More

    Submitted 14 May, 2025; v1 submitted 26 January, 2025; originally announced January 2025.

    Comments: Accepted by "Combining AI and OR/MS for Better Trustworthy Decision Making" Bridge Program co-organized by AAAI and INFORMS as poster and demo

  11. arXiv:2412.20556  [pdf, ps, other

    stat.ML cs.LG math.OC

    Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces

    Authors: Linglingzhi Zhu, Yao Xie

    Abstract: We consider a minimax problem motivated by distributionally robust optimization (DRO) when the worst-case distribution is continuous, leading to significant computational challenges due to the infinite-dimensional nature of the optimization problem. Recent research has explored learning the worst-case distribution using neural network-based generative models to address these computational challeng… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

  12. arXiv:2412.16523  [pdf, other

    cs.LG cs.CY physics.soc-ph stat.ML

    Physics-Guided Fair Graph Sampling for Water Temperature Prediction in River Networks

    Authors: Erhu He, Declan Kutscher, Yiqun Xie, Jacob Zwart, Zhe Jiang, Huaxiu Yao, Xiaowei Jia

    Abstract: This work introduces a novel graph neural networks (GNNs)-based method to predict stream water temperature and reduce model bias across locations of different income and education levels. Traditional physics-based models often have limited accuracy because they are necessarily approximations of reality. Recently, there has been an increasing interest of using GNNs in modeling complex water dynamic… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  13. arXiv:2412.15315  [pdf, other

    stat.ML cs.LG

    Enhancing Masked Time-Series Modeling via Dropping Patches

    Authors: Tianyu Qiu, Yi Xie, Yun Xiong, Hao Niu, Xiaofeng Gao

    Abstract: This paper explores how to enhance existing masked time-series modeling by randomly dropping sub-sequence level patches of time series. On this basis, a simple yet effective method named DropPatch is proposed, which has two remarkable advantages: 1) It improves the pre-training efficiency by a square-level advantage; 2) It provides additional advantages for modeling in scenarios such as in-domain,… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  14. arXiv:2412.01098  [pdf, other

    stat.ML cs.LG

    Spatial Conformal Inference through Localized Quantile Regression

    Authors: Hanyang Jiang, Yao Xie

    Abstract: Reliable uncertainty quantification at unobserved spatial locations, especially in the presence of complex and heterogeneous datasets, remains a core challenge in spatial statistics. Traditional approaches like Kriging rely heavily on assumptions such as normality, which often break down in large-scale, diverse datasets, leading to unreliable prediction intervals. While machine learning methods ha… ▽ More

    Submitted 15 February, 2025; v1 submitted 1 December, 2024; originally announced December 2024.

  15. arXiv:2411.17099  [pdf, other

    stat.ML cs.LG

    Spatio-Temporal Conformal Prediction for Power Outage Data

    Authors: Hanyang Jiang, Yao Xie, Feng Qiu

    Abstract: In recent years, increasingly unpredictable and severe global weather patterns have frequently caused long-lasting power outages. Building resilience, the ability to withstand, adapt to, and recover from major disruptions, has become crucial for the power industry. To enable rapid recovery, accurately predicting future outage numbers is essential. Rather than relying on simple point estimates, we… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  16. arXiv:2411.11203  [pdf, ps, other

    stat.ML cs.CL cs.CR cs.LG stat.ME

    Debiasing Watermarks for Large Language Models via Maximal Coupling

    Authors: Yangxinyu Xie, Xiang Li, Tanwi Mallick, Weijie J. Su, Ruixun Zhang

    Abstract: Watermarking language models is essential for distinguishing between human and machine-generated text and thus maintaining the integrity and trustworthiness of digital communication. We present a novel green/red list watermarking approach that partitions the token set into ``green'' and ``red'' lists, subtly increasing the generation probability for green tokens. To correct token distribution bias… ▽ More

    Submitted 12 June, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

    Comments: To appear in Journal of the American Statistical Association (JASA)

  17. arXiv:2411.02694  [pdf, other

    stat.ML cs.LG math.ST

    Point processes with event time uncertainty

    Authors: Xiuyuan Cheng, Tingnan Gong, Yao Xie

    Abstract: Point processes are widely used statistical models for uncovering the temporal patterns in dependent event data. In many applications, the event time cannot be observed exactly, calling for the incorporation of time uncertainty into the modeling of point process data. In this work, we introduce a framework to model time-uncertain point processes possibly on a network. We start by deriving the form… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  18. arXiv:2410.17882  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Identifiable Representation and Model Learning for Latent Dynamic Systems

    Authors: Congxi Zhang, Yongchun Xie

    Abstract: Learning identifiable representations and models from low-level observations is helpful for an intelligent spacecraft to complete downstream tasks reliably. For temporal observations, to ensure that the data generating process is provably inverted, most existing works either assume the noise variables in the dynamic mechanisms are (conditionally) independent or require that the interventions can d… ▽ More

    Submitted 4 December, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

  19. arXiv:2410.02548  [pdf, ps, other

    stat.ML cs.LG

    Local Flow Matching Generative Models

    Authors: Chen Xu, Xiuyuan Cheng, Yao Xie

    Abstract: Flow Matching (FM) is a simulation-free method for learning a continuous and invertible flow to interpolate between two distributions, and in particular to generate data from noise. Inspired by the variational nature of the diffusion process as a gradient flow, we introduce a stepwise FM model called Local Flow Matching (LFM), which consecutively learns a sequence of FM sub-models, each matching a… ▽ More

    Submitted 11 July, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

  20. arXiv:2410.02078  [pdf, other

    stat.ML cs.CV cs.LG

    Posterior sampling via Langevin dynamics based on generative priors

    Authors: Vishal Purohit, Matthew Repasky, Jianfeng Lu, Qiang Qiu, Yao Xie, Xiuyuan Cheng

    Abstract: Posterior sampling in high-dimensional spaces using generative models holds significant promise for various applications, including but not limited to inverse problems and guided generation tasks. Despite many recent developments, generating diverse posterior samples remains a challenge, as existing methods require restarting the entire generative process for each new sample, making the procedure… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  21. arXiv:2409.20547  [pdf, other

    stat.ML cs.LG stat.CO

    Annealing Flow Generative Models Towards Sampling High-Dimensional and Multi-Modal Distributions

    Authors: Dongze Wu, Yao Xie

    Abstract: Sampling from high-dimensional, multi-modal distributions remains a fundamental challenge across domains such as statistical Bayesian inference and physics-based machine learning. In this paper, we propose Annealing Flow (AF), a method built on Continuous Normalizing Flow (CNF) for sampling from high-dimensional and multi-modal distributions. AF is trained with a dynamic Optimal Transport (OT) obj… ▽ More

    Submitted 27 May, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted to ICML 2025 and will appear in the Proceedings of Machine Learning Research (PMLR)

  22. arXiv:2409.15597  [pdf, other

    stat.ME math.ST

    Higher-criticism for sparse multi-stream change-point detection

    Authors: Tingnan Gong, Alon Kipnis, Yao Xie

    Abstract: We study a statistical procedure based on higher criticism (HC) to address the sparse multi-stream quickest change-point detection problem. Namely, we aim to detect a potential change in the distribution of multiple data streams at some unknown time. If a change occurs, only a few streams are affected, whereas the identity of the affected streams is unknown. The HC-based procedure involves testing… ▽ More

    Submitted 19 April, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: Authors are listed in alphabetical order

  23. arXiv:2409.10882  [pdf, ps, other

    stat.AP

    Spatio-Temporal-Network Point Processes for Modeling Crime Events with Landmarks

    Authors: Zheng Dong, Jorge Mateu, Yao Xie

    Abstract: Self-exciting point processes are widely used to model the contagious effects of crime events living within continuous geographic space, using their occurrence time and locations. However, in urban environments, most events are naturally constrained within the city's street network structure, and the contagious effects of crime are governed by such a network geography. Meanwhile, the complex distr… ▽ More

    Submitted 30 September, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

  24. arXiv:2409.03986  [pdf, other

    cs.LG stat.ML

    An Efficient and Generalizable Symbolic Regression Method for Time Series Analysis

    Authors: Yi Xie, Tianyu Qiu, Yun Xiong, Xiuqi Huang, Xiaofeng Gao, Chao Chen

    Abstract: Time series analysis and prediction methods currently excel in quantitative analysis, offering accurate future predictions and diverse statistical indicators, but generally falling short in elucidating the underlying evolution patterns of time series. To gain a more comprehensive understanding and provide insightful explanations, we utilize symbolic regression techniques to derive explicit express… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  25. arXiv:2408.09672  [pdf, other

    cs.LG math.OC stat.ML

    Regularization for Adversarial Robust Learning

    Authors: Jie Wang, Rui Gao, Yao Xie

    Abstract: Despite the growing prevalence of artificial neural networks in real-world applications, their vulnerability to adversarial attacks remains a significant concern, which motivates us to investigate the robustness of machine learning models. While various heuristics aim to optimize the distributionally robust risk using the $\infty$-Wasserstein metric, such a notion of robustness frequently encounte… ▽ More

    Submitted 22 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: 51 pages, 5 figures

  26. arXiv:2408.09258  [pdf, other

    stat.AP

    Atlanta Gun Violence Modeling via Nonstationary Spatio-temporal Point Processes

    Authors: Zheng Dong, Yao Xie

    Abstract: Analysis of gun violence in the United States has utilized various models based on spatiotemporal point processes. Previous studies have identified a contagion effect in gun violence, characterized by bursts of diffusion across urban environments, which can be effectively represented using the self-excitatory spatiotemporal Hawkes process. The Hawkes process and its variants have been successful i… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  27. arXiv:2408.07219  [pdf, other

    cs.LG stat.ME

    Causal Effect Estimation using identifiable Variational AutoEncoder with Latent Confounders and Post-Treatment Variables

    Authors: Yang Xie, Ziqi Xu, Debo Cheng, Jiuyong Li, Lin Liu, Yinghao Zhang, Zaiwen Feng

    Abstract: Estimating causal effects from observational data is challenging, especially in the presence of latent confounders. Much work has been done on addressing this challenge, but most of the existing research ignores the bias introduced by the post-treatment variables. In this paper, we propose a novel method of joint Variational AutoEncoder (VAE) and identifiable Variational AutoEncoder (iVAE) for lea… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  28. arXiv:2407.10976  [pdf, other

    cs.NI cs.LG eess.SP stat.AP

    Learning Cellular Network Connection Quality with Conformal

    Authors: Hanyang Jiang, Elizabeth Belding, Ellen Zegure, Yao Xie

    Abstract: In this paper, we address the problem of uncertainty quantification for cellular network speed. It is a well-known fact that the actual internet speed experienced by a mobile phone can fluctuate significantly, even when remaining in a single location. This high degree of variability underscores that mere point estimation of network speed is insufficient. Rather, it is advantageous to establish a p… ▽ More

    Submitted 4 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.05641

  29. arXiv:2407.09964  [pdf, other

    math.ST stat.ML

    TrIM: Transformed Iterative Mondrian Forests for Gradient-based Dimension Reduction and High-Dimensional Regression

    Authors: Ricardo Baptista, Eliza O'Reilly, Yangxinyu Xie

    Abstract: We propose a computationally efficient algorithm for gradient-based linear dimension reduction and high-dimensional regression. The algorithm initially computes a Mondrian forest and uses this estimator to identify a relevant feature subspace of the inputs from an estimate of the expected gradient outer product (EGOP) of the regression function. In addition, we introduce an iterative approach know… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 39 pages, 10 figures

  30. arXiv:2406.16136  [pdf, other

    stat.ME

    Distribution-Free Online Change Detection for Low-Rank Images

    Authors: Tingnan Gong, Seong-Hee Kim, Yao Xie

    Abstract: We present a distribution-free CUSUM procedure designed for online change detection in a time series of low-rank images, particularly when the change causes a mean shift. We represent images as matrix data and allow for temporal dependence, in addition to inherent spatial dependence, before and after the change. The marginal distributions are assumed to be general, not limited to any specific para… ▽ More

    Submitted 27 February, 2025; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: 30 pages, 7 figures

  31. arXiv:2406.06894  [pdf, other

    cs.LG stat.ML

    Nonlinear time-series embedding by monotone variational inequality

    Authors: Jonathan Y. Zhou, Yao Xie

    Abstract: In the wild, we often encounter collections of sequential data such as electrocardiograms, motion capture, genomes, and natural language, and sequences may be multichannel or symbolic with nonlinear dynamics. We introduce a new method to learn low-dimensional representations of nonlinear time series without supervision and can have provable recovery guarantees. The learned representation can be us… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  32. arXiv:2406.04859  [pdf, other

    physics.geo-ph cs.LG stat.CO

    Stochastic full waveform inversion with deep generative prior for uncertainty quantification

    Authors: Yuke Xie, Hervé Chauris, Nicolas Desassis

    Abstract: To obtain high-resolution images of subsurface structures from seismic data, seismic imaging techniques such as Full Waveform Inversion (FWI) serve as crucial tools. However, FWI involves solving a nonlinear and often non-unique inverse problem, presenting challenges such as local minima trapping and inadequate handling of inherent uncertainties. In addressing these challenges, we propose leveragi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  33. arXiv:2405.16828  [pdf, ps, other

    cs.LG math.ST stat.ML

    Kernel-based Optimally Weighted Conformal Prediction Intervals

    Authors: Jonghyeok Lee, Chen Xu, Yao Xie

    Abstract: In this work, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learns optimal data-adaptive weights. Theoretically, we tackle the challenge of establishing a conditional… ▽ More

    Submitted 31 May, 2025; v1 submitted 27 May, 2024; originally announced May 2024.

  34. arXiv:2405.15441  [pdf, ps, other

    stat.ML cs.CC cs.LG

    Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

    Authors: Jie Wang, March Boedihardjo, Yao Xie

    Abstract: Optimal transport has been very successful for various machine learning tasks; however, it is known to suffer from the curse of dimensionality. Hence, dimensionality reduction is desirable when applied to high-dimensional data with low-dimensional structures. The kernel max-sliced (KMS) Wasserstein distance is developed for this purpose by finding an optimal nonlinear mapping that reduces data int… ▽ More

    Submitted 18 July, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML-2025

  35. arXiv:2404.18838  [pdf, other

    math.NA stat.CO

    Accurate adaptive deep learning method for solving elliptic problems

    Authors: Jingyong Ying, Yaqi Xie, Jiao Li, Hongqiao Wang

    Abstract: Deep learning method is of great importance in solving partial differential equations. In this paper, inspired by the failure-informed idea proposed by Gao et.al. (SIAM Journal on Scientific Computing 45(4)(2023)) and as an improvement, a new accurate adaptive deep learning method is proposed for solving elliptic problems, including the interface problems and the convection-dominated problems. Bas… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  36. arXiv:2404.11509  [pdf, other

    stat.ML cs.LG

    VC Theory for Inventory Policies

    Authors: Yaqi Xie, Will Ma, Linwei Xin

    Abstract: Advances in computational power and AI have increased interest in reinforcement learning approaches to inventory management. This paper provides a theoretical foundation for these approaches and investigates the benefits of restricting to policy structures that are well-established by inventory theory. In particular, we prove generalization guarantees for learning several well-known classes of inv… ▽ More

    Submitted 7 July, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  37. arXiv:2404.03329  [pdf

    cs.LG eess.SP stat.ML

    COMPILED: Deep Metric Learning for Defect Classification of Threaded Pipe Connections using Multichannel Partially Observed Functional Data

    Authors: Juan Du, Yukun Xie, Chen Zhang

    Abstract: In modern manufacturing, most products are conforming. Few products are nonconforming with different defect types. The identification of defect types can help further root cause diagnosis of production lines. With the sensing technology development, process variables evolved as time changes, which can be collected in high resolution as multichannel functional data. These functional data have rich… ▽ More

    Submitted 8 December, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Submitted version to IISE Transactions

  38. arXiv:2403.14822  [pdf, other

    stat.ML cs.LG math.OC

    Non-Convex Robust Hypothesis Testing using Sinkhorn Uncertainty Sets

    Authors: Jie Wang, Rui Gao, Yao Xie

    Abstract: We present a new framework to address the non-convex robust hypothesis testing problem, wherein the goal is to seek the optimal detector that minimizes the maximum of worst-case type-I and type-II risk functions. The distributional uncertainty sets are constructed to center around the empirical distribution derived from samples based on Sinkhorn discrepancy. Given that the objective involves non-c… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 26 pages, 2 figures

  39. arXiv:2403.09042  [pdf, other

    stat.ME

    Recurrent Events Modeling Based on a Reflected Brownian Motion with Application to Hypoglycemia

    Authors: Yingfa Xie, Haoda Fu, Yuan Huang, Vladimir Pozdnyakov, Jun Yan

    Abstract: Patients with type 2 diabetes need to closely monitor blood sugar levels as their routine diabetes self-management. Although many treatment agents aim to tightly control blood sugar, hypoglycemia often stands as an adverse event. In practice, patients can observe hypoglycemic events more easily than hyperglycemic events due to the perception of neurogenic symptoms. We propose to model each patient… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  40. arXiv:2403.03850  [pdf, other

    stat.ML cs.LG

    Conformal prediction for multi-dimensional time series by ellipsoidal sets

    Authors: Chen Xu, Hanyang Jiang, Yao Xie

    Abstract: Conformal prediction (CP) has been a popular method for uncertainty quantification because it is distribution-free, model-agnostic, and theoretically sound. For forecasting problems in supervised learning, most CP methods focus on building prediction intervals for univariate responses. In this work, we develop a sequential CP method called $\texttt{MultiDimSPCI}$ that builds prediction… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by the Forty-first International Conference on Machine Learning (ICML 2024)

  41. arXiv:2401.15262  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Asymptotic Behavior of Adversarial Training Estimator under $\ell_\infty$-Perturbation

    Authors: Yiling Xie, Xiaoming Huo

    Abstract: Adversarial training has been proposed to protect machine learning models against adversarial attacks. This paper focuses on adversarial training under $\ell_\infty$-perturbation, which has recently attracted much research attention. The asymptotic behavior of the adversarial training estimator is investigated in the generalized linear model. The results imply that the asymptotic distribution of t… ▽ More

    Submitted 2 March, 2025; v1 submitted 26 January, 2024; originally announced January 2024.

  42. arXiv:2312.08324  [pdf, other

    stat.AP

    Bayesian Nonparametric Clustering with Feature Selection for Spatially Resolved Transcriptomics Data

    Authors: Bencong Zhu, Guanyu Hu, Yang Xie, Lin Xu, Xiaodan Fan, Qiwei Li

    Abstract: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Nevertheless, there are inherent challenges associated with these new high-dimensional spatial data, such as zero-inflation, over-dispersion, and heterogeneity. These… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  43. arXiv:2312.05404  [pdf, other

    cs.LG cs.AI stat.ME

    Disentangled Latent Representation Learning for Tackling the Confounding M-Bias Problem in Causal Inference

    Authors: Debo Cheng, Yang Xie, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Yinghao Zhang, Zaiwen Feng

    Abstract: In causal inference, it is a fundamental task to estimate the causal effect from observational data. However, latent confounders pose major challenges in causal inference in observational data, for example, confounding bias and M-bias. Recent data-driven causal effect estimators tackle the confounding bias problem via balanced representation learning, but assume no M-bias in the system, thus they… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 10 pages, 3 figures and 5 tables. Accepted by ICDM2023

  44. arXiv:2312.02959  [pdf, other

    stat.ML cs.CY cs.LG stat.AP

    Detecting algorithmic bias in medical-AI models using trees

    Authors: Jeffrey Smith, Andre Holder, Rishikesan Kamaleswaran, Yao Xie

    Abstract: With the growing prevalence of machine learning and artificial intelligence-based medical decision support systems, it is equally important to ensure that these systems provide patient outcomes in a fair and equitable fashion. This paper presents an innovative framework for detecting areas of algorithmic bias in medical-AI decision support systems. Our approach efficiently identifies potential bia… ▽ More

    Submitted 29 October, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 26 pages, 9 figures

  45. arXiv:2311.05641  [pdf, other

    stat.AP cs.LG

    Mobile Internet Quality Estimation using Self-Tuning Kernel Regression

    Authors: Hanyang Jiang, Henry Shaowu Yuchi, Elizabeth Belding, Ellen Zegura, Yao Xie

    Abstract: Modeling and estimation for spatial data are ubiquitous in real life, frequently appearing in weather forecasting, pollution detection, and agriculture. Spatial data analysis often involves processing datasets of enormous scale. In this work, we focus on large-scale internet-quality open datasets from Ookla. We look into estimating mobile (cellular) internet quality at the scale of a state in the… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  46. arXiv:2310.19787  [pdf

    stat.ME stat.AP stat.ML

    $e^{\text{RPCA}}$: Robust Principal Component Analysis for Exponential Family Distributions

    Authors: Xiaojun Zheng, Simon Mak, Liyan Xie, Yao Xie

    Abstract: Robust Principal Component Analysis (RPCA) is a widely used method for recovering low-rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low-rank background is critical for process monitoring and diagnosis. However, exis… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  47. arXiv:2310.19253  [pdf, other

    cs.LG stat.ME stat.ML

    Flow-based Distributionally Robust Optimization

    Authors: Chen Xu, Jonghyeok Lee, Xiuyuan Cheng, Yao Xie

    Abstract: We present a computationally efficient framework, called $\texttt{FlowDRO}$, for solving flow-based distributionally robust optimization (DRO) problems with Wasserstein uncertainty sets while aiming to find continuous worst-case distribution (also called the Least Favorable Distribution, LFD) and sample from it. The requirement for LFD to be continuous is so that the algorithm can be scalable to p… ▽ More

    Submitted 24 February, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: IEEE Journal on Selected Areas in Information Theory (JSAIT). Accepted. 2024

  48. arXiv:2310.17582  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Convergence of flow-based generative models via proximal gradient descent in Wasserstein space

    Authors: Xiuyuan Cheng, Jianfeng Lu, Yixin Tan, Yao Xie

    Abstract: Flow-based generative models enjoy certain advantages in computing the data generation and the likelihood, and have recently shown competitive empirical performance. Compared to the accumulating theoretical studies on related score-based diffusion models, analysis of flow-based models, which are deterministic in both forward (data-to-noise) and reverse (noise-to-data) directions, remain sparse. In… ▽ More

    Submitted 3 July, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  49. arXiv:2310.03258  [pdf, other

    cs.LG stat.ME

    Assessing Electricity Service Unfairness with Transfer Counterfactual Learning

    Authors: Song Wei, Xiangrui Kong, Alinson Santos Xavier, Shixiang Zhu, Yao Xie, Feng Qiu

    Abstract: Energy justice is a growing area of interest in interdisciplinary energy research. However, identifying systematic biases in the energy sector remains challenging due to confounding variables, intricate heterogeneity in counterfactual effects, and limited data availability. First, this paper demonstrates how one can evaluate counterfactual unfairness in a power system by analyzing the average caus… ▽ More

    Submitted 24 January, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: The preliminary version titled "Detecting Electricity Service Equity Issues with Transfer Counterfactual Learning on Large-Scale Outage Datasets" is presented at NeurIPS 2023 Workshops on Causal Representation Learning (CRL) and Algorithmic Fairness through the Lens of Time (AFT); See v1

  50. arXiv:2309.08911  [pdf, ps, other

    cs.LG stat.ML

    Efficient Methods for Non-stationary Online Learning

    Authors: Peng Zhao, Yan-Feng Xie, Lijun Zhang, Zhi-Hua Zhou

    Abstract: Non-stationary online learning has drawn much attention in recent years. In particular, dynamic regret and adaptive regret are proposed as two principled performance measures for online convex optimization in non-stationary environments. To optimize them, a two-layer online ensemble is usually deployed due to the inherent uncertainty of non-stationarity, in which multiple base-learners are maintai… ▽ More

    Submitted 8 September, 2025; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: V3 changes: accepted by JMLR 2025 and improve the writing; V2/V1 changes: investigate interval dynamic regret and add two applications (online non-stochastic control and online PCA) and improve the presentation; preliminary version published at NeurIPS'22

    Journal ref: Journal of Machine Learning Research, 2025