Skip to main content

Showing 1–50 of 665 results for author: Wang, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.01932  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    A first-order method for nonconvex-nonconcave minimax problems under a local Kurdyka-Łojasiewicz condition

    Authors: Zhaosong Lu, Xiangyuan Wang

    Abstract: We study a class of nonconvex-nonconcave minimax problems in which the inner maximization problem satisfies a local Kurdyka-Łojasiewicz (KL) condition that may vary with the outer minimization variable. In contrast to the global KL or Polyak-Łojasiewicz (PL) conditions commonly assumed in the literature -- which are significantly stronger and often too restrictive in practice -- this local KL cond… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 26 pages

    MSC Class: 90C26; 90C30; 90C47; 90C99; 65K05

  2. arXiv:2506.21894  [pdf, ps, other

    stat.ML cs.LG

    Thompson Sampling in Function Spaces via Neural Operators

    Authors: Rafael Oliveira, Xuesong Wang, Kian Ming A. Chai, Edwin V. Bonilla

    Abstract: We propose an extension of Thompson sampling to optimization problems over function spaces where the objective is a known functional of an unknown operator's output. We assume that functional evaluations are inexpensive, while queries to the operator (such as running a high-fidelity simulator) are costly. Our algorithm employs a sample-then-optimize approach using neural operator surrogates. This… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Under review

  3. arXiv:2506.20596  [pdf, ps, other

    stat.ME stat.CO

    Inference for Error-Prone Count Data: Estimation under a Binomial Convolution Framework

    Authors: Yuqiu Yang, Christina Vu, Cornelis J. Potgieter, Xinlei Wang, Akihito Kamata

    Abstract: Measurement error in count data is common but underexplored in the literature, particularly in contexts where observed scores are bounded and arise from discrete scoring processes. Motivated by applications in oral reading fluency assessment, we propose a binomial convolution framework that extends binary misclassification models to settings where only the aggregate number of correct responses is… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 40 pages, 6 figures, 3 tables

  4. arXiv:2506.12352  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Efficient Network Automatic Relevance Determination

    Authors: Hongwei Zhang, Ziqi Ye, Xinyuan Wang, Xin Guo, Zenglin Xu, Yuan Cheng, Zixin Hu, Yuan Qi

    Abstract: We propose Network Automatic Relevance Determination (NARD), an extension of ARD for linearly probabilistic models, to simultaneously model sparse relationships between inputs $X \in \mathbb R^{d \times N}$ and outputs $Y \in \mathbb R^{m \times N}$, while capturing the correlation structure among the $Y$. NARD employs a matrix normal prior which contains a sparsity-inducing parameter to identify… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  5. arXiv:2506.11232  [pdf, ps, other

    stat.ME

    Regularized Estimation of the Loading Matrix in Factor Models for High-Dimensional Time Series

    Authors: Xialu Liu, Xin Wang

    Abstract: High-dimensional data analysis using traditional models suffers from overparameterization. Two types of techniques are commonly used to reduce the number of parameters - regularization and dimension reduction. In this project, we combine them by imposing a sparse factor structure and propose a regularized estimator to further reduce the number of parameters in factor models. A challenge limiting t… ▽ More

    Submitted 7 July, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

  6. arXiv:2506.07816  [pdf, ps, other

    stat.ML cs.LG math.PR

    Accelerating Constrained Sampling: A Large Deviations Approach

    Authors: Yingli Wang, Changwei Tu, Xiaoyu Wang, Lingjiong Zhu

    Abstract: The problem of sampling a target probability distribution on a constrained domain arises in many applications including machine learning. For constrained sampling, various Langevin algorithms such as projected Langevin Monte Carlo (PLMC) based on the discretization of reflected Langevin dynamics (RLD) and more generally skew-reflected non-reversible Langevin Monte Carlo (SRNLMC) based on the discr… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 40 pages, 7 figures

  7. arXiv:2506.01212  [pdf, ps, other

    cs.LG stat.ML

    Dynamic Modes as Time Representation for Spatiotemporal Forecasting

    Authors: Menglin Kong, Vincent Zhihao Zheng, Xudong Wang, Lijun Sun

    Abstract: This paper introduces a data-driven time embedding method for modeling long-range seasonal dependencies in spatiotemporal forecasting tasks. The proposed approach employs Dynamic Mode Decomposition (DMD) to extract temporal modes directly from observed data, eliminating the need for explicit timestamps or hand-crafted time features. These temporal modes serve as time representations that can be se… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  8. arXiv:2506.01162  [pdf, ps, other

    cs.DS cs.CR cs.LG stat.ML

    Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor

    Authors: Maryam Aliakbarpour, Zhan Shi, Ria Stevens, Vincent X. Wang

    Abstract: Estimating the density of a distribution from its samples is a fundamental problem in statistics. Hypothesis selection addresses the setting where, in addition to a sample set, we are given $n$ candidate distributions -- referred to as hypotheses -- and the goal is to determine which one best describes the underlying data distribution. This problem is known to be solvable very efficiently, requiri… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: 33 pages

  9. arXiv:2506.00933  [pdf, ps, other

    stat.ML cs.LG

    Reconstruction and Prediction of Volterra Integral Equations Driven by Gaussian Noise

    Authors: Zhihao Xu, Saisai Ding, Zhikun Zhang, Xiangjun Wang

    Abstract: Integral equations are widely used in fields such as applied modeling, medical imaging, and system identification, providing a powerful framework for solving deterministic problems. While parameter identification for differential equations has been extensively studied, the focus on integral equations, particularly stochastic Volterra integral equations, remains limited. This research addresses the… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  10. arXiv:2506.00495  [pdf, other

    cs.LG cs.CL stat.ML

    FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts

    Authors: Xinyi Wang, Lirong Gao, Haobo Wang, Yiming Zhang, Junbo Zhao

    Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods have emerged as a widely adopted strategy for adapting pre-trained Large Language Models (LLMs) to downstream tasks, significantly reducing memory and computational costs. However, most existing PEFT techniques uniformly deploy LoRA adapters across all layers, disregarding the intrinsic heterogeneity of layer contributions and task-specific rank requi… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 17 pages, 9 figures

  11. arXiv:2506.00407  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Bias as a Virtue: Rethinking Generalization under Distribution Shifts

    Authors: Ruixuan Chen, Wentao Li, Jiahui Xiao, Yuchen Li, Yimin Tang, Xiaonan Wang

    Abstract: Machine learning models often degrade when deployed on data distributions different from their training data. Challenging conventional validation paradigms, we demonstrate that higher in-distribution (ID) bias can lead to better out-of-distribution (OOD) generalization. Our Adaptive Distribution Bridge (ADB) framework implements this insight by introducing controlled statistical diversity during t… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 14 pages

  12. arXiv:2505.23968  [pdf, other

    cs.CR cs.AI cs.CY cs.LG stat.ML

    Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention

    Authors: Stephan Rabanser, Ali Shahin Shamsabadi, Olive Franzese, Xiao Wang, Adrian Weller, Nicolas Papernot

    Abstract: Cautious predictions -- where a machine learning model abstains when uncertain -- are crucial for limiting harmful errors in safety-critical applications. In this work, we identify a novel threat: a dishonest institution can exploit these mechanisms to discriminate or unjustly deny services under the guise of uncertainty. We demonstrate the practicality of this threat by introducing an uncertainty… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Proceedings of the 42nd International Conference on Machine Learning

  13. arXiv:2505.21285  [pdf, ps, other

    cs.LG stat.ML

    Learnable Kernel Density Estimation for Graphs

    Authors: Xudong Wang, Ziheng Sun, Chris Ding, Jicong Fan

    Abstract: This work proposes a framework LGKDE that learns kernel density estimation for graphs. The key challenge in graph density estimation lies in effectively capturing both structural patterns and semantic variations while maintaining theoretical guarantees. Combining graph kernels and kernel density estimation (KDE) is a standard approach to graph density estimation, but has unsatisfactory performance… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Under Review

    ACM Class: I.2; I.5.1; I.5.2

  14. arXiv:2505.19043  [pdf, ps, other

    cs.LG stat.ML

    Offline Clustering of Linear Bandits: Unlocking the Power of Clusters in Data-Limited Environments

    Authors: Jingyuan Liu, Zeyu Zhang, Xuchuang Wang, Xutong Liu, John C. S. Lui, Mohammad Hajiesmaili, Carlee Joe-Wong

    Abstract: Contextual linear multi-armed bandits are a learning framework for making a sequence of decisions, e.g., advertising recommendations for a sequence of arriving users. Recent works have shown that clustering these users based on the similarity of their learned preferences can significantly accelerate the learning. However, prior work has primarily focused on the online setting, which requires conti… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  15. arXiv:2505.17083  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Scale-invariant Attention

    Authors: Ben Anson, Xi Wang, Laurence Aitchison

    Abstract: One persistent challenge in LLM research is the development of attention mechanisms that are able to generalise from training on shorter contexts to inference on longer contexts. We propose two conditions that we expect all effective long context attention mechanisms to have: scale-invariant total attention, and scale-invariant attention sparsity. Under a Gaussian assumption, we show that a simple… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Preprint

  16. arXiv:2505.14421  [pdf, ps, other

    stat.ML cs.LG eess.SP eess.SY

    A system identification approach to clustering vector autoregressive time series

    Authors: Zuogong Yue, Xinyi Wang, Victor Solo

    Abstract: Clustering of time series based on their underlying dynamics is keeping attracting researchers due to its impacts on assisting complex system modelling. Most current time series clustering methods handle only scalar time series, treat them as white noise, or rely on domain knowledge for high-quality feature construction, where the autocorrelation pattern/feature is mostly ignored. Instead of relyi… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  17. arXiv:2505.12952  [pdf, ps, other

    cs.LG stat.ML

    LoD: Loss-difference OOD Detection by Intentionally Label-Noisifying Unlabeled Wild Data

    Authors: Chuanxing Geng, Qifei Li, Xinrui Wang, Dong Liang, Songcan Chen, Pong C. Yuen

    Abstract: Using unlabeled wild data containing both in-distribution (ID) and out-of-distribution (OOD) data to improve the safety and reliability of models has recently received increasing attention. Existing methods either design customized losses for labeled ID and unlabeled wild data then perform joint optimization, or first filter out OOD data from the latter then learn an OOD detector. While achieving… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI2025

  18. arXiv:2505.06804  [pdf, other

    cs.LG stat.ML

    Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology

    Authors: Xiaohan Wang, Matthew Berger

    Abstract: For domains that involve numerical simulation, it can be computationally expensive to run an ensemble of simulations spanning a parameter space of interest to a user. To this end, an attractive surrogate for simulation is the generative modeling of fields produced by an ensemble, allowing one to synthesize fields in a computationally cheap, yet accurate, manner. However, for the purposes of visual… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  19. arXiv:2505.06487  [pdf

    stat.AP math.OC

    Data Envelopment Analysis with Robust and Closest Targets:Integrating Full-Dimensional Efficient Facets for Risk-Resilient Benchmarking

    Authors: Xiuquan Huang, Xi Wang, Tao Zhang, Xiaocang Xu, Ali Emrouznejad

    Abstract: As the external environment become increasingly volatile and unpredictable, the selection of benchmarking targets in data envelopment analysis should account for their ability to consider risks; however, this aspect has not received sufficient attention. We propose a robust benchmarking target defined by the intersection of the maximum number of full-dimensional efficient facets, each representing… ▽ More

    Submitted 18 June, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

  20. arXiv:2505.05364  [pdf

    stat.AP

    Machine learning bridging battery field data and laboratory data

    Authors: Yanbin Zhao, Hao Liu, Zhihua Deng, Tong Li, Haoyi Jiang, Zhenfei Ling, Xingkai Wang, Lei Zhang, Xiaoping Ouyang

    Abstract: Aiming at the dilemma that most laboratory data-driven diagnostic and prognostic methods cannot be applied to field batteries in passenger cars and energy storage systems, this paper proposes a method to bridge field data and laboratory data using machine learning. Only two field real impedances corresponding to a medium frequency and a high frequency are needed to predict laboratory real impedanc… ▽ More

    Submitted 13 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: 73 pages, 21 figures

  21. arXiv:2505.04773  [pdf, ps, other

    stat.AP stat.CO

    Estimating the Heritability of Longitudinal Rate-of-Change: Genetic Insights into PSA Velocity in Prostate Cancer-Free Individuals

    Authors: Pei Zhang, Xiaoyu Wang, Jianxin Shi, Paul S. Albert

    Abstract: Serum prostate-specific antigen (PSA) is widely used for prostate cancer screening. While the genetics of PSA levels has been studied to enhance screening accuracy, the genetic basis of PSA velocity, the rate of PSA change over time, remains unclear. The Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial, a large, randomized study with longitudinal PSA data (15,260 cancer-free m… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  22. Residual-based Alternative Partial Least Squares for Generalized Functional Linear Models

    Authors: Yue Wang, Xiao Wang, Joseph G. Ibrahim, Hongtu Zhu

    Abstract: Many biomedical studies collect high-dimensional medical imaging data to identify biomarkers for the detection, diagnosis, and treatment of human diseases. Consequently, it is crucial to develop accurate models that can predict a wide range of clinical outcomes (both discrete and continuous) based on imaging data. By treating imaging predictors as functional data, we propose a residual-based alter… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 40 pages, 1 figure; accepted in Statistica Sinica

  23. arXiv:2505.00308  [pdf

    cs.CV cs.AI stat.AP

    AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality

    Authors: Biling Wang, Austen Maniscalco, Ti Bai, Siqiu Wang, Michael Dohopolski, Mu-Han Lin, Chenyang Shen, Dan Nguyen, Junzhou Huang, Steve Jiang, Xinlei Wang

    Abstract: Purpose: This study presents a Deep Learning (DL)-based quality assessment (QA) approach for evaluating auto-generated contours (auto-contours) in radiotherapy, with emphasis on Online Adaptive Radiotherapy (OART). Leveraging Bayesian Ordinal Classification (BOC) and calibrated uncertainty thresholds, the method enables confident QA predictions without relying on ground truth contours or extensive… ▽ More

    Submitted 11 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  24. arXiv:2505.00217  [pdf, other

    stat.ME

    Robust Estimation and Inference in Hybrid Controlled Trials for Binary Outcomes: A Case Study on Non-Small Cell Lung Cancer

    Authors: Jiajun Liu, Ke Zhu, Shu Yang, Xiaofei Wang

    Abstract: Hybrid controlled trials (HCTs), which augment randomized controlled trials (RCTs) with external controls (ECs), are increasingly receiving attention as a way to address limited power, slow accrual, and ethical concerns in clinical research. However, borrowing from ECs raises critical statistical challenges in estimation and inference, especially for binary outcomes where hidden bias is harder to… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  25. arXiv:2504.18835  [pdf

    stat.AP

    Machine learning accelerates fuel cell life testing

    Authors: Yanbin Zhao, Hao Liu, Zhihua Deng, Haoyi Jiang, Zhenfei Ling, Zhiyang Liu, Xingkai Wang, Tong Li, Xiaoping Ouyang

    Abstract: Accelerated life testing (ALT) can significantly reduce the economic, time, and labor costs of life testing in the process of equipment, device, and material research and development (R&D), and improve R&D efficiency. This paper proposes a performance characterization data prediction (PCDP) method and a life prediction-driven ALT (LP-ALT) method to accelerate the life test of polymer electrolyte m… ▽ More

    Submitted 7 May, 2025; v1 submitted 26 April, 2025; originally announced April 2025.

    Comments: 39 pages, 25 figures

  26. arXiv:2504.11320  [pdf, other

    cs.LG cs.AI cs.DC math.OC stat.ML

    Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints

    Authors: Ruicheng Ao, Gan Luo, David Simchi-Levi, Xinshang Wang

    Abstract: Large Language Models (LLMs) are indispensable in today's applications, but their inference procedure -- generating responses by processing text in segments and using a memory-heavy Key-Value (KV) cache -- demands significant computational resources, particularly under memory constraints. This paper formulates LLM inference optimization as a multi-stage online scheduling problem where sequential p… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 42 pages, 18 figures

  27. arXiv:2504.04622  [pdf, ps, other

    stat.ME

    Regularization and Selection in A Directed Network Model with Nodal Homophily and Nodal Effects

    Authors: Zhaoyu Xing, Y. X. Rachel Wang, Andrew T. A. Wood, Tao Zou

    Abstract: This article introduces a regularization and selection methods for directed networks with nodal homophily and nodal effects. The proposed approach not only preserves the statistical efficiency of the resulting estimator, but also ensures that the selection of nodal homophily and nodal effects is scalable with large-scale network data and multiple nodal features. In particular, we propose a directe… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 26 pages

    MSC Class: 62J07; 62F12; 05C82

  28. arXiv:2503.18987  [pdf, other

    cs.LG cs.AI stat.ML

    Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization

    Authors: Xiran Wang, Jian Zhang, Lei Qi, Yinghuan Shi

    Abstract: Domain generalization is proposed to address distribution shift, arising from statistical disparities between training source and unseen target domains. The widely used first-order meta-learning algorithms demonstrate strong performance for domain generalization by leveraging the gradient matching theory, which aims to establish balanced parameters across source domains to reduce overfitting to an… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  29. arXiv:2503.15967  [pdf, other

    stat.ME

    Integrative Analysis of High-dimensional RCT and RWD Subject to Censoring and Hidden Confounding

    Authors: Xin Ye, Shu Yang, Xiaofei Wang, Yanyan Liu

    Abstract: In this study, we focus on estimating the heterogeneous treatment effect (HTE) for survival outcome. The outcome is subject to censoring and the number of covariates is high-dimensional. We utilize data from both the randomized controlled trial (RCT), considered as the gold standard, and real-world data (RWD), possibly affected by hidden confounding factors. To achieve a more efficient HTE estimat… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  30. arXiv:2503.15745  [pdf, ps, other

    stat.ME stat.AP

    Statistical Inference for Heterogeneous Treatment Effect with Right-censored Data from Synthesizing Randomized Clinical Trials and Real-world Data

    Authors: Guangcai Mao, Shu Yang, Xiaofei Wang

    Abstract: The heterogeneous treatment effect plays a crucial role in precision medicine. There is evidence that real-world data, even subject to biases, can be employed as supplementary evidence for randomized clinical trials to improve the statistical efficiency of the heterogeneous treatment effect estimation. In this paper, for survival data with right censoring, we consider estimating the heterogeneous… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  31. arXiv:2503.14512  [pdf

    q-bio.QM cs.LG stat.AP stat.ML

    Machine learning algorithms to predict stroke in China based on causal inference of time series analysis

    Authors: Qizhi Zheng, Ayang Zhao, Xinzhu Wang, Yanhong Bai, Zikun Wang, Xiuying Wang, Xianzhang Zeng, Guanghui Dong

    Abstract: Participants: This study employed a combination of Vector Autoregression (VAR) model and Graph Neural Networks (GNN) to systematically construct dynamic causal inference. Multiple classic classification algorithms were compared, including Random Forest, Logistic Regression, XGBoost, Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Gradient Boosting, and Multi Layer Perceptron (MLP). The SMO… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 17 pages

  32. arXiv:2503.07938  [pdf, other

    cs.LG cs.CV stat.ME

    CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement

    Authors: Chenrui Ma, Rongchang Zhao, Xi Xiao, Hongyang Xie, Tianyang Wang, Xiao Wang, Hao Zhang, Yanning Shen

    Abstract: While deep generative models have significantly advanced representation learning, they may inherit or amplify biases and fairness issues by encoding sensitive attributes alongside predictive features. Enforcing strict independence in disentanglement is often unrealistic when target and sensitive factors are naturally correlated. To address this challenge, we propose CAD-VAE (Correlation-Aware Dise… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  33. arXiv:2503.06199  [pdf, other

    stat.ME

    Bayesian Machine Learning for Estimating Optimal Dynamic Treatment Regimes with Ordinal Outcomes

    Authors: Xinru Wang, Tanujit Chakraborty, Bibhas Chakraborty

    Abstract: Dynamic treatment regimes (DTRs) are sequences of decision rules designed to tailor treatment based on patients' treatment history and evolving disease status. Ordinal outcomes frequently serve as primary endpoints in clinical trials and observational studies. However, constructing optimal DTRs for ordinal outcomes has been underexplored. This paper introduces a Bayesian machine learning (BML) fra… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

  34. arXiv:2503.02245  [pdf, other

    stat.AP

    Identification of Genetic Factors Associated with Corpus Callosum Morphology: Conditional Strong Independence Screening for Non-Euclidean Responses

    Authors: Zhe Gao, Jin Zhu, Yue Hu, Wenliang Pan, Xueqin Wang

    Abstract: The corpus callosum, the largest white matter structure in the brain, plays a critical role in interhemispheric communication. Variations in its morphology are associated with various neurological and psychological conditions, making it a key focus in neurogenetics. Age is known to influence the structure and morphology of the corpus callosum significantly, complicating the identification of speci… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  35. arXiv:2503.01728  [pdf, other

    cs.LG stat.ME

    DeepSuM: Deep Sufficient Modality Learning Framework

    Authors: Zhe Gao, Jian Huang, Ting Li, Xueqin Wang

    Abstract: Multimodal learning has become a pivotal approach in developing robust learning models with applications spanning multimedia, robotics, large language models, and healthcare. The efficiency of multimodal systems is a critical concern, given the varying costs and resource demands of different modalities. This underscores the necessity for effective modality selection to balance performance gains ag… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  36. arXiv:2502.16504  [pdf, other

    stat.ME

    Local Information for Global Network Estimation in Latent Space Models

    Authors: Lijia Wang, Xiao Han, Yanhui Wu, Y. X. Rachel Wang

    Abstract: In social networks, neighborhood is crucial for understanding individual behavior in response to environments, and thus it is essential to analyze an individual's local perspective within the global network. This paper studies how to utilize a partial information network centered around a given individual for global network estimation by fitting a general latent space model. Compared to the entire… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  37. arXiv:2502.16232  [pdf, other

    math.NA cs.LG stat.ML

    Flow-based Bayesian filtering for high-dimensional nonlinear stochastic dynamical systems

    Authors: Xintong Wang, Xiaofei Guan, Ling Guo, Hao Wu

    Abstract: Bayesian filtering for high-dimensional nonlinear stochastic dynamical systems is a fundamental yet challenging problem in many fields of science and engineering. Existing methods face significant obstacles: Gaussian-based filters struggle with non-Gaussian distributions, while sequential Monte Carlo methods are computationally intensive and prone to particle degeneracy in high dimensions. Althoug… ▽ More

    Submitted 5 March, 2025; v1 submitted 22 February, 2025; originally announced February 2025.

  38. arXiv:2502.09591  [pdf, ps, other

    cs.LG stat.ML

    Censor Dependent Variational Inference

    Authors: Chuanhui Liu, Xiao Wang

    Abstract: This paper provides a comprehensive analysis of variational inference in latent variable models for survival analysis, emphasizing the distinctive challenges associated with applying variational methods to survival data. We identify a critical weakness in the existing methodology, demonstrating how a poorly designed variational distribution may hinder the objective of survival analysis tasks - mod… ▽ More

    Submitted 3 June, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

  39. arXiv:2502.06397  [pdf, other

    stat.ME

    Factor Modelling for Biclustering Large-dimensional Matrix-valued Time Series

    Authors: Yong He, Xiaoyang Ma, Xingheng Wang, Yalin Wang

    Abstract: A novel unsupervised learning method is proposed in this paper for biclustering large-dimensional matrix-valued time series based on an entirely new latent two-way factor structure. Each block cluster is characterized by its own row and column cluster-specific factors in addition to some common matrix factors which impact on all the matrix time series. We first estimate the global loading spaces b… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  40. arXiv:2502.00470  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Primal-Dual Algorithms: Unification, Connections, and Insights

    Authors: Runxiong Wu, Dong Liu, Xueqin Wang, Andi Wang

    Abstract: We study primal-dual algorithms for general empirical risk minimization problems in distributed settings, focusing on two prominent classes of algorithms. The first class is the communication-efficient distributed dual coordinate ascent (CoCoA), derived from the coordinate ascent method for solving the dual problem. The second class is the alternating direction method of multipliers (ADMM), includ… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 15 pages, 4 figures, 1 table

  41. arXiv:2501.19239  [pdf, ps, other

    cs.LG stat.ML

    Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics

    Authors: Xingyu Wang, Mengfan Xu

    Abstract: We study decentralized multi-agent multi-armed bandits in fully heavy-tailed settings, where clients communicate over sparse random graphs with heavy-tailed degree distributions and observe heavy-tailed (homogeneous or heterogeneous) reward distributions with potentially infinite variance. The objective is to maximize system performance by pulling the globally optimal arm with the highest global r… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 40 pages

  42. arXiv:2501.18060  [pdf, other

    stat.ME cs.LG stat.ML

    Noise-Adaptive Conformal Classification with Marginal Coverage

    Authors: Teresa Bortolotti, Y. X. Rachel Wang, Xin Tong, Alessandra Menafoglio, Simone Vantini, Matteo Sesia

    Abstract: Conformal inference provides a rigorous statistical framework for uncertainty quantification in machine learning, enabling well-calibrated prediction sets with precise coverage guarantees for any classification model. However, its reliance on the idealized assumption of perfect data exchangeability limits its effectiveness in the presence of real-world complications, such as low-quality labels --… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

  43. arXiv:2501.18049  [pdf, ps, other

    cs.LG math.OC stat.ML

    Joint Pricing and Resource Allocation: An Optimal Online-Learning Approach

    Authors: Jianyu Xu, Xuan Wang, Yu-Xiang Wang, Jiashuo Jiang

    Abstract: We study an online learning problem on dynamic pricing and resource allocation, where we make joint pricing and inventory decisions to maximize the overall net profit. We consider the stochastic dependence of demands on the price, which complicates the resource allocation process and introduces significant non-convexity and non-smoothness to the problem. To solve this problem, we develop an effici… ▽ More

    Submitted 21 May, 2025; v1 submitted 29 January, 2025; originally announced January 2025.

    MSC Class: 91B06; 90B22; 91B24; 90B50; 90B80; 62P20 ACM Class: I.2.6

  44. arXiv:2501.15555  [pdf, other

    cs.LG cs.AI cs.GR stat.ML

    Distributionally Robust Graph Out-of-Distribution Recommendation via Diffusion Model

    Authors: Chu Zhao, Enneng Yang, Yuliang Liang, Jianzhe Zhao, Guibing Guo, Xingwei Wang

    Abstract: The distributionally robust optimization (DRO)-based graph neural network methods improve recommendation systems' out-of-distribution (OOD) generalization by optimizing the model's worst-case performance. However, these studies fail to consider the impact of noisy samples in the training data, which results in diminished generalization capabilities and lower accuracy. Through experimental and theo… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

    Comments: 14 pages, Accepted by WWW'25

  45. arXiv:2501.15522  [pdf, other

    stat.ML cs.LG q-bio.QM

    Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths

    Authors: Yueyang Wang, Kejun Tang, Xili Wang, Xiaoliang Wan, Weiqing Ren, Chao Yang

    Abstract: The committor functions are central to investigating rare but important events in molecular simulations. It is known that computing the committor function suffers from the curse of dimensionality. Recently, using neural networks to estimate the committor function has gained attention due to its potential for high-dimensional problems. Training neural networks to approximate the committor function… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  46. arXiv:2501.12212  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Quantitative Error Bounds for Scaling Limits of Stochastic Iterative Algorithms

    Authors: Xiaoyu Wang, Mikolaj J. Kasprzak, Jeffrey Negrea, Solesne Bourguin, Jonathan H. Huggins

    Abstract: Stochastic iterative algorithms, including stochastic gradient descent (SGD) and stochastic gradient Langevin dynamics (SGLD), are widely utilized for optimization and sampling in large-scale and high-dimensional problems in machine learning, statistics, and engineering. Numerous works have bounded the parameter error in, and characterized the uncertainty of, these approximations. One common appro… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    MSC Class: 60F17 (Primary) 60J60; 62-08; 68T05; 62E17 (Secondary)

  47. arXiv:2501.11743  [pdf, other

    cs.LG math.PR stat.CO

    Non-Reversible Langevin Algorithms for Constrained Sampling

    Authors: Hengrong Du, Qi Feng, Changwei Tu, Xiaoyu Wang, Lingjiong Zhu

    Abstract: We consider the constrained sampling problem where the goal is to sample from a target distribution on a constrained domain. We propose skew-reflected non-reversible Langevin dynamics (SRNLD), a continuous-time stochastic differential equation with skew-reflected boundary. We obtain non-asymptotic convergence rate of SRNLD to the target distribution in both total variation and 1-Wasserstein distan… ▽ More

    Submitted 14 April, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

    Comments: 35 pages, 9 figures, typos corrected

  48. arXiv:2501.02846  [pdf, other

    stat.ME

    Bayesian analysis of nonlinear structured latent factor models using a Gaussian Process Prior

    Authors: Yimang Zhang, Xiaorui Wang, Jian Qing Shi

    Abstract: Factor analysis models are widely utilized in social and behavioral sciences, such as psychology, education, and marketing, to measure unobservable latent traits. In this article, we introduce a nonlinear structured latent factor analysis model which is more flexible to characterize the relationship between manifest variables and latent factors. The confirmatory identifiability of the latent facto… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  49. arXiv:2412.20727  [pdf, other

    cs.LG stat.ML

    AverageTime: Enhance Long-Term Time Series Forecasting with Simple Averaging

    Authors: Gaoxiang Zhao, Li Zhou, Xiaoqiang Wang

    Abstract: Long-term time series forecasting focuses on leveraging historical data to predict future trends. The core challenge lies in effectively modeling dependencies both within sequences and channels. Convolutional Neural Networks and Linear models often excel in sequence modeling but frequently fall short in capturing complex channel dependencies. In contrast, Transformer-based models, with their atten… ▽ More

    Submitted 2 April, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

  50. arXiv:2412.20471  [pdf, ps, other

    cs.GT cs.LG math.OC stat.ML

    On the Convergence of Min-Max Langevin Dynamics and Algorithm

    Authors: Yang Cai, Siddharth Mitra, Xiuyuan Wang, Andre Wibisono

    Abstract: We study zero-sum games in the space of probability distributions over the Euclidean space $\mathbb{R}^d$ with entropy regularization, in the setting when the interaction function between the players is smooth and strongly convex-strongly concave. We prove an exponential convergence guarantee for the mean-field min-max Langevin dynamics to compute the equilibrium distribution of the zero-sum game.… ▽ More

    Submitted 27 June, 2025; v1 submitted 29 December, 2024; originally announced December 2024.

    Comments: v3: Accepted for presentation at the Conference on Learning Theory (COLT) 2025. v2: Revised introduction and presentation of results