Skip to main content

Showing 1–50 of 141 results for author: Zhu, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.14154  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    SConU: Selective Conformal Uncertainty in Large Language Models

    Authors: Zhiyuan Wang, Qingni Wang, Yue Zhang, Tianlong Chen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: As large language models are increasingly utilized in real-world applications, guarantees of task-specific metrics are essential for their reliable deployment. Previous studies have introduced various criteria of conformal uncertainty grounded in split conformal prediction, which offer user-specified correctness coverage. However, existing frameworks often fail to identify uncertainty data outlier… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  2. arXiv:2503.11729  [pdf, other

    stat.AP

    Analysis of Information Loss on Composition Measurement in Stiff Chemically Reacting Systems

    Authors: Yiming Lu, Xu Zhu, Long Zhang, Hua Zhou

    Abstract: Gas sampling methods have been crucial for the advancement of combustion science, enabling analysis of reaction kinetics and pollutant formation. However, the measured composition can deviate from the true one because of the potential residual reactions in the sampling probes. This study formulates the initial composition estimation in stiff chemically reacting systems as a Bayesian inference prob… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  3. arXiv:2503.02110  [pdf, other

    stat.ML cs.LG

    Quantifying Overfitting along the Regularization Path for Two-Part-Code MDL in Supervised Classification

    Authors: Xiaohan Zhu, Nathan Srebro

    Abstract: We provide a complete characterization of the entire regularization curve of a modified two-part-code Minimum Description Length (MDL) learning rule for binary classification, based on an arbitrary prior or description language. Grunwald and Langford [2004] previously established the lack of asymptotic consistency, from an agnostic PAC (frequentist worst case) perspective, of the MDL rule with a p… ▽ More

    Submitted 10 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  4. arXiv:2502.18611  [pdf, other

    math.PR cs.LG stat.ML

    Tight Bounds on the Binomial CDF, and the Minimum of i.i.d Binomials, in terms of KL-Divergence

    Authors: Xiaohan Zhu, Mesrob I. Ohannessian, Nathan Srebro

    Abstract: We provide finite sample upper and lower bounds on the Binomial tail probability which are a direct application of Sanov's theorem. We then use these to obtain high probability upper and lower bounds on the minimum of i.i.d. Binomial random variables. Both bounds are finite sample, asymptotically tight, and expressed in terms of the KL-divergence.

    Submitted 25 February, 2025; originally announced February 2025.

  5. arXiv:2412.07635  [pdf, ps, other

    stat.AP

    A novel Phase I clinical trial design with unequal cohort sizes

    Authors: Xiaojun Zhu

    Abstract: This paper introduces a new Phase I design aimed at enhancing the performance of existing methods, including algorithm-based, model-based, and model-assisted designs. The design, developed by integrating the concept of Fisher information, is easily operationalized. The new design addresses the issue of the classical designs'slow dosage escalation. Simulation demonstrate that the proposed design ma… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  6. Penalized Sparse Covariance Regression with High Dimensional Covariates

    Authors: Yuan Gao, Zhiyuan Zhang, Zhanrui Cai, Xuening Zhu, Tao Zou, Hansheng Wang

    Abstract: Covariance regression offers an effective way to model the large covariance matrix with the auxiliary similarity matrices. In this work, we propose a sparse covariance regression (SCR) approach to handle the potentially high-dimensional predictors (i.e., similarity matrices). Specifically, we use the penalization method to identify the informative predictors and estimate their associated coefficie… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    MSC Class: 62J99; 62P20

  7. arXiv:2410.00068  [pdf

    eess.IV cs.LG stat.AP

    Denoising VAE as an Explainable Feature Reduction and Diagnostic Pipeline for Autism Based on Resting state fMRI

    Authors: Xinyuan Zheng, Orren Ravid, Robert A. J. Barry, Yoojean Kim, Qian Wang, Young-geun Kim, Xi Zhu, Xiaofu He

    Abstract: Autism spectrum disorders (ASDs) are developmental conditions characterized by restricted interests and difficulties in communication. The complexity of ASD has resulted in a deficiency of objective diagnostic biomarkers. Deep learning methods have gained recognition for addressing these challenges in neuroimaging analysis, but finding and interpreting such diagnostic biomarkers are still challeng… ▽ More

    Submitted 27 March, 2025; v1 submitted 30 September, 2024; originally announced October 2024.

    ACM Class: J.3; I.4.9; I.4.10

  8. arXiv:2409.15307  [pdf, other

    stat.CO physics.comp-ph

    An adaptive Gaussian process method for multi-modal Bayesian inverse problems

    Authors: Zhihang Xu, Xiaoyu Zhu, Daoji Li, Qifeng Liao

    Abstract: Inverse problems are prevalent in both scientific research and engineering applications. In the context of Bayesian inverse problems, sampling from the posterior distribution is particularly challenging when the forward models are computationally expensive. This challenge escalates further when the posterior distribution is multimodal. To address this, we propose a Gaussian process (GP) based meth… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  9. arXiv:2408.12353  [pdf, other

    stat.ML cs.LG math.ST

    Distributed quasi-Newton robust estimation under differential privacy

    Authors: Chuhan Wang, Lixing Zhu, Xuehu Zhu

    Abstract: For distributed computing with Byzantine machines under Privacy Protection (PP) constraints, this paper develops a robust PP distributed quasi-Newton estimation, which only requires the node machines to transmit five vectors to the central processor with high asymptotic relative efficiency. Compared with the gradient descent strategy which requires more rounds of transmission and the Newton iterat… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 38 pages, 6 figures

  10. arXiv:2407.15388  [pdf, ps, other

    stat.AP q-fin.RM

    A new paradigm of mortality modeling via individual vitality dynamics

    Authors: Xiaobai Zhu, Kenneth Q. Zhou, Zijia Wang

    Abstract: The significance of mortality modeling extends across multiple research areas, ranging from life insurance valuation to optimal lifetime decision-making. Existing approaches, such as mortality laws and factor-based models, often fall short in capturing the complexity of individual mortality, hindering their ability to address specific research needs. To overcome these limitations, this paper intro… ▽ More

    Submitted 21 October, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 45 pages

  11. arXiv:2406.03296  [pdf, other

    stat.ME

    Multi-relational Network Autoregression Model with Latent Group Structures

    Authors: Yimeng Ren, Xuening Zhu, Ganggang Xu, Yanyuan Ma

    Abstract: Multi-relational networks among entities are frequently observed in the era of big data. Quantifying the effects of multiple networks have attracted significant research interest recently. In this work, we model multiple network effects through an autoregressive framework for tensor-valued time series. To characterize the potential heterogeneity of the networks and handle the high dimensionality o… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.02107

  12. arXiv:2405.17744  [pdf, other

    stat.ME

    Factor Augmented Matrix Regression

    Authors: Elynn Chen, Jianqing Fan, Xiaonan Zhu

    Abstract: We introduce \underline{F}actor-\underline{A}ugmented \underline{Ma}trix \underline{R}egression (FAMAR) to address the growing applications of matrix-variate data and their associated challenges, particularly with high-dimensionality and covariate correlations. FAMAR encompasses two key algorithms. The first is a novel non-iterative approach that efficiently estimates the factors and loadings of t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  13. arXiv:2405.07408  [pdf, other

    stat.ME stat.AP

    Bayesian Spatially Clustered Compositional Regression: Linking intersectoral GDP contributions to Gini Coefficients

    Authors: Jingcheng Meng, Yimeng Ren, Xuening Zhu, Guanyu Hu

    Abstract: The Gini coefficient is an universally used measurement of income inequality. Intersectoral GDP contributions reveal the economic development of different sectors of the national economy. Linking intersectoral GDP contributions to Gini coefficients will provide better understandings of how the Gini coefficient is influenced by different industries. In this paper, a compositional regression with sp… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  14. arXiv:2404.18732  [pdf, other

    stat.ME

    Two-way Homogeneity Pursuit for Quantile Network Vector Autoregression

    Authors: Wenyang Liu, Ganggang Xu, Jianqing Fan, Xuening Zhu

    Abstract: While the Vector Autoregression (VAR) model has received extensive attention for modelling complex time series, quantile VAR analysis remains relatively underexplored for high-dimensional time series data. To address this disparity, we introduce a two-way grouped network quantile (TGNQ) autoregression model for time series collected on large-scale networks, known for their significant heterogeneou… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  15. arXiv:2403.11163  [pdf, ps, other

    stat.ME cs.LG math.ST stat.CO

    A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

    Authors: Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, Jing Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

    Abstract: This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  16. arXiv:2403.07124  [pdf, other

    stat.ME cs.SI

    Stochastic gradient descent-based inference for dynamic network models with attractors

    Authors: Hancong Pan, Xiaojing Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: In Coevolving Latent Space Networks with Attractors (CLSNA) models, nodes in a latent space represent social actors, and edges indicate their dynamic interactions. Attractors are added at the latent level to capture the notion of attractive and repulsive forces between nodes, borrowing from dynamical systems theory. However, CLSNA reliance on MCMC estimation makes scaling difficult, and the requir… ▽ More

    Submitted 15 December, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  17. arXiv:2403.03444  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Uncertainty quantification for deeponets with ensemble kalman inversion

    Authors: Andrew Pensoneault, Xueyu Zhu

    Abstract: In recent years, operator learning, particularly the DeepONet, has received much attention for efficiently learning complex mappings between input and output functions across diverse fields. However, in practical scenarios with limited and noisy data, accessing the uncertainty in DeepONet predictions becomes essential, especially in mission-critical or safety-critical applications. Existing method… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 25 pages

    MSC Class: 65

  18. Estimation of the genetic Gaussian network using GWAS summary data

    Authors: Yihe Yang, Noah Lorincz-Comi, Xiaofeng Zhu

    Abstract: Genetic Gaussian network of multiple phenotypes constructed through the genetic correlation matrix is informative for understanding their biological dependencies. However, its interpretation may be challenging because the estimated genetic correlations are biased due to estimation errors and horizontal pleiotropy inherent in GWAS summary statistics. Here we introduce a novel approach called Estima… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 29 pages, 7 figures, 1 table

    MSC Class: 62H22; 62P10 ACM Class: A.0

    Journal ref: Biometrics, Volume 80, Issue 4, December 2024, ujae148

  19. arXiv:2311.08874  [pdf, other

    cs.LG stat.AP stat.ML

    Human-in-the-loop: Towards Label Embeddings for Measuring Classification Difficulty

    Authors: Katharina Hechinger, Christoph Koller, Xiao Xiang Zhu, Göran Kauermann

    Abstract: Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some instances cannot be definitively classified. In other words, there is inevitable ambiguity in the annotation step and hence, not necessarily a "ground tr… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  20. arXiv:2310.18572  [pdf, ps, other

    stat.AP

    Where to serve and return in Badminton Men's Double?

    Authors: Xuelin Zhu, Yu Sun, Yumin Zeng, Cong Xu

    Abstract: This study aims to analyze the service and return landing areas in badminton men's double, based on data extracted from 20 badminton matches. We find that most services land near the center-line, while returns tend to land in the crossing areas of the serving team's court. Using generalized logit models, we are able to predict the return landing area based on features of the service and return rou… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  21. Categorising the World into Local Climate Zones -- Towards Quantifying Labelling Uncertainty for Machine Learning Models

    Authors: Katharina Hechinger, Xiao Xiang Zhu, Göran Kauermann

    Abstract: Image classification is often prone to labelling uncertainty. To generate suitable training data, images are labelled according to evaluations of human experts. This can result in ambiguities, which will affect subsequent models. In this work, we aim to model the labelling uncertainty in the context of remote sensing and the classification of satellite images. We construct a multinomial mixture mo… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Journal ref: Journal of the Royal Statistical Society Series C: Applied Statistics 73 (2024) 143-161

  22. arXiv:2308.01178  [pdf, other

    stat.ME

    Model Selection for Exposure-Mediator Interaction

    Authors: Ruiyang Li, Xi Zhu, Seonjoo Lee

    Abstract: In mediation analysis, the exposure often influences the mediating effect, i.e., there is an interaction between exposure and mediator on the dependent variable. When the mediator is high-dimensional, it is necessary to identify non-zero mediators (M) and exposure-by-mediator (X-by-M) interactions. Although several high-dimensional mediation methods can naturally handle X-by-M interactions, resear… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 15 pages, 3 figures

  23. arXiv:2307.07827  [pdf, other

    stat.ME

    Corrected kernel principal component analysis for model structural change detection

    Authors: Luoyao Yu, Lixing Zhu, Ruoqing Zhu, Xuehu Zhu

    Abstract: This paper develops a method to detect model structural changes by applying a Corrected Kernel Principal Component Analysis (CKPCA) to construct the so-called central distribution deviation subspaces. This approach can efficiently identify the mean and distribution changes in these dimension reduction subspaces. We derive that the locations and number changes in the dimension reduction data subspa… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  24. arXiv:2307.07574  [pdf, other

    stat.ME econ.EM stat.ML

    Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models

    Authors: Xiaorui Zhu, Yichen Qin, Peng Wang

    Abstract: Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. A critical question remains unsettled; that is, is it possible and how to embed the inference of the model into the simultaneous inference of the coefficients? To this end, we propose a notion of simultaneous confidence int… ▽ More

    Submitted 2 January, 2025; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 26 pages, 6 figures

    MSC Class: 62fxx

    Journal ref: Metrika, 2024

  25. Biomass Estimation and Uncertainty Quantification from Tree Height

    Authors: Qian Song, Conrad M Albrecht, Zhitong Xiong, Xiao Xiang Zhu

    Abstract: We propose a tree-level biomass estimation model approximating allometric equations by LiDAR data. Since tree crown diameters estimation is challenging from spaceborne LiDAR measurements, we develop a model to correlate tree height with biomass on the individual tree level employing a Gaussian process regressor. In order to validate the proposed model, a set of 8,342 samples on tree height, trunk… ▽ More

    Submitted 17 May, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: in press, IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens

  26. arXiv:2305.08413  [pdf, other

    cs.CV eess.IV stat.AP

    Artificial intelligence to advance Earth observation: : A review of models, recent trends, and pathways forward

    Authors: Devis Tuia, Konrad Schindler, Begüm Demir, Xiao Xiang Zhu, Mrinalini Kochupillai, Sašo Džeroski, Jan N. van Rijn, Holger H. Hoos, Fabio Del Frate, Mihai Datcu, Volker Markl, Bertrand Le Saux, Rochelle Schneider, Gustau Camps-Valls

    Abstract: Earth observation (EO) is a prime instrument for monitoring land and ocean processes, studying the dynamics at work, and taking the pulse of our planet. This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information. The promises, as well as the current challenges of these developments, a… ▽ More

    Submitted 16 September, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Journal ref: IEEE Geoscience and Remote Sensing Magazine, 2024

  27. arXiv:2305.02770  [pdf, other

    cs.CY cs.CL stat.AP

    The Politics of Language Choice: How the Russian-Ukrainian War Influences Ukrainians' Language Use on Twitter

    Authors: Daniel Racek, Brittany I. Davidson, Paul W. Thurner, Xiao Xiang Zhu, Göran Kauermann

    Abstract: The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample… ▽ More

    Submitted 6 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  28. arXiv:2304.06292  [pdf, ps, other

    cs.LG stat.AP stat.ME

    Improved Naive Bayes with Mislabeled Data

    Authors: Qianhan Zeng, Yingqiu Zhu, Xuening Zhu, Feifei Wang, Weichen Zhao, Shuning Sun, Meng Su, Hansheng Wang

    Abstract: Labeling mistakes are frequently encountered in real-world applications. If not treated well, the labeling mistakes can deteriorate the classification performances of a model seriously. To address this issue, we propose an improved Naive Bayes method for text classification. It is analytically simple and free of subjective judgements on the correct and incorrect labels. By specifying the generatin… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  29. arXiv:2304.06231  [pdf, other

    stat.ME

    Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis with Limited Computational Resources

    Authors: Shuyuan Wu, Xuening Zhu, Hansheng Wang

    Abstract: Modern statistical analysis often encounters datasets with large sizes. For these datasets, conventional estimation methods can hardly be used immediately because practitioners often suffer from limited computational resources. In most cases, they do not have powerful computational resources (e.g., Hadoop or Spark). How to practically analyze large datasets with limited computational resources the… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  30. arXiv:2304.02269  [pdf, other

    stat.ME

    Distributed Logistic Regression for Massive Data with Rare Events

    Authors: Xuetong Li, Xuening Zhu, Hansheng Wang

    Abstract: Large-scale rare events data are commonly encountered in practice. To tackle the massive rare events data, we propose a novel distributed estimation method for logistic regression in a distributed system. For a distributed framework, we face the following two challenges. The first challenge is how to distribute the data. In this regard, two different distribution strategies (i.e., the RANDOM strat… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  31. arXiv:2303.07392  [pdf, other

    stat.ML cs.LG

    Efficient Bayesian Physics Informed Neural Networks for Inverse Problems via Ensemble Kalman Inversion

    Authors: Andrew Pensoneault, Xueyu Zhu

    Abstract: Bayesian Physics Informed Neural Networks (B-PINNs) have gained significant attention for inferring physical parameters and learning the forward solutions for problems based on partial differential equations. However, the overparameterized nature of neural networks poses a computational challenge for high-dimensional posterior inference. Existing inference approaches, such as particle-based or var… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  32. arXiv:2303.04745  [pdf, other

    cs.LG stat.ML

    A General Theory of Correct, Incorrect, and Extrinsic Equivariance

    Authors: Dian Wang, Xupeng Zhu, Jung Yeon Park, Mingxi Jia, Guanang Su, Robert Platt, Robin Walters

    Abstract: Although equivariant machine learning has proven effective at many tasks, success depends heavily on the assumption that the ground truth function is symmetric over the entire domain matching the symmetry in an equivariant neural network. A missing piece in the equivariant learning literature is the analysis of equivariant networks when symmetry exists only partially in the domain. In this work, w… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at NeurIPS 2023

  33. arXiv:2302.02768  [pdf, other

    stat.ME

    Network Autoregression for Incomplete Matrix-Valued Time Series

    Authors: Xuening Zhu, Feifei Wang, Zeng Li, Yanyuan Ma

    Abstract: We study the dynamics of matrix-valued time series with observed network structures by proposing a matrix network autoregression model with row and column networks of the subjects. We incorporate covariate information and a low rank intercept matrix. We allow incomplete observations in the matrices and the missing mechanism can be covariate dependent. To estimate the model, a two-step estimation p… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  34. arXiv:2301.05130  [pdf, other

    stat.ME econ.EM math.ST

    Unbiased estimation and asymptotically valid inference in multivariable Mendelian randomization with many weak instrumental variables

    Authors: Yihe Yang, Noah Lorincz-Comi, Xiaofeng Zhu

    Abstract: Mendelian randomization (MR) is an instrumental variable (IV) approach to infer causal relationships between exposures and outcomes with genome-wide association studies (GWAS) summary data. However, the multivariable inverse-variance weighting (IVW) approach, which serves as the foundation for most MR approaches, cannot yield unbiased causal effect estimates in the presence of many weak IVs. To ad… ▽ More

    Submitted 10 February, 2024; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: We have observed potential competitors, so we reverted to the version prior to the fourth update (v4). However, this paper and https://www.biorxiv.org/content/10.1101/2023.01.10.523480v3.abstract have been merged, with the main content summarized in Supplemental Material 2

    MSC Class: 62F12 (Primary) 62J05; 62P10 (Secondary)

  35. arXiv:2212.02107  [pdf, other

    stat.ME

    Matrix-valued Network Autoregression Model with Latent Group Structure

    Authors: Yimeng Ren, Xuening Zhu, Yanyuan Ma

    Abstract: Matrix-valued time series data are frequently observed in a broad range of areas and have attracted great attention recently. In this work, we model network effects for high dimensional matrix-valued time series data in a matrix autoregression framework. To characterize the potential heterogeneity of the subjects and handle the high dimensionality simultaneously, we assume that each subject has a… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  36. arXiv:2211.06039  [pdf, other

    stat.ML cs.LG

    Online Linearized LASSO

    Authors: Shuoguang Yang, Yuhao Yan, Xiuneng Zhu, Qiang Sun

    Abstract: Sparse regression has been a popular approach to perform variable selection and enhance the prediction accuracy and interpretability of the resulting statistical model. Existing approaches focus on offline regularized regression, while the online scenario has rarely been studied. In this paper, we propose a novel online sparse linear regression framework for analyzing streaming data when data poin… ▽ More

    Submitted 1 January, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

  37. arXiv:2210.16634  [pdf, other

    stat.CO

    Distributed Estimation and Inference for Spatial Autoregression Model with Large Scale Networks

    Authors: Yimeng Ren, Zhe Li, Xuening Zhu, Yuan Gao, Hansheng Wang

    Abstract: The rapid growth of online network platforms generates large-scale network data and it poses great challenges for statistical analysis using the spatial autoregression (SAR) model. In this work, we develop a novel distributed estimation and statistical inference framework for the SAR model on a distributed system. We first propose a distributed network least squares approximation (DNLSA) method. T… ▽ More

    Submitted 27 November, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

  38. arXiv:2210.03294  [pdf, other

    cs.LG math.OC stat.ML

    Understanding Edge-of-Stability Training Dynamics with a Minimalist Example

    Authors: Xingyu Zhu, Zixuan Wang, Xiang Wang, Mo Zhou, Rong Ge

    Abstract: Recently, researchers observed that gradient descent for deep neural networks operates in an ``edge-of-stability'' (EoS) regime: the sharpness (maximum eigenvalue of the Hessian) is often larger than stability threshold $2/η$ (where $η$ is the step size). Despite this, the loss oscillates and converges in the long run, and the sharpness at the end is just slightly below $2/η$. While many other wel… ▽ More

    Submitted 21 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 53 pages, 19 figures

    ACM Class: I.2.6

  39. arXiv:2209.12229  [pdf, other

    stat.ME

    Simultaneous Estimation and Group Identification for Network Vector Autoregressive Model with Heterogeneous Nodes

    Authors: Xuening Zhu, Ganggang Xu, Jianqing Fan

    Abstract: Individuals or companies in a large social or financial network often display rather heterogeneous behaviors for various reasons. In this work, we propose a network vector autoregressive model with a latent group structure to model heterogeneous dynamic patterns observed from network nodes, for which group-wise network effects and timeinvariant fixed-effects can be naturally incorporated. In our f… ▽ More

    Submitted 11 August, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  40. arXiv:2209.05474  [pdf, other

    stat.ME

    Consistent Selection of the Number of Groups in Panel Models via Cross-Validation

    Authors: Zhe Li, Xuening Zhu, Changliang Zou

    Abstract: Group number selection is a key problem for group panel data modeling. In this work, we develop a cross-validation (CV) method to tackle this problem. Specifically, we split the panel data into two data folds on the time span, with group structure preserved for individuals. We first estimate the group memberships and parameters on one data fold, then we plug in the estimates and utilize the other… ▽ More

    Submitted 16 May, 2025; v1 submitted 12 September, 2022; originally announced September 2022.

  41. Seismic fragility analysis using stochastic polynomial chaos expansions

    Authors: X. Zhu, M. Broccardo, B. Sudret

    Abstract: Within the performance-based earthquake engineering (PBEE) framework, the fragility model plays a pivotal role. Such a model represents the probability that the engineering demand parameter (EDP) exceeds a certain safety threshold given a set of selected intensity measures (IMs) that characterize the earthquake load. The-state-of-the art methods for fragility computation rely on full non-linear ti… ▽ More

    Submitted 1 February, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Report number: RSUQ-2022-006B

    Journal ref: Probabilistic Engineering Mechanics, 2023,103413

  42. arXiv:2206.13004  [pdf, other

    stat.ME

    Multiple change point detection in tensors

    Authors: Jiaqi Huang, Junhui Wang, Xuehu Zhu, Lixing Zhu

    Abstract: This paper proposes a criterion for detecting change structures in tensor data. To accommodate tensor structure with structural mode that is not suitable to be equally treated and summarized in a distance to measure the difference between any two adjacent tensors, we define a mode-based signal-screening Frobenius distance for the moving sums of slices of tensor data to handle both dense and sparse… ▽ More

    Submitted 18 March, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

  43. arXiv:2206.00165  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Byzantine-Robust Online and Offline Distributed Reinforcement Learning

    Authors: Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu

    Abstract: We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server. However, $α$-fraction of agents are adversarial and can report arbitrary fake information. Critically, these adversarial agents can collude and their fake data can be of any sizes. We desire to robustly identify a near-optimal… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  44. arXiv:2204.08524  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    So2Sat POP -- A Curated Benchmark Data Set for Population Estimation from Space on a Continental Scale

    Authors: Sugandha Doda, Yuanyuan Wang, Matthias Kahl, Eike Jens Hoffmann, Kim Ouan, Hannes Taubenböck, Xiao Xiang Zhu

    Abstract: Obtaining a dynamic population distribution is key to many decision-making processes such as urban planning, disaster management and most importantly helping the government to better allocate socio-technical supply. For the aspiration of these objectives, good population data is essential. The traditional method of collecting population data through the census is expensive and tedious. In recent y… ▽ More

    Submitted 10 November, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  45. arXiv:2202.10513  [pdf, other

    stat.ME

    Quantifying Uncertainty for Temporal Motif Estimation in Graph Streams under Sampling

    Authors: Xiaojing Zhu, Eric D. Kolaczyk

    Abstract: Dynamic networks, a.k.a. graph streams, consist of a set of vertices and a collection of timestamped interaction events (i.e., temporal edges) between vertices. Temporal motifs are defined as classes of (small) isomorphic induced subgraphs on graph streams, considering both edge ordering and duration. As with motifs in static networks, temporal motifs are the fundamental building blocks for tempor… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  46. Stochastic polynomial chaos expansions to emulate stochastic simulators

    Authors: X. Zhu, B. Sudret

    Abstract: In the context of uncertainty quantification, computational models are required to be repeatedly evaluated. This task is intractable for costly numerical models. Such a problem turns out to be even more severe for stochastic simulators, the output of which is a random variable for a given set of input parameters. To alleviate the computational burden, surrogate models are usually constructed and e… ▽ More

    Submitted 26 November, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Report number: RSUQ-2022-01C

    Journal ref: International Journal for Uncertainty Quantification 13(2), 31-52 (2023)

  47. arXiv:2111.01507  [pdf, other

    stat.ME math.ST stat.CO

    An Asymptotic Analysis of Minibatch-Based Momentum Methods for Linear Regression Models

    Authors: Yuan Gao, Xuening Zhu, Haobo Qi, Guodong Li, Riquan Zhang, Hansheng Wang

    Abstract: Momentum methods have been shown to accelerate the convergence of the standard gradient descent algorithm in practice and theory. In particular, the minibatch-based gradient descent methods with momentum (MGDM) are widely used to solve large-scale optimization problems with massive datasets. Despite the success of the MGDM methods in practice, their theoretical properties are still underexplored.… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 45 pages, 5 figures

  48. arXiv:2110.04991  [pdf, other

    stat.ME

    Graphical Assistant Grouped Network Autoregression Model: a Bayesian Nonparametric Recourse

    Authors: Yimeng Ren, Xuening Zhu, Guanyu Hu

    Abstract: Vector autoregression model is ubiquitous in classical time series data analysis. With the rapid advance of social network sites, time series data over latent graph is becoming increasingly popular. In this paper, we develop a novel Bayesian grouped network autoregression model to simultaneously estimate group information (number of groups and group configurations) and group-wise parameters. Speci… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  49. arXiv:2110.00936  [pdf, ps, other

    stat.ME stat.CO

    A Sequential Addressing Subsampling Method for Massive Data Analysis under Memory Constraint

    Authors: Rui Pan, Yingqiu Zhu, Baishan Guo, Xuening Zhu, Hansheng Wang

    Abstract: The emergence of massive data in recent years brings challenges to automatic statistical inference. This is particularly true if the data are too numerous to be read into memory as a whole. Accordingly, new sampling techniques are needed to sample data from a hard drive. In this paper, we propose a sequential addressing subsampling (SAS) method, that can sample data directly from the hard drive. T… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

  50. arXiv:2109.13129  [pdf, other

    stat.AP stat.ME

    Disentangling positive and negative partisanship in social media interactions using a coevolving latent space network with attractors model

    Authors: Xiaojing Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: We develop a broadly applicable class of coevolving latent space network with attractors (CLSNA) models, where nodes represent individual social actors assumed to lie in an unknown latent space, edges represent the presence of a specified interaction between actors, and attractors are added in the latent level to capture the notion of attractive and repulsive forces. We apply the CLSNA models to u… ▽ More

    Submitted 13 August, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: revised version