Skip to main content

Showing 1–8 of 8 results for author: Shi, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.02919  [pdf, other

    stat.ML cs.GR cs.LG

    ConfEviSurrogate: A Conformalized Evidential Surrogate Model for Uncertainty Quantification

    Authors: Yuhan Duan, Xin Zhao, Neng Shi, Han-Wei Shen

    Abstract: Surrogate models, crucial for approximating complex simulation data across sciences, inherently carry uncertainties that range from simulation noise to model prediction errors. Without rigorous uncertainty quantification, predictions become unreliable and hence hinder analysis. While methods like Monte Carlo dropout and ensemble models exist, they are often costly, fail to isolate uncertainty type… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  2. arXiv:2407.17720  [pdf, ps, other

    stat.CO physics.comp-ph

    Diffusion-Based Surrogate Modeling and Multi-Fidelity Calibration

    Authors: Naichen Shi, Hao Yan, Shenghan Guo, Raed Al Kontar

    Abstract: Physics simulations have become fundamental tools to study myriad engineering systems. As physics simulations often involve simplifications, their outputs should be calibrated using real-world data. In this paper, we present a diffusion-based surrogate (DBS) that calibrates multi-fidelity physics simulations with diffusion generative processes. DBS categorizes multi-fidelity physics simulations in… ▽ More

    Submitted 27 June, 2025; v1 submitted 24 July, 2024; originally announced July 2024.

    Journal ref: IEEE Transactions on Automation Science and Engineering, 2025

  3. Personalized Tucker Decomposition: Modeling Commonality and Peculiarity on Tensor Data

    Authors: Jiuyun Hu, Naichen Shi, Raed Al Kontar, Hao Yan

    Abstract: We propose personalized Tucker decomposition (perTucker) to address the limitations of traditional tensor decomposition methods in capturing heterogeneity across different datasets. perTucker decomposes tensor data into shared global components and personalized local components. We introduce a mode orthogonality assumption and develop a proximal gradient regularized block coordinate descent algori… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Journal ref: Technometrics 2025

  4. arXiv:2305.17744  [pdf, other

    stat.ME

    Heterogeneous Matrix Factorization: When Features Differ by Datasets

    Authors: Naichen Shi, Raed Al Kontar, Salar Fattahi

    Abstract: In myriad statistical applications, data are collected from related but heterogeneous sources. These sources share some commonalities while containing idiosyncratic characteristics. One of the most fundamental challenges in such scenarios is to recover the shared and source-specific factors. Despite the existence of a few heuristic approaches, a generic algorithm with theoretical guarantees has ye… ▽ More

    Submitted 27 March, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

  5. arXiv:2207.08041  [pdf, other

    cs.LG math.ST stat.ML

    Personalized PCA: Decoupling Shared and Unique Features

    Authors: Naichen Shi, Raed Al Kontar

    Abstract: In this paper, we tackle a significant challenge in PCA: heterogeneity. When data are collected from different sources with heterogeneous trends while still sharing some congruency, it is critical to extract shared knowledge while retaining the unique features of each source. To this end, we propose personalized PCA (PerPCA), which uses mutually orthogonal global and local principal components to… ▽ More

    Submitted 8 February, 2024; v1 submitted 16 July, 2022; originally announced July 2022.

    Report number: https://www.jmlr.org/papers/v25/22-0810.html

    Journal ref: Journal of Machine Learning Research 2024, 25(41):1-82

  6. Fed-ensemble: Improving Generalization through Model Ensembling in Federated Learning

    Authors: Naichen Shi, Fan Lai, Raed Al Kontar, Mosharaf Chowdhury

    Abstract: In this paper we propose Fed-ensemble: a simple approach that bringsmodel ensembling to federated learning (FL). Instead of aggregating localmodels to update a single global model, Fed-ensemble uses random permutations to update a group of K models and then obtains predictions through model averaging. Fed-ensemble can be readily utilized within established FL methods and does not impose a computat… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Journal ref: IEEE Transactions on Automation Science and Engineering (TASE), 2023

  7. arXiv:2006.13790  [pdf, ps, other

    stat.ME stat.CO

    Sequential Gibbs Sampling Algorithm for Cognitive Diagnosis Models with Many Attributes

    Authors: Juntao Wang, Ningzhong Shi, Xue Zhang, Gongjun Xu

    Abstract: Cognitive diagnosis models (CDMs) are useful statistical tools to provide rich information relevant for intervention and learning. As a popular approach to estimate and make inference of CDMs, the Markov chain Monte Carlo (MCMC) algorithm is widely used in practice. However, when the number of attributes, $K$, is large, the existing MCMC algorithm may become time-consuming, due to the fact that… ▽ More

    Submitted 13 February, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 43 pages, 3 figures

  8. arXiv:0705.4588  [pdf, ps, other

    stat.ME

    Variable Selection Incorporating Prior Constraint Information into Lasso

    Authors: Shurong Zheng, Guodong Song, Ning-Zhong Shi

    Abstract: We propose the variable selection procedure incorporating prior constraint information into lasso. The proposed procedure combines the sample and prior information, and selects significant variables for responses in a narrower region where the true parameters lie. It increases the efficiency to choose the true model correctly. The proposed procedure can be executed by many constrained quadratic… ▽ More

    Submitted 31 May, 2007; originally announced May 2007.

    Comments: 15 pages