Skip to main content

Showing 1–50 of 85 results for author: Jiang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.19136  [pdf, ps, other

    stat.ML cs.LG

    Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference

    Authors: Frank Shih, Zhenghao Jiang, Faming Liang

    Abstract: Uncertainty quantification (UQ) in scientific machine learning is increasingly critical as neural networks are widely adopted to tackle complex problems across diverse scientific disciplines. For physics-informed neural networks (PINNs), a prominent model in scientific machine learning, uncertainty is typically quantified using Bayesian or dropout methods. However, both approaches suffer from a fu… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  2. arXiv:2505.11749  [pdf, other

    stat.ML cs.LG

    Missing Data Imputation by Reducing Mutual Information with Rectified Flows

    Authors: Jiahao Yu, Qizhen Ying, Leyang Wang, Ziyue Jiang, Song Liu

    Abstract: This paper introduces a novel iterative method for missing data imputation that sequentially reduces the mutual information between data and their corresponding missing mask. Inspired by GAN-based approaches, which train generators to decrease the predictability of missingness patterns, our method explicitly targets the reduction of mutual information. Specifically, our algorithm iteratively minim… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2505.05633  [pdf, ps, other

    stat.ME stat.CO

    Tutorial on Bayesian Functional Regression Using Stan

    Authors: Ziren Jiang, Ciprian Crainiceanu, Erjia Cui

    Abstract: This manuscript provides step-by-step instructions for implementing Bayesian functional regression models using Stan. Extensive simulations indicate that the inferential performance of the methods is comparable to that of state-of-the-art frequentist approaches. However, Bayesian approaches allow for more flexible modeling and provide an alternative when frequentist methods are not available or ma… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  4. arXiv:2505.02020  [pdf, other

    cs.LG cs.AI stat.ML

    Wide & Deep Learning for Node Classification

    Authors: Yancheng Chen, Wenguo Yang, Zhipeng Jiang

    Abstract: Wide & Deep, a simple yet effective learning architecture for recommendation systems developed by Google, has had a significant impact in both academia and industry due to its combination of the memorization ability of generalized linear models and the generalization ability of deep models. Graph convolutional networks (GCNs) remain dominant in node classification tasks; however, recent studies ha… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

    Comments: 16 pages, 6 figures, 13 tables

  5. arXiv:2505.00526  [pdf, other

    econ.EM cs.LG stat.CO

    Pre-Training Estimators for Structural Models: Application to Consumer Search

    Authors: Yanhao 'Max' Wei, Zhenling Jiang

    Abstract: We explore pretraining estimators for structural econometric models. The estimator is "pretrained" in the sense that the bulk of the computational cost and researcher effort occur during the construction of the estimator. Subsequent applications of the estimator to different datasets require little computational cost or researcher effort. The estimation leverages a neural net to recognize the stru… ▽ More

    Submitted 19 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

    Comments: Originally posted on SSRN on June 7, 2024

    ACM Class: G.3; J.4; I.2

  6. arXiv:2504.01276  [pdf, other

    eess.SP stat.OT

    Online Fault Detection and Classification of Chemical Process Systems Leveraging Statistical Process Control and Riemannian Geometric Analysis

    Authors: Alireza Miraliakbar, Fangyuan Ma, Zheyu Jiang

    Abstract: In this work, we study an integrated fault detection and classification framework called FARM for fast, accurate, and robust online chemical process monitoring. The FARM framework integrates the latest advancements in statistical process control (SPC) for monitoring nonparametric and heterogeneous data streams with novel data analysis approaches based on Riemannian geometry together in a hierarchi… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Under review at Computers and Chemical Engineering

  7. arXiv:2503.22401  [pdf, other

    cs.LG stat.ME

    Generative Reliability-Based Design Optimization Using In-Context Learning Capabilities of Large Language Models

    Authors: Zhonglin Jiang, Qian Tang, Zequn Wang

    Abstract: Large Language Models (LLMs) have demonstrated remarkable in-context learning capabilities, enabling flexible utilization of limited historical information to play pivotal roles in reasoning, problem-solving, and complex pattern recognition tasks. Inspired by the successful applications of LLMs in multiple domains, this paper proposes a generative design method by leveraging the in-context learnin… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: 17 pages, 11 figures, 4tables

  8. arXiv:2502.17723  [pdf, other

    stat.ME

    Semiparametric estimation for multivariate Hawkes processes using dependent Dirichlet processes: An application to order flow data in financial markets

    Authors: Alex Ziyu Jiang, Abel Rodriguez

    Abstract: The order flow in high-frequency financial markets has been of particular research interest in recent years, as it provides insights into trading and order execution strategies and leads to better understanding of the supply-demand interplay and price formation. In this work, we propose a semiparametric multivariate Hawkes process model that relies on (mixtures of) dependent Dirichlet processes to… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  9. Estimating Parameters of Structural Models Using Neural Networks

    Authors: Yanhao, Wei, Zhenling Jiang

    Abstract: We study an alternative use of machine learning. We train neural nets to provide the parameter estimate of a given (structural) econometric model, for example, discrete choice or consumer search. Training examples consist of datasets generated by the econometric model under a range of parameter values. The neural net takes the moments of a dataset as input and tries to recognize the parameter valu… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    ACM Class: G.3; J.4; I.2

    Journal ref: Marketing Science 44(1):102-128 (2024)

  10. arXiv:2501.16388  [pdf, other

    cs.LG stat.AP

    Development and Validation of a Dynamic Kidney Failure Prediction Model based on Deep Learning: A Real-World Study with External Validation

    Authors: Jingying Ma, Jinwei Wang, Lanlan Lu, Yexiang Sun, Mengling Feng, Peng Shen, Zhiqin Jiang, Shenda Hong, Luxia Zhang

    Abstract: Background: Chronic kidney disease (CKD), a progressive disease with high morbidity and mortality, has become a significant global public health problem. At present, most of the models used for predicting the progression of CKD are static models. We aim to develop a dynamic kidney failure prediction model based on deep learning (KFDeep) for CKD patients, utilizing all available data on common clin… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  11. arXiv:2501.06777  [pdf, ps, other

    econ.EM stat.ML

    Identification and Estimation of Simultaneous Equation Models Using Higher-Order Cumulant Restrictions

    Authors: Ziyu Jiang

    Abstract: Identifying structural parameters in linear simultaneous equation models is a fundamental challenge in economics and related fields. Recent work leverages higher-order distributional moments, exploiting the fact that non-Gaussian data carry more structural information than the Gaussian framework. While many of these contributions still require zero-covariance assumptions for structural errors, thi… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  12. arXiv:2412.16523  [pdf, other

    cs.LG cs.CY physics.soc-ph stat.ML

    Physics-Guided Fair Graph Sampling for Water Temperature Prediction in River Networks

    Authors: Erhu He, Declan Kutscher, Yiqun Xie, Jacob Zwart, Zhe Jiang, Huaxiu Yao, Xiaowei Jia

    Abstract: This work introduces a novel graph neural networks (GNNs)-based method to predict stream water temperature and reduce model bias across locations of different income and education levels. Traditional physics-based models often have limited accuracy because they are necessarily approximations of reality. Recently, there has been an increasing interest of using GNNs in modeling complex water dynamic… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  13. arXiv:2412.10658  [pdf, ps, other

    stat.ME cs.AI cs.LG

    Combining Priors with Experience: Confidence Calibration Based on Binomial Process Modeling

    Authors: Jinzong Dong, Zhaohui Jiang, Dong Pan, Haoyang Yu

    Abstract: Confidence calibration of classification models is a technique to estimate the true posterior probability of the predicted class, which is critical for ensuring reliable decision-making in practical applications. Existing confidence calibration methods mostly use statistical techniques to estimate the calibration curve from data or fit a user-defined calibration function, but often overlook fully… ▽ More

    Submitted 18 February, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI-25

  14. arXiv:2412.08794  [pdf, other

    cs.LG stat.ML

    Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

    Authors: Prajwal Koirala, Zhanhong Jiang, Soumik Sarkar, Cody Fleming

    Abstract: In safe offline reinforcement learning (RL), the objective is to develop a policy that maximizes cumulative rewards while strictly adhering to safety constraints, utilizing only offline data. Traditional methods often face difficulties in balancing these constraints, leading to either diminished performance or increased safety risks. We address these issues with a novel approach that begins by lea… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  15. arXiv:2410.23412  [pdf, other

    stat.ME q-bio.QM stat.ML

    BAMITA: Bayesian Multiple Imputation for Tensor Arrays

    Authors: Ziren Jiang, Gen Li, Eric F. Lock

    Abstract: Data increasingly take the form of a multi-way array, or tensor, in several biomedical domains. Such tensors are often incompletely observed. For example, we are motivated by longitudinal microbiome studies in which several timepoints are missing for several subjects. There is a growing literature on missing data imputation for tensors. However, existing methods give a point estimate for missing v… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 27 pages, 5 tables, 2 figures

  16. arXiv:2410.17864  [pdf, other

    stat.ME stat.AP

    Longitudinal Causal Inference with Selective Eligibility

    Authors: Zhichao Jiang, Eli Ben-Michael, D. James Greiner, Ryan Halen, Kosuke Imai

    Abstract: Dropout poses a significant challenge to causal inference in longitudinal studies with time-varying treatments. However, existing research does not simultaneously address dropout and time-varying treatments. We examine selective eligibility, an important yet overlooked source of non-ignorable dropout in such settings. This problem arises when a unit's prior treatment history influences its eligibi… ▽ More

    Submitted 15 March, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  17. arXiv:2407.03515  [pdf, other

    stat.ML cs.LG

    Fast Calculation of Feature Contributions in Boosting Trees

    Authors: Zhongli Jiang, Min Zhang, Dabao Zhang

    Abstract: Recently, several fast algorithms have been proposed to decompose predicted value into Shapley values, enabling individualized feature contribution analysis in tree models. While such local decomposition offers valuable insights, it underscores the need for a global evaluation of feature contributions. Although coefficients of determination ($R^2$) allow for comparative assessment of individual fe… ▽ More

    Submitted 26 May, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

  18. arXiv:2406.14380  [pdf, other

    econ.EM cs.LG stat.ME

    Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach

    Authors: Ruohan Zhan, Shichao Han, Yuchen Hu, Zhenling Jiang

    Abstract: Recommender systems are essential for content-sharing platforms by curating personalized content. To evaluate updates to recommender systems targeting content creators, platforms frequently rely on creator-side randomized experiments. The treatment effect measures the change in outcomes when a new algorithm is implemented compared to the status quo. We show that the standard difference-in-means es… ▽ More

    Submitted 5 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  19. arXiv:2403.12108  [pdf, other

    cs.AI econ.GN stat.AP stat.ME

    Does AI help humans make better decisions? A statistical evaluation framework for experimental and observational studies

    Authors: Eli Ben-Michael, D. James Greiner, Melody Huang, Kosuke Imai, Zhichao Jiang, Sooahn Shin

    Abstract: The use of Artificial Intelligence (AI), or more generally data-driven algorithms, has become ubiquitous in today's society. Yet, in many cases and especially when stakes are high, humans still make final decisions. The critical question, therefore, is whether AI helps humans make better decisions compared to a human-alone or AI-alone system. We introduce a new methodological framework to empirica… ▽ More

    Submitted 11 October, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  20. arXiv:2402.03192  [pdf, other

    stat.ME

    Multiple testing using uniform filtering of ordered p-values

    Authors: Zhiwen Jiang, Stephan Morgenthaler

    Abstract: We investigate the multiplicity model with m values of some test statistic independently drawn from a mixture of no effect (null) and positive effect (alternative), where we seek to identify, the alternative test results with a controlled error rate. We are interested in the case where the alternatives are rare. A number of multiple testing procedures filter the set of ordered p-values in order to… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 22 pages, 5 figures

  21. arXiv:2312.11927  [pdf, other

    cs.LG cs.SI stat.ME

    Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery

    Authors: Pengwei Yan, Kaisong Song, Zhuoren Jiang, Yangyang Kang, Tianqianjin Lin, Changlong Sun, Xiaozhong Liu

    Abstract: While self-supervised graph pretraining techniques have shown promising results in various domains, their application still experiences challenges of limited topology learning, human knowledge dependency, and incompetent multi-level interactions. To address these issues, we propose a novel solution, Dual-level Graph self-supervised Pretraining with Motif discovery (DGPM), which introduces a unique… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 14 pages, 6 figures, accepted by AAAI'24

  22. arXiv:2312.05757  [pdf, ps, other

    cs.LG cs.AI cs.DL cs.SI stat.ME

    Towards Human-like Perception: Learning Structural Causal Model in Heterogeneous Graph

    Authors: Tianqianjin Lin, Kaisong Song, Zhuoren Jiang, Yangyang Kang, Weikang Yuan, Xurui Li, Changlong Sun, Cui Huang, Xiaozhong Liu

    Abstract: Heterogeneous graph neural networks have become popular in various domains. However, their generalizability and interpretability are limited due to the discrepancy between their inherent inference flows and human reasoning logic or underlying causal relationships for the learning problem. This study introduces a novel solution, HG-SCM (Heterogeneous Graph as Structural Causal Model). It can mimic… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 28 pages, 10 figures, 6 tables, accepted by Information Processing & Management

    Journal ref: Information Processing & Management, 60 (2024) 1-21

  23. arXiv:2310.11620  [pdf, other

    stat.ME

    Modified treatment policy effect estimation with weighted energy distance

    Authors: Ziren Jiang, Jared D. Huling

    Abstract: The causal effects of continuous treatments are often characterized through the average dose response function, which is challenging to estimate from observational data due to confounding and positivity violations. Modified treatment policies (MTPs) are an alternative approach that aim to assess the effect of a modification to observed treatment values and work under relaxed assumptions. Estimator… ▽ More

    Submitted 18 January, 2025; v1 submitted 17 October, 2023; originally announced October 2023.

  24. arXiv:2310.01508  [pdf, other

    cs.LG stat.ML

    CODA: Temporal Domain Generalization via Concept Drift Simulator

    Authors: Chia-Yuan Chang, Yu-Neng Chuang, Zhimeng Jiang, Kwei-Herng Lai, Anxiao Jiang, Na Zou

    Abstract: In real-world applications, machine learning models often become obsolete due to shifts in the joint distribution arising from underlying temporal trends, a phenomenon known as the "concept drift". Existing works propose model-specific strategies to achieve temporal generalization in the near-future domain. However, the diverse characteristics of real-world datasets necessitate customized predicti… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  25. arXiv:2309.14658  [pdf, other

    stat.CO stat.ME

    Improvements on Scalable Stochastic Bayesian Inference Methods for Multivariate Hawkes Process

    Authors: Alex Ziyu Jiang, Abel Rodríguez

    Abstract: Multivariate Hawkes Processes (MHPs) are a class of point processes that can account for complex temporal dynamics among event sequences. In this work, we study the accuracy and computational efficiency of three classes of algorithms which, while widely used in the context of Bayesian inference, have rarely been applied in the context of MHPs: stochastic gradient expectation-maximization, stochast… ▽ More

    Submitted 20 February, 2025; v1 submitted 26 September, 2023; originally announced September 2023.

  26. arXiv:2309.13270  [pdf, other

    stat.ME stat.ML

    BARTSIMP: flexible spatial covariate modeling and prediction using Bayesian additive regression trees

    Authors: Alex Ziyu Jiang, Jon Wakefield

    Abstract: Prediction is a classic challenge in spatial statistics and the inclusion of spatial covariates can greatly improve predictive performance when incorporated into a model with latent spatial effects. It is desirable to develop flexible regression models that allow for nonlinearities and interactions in the covariate specification. Existing machine learning approaches that allow for spatial dependen… ▽ More

    Submitted 20 February, 2025; v1 submitted 23 September, 2023; originally announced September 2023.

  27. arXiv:2309.12425  [pdf, other

    stat.ME math.ST

    Principal Stratification with Continuous Post-Treatment Variables: Nonparametric Identification and Semiparametric Estimation

    Authors: Sizhu Lu, Zhichao Jiang, Peng Ding

    Abstract: Post-treatment variables often complicate causal inference. They appear in many scientific problems, including noncompliance, truncation by death, mediation, and surrogate endpoint evaluation. Principal stratification is a strategy to address these challenges by adjusting for the potential values of the post-treatment variables, defined as the principal strata. It allows for characterizing treatme… ▽ More

    Submitted 3 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  28. arXiv:2306.13242  [pdf, other

    stat.ML cs.AI cs.IT cs.LG

    Approximate Causal Effect Identification under Weak Confounding

    Authors: Ziwei Jiang, Lai Wei, Murat Kocaoglu

    Abstract: Causal effect estimation has been studied by many researchers when only observational data is available. Sound and complete algorithms have been developed for pointwise estimation of identifiable causal queries. For non-identifiable causal queries, researchers developed polynomial programs to estimate tight bounds on causal effect. However, these are computationally difficult to optimize for varia… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Published in ICML 2023

  29. arXiv:2306.07918  [pdf, other

    cs.LG stat.ML

    Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators

    Authors: Ziyang Jiang, Yiling Liu, Michael H. Klein, Ahmed Aloui, Yiman Ren, Keyu Li, Vahid Tarokh, David Carlson

    Abstract: Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For examp… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 16 pages, 4 figures, 5 tables

  30. arXiv:2305.07642  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    The ASNR-MICCAI Brain Tumor Segmentation (BraTS) Challenge 2023: Intracranial Meningioma

    Authors: Dominic LaBella, Maruf Adewole, Michelle Alonso-Basanta, Talissa Altes, Syed Muhammad Anwar, Ujjwal Baid, Timothy Bergquist, Radhika Bhalerao, Sully Chen, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Devon Godfrey, Fathi Hilal, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang, Elaine Johanson, Anahita Fathi Kazerooni, Collin Kent, John Kirkpatrick, Florian Kofler , et al. (35 additional authors not shown)

    Abstract: Meningiomas are the most common primary intracranial tumor in adults and can be associated with significant morbidity and mortality. Radiologists, neurosurgeons, neuro-oncologists, and radiation oncologists rely on multiparametric MRI (mpMRI) for diagnosis, treatment planning, and longitudinal treatment monitoring; yet automated, objective, and quantitative tools for non-invasive assessment of men… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  31. arXiv:2304.06164  [pdf, other

    stat.AP

    A Multi-Arm Two-Stage (MATS) Design for Proof-of-Concept and Dose Optimization in Early-Phase Oncology Trials

    Authors: Zhenghao Jiang, Gu Mi, Ji Lin, Christelle Lorenzato, Yuan Ji

    Abstract: The Project Optimus initiative by the FDA's Oncology Center of Excellence is widely viewed as a groundbreaking effort to change the $\textit{status quo}$ of conventional dose-finding strategies in oncology. Unlike in other therapeutic areas where multiple doses are evaluated thoroughly in dose ranging studies, early-phase oncology dose-finding studies are characterized by the practice of identifyi… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  32. arXiv:2302.13425  [pdf, other

    cs.LG stat.ML

    A Survey on Uncertainty Quantification Methods for Deep Learning

    Authors: Wenchong He, Zhe Jiang, Tingsong Xiao, Zelin Xu, Yukun Li

    Abstract: Deep neural networks (DNNs) have achieved tremendous success in making accurate predictions for computer vision, natural language processing, as well as science and engineering domains. However, it is also well-recognized that DNNs sometimes make unexpected, incorrect, but overconfident predictions. This can cause serious consequences in high-stake applications, such as autonomous driving, medical… ▽ More

    Submitted 19 January, 2025; v1 submitted 26 February, 2023; originally announced February 2023.

    Comments: 39 pages, 13 figures

  33. arXiv:2302.02009  [pdf, other

    cs.LG stat.ML

    Domain Adaptation via Rebalanced Sub-domain Alignment

    Authors: Yiling Liu, Juncheng Dong, Ziyang Jiang, Ahmed Aloui, Keyu Li, Hunter Klein, Vahid Tarokh, David Carlson

    Abstract: Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitati… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 20 pages, 6 figures, 4 tables

  34. arXiv:2301.11351  [pdf, other

    cs.LG stat.ML

    Estimating Causal Effects using a Multi-task Deep Ensemble

    Authors: Ziyang Jiang, Zhuoran Hou, Yiling Liu, Yiman Ren, Keyu Li, David Carlson

    Abstract: A number of methods have been proposed for causal effect estimation, yet few have demonstrated efficacy in handling data with complex structures, such as images. To fill this gap, we propose Causal Multi-task Deep Ensemble (CMDE), a novel framework that learns both shared and group-specific information from the study population. We provide proofs demonstrating equivalency of CDME to a multi-task G… ▽ More

    Submitted 27 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 18 pages, 7 figures, 3 tables, published at the 40th International Conference on Machine Learning (ICML 2023)

  35. Informing policy via dynamic models: Cholera in Haiti

    Authors: Jesse Wheeler, AnnaElaine Rosengart, Zhuoxun Jiang, Kevin Tan, Noah Treutle, Edward Ionides

    Abstract: Public health decisions must be made about when and how to implement interventions to control an infectious disease epidemic. These decisions should be informed by data on the epidemic as well as current understanding about the transmission dynamics. Such decisions can be posed as statistical questions about scientifically motivated dynamic models. Thus, we encounter the methodological task of bui… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 January, 2023; originally announced January 2023.

    Comments: To be submitted to Plos Comp Bio

  36. arXiv:2301.03246  [pdf, other

    stat.ME

    An instrumental variable method for point processes: generalised Wald estimation based on deconvolution

    Authors: Zhichao Jiang, Shizhe Chen, Peng Ding

    Abstract: Point processes are probabilistic tools for modeling event data. While there exists a fast-growing literature studying the relationships between point processes, it remains unexplored how such relationships connect to causal effects. In the presence of unmeasured confounders, parameters from point process models do not necessarily have causal interpretations. We propose an instrumental variable me… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  37. arXiv:2210.06728  [pdf, ps, other

    stat.ML cs.DS cs.IT cs.LG stat.CO

    On the Efficient Implementation of High Accuracy Optimality of Profile Maximum Likelihood

    Authors: Moses Charikar, Zhihao Jiang, Kirankumar Shiragur, Aaron Sidford

    Abstract: We provide an efficient unified plug-in approach for estimating symmetric properties of distributions given $n$ independent samples. Our estimator is based on profile-maximum-likelihood (PML) and is sample optimal for estimating various symmetric properties when the estimation error $ε\gg n^{-1/3}$. This result improves upon the previous best accuracy threshold of $ε\gg n^{-1/4}$ achievable by pol… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  38. arXiv:2209.10105  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Distributed Online Non-convex Optimization with Composite Regret

    Authors: Zhanhong Jiang, Aditya Balu, Xian Yeow Lee, Young M. Lee, Chinmay Hegde, Soumik Sarkar

    Abstract: Regret has been widely adopted as the metric of choice for evaluating the performance of online optimization algorithms for distributed, multi-agent systems. However, data/model variations associated with agents can significantly impact decisions and requires consensus among agents. Moreover, most existing works have focused on developing approaches for (either strongly or non-strongly) convex los… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 41 pages, presented in allerton conference 2022

  39. arXiv:2208.11411  [pdf, other

    q-bio.NC cond-mat.dis-nn cond-mat.stat-mech math-ph stat.ML

    Spectrum of non-Hermitian deep-Hebbian neural networks

    Authors: Zijian Jiang, Ziming Chen, Tianqi Hou, Haiping Huang

    Abstract: Neural networks with recurrent asymmetric couplings are important to understand how episodic memories are encoded in the brain. Here, we integrate the experimental observation of wide synaptic integration window into our model of sequence retrieval in the continuous time dynamics. The model with non-normal neuron-interactions is theoretically studied by deriving a random matrix theory of the Jacob… ▽ More

    Submitted 16 January, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 65 pages, 12 figures, revised version for publication

    Journal ref: Phys. Rev. Research 5, 013090 (2023)

  40. arXiv:2206.10479  [pdf, other

    stat.ML cs.LG stat.ME

    Policy Learning with Asymmetric Counterfactual Utilities

    Authors: Eli Ben-Michael, Kosuke Imai, Zhichao Jiang

    Abstract: Data-driven decision making plays an important role even in high stakes settings like medicine and public policy. Learning optimal policies from observed data requires a careful formulation of the utility function whose expected value is maximized across a population. Although researchers typically use utilities that depend on observed outcomes alone, in many settings the decision maker's utility… ▽ More

    Submitted 28 November, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  41. arXiv:2205.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Incorporating Prior Knowledge into Neural Networks through an Implicit Composite Kernel

    Authors: Ziyang Jiang, Tongshu Zheng, Yiling Liu, David Carlson

    Abstract: It is challenging to guide neural network (NN) learning with prior knowledge. In contrast, many known properties, such as spatial smoothness or seasonality, are straightforward to model by choosing an appropriate kernel in a Gaussian process (GP). Many deep learning applications could be enhanced by modeling such known properties. For example, convolutional neural networks (CNNs) are frequently us… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: 27 pages, 13 figures, 5 tables, 3 algorithms, published in Transactions on Machine Learning Research (TMLR)

    ACM Class: I.5.1

  42. arXiv:2111.00743  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Towards the Generalization of Contrastive Self-Supervised Learning

    Authors: Weiran Huang, Mingyang Yi, Xuyang Zhao, Zihao Jiang

    Abstract: Recently, self-supervised learning has attracted great attention, since it only requires unlabeled data for model training. Contrastive learning is one popular method for self-supervised learning and has achieved promising empirical performance. However, the theoretical understanding of its generalization ability is still limited. To this end, we define a kind of $(σ,δ)$-measure to mathematically… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted by ICLR 2023

  43. arXiv:2109.11679  [pdf, other

    stat.ML cs.LG stat.ME

    Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

    Authors: Eli Ben-Michael, D. James Greiner, Kosuke Imai, Zhichao Jiang

    Abstract: Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. We examine a particular case of algorithmic pre-trial risk assessments in the US criminal justice system, which provide deterministic classificati… ▽ More

    Submitted 31 March, 2025; v1 submitted 21 September, 2021; originally announced September 2021.

  44. arXiv:2106.11917  [pdf, other

    stat.AP

    Model-based Pre-clinical Trials for Medical Devices Using Statistical Model Checking

    Authors: Haochen Yang, Jicheng Gu, Zhihao Jiang

    Abstract: Clinical trials are considered as the golden standard for medical device validation. However, many sacrifices have to be made during the design and conduction of the trials due to cost considerations and partial information, which may compromise the significance of the trial results. In this paper, we proposed a model-based pre-clinical trial framework using statistical model checking. Physiologic… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  45. arXiv:2012.02845  [pdf, other

    cs.CY stat.AP stat.ME

    Experimental Evaluation of Algorithm-Assisted Human Decision-Making: Application to Pretrial Public Safety Assessment

    Authors: Kosuke Imai, Zhichao Jiang, James Greiner, Ryan Halen, Sooahn Shin

    Abstract: Despite an increasing reliance on fully-automated algorithmic decision-making in our day-to-day lives, human beings still make highly consequential decisions. As frequently seen in business, healthcare, and public policy, recommendations produced by algorithms are provided to human decision-makers to guide their decisions. While there exists a fast-growing literature evaluating the bias and fairne… ▽ More

    Submitted 11 December, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

  46. arXiv:2012.01615  [pdf, other

    stat.ME

    Multiply robust estimation of causal effects under principal ignorability

    Authors: Zhichao Jiang, Shu Yang, Peng Ding

    Abstract: Causal inference concerns not only the average effect of the treatment on the outcome but also the underlying mechanism through an intermediate variable of interest. Principal stratification characterizes such a mechanism by targeting subgroup causal effects within principal strata, which are defined by the joint potential values of an intermediate variable. Due to the fundamental problem of causa… ▽ More

    Submitted 27 March, 2022; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: to appear in JRSSB

  47. arXiv:2011.07677  [pdf, other

    stat.ME

    Statistical Inference and Power Analysis for Direct and Spillover Effects in Two-Stage Randomized Experiments

    Authors: Zhichao Jiang, Kosuke Imai, Anup Malani

    Abstract: Two-stage randomized experiments are becoming an increasingly popular experimental design for causal inference when the outcome of one unit may be affected by the treatment assignments of other units in the same cluster. In this paper, we provide a methodological framework for general tools of statistical inference and power analysis for two-stage randomized experiments. Under the randomization-ba… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 November, 2020; originally announced November 2020.

  48. arXiv:2010.11166  [pdf, other

    cs.LG cs.DC stat.ML

    Decentralized Deep Learning using Momentum-Accelerated Consensus

    Authors: Aditya Balu, Zhanhong Jiang, Sin Yong Tan, Chinmay Hedge, Young M Lee, Soumik Sarkar

    Abstract: We consider the problem of decentralized deep learning where multiple agents collaborate to learn from a distributed dataset. While there exist several decentralized deep learning approaches, the majority consider a central parameter-server topology for aggregating the model parameters from the agents. However, such a topology may be inapplicable in networked systems such as ad-hoc mobile networks… ▽ More

    Submitted 28 November, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

  49. arXiv:2008.12442  [pdf, other

    cs.LG stat.ML

    Semi-supervised Learning with the EM Algorithm: A Comparative Study between Unstructured and Structured Prediction

    Authors: Wenchong He, Zhe Jiang

    Abstract: Semi-supervised learning aims to learn prediction models from both labeled and unlabeled samples. There has been extensive research in this area. Among existing work, generative mixture models with Expectation-Maximization (EM) is a popular method due to clear statistical properties. However, existing literature on EM-based semi-supervised learning largely focuses on unstructured prediction, assum… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

  50. arXiv:2008.04882  [pdf, other

    cs.LG stat.ML

    Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation

    Authors: Tryambak Gangopadhyay, Sin Yong Tan, Zhanhong Jiang, Rui Meng, Soumik Sarkar

    Abstract: Multivariate time series modeling and prediction problems are abundant in many machine learning application domains. Accurate interpretation of such prediction outcomes from a machine learning model that explicitly captures temporal correlations can significantly benefit the domain experts. In this context, temporal attention has been successfully applied to isolate the important time steps for th… ▽ More

    Submitted 26 October, 2020; v1 submitted 11 August, 2020; originally announced August 2020.