-
PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework
Authors:
Abhineet Agarwal,
Michael Xiao,
Rebecca Barter,
Omer Ronen,
Boyu Fan,
Bin Yu
Abstract:
As machine learning (ML) models are increasingly deployed in high-stakes domains, trustworthy uncertainty quantification (UQ) is critical for ensuring the safety and reliability of these models. Traditional UQ methods rely on specifying a true generative model and are not robust to misspecification. On the other hand, conformal inference allows for arbitrary ML models but does not consider model s…
▽ More
As machine learning (ML) models are increasingly deployed in high-stakes domains, trustworthy uncertainty quantification (UQ) is critical for ensuring the safety and reliability of these models. Traditional UQ methods rely on specifying a true generative model and are not robust to misspecification. On the other hand, conformal inference allows for arbitrary ML models but does not consider model selection, which leads to large interval sizes. We tackle these drawbacks by proposing a UQ method based on the predictability, computability, and stability (PCS) framework for veridical data science proposed by Yu and Kumbier. Specifically, PCS-UQ addresses model selection by using a prediction check to screen out unsuitable models. PCS-UQ then fits these screened algorithms across multiple bootstraps to assess inter-sample variability and algorithmic instability, enabling more reliable uncertainty estimates. Further, we propose a novel calibration scheme that improves local adaptivity of our prediction sets. Experiments across $17$ regression and $6$ classification datasets show that PCS-UQ achieves the desired coverage and reduces width over conformal approaches by $\approx 20\%$. Further, our local analysis shows PCS-UQ often achieves target coverage across subgroups while conformal methods fail to do so. For large deep-learning models, we propose computationally efficient approximation schemes that avoid the expensive multiple bootstrap trainings of PCS-UQ. Across three computer vision benchmarks, PCS-UQ reduces prediction set size over conformal methods by $20\%$. Theoretically, we show a modified PCS-UQ algorithm is a form of split conformal inference and achieves the desired coverage with exchangeable data.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Cornering in the Water: An Investigation of Dolphin Swimming Performance
Authors:
Mingkai Xia,
Junhan Zhang,
Ningshan Wang,
Gabriel Antoniak,
Nicole West,
Ding Zhang,
Kenneth Alex Shorter
Abstract:
This article provides new insights into dolphin maneuver strategies in lap swimming tasks. However, most existing research focuses on straight-line swimming leaving the study of dolphins' corning strategies an open area. Challenges for directly analyzing dolphins' turning behavior include difficulties in motion tracking underwater and the inability to directly measure the propulsive forces. This p…
▽ More
This article provides new insights into dolphin maneuver strategies in lap swimming tasks. However, most existing research focuses on straight-line swimming leaving the study of dolphins' corning strategies an open area. Challenges for directly analyzing dolphins' turning behavior include difficulties in motion tracking underwater and the inability to directly measure the propulsive forces. This paper provides methodology and analyses of dolphins' swimming performance during lap swimming tasks. External camera detection and internal kinematics measured from wearable bio-tags are involved in this study to support accurate localization of the animals. A particle filter, which fuses the external and internal measurements, is implemented to provide accurate estimations of the trajectories, even when they swim deep below the water's surface. Thereafter, a hydrodynamic model is constructed to calculate the thrust power and energy cost of the animals. The energetic cost during lap swimming is calculated for the comparison between different corning behaviors. The results show that the implemented particle filter can provide precise and complete trajectories of the tested dolphins, providing fundamental for statistical study of the corning behavior. From the kinematic analysis, TT01 is the fastest lap swimmer, with the highest swimming speed for the whole lap while performing a sharp turn with small deceleration. TT02 performs greater energetic efficiency than TT01 by transferring more weight at high speed. TT03 shows the highest energetic efficiency by maintaining a slow underwater motion.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
A generalized e-value feature detection method with FDR control at multiple resolutions
Authors:
Chengyao Yu,
Ruixing Ming,
Min Xiao,
Zhanfeng Wang,
Bingyi Jing
Abstract:
Multiple resolutions occur in a range number of explanatory features due to existence of domain-specific structure, which results in groups for the features. Within this context, the simultaneous detection of significant features and groups aimed at a specific response with false discovery rate (FDR) control stands as a crucial issue, such as the spatial genome-wide association studies. Existing m…
▽ More
Multiple resolutions occur in a range number of explanatory features due to existence of domain-specific structure, which results in groups for the features. Within this context, the simultaneous detection of significant features and groups aimed at a specific response with false discovery rate (FDR) control stands as a crucial issue, such as the spatial genome-wide association studies. Existing methods typically require maintaining the same detection approach at different resolutions to achieve multilayer FDR control, which may be not efficient. For instance, it is unsuitable to apply knockoff method to detect features with high correlations, therefore, the efficiency of multilayer knockoff filter (MKF) is also not guaranteed. To tackle this problem, we introduce a novel method of derandomized flexible e-filter procedure (DFEFP) by developing generalized e-values. This method utilizes a wide variety of base detection procedures that operate effectively across various resolutions to provide stable and consistent results, while controlling the false discovery rate at multiple resolutions simultaneously. Furthermore, we investigate the statistical properties of the DFEFP, encompassing multilayer FDR control, stability guarantee, and solution correctness of algorithm. The DFEFP is initially exemplified to construct an e-value data splitting filter (eDS-filter). Subsequently, the eDS-filter in combination with the group knockoff filter (gKF) is used to develop more flexible methodology which referred to as the eDS+gKF-filter. Simulation studies demonstrate that the eDS+gKF-filter effectively controls FDR at multiple resolutions while either maintaining or enhancing power compared to MKF. The superiority of the eDS+gKF-filter is also demonstrated through the analysis of HIV mutation data.
△ Less
Submitted 21 February, 2025; v1 submitted 25 September, 2024;
originally announced September 2024.
-
A local squared Wasserstein-2 method for efficient reconstruction of models with uncertainty
Authors:
Mingtao Xia,
Qijing Shen
Abstract:
In this paper, we propose a local squared Wasserstein-2 (W_2) method to solve the inverse problem of reconstructing models with uncertain latent variables or parameters. A key advantage of our approach is that it does not require prior information on the distribution of the latent variables or parameters in the underlying models. Instead, our method can efficiently reconstruct the distributions of…
▽ More
In this paper, we propose a local squared Wasserstein-2 (W_2) method to solve the inverse problem of reconstructing models with uncertain latent variables or parameters. A key advantage of our approach is that it does not require prior information on the distribution of the latent variables or parameters in the underlying models. Instead, our method can efficiently reconstruct the distributions of the output associated with different inputs based on empirical distributions of observation data. We demonstrate the effectiveness of our proposed method across several uncertainty quantification (UQ) tasks, including linear regression with coefficient uncertainty, training neural networks with weight uncertainty, and reconstructing ordinary differential equations (ODEs) with a latent random variable.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks
Authors:
Mingtao Xia,
Xiangting Li,
Qijing Shen,
Tom Chou
Abstract:
We analyze the Wasserstein distance ($W$-distance) between two probability distributions associated with two multidimensional jump-diffusion processes. Specifically, we analyze a temporally decoupled squared $W_2$-distance, which provides both upper and lower bounds associated with the discrepancies in the drift, diffusion, and jump amplitude functions between the two jump-diffusion processes. The…
▽ More
We analyze the Wasserstein distance ($W$-distance) between two probability distributions associated with two multidimensional jump-diffusion processes. Specifically, we analyze a temporally decoupled squared $W_2$-distance, which provides both upper and lower bounds associated with the discrepancies in the drift, diffusion, and jump amplitude functions between the two jump-diffusion processes. Then, we propose a temporally decoupled squared $W_2$-distance method for efficiently reconstructing unknown jump-diffusion processes from data using parameterized neural networks. We further show its performance can be enhanced by utilizing prior information on the drift function of the jump-diffusion process. The effectiveness of our proposed reconstruction method is demonstrated across several examples and applications.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Squared Wasserstein-2 Distance for Efficient Reconstruction of Stochastic Differential Equations
Authors:
Mingtao Xia,
Xiangting Li,
Qijing Shen,
Tom Chou
Abstract:
We provide an analysis of the squared Wasserstein-2 ($W_2$) distance between two probability distributions associated with two stochastic differential equations (SDEs). Based on this analysis, we propose the use of a squared $W_2$ distance-based loss functions in the \textit{reconstruction} of SDEs from noisy data. To demonstrate the practicality of our Wasserstein distance-based loss functions, w…
▽ More
We provide an analysis of the squared Wasserstein-2 ($W_2$) distance between two probability distributions associated with two stochastic differential equations (SDEs). Based on this analysis, we propose the use of a squared $W_2$ distance-based loss functions in the \textit{reconstruction} of SDEs from noisy data. To demonstrate the practicality of our Wasserstein distance-based loss functions, we performed numerical experiments that demonstrate the efficiency of our method in reconstructing SDEs that arise across a number of applications.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Data-Driven Risk Measurement by SV-GARCH-EVT Model
Authors:
Minheng Xiao
Abstract:
This paper aims to more effectively manage and mitigate stock market risks by accurately characterizing financial market returns and volatility. We enhance the Stochastic Volatility (SV) model by incorporating fat-tailed distributions and leverage effects, estimating model parameters using Markov Chain Monte Carlo (MCMC) methods. By integrating extreme value theory (EVT) to fit the tail distributi…
▽ More
This paper aims to more effectively manage and mitigate stock market risks by accurately characterizing financial market returns and volatility. We enhance the Stochastic Volatility (SV) model by incorporating fat-tailed distributions and leverage effects, estimating model parameters using Markov Chain Monte Carlo (MCMC) methods. By integrating extreme value theory (EVT) to fit the tail distribution of standard residuals, we develop the SV-EVT-VaR-based dynamic model. Our empirical analysis, using daily S\&P 500 index data and simulated returns, shows that SV-EVT-based models outperform others in backtesting. These models effectively capture the fat-tailed properties of financial returns and the leverage effect, proving superior for out-of-sample data analysis.
△ Less
Submitted 27 December, 2024; v1 submitted 23 January, 2022;
originally announced January 2022.
-
Odds Ratios are far from "portable": A call to use realistic models for effect variation in meta-analysis
Authors:
Mengli Xiao,
Haitao Chu,
Stephen Cole,
Yong Chen,
Richard MacLehose,
David Richardson,
Sander Greenland
Abstract:
Objective: Recently Doi et al. argued that risk ratios should be replaced with odds ratios in clinical research. We disagreed, and empirically documented the lack of portability of odds ratios, while Doi et al. defended their position. In this response we highlight important errors in their position.
Study Design and Setting: We counter Doi et al.'s arguments by further examining the correlation…
▽ More
Objective: Recently Doi et al. argued that risk ratios should be replaced with odds ratios in clinical research. We disagreed, and empirically documented the lack of portability of odds ratios, while Doi et al. defended their position. In this response we highlight important errors in their position.
Study Design and Setting: We counter Doi et al.'s arguments by further examining the correlations of odds ratios, and risk ratios, with baseline risks in 20,198 meta-analyses from the Cochrane Database of Systematic Reviews.
Results: Doi et al.'s claim that odds ratios are portable is invalid because 1) their reasoning is circular: they assume a model under which the odds ratio is constant and show that under such a model the odds ratio is portable; 2) the method they advocate to convert odds ratios to risk ratios is biased; 3) their empirical example is readily-refuted by counter-examples of meta-analyses in which the risk ratio is portable but the odds ratio isn't; and 4) they fail to consider the causal determinants of meta-analytic inclusion criteria: Doi et al. mistakenly claim that variation in odds ratios with different baseline risks in meta-analyses is due to collider bias. Empirical comparison between the correlations of odds ratios, and risk ratios, with baseline risks show that the portability of odds ratios and risk ratios varies across settings.
Conclusion: The suggestion to replace risk ratios with odds ratios is based on circular reasoning and a confusion of mathematical and empirical results. It is especially misleading for meta-analyses and clinical guidance. Neither the odds ratio nor the risk ratio is universally portable. To address this lack of portability, we reinforce our suggestion to report variation in effect measures conditioning on modifying factors such as baseline risk; understanding such variation is essential to patient-centered practice.
△ Less
Submitted 7 June, 2021; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Mining of high throughput screening database reveals AP-1 and autophagy pathways as potential targets for COVID-19 therapeutics
Authors:
Hu Zhu,
Catherine Z. Chen,
Srilatha Sakamuru,
Anton Simeonov,
Mathew D. Hall,
Menghang Xia,
Wei Zheng,
Ruili Huang
Abstract:
The recent global pandemic of Coronavirus Disease 2019 (COVID-19) caused by the new coronavirus SARS-CoV-2 presents an urgent need for new therapeutic candidates. Many efforts have been devoted to screening existing drug libraries with the hope to repurpose approved drugs as potential treatments for COVID-19. However, the antiviral mechanisms of action for the drugs found active in these phenotypi…
▽ More
The recent global pandemic of Coronavirus Disease 2019 (COVID-19) caused by the new coronavirus SARS-CoV-2 presents an urgent need for new therapeutic candidates. Many efforts have been devoted to screening existing drug libraries with the hope to repurpose approved drugs as potential treatments for COVID-19. However, the antiviral mechanisms of action for the drugs found active in these phenotypic screens are largely unknown. To deconvolute the viral targets for more effective anti-COVID-19 drug development, we mined our in-house database of approved drug screens against 994 assays and compared their activity profiles with the drug activity profile in a cytopathic effect (CPE) assay of SARS-CoV-2. We found that the autophagy and AP-1 signaling pathway activity profiles are significantly correlated with the anti-SARS-CoV-2 activity profile. In addition, a class of neurology/psychiatry drugs was found significantly enriched with anti-SARS-CoV-2 activity. Taken together, these results have provided new insights into SARS-CoV-2 infection and potential targets for COVID-19 therapeutics.
△ Less
Submitted 23 July, 2020;
originally announced July 2020.
-
Learning Based Hybrid Beamforming Design for Full-Duplex Millimeter Wave Systems
Authors:
Shaocheng Huang,
Yu Ye,
Ming Xiao
Abstract:
Millimeter Wave (mmWave) communications with full-duplex (FD) have the potential of increasing the spectral efficiency, relative to those with half-duplex. However, the residual self-interference (SI) from FD and high pathloss inherent to mmWave signals may degrade the system performance. Meanwhile, hybrid beamforming (HBF) is an efficient technology to enhance the channel gain and mitigate interf…
▽ More
Millimeter Wave (mmWave) communications with full-duplex (FD) have the potential of increasing the spectral efficiency, relative to those with half-duplex. However, the residual self-interference (SI) from FD and high pathloss inherent to mmWave signals may degrade the system performance. Meanwhile, hybrid beamforming (HBF) is an efficient technology to enhance the channel gain and mitigate interference with reasonable complexity. However, conventional HBF approaches for FD mmWave systems are based on optimization processes, which are either too complex or strongly rely on the quality of channel state information (CSI). We propose two learning schemes to design HBF for FD mmWave systems, i.e., extreme learning machine based HBF (ELM-HBF) and convolutional neural networks based HBF (CNN-HBF). Specifically, we first propose an alternating direction method of multipliers (ADMM) based algorithm to achieve SI cancellation beamforming, and then use a majorization-minimization (MM) based algorithm for joint transmitting and receiving HBF optimization. To train the learning networks, we simulate noisy channels as input, and select the hybrid beamformers calculated by proposed algorithms as targets. Results show that both learning based schemes can provide more robust HBF performance and achieve at least 22.1% higher spectral efficiency compared to orthogonal matching pursuit (OMP) algorithms. Besides, the online prediction time of proposed learning based schemes is almost 20 times faster than the OMP scheme. Furthermore, the training time of ELM-HBF is about 600 times faster than that of CNN-HBF with 64 transmitting and receiving antennas.
△ Less
Submitted 16 April, 2020;
originally announced April 2020.
-
Mobility-aware Content Preference Learning in Decentralized Caching Networks
Authors:
Yu Ye,
Ming Xiao,
Mikael Skoglund
Abstract:
Due to the drastic increase of mobile traffic, wireless caching is proposed to serve repeated requests for content download. To determine the caching scheme for decentralized caching networks, the content preference learning problem based on mobility prediction is studied. We first formulate preference prediction as a decentralized regularized multi-task learning (DRMTL) problem without considerin…
▽ More
Due to the drastic increase of mobile traffic, wireless caching is proposed to serve repeated requests for content download. To determine the caching scheme for decentralized caching networks, the content preference learning problem based on mobility prediction is studied. We first formulate preference prediction as a decentralized regularized multi-task learning (DRMTL) problem without considering the mobility of mobile terminals (MTs). The problem is solved by a hybrid Jacobian and Gauss-Seidel proximal multi-block alternating direction method (ADMM) based algorithm, which is proven to conditionally converge to the optimal solution with a rate $O(1/k)$. Then we use the tool of \textit{Markov renewal process} to predict the moving path and sojourn time for MTs, and integrate the mobility pattern with the DRMTL model by reweighting the training samples and introducing a transfer penalty in the objective. We solve the problem and prove that the developed algorithm has the same convergence property but with different conditions. Through simulation we show the convergence analysis on proposed algorithms. Our real trace driven experiments illustrate that the mobility-aware DRMTL model can provide a more accurate prediction on geography preference than DRMTL model. Besides, the hit ratio achieved by most popular proactive caching (MPC) policy with preference predicted by mobility-aware DRMTL outperforms the MPC with preference from DRMTL and random caching (RC) schemes.
△ Less
Submitted 22 August, 2019;
originally announced August 2019.
-
The FFBS Estimation of High Dimensional Panel Data Factor Stochastic Volatility Models
Authors:
Guobin Fang,
Huimin Ma,
Michelle Xia,
Bo Zhang
Abstract:
In this paper, We propose a new style panel data factor stochastic volatility model with observable factors and unobservable factors based on the multivariate stochastic volatility model, which is mainly composed of three parts, such as the mean equation, volatility equation and factor volatility evolution. The stochastic volatility equation is a 1-step forward prediction process with high dimensi…
▽ More
In this paper, We propose a new style panel data factor stochastic volatility model with observable factors and unobservable factors based on the multivariate stochastic volatility model, which is mainly composed of three parts, such as the mean equation, volatility equation and factor volatility evolution. The stochastic volatility equation is a 1-step forward prediction process with high dimensional parameters to be estimated. Using the Markov Chain Monte Carlo Simulation (MCMC) method, the Forward Filtering Backward Sampling (FFBS) algorithm of the stochastic volatility equation is mainly used to estimate the new model by Kalman Filter Recursive Algorithm (KFRA). The results of numeric simulation and latent factor estimation show that the algorithm possesses robustness and consistency for parameter estimation. This paper makes a comparative analysis of the observable and unobservable factors of internet finance and traditional financial listed companies in the Chinese stock market using the new model and its estimation method. The results show that the influence of observable factors is similar to the two types of listed companies, but the influence of unobservable factors is obviously different.
△ Less
Submitted 8 April, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.