Skip to main content

Showing 1–50 of 124 results for author: Shi, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.06599  [pdf, ps, other

    cs.LG stat.ML

    Direct Prediction Set Minimization via Bilevel Conformal Classifier Training

    Authors: Yuanjie Shi, Hooman Shahrokhi, Xuesong Jia, Xiongzhi Chen, Janardhan Rao Doppa, Yan Yan

    Abstract: Conformal prediction (CP) is a promising uncertainty quantification framework which works as a wrapper around a black-box classifier to construct prediction sets (i.e., subset of candidate classes) with provable guarantees. However, standard calibration methods for CP tend to produce large prediction sets which makes them less useful in practice. This paper considers the problem of integrating con… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: Accepted for Publication at International Conference on Machine Learning (ICML), 2025

  2. arXiv:2504.20686  [pdf, ps, other

    stat.ME

    Inference of high-dimensional weak instrumental variable regression models without ridge-regularization

    Authors: Jiarong Ding, Xu Guo, Yanmei Shi, Yuxin Wang

    Abstract: Inference of instrumental variable regression models with many weak instruments attracts many attentions recently. To extend the classical Anderson-Rubin test to high-dimensional setting, many procedures adopt ridge-regularization. However, we show that it is not necessary to consider ridge-regularization. Actually we propose a new quadratic-type test statistic which does not involve tuning parame… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  3. arXiv:2504.08438  [pdf, ps, other

    cs.RO stat.ML

    Diffusion Models for Robotic Manipulation: A Survey

    Authors: Rosa Wolf, Yitian Shi, Sheng Liu, Rania Rayyes

    Abstract: Diffusion generative models have demonstrated remarkable success in visual domains such as image and video generation. They have also recently emerged as a promising approach in robotics, especially in robot manipulations. Diffusion models leverage a probabilistic framework, and they stand out with their ability to model multi-modal distributions and their robustness to high-dimensional input and… ▽ More

    Submitted 30 June, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: 28 pages, 2 figure, 9 tables

  4. arXiv:2504.06372  [pdf, other

    eess.SY stat.ME stat.ML

    A Metropolis-Adjusted Langevin Algorithm for Sampling Jeffreys Prior

    Authors: Yibo Shi, Braghadeesh Lakshminarayanan, Cristian R. Rojas

    Abstract: Inference and estimation are fundamental aspects of statistics, system identification and machine learning. For most inference problems, prior knowledge is available on the system to be modeled, and Bayesian analysis is a natural framework to impose such prior information in the form of a prior distribution. However, in many situations, coming out with a fully specified prior distribution is not e… ▽ More

    Submitted 15 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: 7 pages

  5. arXiv:2504.01432  [pdf, other

    stat.ME

    Adaptive adequacy testing of high-dimensional factor-augmented regression model

    Authors: Yanmei Shi, Leheng Cai, Xu Guo, Shurong Zheng

    Abstract: In this paper, we investigate the adequacy testing problem of high-dimensional factor-augmented regression model. Existing test procedures perform not well under dense alternatives. To address this critical issue, we introduce a novel quadratic-type test statistic which can efficiently detect dense alternative hypotheses. We further propose an adaptive test procedure to remain powerful under both… ▽ More

    Submitted 3 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  6. arXiv:2503.18987  [pdf, other

    cs.LG cs.AI stat.ML

    Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization

    Authors: Xiran Wang, Jian Zhang, Lei Qi, Yinghuan Shi

    Abstract: Domain generalization is proposed to address distribution shift, arising from statistical disparities between training source and unseen target domains. The widely used first-order meta-learning algorithms demonstrate strong performance for domain generalization by leveraging the gradient matching theory, which aims to establish balanced parameters across source domains to reduce overfitting to an… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  7. arXiv:2503.17638  [pdf, other

    stat.AP

    Collective Wisdom: Policy Averaging with an Application to the Newsvendor Problem

    Authors: Xiangyu Cui, Nicholas G. Hall, Yun Shi, Tianyuan Su

    Abstract: We propose a Policy Averaging Approach (PAA) that synthesizes the strengths of existing approaches to create more reliable, flexible and justifiable policies for stochastic optimization problems. An important component of the PAA is risk diversification to reduce the randomness of policies. A second component emulates model averaging from statistics. A third component involves using cross-validati… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  8. arXiv:2503.00530  [pdf, other

    stat.ML cs.LG

    Trajectory Inference with Smooth Schrödinger Bridges

    Authors: Wanli Hong, Yuliang Shi, Jonathan Niles-Weed

    Abstract: Motivated by applications in trajectory inference and particle tracking, we introduce Smooth Schrödinger Bridges. Our proposal generalizes prior work by allowing the reference process in the Schrödinger Bridge problem to be a smooth Gaussian process, leading to more regular and interpretable trajectories in applications. Though naïvely smoothing the reference process leads to a computationally int… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  9. arXiv:2501.18501  [pdf, other

    stat.ML cs.AI cs.LG

    Beyond Prior Limits: Addressing Distribution Misalignment in Particle Filtering

    Authors: Yiwei Shi, Jingyu Hu, Yu Zhang, Mengyue Yang, Weinan Zhang, Cunjia Liu, Weiru Liu

    Abstract: Particle filtering is a Bayesian inference method and a fundamental tool in state estimation for dynamic systems, but its effectiveness is often limited by the constraints of the initial prior distribution, a phenomenon we define as the Prior Boundary Phenomenon. This challenge arises when target states lie outside the prior's support, rendering traditional particle filtering methods inadequate fo… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  10. arXiv:2501.06529  [pdf, other

    stat.ME

    Estimation and inference of high-dimensional partially linear regression models with latent factors

    Authors: Yanmei Shi, Meiling Hao, Yanlin Tang, Xu Guo

    Abstract: In this paper, we introduce a novel high-dimensional Factor-Adjusted sparse Partially Linear regression Model (FAPLM), to integrate the linear effects of high-dimensional latent factors with the nonparametric effects of low-dimensional covariates. The proposed FAPLM combines the interpretability of linear models, the flexibility of nonparametric models, with the ability to effectively capture the… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

  11. arXiv:2501.02489  [pdf, other

    stat.ME math.ST

    High-dimensional inference for single-index model with latent factors

    Authors: Yanmei Shi, Meiling Hao, Yanlin Tang, Heng Lian, Xu Guo

    Abstract: Models with latent factors recently attract a lot of attention. However, most investigations focus on linear regression models and thus cannot capture nonlinearity. To address this issue, we propose a novel Factor Augmented Single-Index Model. We first address the concern whether it is necessary to consider the augmented part by introducing a score-type test statistic. Compared with previous test… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

  12. arXiv:2411.19908  [pdf, ps, other

    stat.ML cs.LG

    Another look at inference after prediction

    Authors: Jessica Gronsbell, Jianhui Gao, Yaqi Shi, Zachary R. McCaw, David Cheng

    Abstract: From structural biology to epidemiology, predictions from machine learning (ML) models are increasingly used to complement costly gold-standard data to enable faster, more affordable, and scalable scientific inquiry. In response, prediction-based (PB) inference has emerged to accommodate statistical analysis using a large volume of predictions together with a small amount of gold-standard data. Th… ▽ More

    Submitted 8 June, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

  13. arXiv:2410.04667  [pdf, other

    stat.ME stat.AP

    A Finite Mixture Hidden Markov Model for Intermittently Observed Disease Process with Heterogeneity and Partially Known Disease Type

    Authors: Yidan Shi, Leilei Zeng, Mary E. Thompson, Suzanne L. Tyas

    Abstract: Continuous-time multistate models are widely used for analyzing interval-censored data on disease progression over time. Sometimes, diseases manifest differently and what appears to be a coherent collection of symptoms is the expression of multiple distinct disease subtypes. To address this complexity, we propose a mixture hidden Markov model, where the observation process encompasses states repre… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 27 pages, 4 figures, 6 tables

  14. arXiv:2409.13655  [pdf, other

    cs.LG stat.AP

    Adaptive Mixture Importance Sampling for Automated Ads Auction Tuning

    Authors: Yimeng Jia, Kaushal Paneri, Rong Huang, Kailash Singh Maurya, Pavan Mallapragada, Yifan Shi

    Abstract: This paper introduces Adaptive Mixture Importance Sampling (AMIS) as a novel approach for optimizing key performance indicators (KPIs) in large-scale recommender systems, such as online ad auctions. Traditional importance sampling (IS) methods face challenges in dynamic environments, particularly in navigating through complexities of multi-modal landscapes and avoiding entrapment in local optima f… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: Accepted at the CONSEQUENCES '24 workshop, co-located with ACM RecSys '24

    MSC Class: 68T05; 65C05; 68Q87 ACM Class: G.3; I.2.6; I.6.8

  15. arXiv:2407.02681  [pdf, other

    cs.LG eess.IV math.OC stat.ML

    Uniform Transformation: Refining Latent Representation in Variational Autoencoders

    Authors: Ye Shi, C. S. George Lee

    Abstract: Irregular distribution in latent space causes posterior collapse, misalignment between posterior and prior, and ill-sampling problem in Variational Autoencoders (VAEs). In this paper, we introduce a novel adaptable three-stage Uniform Transformation (UT) module -- Gaussian Kernel Density Estimation (G-KDE) clustering, non-parametric Gaussian Mixture (GM) Modeling, and Probability Integral Transfor… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 IEEE 20th International Conference on Automation Science and Engineering

  16. arXiv:2406.12171  [pdf, other

    stat.ME stat.AP

    Model Selection for Causal Modeling in Missing Exposure Problems

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: In causal inference, properly selecting the propensity score (PS) model is an important topic and has been widely investigated in observational studies. There is also a large literature focusing on the missing data problem. However, there are very few studies investigating the model selection issue for causal inference when the exposure is missing at random (MAR). In this paper, we discuss how to… ▽ More

    Submitted 13 December, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  17. arXiv:2406.08668  [pdf, other

    stat.ME

    Causal Inference on Missing Exposure via Robust Estimation

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: How to deal with missing data in observational studies is a common concern for causal inference. When the covariates are missing at random (MAR), multiple approaches have been provided to help solve the issue. However, if the exposure is MAR, few approaches are available and careful adjustments on both missingness and confounding issues are required to ensure a consistent estimate of the true caus… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  18. arXiv:2404.02986  [pdf, other

    cs.LG stat.ML

    Universal Functional Regression with Neural Operator Flows

    Authors: Yaozhong Shi, Angela F. Gao, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: Regression on function spaces is typically limited to models with Gaussian process priors. We introduce the notion of universal functional regression, in which we aim to learn a prior distribution over non-Gaussian function spaces that remains mathematically tractable for functional regression. To do this, we develop Neural Operator Flows (OpFlow), an infinite-dimensional extension of normalizing… ▽ More

    Submitted 26 November, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  19. arXiv:2310.09583  [pdf, other

    cs.LG stat.ML

    Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation

    Authors: Shutong Ding, Tianyu Cui, Jingya Wang, Ye Shi

    Abstract: Deep Equilibrium Models (DEQs) and Neural Ordinary Differential Equations (Neural ODEs) are two branches of implicit models that have achieved remarkable success owing to their superior performance and low memory consumption. While both are implicit models, DEQs and Neural ODEs are derived from different mathematical formulations. Inspired by homotopy continuation, we establish a connection betwee… ▽ More

    Submitted 21 December, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS2023

  20. arXiv:2310.04919  [pdf, other

    stat.ME cs.LG stat.ML

    The Conditional Prediction Function: A Novel Technique to Control False Discovery Rate for Complex Models

    Authors: Yushu Shi, Michael Martens

    Abstract: In modern scientific research, the objective is often to identify which variables are associated with an outcome among a large class of potential predictors. This goal can be achieved by selecting variables in a manner that controls the the false discovery rate (FDR), the proportion of irrelevant predictors among the selections. Knockoff filtering is a cutting-edge approach to variable selection t… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  21. arXiv:2309.09831  [pdf, other

    math.ST stat.ML

    Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

    Authors: Ethan X. Fang, Yajun Mei, Yuyang Shi, Qunzhi Xu, Tuo Zhao

    Abstract: We consider the linear discriminant analysis problem in the high-dimensional settings. In this work, we propose PANDA(PivotAl liNear Discriminant Analysis), a tuning-insensitive method in the sense that it requires very little effort to tune the parameters. Moreover, we prove that PANDA achieves the optimal convergence rate in terms of both the estimation error and misclassification rate. Our theo… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  22. arXiv:2309.08109  [pdf, other

    stat.ME

    CAT: a conditional association test for microbiome data using a leave-out approach

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: In microbiome analysis, researchers often seek to identify taxonomic features associated with an outcome of interest. However, microbiome features are intercorrelated and linked by phylogenetic relationships, making it challenging to assess the association between an individual feature and an outcome. Researchers have developed global tests for the association of microbiome profiles with outcomes… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  23. arXiv:2308.13737  [pdf, other

    stat.AP

    survivalContour: Visualizing predicted survival via colored contour plots

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: Advances in survival analysis have facilitated unprecedented flexibility in data modeling, yet there remains a lack of tools for graphically illustrating the influence of continuous covariates on predicted survival outcomes. We propose the utilization of a colored contour plot to depict the predicted survival probabilities over time, and provide a Shiny app and R package as implementations of this… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  24. arXiv:2308.13298  [pdf, other

    cs.LG eess.SP stat.ML

    Federated Linear Bandit Learning via Over-the-Air Computation

    Authors: Jiali Wang, Yuning Jiang, Xin Liu, Ting Wang, Yuanming Shi

    Abstract: In this paper, we investigate federated contextual linear bandit learning within a wireless system that comprises a server and multiple devices. Each device interacts with the environment, selects an action based on the received reward, and sends model updates to the server. The primary objective is to minimize cumulative regret across all devices within a finite time horizon. To reduce the commun… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  25. arXiv:2308.12016  [pdf, ps, other

    stat.ML cs.LG

    MKL-$L_{0/1}$-SVM

    Authors: Bin Zhu, Yijie Shi

    Abstract: This paper presents a Multiple Kernel Learning (abbreviated as MKL) framework for the Support Vector Machine (SVM) with the $(0, 1)$ loss function. Some KKT-like first-order optimality conditions are provided and then exploited to develop a fast ADMM algorithm to solve the nonsmooth nonconvex optimization problem. Numerical experiments on real data sets show that the performance of our MKL-… ▽ More

    Submitted 3 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 26 pages in the JMLR template, 3 figures, and 2 tables, submitted to the Journal of Machine Learning Research, with minor text overlap with arXiv: 2303.04445 (conference version). arXiv admin note: text overlap with arXiv:2303.04445

  26. arXiv:2307.16360  [pdf, other

    cs.LG stat.ML

    Probabilistically robust conformal prediction

    Authors: Subhankar Ghosh, Yuanjie Shi, Taha Belkhouja, Yan Yan, Jana Doppa, Brian Jones

    Abstract: Conformal prediction (CP) is a framework to quantify uncertainty of machine learning classifiers including deep neural networks. Given a testing example and a trained classifier, CP produces a prediction set of candidate labels with a user-specified coverage (i.e., true class label is contained with high probability). Almost all the existing work on CP assumes clean testing data and there is not m… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, 2023

    Journal ref: Uncertainty in Artificial Intelligence. PMLR 216:681-690, 2023

  27. arXiv:2303.16852  [pdf, other

    stat.ML cs.LG

    Diffusion Schrödinger Bridge Matching

    Authors: Yuyang Shi, Valentin De Bortoli, Andrew Campbell, Arnaud Doucet

    Abstract: Solving transport problems, i.e. finding a map transporting one given distribution to another, has numerous applications in machine learning. Novel mass transport methods motivated by generative modeling have recently been proposed, e.g. Denoising Diffusion Models (DDMs) and Flow Matching Models (FMMs) implement such a transport through a Stochastic Differential Equation (SDE) or an Ordinary Diffe… ▽ More

    Submitted 11 December, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  28. arXiv:2303.04445  [pdf, ps, other

    stat.ML cs.LG

    An ADMM Solver for the MKL-$L_{0/1}$-SVM

    Authors: Yijie Shi, Bin Zhu

    Abstract: We formulate the Multiple Kernel Learning (abbreviated as MKL) problem for the support vector machine with the infamous $(0,1)$-loss function. Some first-order optimality conditions are given and then exploited to develop a fast ADMM solver for the nonconvex and nonsmooth optimization problem. A simple numerical experiment on synthetic planar data shows that our MKL-$L_{0/1}$-SVM framework could b… ▽ More

    Submitted 30 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 2 tables. Submitted to the 62nd IEEE Conference on Decision and Control as a Regular paper, with a shortened version (arXiv version 1) submitted to the 3rd Chinese Conference on Predictive Control and Intelligent Decision (CPCID) as an Extended Abstract

  29. Bayesian Methods in Tensor Analysis

    Authors: Yiyao Shi, Weining Shen

    Abstract: Tensors, also known as multidimensional arrays, are useful data structures in machine learning and statistics. In recent years, Bayesian methods have emerged as a popular direction for analyzing tensor-valued data since they provide a convenient way to introduce sparsity into the model and conduct uncertainty quantification. In this article, we provide an overview of frequentist and Bayesian metho… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 32 pages, 8 figures, 2 tables

    Journal ref: Statistics and Its Interface, Vol. 17, No. 2 (2024), pp. 249-274

  30. arXiv:2211.04725  [pdf, other

    stat.ME

    Single Parameter Inference of Non-sparse Logistic Regression Models

    Authors: Yanmei Shi, QiZhang

    Abstract: This paper infers a single parameter in non-sparse logistic regression models. By transforming the null hypothesis into a moment condition, we construct the test statistic and obtain the asymptotic null distribution. Numerical experiments show that our method performs well.

    Submitted 9 November, 2022; originally announced November 2022.

  31. arXiv:2211.03595  [pdf, other

    stat.ML cs.LG

    From Denoising Diffusions to Denoising Markov Models

    Authors: Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusions are state-of-the-art generative models exhibiting remarkable empirical performance. They work by diffusing the data distribution into a Gaussian distribution and then learning to reverse this noising process to obtain synthetic datapoints. The denoising diffusion relies on approximations of the logarithmic derivatives of the noised data densities using score matching. Such mod… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  32. arXiv:2210.06226  [pdf, other

    stat.ML cs.LG

    Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics

    Authors: Kamélia Daudel, Joe Benton, Yuyang Shi, Arnaud Doucet

    Abstract: Several algorithms involving the Variational Rényi (VR) bound have been proposed to minimize an alpha-divergence between a target posterior distribution and a variational distribution. Despite promising empirical results, those algorithms resort to biased stochastic gradient descent procedures and thus lack theoretical guarantees. In this paper, we formalize and study the VR-IWAE bound, a generali… ▽ More

    Submitted 19 July, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  33. arXiv:2210.04268  [pdf, other

    stat.ME math.ST

    A Locally Adaptive Shrinkage Approach to False Selection Rate Control in High-Dimensional Classification

    Authors: Bowen Gang, Yuantao Shi, Wenguang Sun

    Abstract: The uncertainty quantification and error control of classifiers are crucial in many high-consequence decision-making scenarios. We propose a selective classification framework that provides an indecision option for any observations that cannot be classified with confidence. The false selection rate (FSR), defined as the expected fraction of erroneous classifications among all definitive classifica… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  34. arXiv:2206.08994  [pdf, other

    stat.ML cs.CV cs.LG math.NA

    Robust Group Synchronization via Quadratic Programming

    Authors: Yunpeng Shi, Cole Wyeth, Gilad Lerman

    Abstract: We propose a novel quadratic programming formulation for estimating the corruption levels in group synchronization, and use these estimates to solve this problem. Our objective function exploits the cycle consistency of the group and we thus refer to our method as detection and estimation of structural consistency (DESC). This general framework can be extended to other algebraic and geometric stru… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022

    MSC Class: 90C26; 90C17; 68Q87; 65C20; 90-08; 60-08 ACM Class: G.1.6; I.4.0

  35. arXiv:2206.08871  [pdf, other

    cs.LG stat.ML

    How Robust is Unsupervised Representation Learning to Distribution Shift?

    Authors: Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H. S. Torr, Amartya Sanyal

    Abstract: The robustness of machine learning algorithms to distributions shift is primarily discussed in the context of supervised learning (SL). As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift. We posit that the input-driven objectives of unsup… ▽ More

    Submitted 16 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  36. arXiv:2206.01704  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

    Authors: Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

    Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems. We propose a model-based RL framework with formal stability guarantees, Krasovskii Constrained RL (KCRL), that adopts Krasovskii's family of Lya… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  37. arXiv:2203.16505  [pdf, other

    cs.CV math.NA stat.ML

    Fast, Accurate and Memory-Efficient Partial Permutation Synchronization

    Authors: Shaohan Li, Yunpeng Shi, Gilad Lerman

    Abstract: Previous partial permutation synchronization (PPS) algorithms, which are commonly used for multi-object matching, often involve computation-intensive and memory-demanding matrix operations. These operations become intractable for large scale structure-from-motion datasets. For pure permutation synchronization, the recent Cycle-Edge Message Passing (CEMP) framework suggests a memory-efficient and f… ▽ More

    Submitted 31 March, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

    MSC Class: 90C26; 90C10; 90C17; 68Q87; 65C20

  38. arXiv:2202.13460  [pdf, other

    stat.ML cs.LG

    Conditional Simulation Using Diffusion Schrödinger Bridges

    Authors: Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusion models have recently emerged as a powerful class of generative models. They provide state-of-the-art results, not only for unconditional simulation, but also when used to solve conditional simulation problems arising in a wide range of inverse problems. A limitation of these models is that they are computationally intensive at generation time as they require simulating a diffus… ▽ More

    Submitted 26 June, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 29 pages, 15 figures. UAI 2022 camera-ready version

  39. arXiv:2202.11455  [pdf, other

    cs.LG cs.CV math.ST stat.ML

    On PAC-Bayesian reconstruction guarantees for VAEs

    Authors: Badr-Eddine Chérief-Abdellatif, Yuyang Shi, Arnaud Doucet, Benjamin Guedj

    Abstract: Despite its wide use and empirical successes, the theoretical understanding and study of the behaviour and performance of the variational autoencoder (VAE) have only emerged in the past few years. We contribute to this recent line of work by analysing the VAE's reconstruction ability for unseen test data, leveraging arguments from the PAC-Bayes theory. We provide generalisation bounds on the theor… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 14 pages

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022, Valencia, Spain. PMLR: Volume 151

  40. arXiv:2202.06383  [pdf, other

    cs.LG stat.AP

    Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

    Authors: Yuan Shi, Saied Mahdian, Jose Blanchet, Peter Glynn, Andrew Y. Shin, David Scheinker

    Abstract: Using data from cardiovascular surgery patients with long and highly variable post-surgical lengths of stay (LOS), we develop a modeling framework to reduce recovery unit congestion. We estimate the LOS and its probability distribution using machine learning models, schedule procedures on a rolling basis using a variety of optimization models, and estimate performance with simulation. The machine… ▽ More

    Submitted 28 November, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  41. arXiv:2112.07746  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

    Authors: Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

    Abstract: Current state-of-the-art model-based reinforcement learning algorithms use trajectory sampling methods, such as the Cross-Entropy Method (CEM), for planning in continuous control settings. These zeroth-order optimizers require sampling a large number of trajectory rollouts to select an optimal action, which scales poorly for large prediction horizons or high dimensional action spaces. First-order… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  42. arXiv:2111.06985  [pdf, other

    stat.ME

    Nonparametric Bayesian Knockoff Generators for Feature Selection Under Complex Data Structure

    Authors: Michael J. Martens, Anjishnu Banerjee, Xinran Qi, Yushu Shi

    Abstract: The recent proliferation of high-dimensional data, such as electronic health records and genetics data, offers new opportunities to find novel predictors of outcomes. Presented with a large set of candidate features, interest often lies in selecting the ones most likely to be predictive of an outcome for further study. Controlling the false discovery rate (FDR) at a specified level is often desire… ▽ More

    Submitted 22 September, 2024; v1 submitted 12 November, 2021; originally announced November 2021.

  43. arXiv:2111.01395  [pdf, other

    cs.LG cs.CR stat.ML

    Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds

    Authors: Yujia Huang, Huan Zhang, Yuanyuan Shi, J Zico Kolter, Anima Anandkumar

    Abstract: Certified robustness is a desirable property for deep neural networks in safety-critical applications, and popular training algorithms can certify robustness of a neural network by computing a global bound on its Lipschitz constant. However, such a bound is often loose: it tends to over-regularize the neural network and degrade its natural accuracy. A tighter Lipschitz bound may provide a better t… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  44. arXiv:2110.13549  [pdf, other

    stat.ML cs.LG stat.CO

    Online Variational Filtering and Parameter Learning

    Authors: Andrew Campbell, Yuyang Shi, Tom Rainforth, Arnaud Doucet

    Abstract: We present a variational method for online state estimation and parameter learning in state-space models (SSMs), a ubiquitous class of latent variable models for sequential data. As per standard batch variational techniques, we use stochastic gradients to simultaneously optimize a lower bound on the log evidence with respect to both model parameters and a variational approximation of the states' p… ▽ More

    Submitted 14 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: 27 pages, 6 figures. NeurIPS 2021 (Oral); updated references

  45. arXiv:2110.07818  [pdf, other

    q-bio.QM stat.ME

    A novel framework to quantify uncertainty in peptide-tandem mass spectrum matches with application to nanobody peptide identification

    Authors: Chris McKennan, Zhe Sang, Yi Shi

    Abstract: Nanobodies are small antibody fragments derived from camelids that selectively bind to antigens. These proteins have marked physicochemical properties that support advanced therapeutics, including treatments for SARS-CoV-2. To realize their potential, bottom-up proteomics via liquid chromatography-tandem mass spectrometry (LC-MS/MS) has been proposed to identify antigen-specific nanobodies at the… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 19 pages, 7 figures in the main text; 59 pages, 15 figures including supplement

  46. arXiv:2109.08139  [pdf, ps, other

    eess.SP cs.LG cs.NI stat.ML

    Adversarial Attacks against Deep Learning Based Power Control in Wireless Communications

    Authors: Brian Kim, Yi Shi, Yalin E. Sagduyu, Tugba Erpek, Sennur Ulukus

    Abstract: We consider adversarial machine learning based attacks on power allocation where the base station (BS) allocates its transmit power to multiple orthogonal subcarriers by using a deep neural network (DNN) to serve multiple user equipments (UEs). The DNN that corresponds to a regression model is trained with channel gains as the input and returns transmit powers as the output. While the BS allocates… ▽ More

    Submitted 12 October, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  47. arXiv:2104.12953  [pdf, other

    cs.LG cs.AI stat.ML

    Exploring Uncertainty in Deep Learning for Construction of Prediction Intervals

    Authors: Yuandu Lai, Yucheng Shi, Yahong Han, Yunfeng Shao, Meiyu Qi, Bingshuai Li

    Abstract: Deep learning has achieved impressive performance on many tasks in recent years. However, it has been found that it is still not enough for deep neural networks to provide only point estimates. For high-risk tasks, we need to assess the reliability of the model predictions. This requires us to quantify the uncertainty of model prediction and construct prediction intervals. In this paper, We explor… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  48. arXiv:2104.09937  [pdf, other

    cs.LG stat.ML

    Gradient Matching for Domain Generalization

    Authors: Yuge Shi, Jeffrey Seely, Philip H. S. Torr, N. Siddharth, Awni Hannun, Nicolas Usunier, Gabriel Synnaeve

    Abstract: Machine learning systems typically assume that the distributions of training and test sets match closely. However, a critical requirement of such systems in the real world is their ability to generalize to unseen domains. Here, we propose an inter-domain gradient matching objective that targets domain generalization by maximizing the inner product between gradients from different domains. Since di… ▽ More

    Submitted 13 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  49. arXiv:2103.13221  [pdf, other

    stat.ME

    Mixed Effects Envelope Models

    Authors: Yuyang Shi, Linquan Ma, Lan Liu

    Abstract: When multiple measures are collected repeatedly over time, redundancy typically exists among responses. The envelope method was recently proposed to reduce the dimension of responses without loss of information in regression with multivariate responses. It can gain substantial efficiency over the standard least squares estimator. In this paper, we generalize the envelope method to mixed effects mo… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  50. arXiv:2011.04868  [pdf, other

    cs.LG math.OC stat.ML

    Neural Network Compression Via Sparse Optimization

    Authors: Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao Tu

    Abstract: The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yi… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.