Skip to main content

Showing 1–9 of 9 results for author: Xia, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.21199  [pdf, other

    stat.ML cs.CR cs.LG

    Generate-then-Verify: Reconstructing Data from Limited Published Statistics

    Authors: Terrance Liu, Eileen Xiao, Pratiksha Thaker, Adam Smith, Zhiwei Steven Wu

    Abstract: We study the problem of reconstructing tabular data from aggregate statistics, in which the attacker aims to identify interesting claims about the sensitive data that can be verified with 100% certainty given the aggregates. Successful attempts in prior work have conducted studies in settings where the set of published statistics is rich enough that entire datasets can be reconstructed with certai… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  2. arXiv:2412.09482  [pdf, other

    stat.ME

    Inference under Staggered Adoption: Case Study of the Affordable Care Act

    Authors: Eric Xia, Yuling Yan, Martin J. Wainwright

    Abstract: Panel data consists of a collection of $N$ units that are observed over $T$ units of time. A policy or treatment is subject to staggered adoption if different units take on treatment at different times and remains treated (or never at all). Assessing the effectiveness of such a policy requires estimating the treatment effect, corresponding to the difference between outcomes for treated versus untr… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  3. arXiv:2410.02015  [pdf, other

    math.ST stat.ME

    Instrumental variables: A non-asymptotic viewpoint

    Authors: Eric Xia, Martin J. Wainwright, Whitney Newey

    Abstract: We provide a non-asymptotic analysis of the linear instrumental variable estimator allowing for the presence of exogeneous covariates. In addition, we introduce a novel measure of the strength of an instrument that can be used to derive non-asymptotic confidence intervals. For strong instruments, these non-asymptotic intervals match the asymptotic ones exactly up to higher order corrections; for w… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  4. arXiv:2210.11377  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces

    Authors: Eric Xia, Martin J. Wainwright

    Abstract: We present and analyze the Krylov-Bellman Boosting (KBB) algorithm for policy evaluation in general state spaces. It alternates between fitting the Bellman residual using non-parametric regression (as in boosting), and estimating the value function via the least-squares temporal difference (LSTD) procedure applied with a feature set that grows adaptively over time. By exploiting the connection to… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 40 pages, 7 figures

  5. arXiv:2201.08536  [pdf, other

    stat.ML cs.LG

    Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

    Authors: Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

    Abstract: Various algorithms for reinforcement learning (RL) exhibit dramatic variation in their convergence rates as a function of problem structure. Such problem-dependent behavior is not captured by worst-case analyses and has accordingly inspired a growing effort in obtaining instance-dependent guarantees and deriving instance-optimal algorithms for RL problems. This research has been carried out, howev… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  6. arXiv:2106.14352  [pdf, other

    stat.ML cs.LG

    Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

    Authors: Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

    Abstract: Various algorithms in reinforcement learning exhibit dramatic variability in their convergence rates and ultimate accuracy as a function of the problem structure. Such instance-specific behavior is not captured by existing global minimax bounds, which are worst-case in nature. We analyze the problem of estimating optimal $Q$-value functions for a discounted Markov decision process with discrete st… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  7. arXiv:1905.09959  [pdf, other

    stat.ML cs.LG math.ST

    Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models

    Authors: Chiao-Yu Yang, Eric Xia, Nhat Ho, Michael I. Jordan

    Abstract: Dirichlet process mixture models (DPMM) play a central role in Bayesian nonparametrics, with applications throughout statistics and machine learning. DPMMs are generally used in clustering problems where the number of clusters is not known in advance, and the posterior distribution is treated as providing inference for this number. Recently, however, it has been shown that the DPMM is inconsistent… ▽ More

    Submitted 18 October, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    MSC Class: 62C10; 62G20; 62G99

  8. arXiv:1903.00197  [pdf

    q-bio.QM cs.LG stat.ML

    Outcome-Driven Clustering of Acute Coronary Syndrome Patients using Multi-Task Neural Network with Attention

    Authors: Eryu Xia, Xin Du, Jing Mei, Wen Sun, Suijun Tong, Zhiqing Kang, Jian Sheng, Jian Li, Changsheng Ma, Jianzeng Dong, Shaochun Li

    Abstract: Cluster analysis aims at separating patients into phenotypically heterogenous groups and defining therapeutically homogeneous patient subclasses. It is an important approach in data-driven disease classification and subtyping. Acute coronary syndrome (ACS) is a syndrome due to sudden decrease of coronary artery blood flow, where disease classification would help to inform therapeutic strategies an… ▽ More

    Submitted 27 March, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

  9. arXiv:1707.09706  [pdf

    cs.AI stat.AP

    Developing Knowledge-enhanced Chronic Disease Risk Prediction Models from Regional EHR Repositories

    Authors: Jing Mei, Eryu Xia, Xiang Li, Guotong Xie

    Abstract: Precision medicine requires the precision disease risk prediction models. In literature, there have been a lot well-established (inter-)national risk models, but when applying them into the local population, the prediction performance becomes unsatisfactory. To address the localization issue, this paper exploits the way to develop knowledge-enhanced localized risk models. On the one hand, we tune… ▽ More

    Submitted 30 July, 2017; originally announced July 2017.