Skip to main content

Showing 1–13 of 13 results for author: Carone, M

Searching in archive math. Search in all archives.
.
  1. arXiv:2411.09017  [pdf, other

    stat.ME math.ST

    Debiased machine learning for counterfactual survival functionals based on left-truncated right-censored data

    Authors: Eric R. Morenz, Charles J. Wolock, Marco Carone

    Abstract: Learning causal effects of a binary exposure on time-to-event endpoints can be challenging because survival times may be partially observed due to censoring and systematically biased due to truncation. In this work, we present debiased machine learning-based nonparametric estimators of the joint distribution of a counterfactual survival time and baseline covariates for use when the observed data a… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: The first two authors contributed equally to this work. 61 pages (36 main text, 25 supplement). 6 figures (6 main text, 0 supplement)

  2. arXiv:2411.02771  [pdf, ps, other

    stat.ME math.ST stat.ML

    Doubly robust inference via calibration

    Authors: Lars van der Laan, Alex Luedtke, Marco Carone

    Abstract: Doubly robust estimators are widely used for estimating average treatment effects and other linear summaries of regression functions. While consistency requires only one of two nuisance functions to be estimated consistently, asymptotic normality typically require sufficiently fast convergence of both. In this work, we correct this mismatch: we show that calibrating the nuisance estimators within… ▽ More

    Submitted 27 June, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  3. arXiv:2409.19230  [pdf, other

    stat.ME math.ST

    Propensity Score Augmentation in Matching-based Estimation of Causal Effects

    Authors: Ernesto Ulloa-Pérez, Marco Carone, Alex Luedtke

    Abstract: When assessing the causal effect of a binary exposure using observational data, confounder imbalance across exposure arms must be addressed. Matching methods, including propensity score-based matching, can be used to deconfound the causal relationship of interest. They have been particularly popular in practice, at least in part due to their simplicity and interpretability. However, these methods… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  4. arXiv:2409.09973  [pdf, other

    math.ST stat.ME stat.ML

    Towards a Unified Theory for Semiparametric Data Fusion with Individual-Level Data

    Authors: Ellen Graham, Marco Carone, Andrea Rotnitzky

    Abstract: We address the goal of conducting inference about a smooth finite-dimensional parameter by utilizing individual-level data from various independent sources. Recent advancements have led to the development of a comprehensive theory capable of handling scenarios where different data sources align with, possibly distinct subsets of, conditional distributions of a single factorization of the joint tar… ▽ More

    Submitted 24 February, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

    Comments: 122 pages. Updated to simplify notation and include a supplemental section discussing the relationship between this work and arXiv:2111.14945

  5. arXiv:2307.12544  [pdf, other

    stat.ME math.ST stat.ML

    Adaptive debiased machine learning using data-driven model selection techniques

    Authors: Lars van der Laan, Marco Carone, Alex Luedtke, Mark van der Laan

    Abstract: Debiased machine learning estimators for nonparametric inference of smooth functionals of the data-generating distribution can suffer from excessive variability and instability. For this reason, practitioners may resort to simpler models based on parametric or semiparametric assumptions. However, such simplifying assumptions may fail to hold, and estimates may then be biased due to model misspecif… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 32 pages + appendix

  6. arXiv:2004.03683  [pdf, other

    stat.ME math.ST stat.ML

    A general framework for inference on algorithm-agnostic variable importance

    Authors: Brian D. Williamson, Peter B. Gilbert, Noah R. Simon, Marco Carone

    Abstract: In many applications, it is of interest to assess the relative contribution of features (or subsets of features) toward the goal of predicting a response -- in other words, to gauge the variable importance of features. Most recent work on variable importance assessment has focused on describing the importance of features within the confines of a given prediction algorithm. However, such assessment… ▽ More

    Submitted 13 September, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 69 total pages (35 in the main document, 34 supplementary), 23 figures (4 in the main document, 19 supplementary)

  7. arXiv:2003.01856  [pdf, other

    stat.ME math.ST

    Universal sieve-based strategies for efficient estimation using machine learning tools

    Authors: Hongxiang Qiu, Alex Luedtke, Marco Carone

    Abstract: Suppose that we wish to estimate a finite-dimensional summary of one or more function-valued features of an underlying data-generating mechanism under a nonparametric model. One approach to estimation is by plugging in flexible estimates of these features. Unfortunately, in general, such estimators may not be asymptotically efficient, which often makes these estimators difficult to use as a basis… ▽ More

    Submitted 26 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 46 pages, 6 figures, submitted to Bernoulli

  8. arXiv:1810.09022  [pdf, other

    math.ST stat.ME

    Correcting an estimator of a multivariate monotone function with isotonic regression

    Authors: Ted Westling, Mark van der Laan, Marco Carone

    Abstract: In many problems, a sensible estimator of a possibly multivariate monotone function may itself fail to be monotone. We study the correction of such an estimator obtained via projection onto the space of functions monotone over a finite grid in the domain. We demonstrate that this corrected estimator has no worse supremal estimation error than the initial estimator, and that analogously corrected c… ▽ More

    Submitted 4 September, 2019; v1 submitted 21 October, 2018; originally announced October 2018.

  9. arXiv:1806.01928  [pdf, other

    math.ST

    A unified study of nonparametric inference for monotone functions

    Authors: Ted Westling, Marco Carone

    Abstract: The problem of nonparametric inference on a monotone function has been extensively studied in many particular cases. Estimators considered have often been of so-called Grenander type, being representable as the left derivative of the greatest convex minorant or least concave majorant of an estimator of a primitive function. In this paper, we provide general conditions for consistency and pointwise… ▽ More

    Submitted 29 November, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: Substantial revisions made to the manuscript

  10. arXiv:1608.08717  [pdf, other

    math.ST

    Toward computerized efficient estimation in infinite-dimensional models

    Authors: Marco Carone, Alexander R. Luedtke, Mark J. van der Laan

    Abstract: Despite the risk of misspecification they are tied to, parametric models continue to be used in statistical practice because they are accessible to all. In particular, efficient estimation procedures in parametric models are simple to describe and implement. Unfortunately, the same cannot be said of semiparametric and nonparametric models. While the latter often reflect the level of available scie… ▽ More

    Submitted 30 August, 2016; originally announced August 2016.

  11. arXiv:1511.08369  [pdf, other

    math.ST

    Second-Order Inference for the Mean of a Variable Missing at Random

    Authors: Iván Díaz, Marco Carone, Mark J. van der Laan

    Abstract: We present a second-order estimator of the mean of a variable subject to missingness, under the missing at random assumption. The estimator improves upon existing methods by using an approximate second-order expansion of the parameter functional, in addition to the first-order expansion employed by standard doubly robust methods. This results in weaker assumptions about the convergence rates neces… ▽ More

    Submitted 26 November, 2015; originally announced November 2015.

  12. arXiv:1510.04195  [pdf, other

    math.ST stat.ML

    An Omnibus Nonparametric Test of Equality in Distribution for Unknown Functions

    Authors: Alexander R. Luedtke, Marco Carone, Mark J. van der Laan

    Abstract: We present a novel family of nonparametric omnibus tests of the hypothesis that two unknown but estimable functions are equal in distribution when applied to the observed data structure. We developed these tests, which represent a generalization of the maximum mean discrepancy tests described in Gretton et al. [2006], using recent developments from the higher-order pathwise differentiability liter… ▽ More

    Submitted 13 June, 2017; v1 submitted 14 October, 2015; originally announced October 2015.

    MSC Class: 62G10

  13. Large-sample study of the kernel density estimators under multiplicative censoring

    Authors: Masoud Asgharian, Marco Carone, Vahid Fakoor

    Abstract: The multiplicative censoring model introduced in Vardi [Biometrika 76 (1989) 751--761] is an incomplete data problem whereby two independent samples from the lifetime distribution $G$, $\mathcal{X}_m=(X_1,...,X_m)$ and $\mathcal{Z}_n=(Z_1,...,Z_n)$, are observed subject to a form of coarsening. Specifically, sample $\mathcal{X}_m$ is fully observed while $\mathcal{Y}_n=(Y_1,...,Y_n)$ is observed i… ▽ More

    Submitted 29 May, 2012; originally announced May 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS954 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS954

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 1, 159-187