Skip to main content

Showing 1–5 of 5 results for author: Underwood, W G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.23870  [pdf, ps, other

    stat.ME math.ST

    Upgrading survival models with CARE

    Authors: William G. Underwood, Henry W. J. Reeve, Oliver Y. Feng, Samuel A. Lambert, Bhramar Mukherjee, Richard J. Samworth

    Abstract: Clinical risk prediction models are regularly updated as new data, often with additional covariates, become available. We propose CARE (Convex Aggregation of relative Risk Estimators) as a general approach for combining existing "external" estimators with a new data set in a time-to-event survival analysis setting. Our method initially employs the new data to fit a flexible family of reproducing k… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: 79 pages, 12 figures

    MSC Class: 62N02 (Primary); 62G05; 62P10 (Secondary)

  2. arXiv:2310.09702  [pdf, other

    math.ST stat.ME stat.ML

    Inference with Mondrian Random Forests

    Authors: Matias D. Cattaneo, Jason M. Klusowski, William G. Underwood

    Abstract: Random forests are popular methods for regression and classification analysis, and many different variants have been proposed in recent years. One interesting example is the Mondrian random forest, in which the underlying constituent trees are constructed via a Mondrian process. We give precise bias and variance characterizations, along with a Berry-Esseen-type central limit theorem, for the Mondr… ▽ More

    Submitted 8 April, 2025; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: 64 pages, 1 figure, 6 tables

    MSC Class: 62G08 (Primary); 62G05; 62G20 (Secondary)

  3. arXiv:2210.00362  [pdf, other

    math.ST econ.EM stat.ME

    Yurinskii's Coupling for Martingales

    Authors: Matias D. Cattaneo, Ricardo P. Masini, William G. Underwood

    Abstract: Yurinskii's coupling is a popular theoretical tool for non-asymptotic distributional analysis in mathematical statistics and applied probability, offering a Gaussian strong approximation with an explicit error bound under easily verifiable conditions. Originally stated in $\ell^2$-norm for sums of independent random vectors, it has recently been extended both to the $\ell^p$-norm, for… ▽ More

    Submitted 23 September, 2024; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 57 pages, 1 figure

    MSC Class: 62E20; 62G20; 60G42

  4. arXiv:2201.05967  [pdf, other

    math.ST stat.ME

    Uniform Inference for Kernel Density Estimators with Dyadic Data

    Authors: Matias D. Cattaneo, Yingjie Feng, William G. Underwood

    Abstract: Dyadic data is often encountered when quantities of interest are associated with the edges of a network. As such it plays an important role in statistics, econometrics and many other data science disciplines. We consider the problem of uniformly estimating a dyadic Lebesgue density function, focusing on nonparametric kernel-based estimators taking the form of dyadic empirical processes. Our main c… ▽ More

    Submitted 13 October, 2023; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: Article: 23 pages, 3 figures. Supplemental appendix: 72 pages, 3 figures

    MSC Class: 62G05; 62G07; 62M99 (Primary) 91D30; 90B15 (Secondary)

  5. arXiv:2004.01293  [pdf, other

    cs.SI cs.LG physics.soc-ph stat.ML

    Motif-Based Spectral Clustering of Weighted Directed Networks

    Authors: William George Underwood, Andrew Elliott, Mihai Cucuringu

    Abstract: Clustering is an essential technique for network analysis, with applications in a diverse range of fields. Although spectral clustering is a popular and effective method, it fails to consider higher-order structure and can perform poorly on directed networks. One approach is to capture and cluster higher-order structures using motif adjacency matrices. However, current formulations fail to take ed… ▽ More

    Submitted 10 September, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: 38 pages, 20 figures

    Journal ref: Applied Network Science 5, 62 (2020)