Skip to main content

Showing 1–22 of 22 results for author: Barrett, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.18038  [pdf, ps, other

    stat.ME stat.CO

    Assessing the impact of variance heterogeneity and misspecification in mixed-effects location-scale models

    Authors: Vincent Jeanselme, Marco Palma, Jessica K Barrett

    Abstract: Linear Mixed Model (LMM) is a common statistical approach to model the relation between exposure and outcome while capturing individual variability through random effects. However, this model assumes the homogeneity of the error term's variance. Breaking this assumption, known as homoscedasticity, can bias estimates and, consequently, may change a study's conclusions. If this assumption is unmet,… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2503.12270  [pdf, other

    stat.ME

    A Bayesian location-scale joint model for time-to-event and multivariate longitudinal data with association based on within-individual variability

    Authors: Marco Palma, Ruth H Keogh, Siobhán B Carr, Rhonda Szczesniak, David Taylor-Robinson, Angela M Wood, Graciela Muniz-Terrera, Jessica K Barrett

    Abstract: Within-individual variability of health indicators measured over time is becoming commonly used to inform about disease progression. Simple summary statistics (e.g. the standard deviation for each individual) are often used but they are not suited to account for time changes. In addition, when these summary statistics are used as covariates in a regression model for time-to-event outcomes, the est… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  3. arXiv:2411.00405  [pdf, other

    stat.ML cs.LG

    HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search

    Authors: Tuan Ngo Nguyen, Jay Barrett, Kwang-Sung Jun

    Abstract: We study the problem of estimating the \emph{value} of the largest mean among K distributions via samples from them (rather than estimating \emph{which} distribution has the largest mean), which arises from various machine learning tasks including Q-learning and Monte Carlo Tree Search (MCTS). While there have been a few proposed algorithms, their performance analyses have been limited to their bi… ▽ More

    Submitted 28 April, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: In Proceedings of the Artificial Intelligence and Statistics (AISTATS) 2025

  4. arXiv:2410.22534  [pdf, other

    stat.ME stat.CO

    Bayesian shared parameter joint models for heterogeneous populations

    Authors: Sida Chen, Danilo Alvares, Marco Palma, Jessica K. Barrett

    Abstract: Joint models (JMs) for longitudinal and time-to-event data are an important class of biostatistical models in health and medical research. When the study population consists of heterogeneous subgroups, the standard JM may be inadequate and lead to misleading results. Joint latent class models (JLCMs) and their variants have been proposed to incorporate latent class structures into JMs. JLCMs are u… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  5. arXiv:2408.03463  [pdf, other

    stat.ME cs.AI

    Identifying treatment response subgroups in observational time-to-event data

    Authors: Vincent Jeanselme, Chang Ho Yoon, Fabian Falck, Brian Tom, Jessica Barrett

    Abstract: Identifying patient subgroups with different treatment responses is an important task to inform medical recommendations, guidelines, and the design of future clinical trials. Existing approaches for treatment effect estimation primarily rely on Randomised Controlled Trials (RCTs), which are often limited by insufficient power, multiple comparisons, and unbalanced covariates. In addition, RCTs tend… ▽ More

    Submitted 23 February, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: Preprint under review

  6. arXiv:2407.14311  [pdf, ps, other

    stat.ME stat.AP

    A Bayesian joint model of multiple longitudinal and categorical outcomes with application to multiple myeloma using permutation-based variable importance

    Authors: Danilo Alvares, Jessica K. Barrett, François Mercier, Jochen Schulze, Sean Yiu, Felipe Castro, Spyros Roumpanis, Yajing Zhu

    Abstract: Joint models have proven to be an effective approach for uncovering potentially hidden connections between various types of outcomes, mainly continuous, time-to-event, and binary. Typically, longitudinal continuous outcomes are characterized by linear mixed-effects models, survival outcomes are described by proportional hazards models, and the link between outcomes are captured by shared random ef… ▽ More

    Submitted 14 June, 2025; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: 29 pages, 5 figures

  7. arXiv:2405.20418  [pdf, other

    stat.AP stat.ME

    A Bayesian joint model of multiple nonlinear longitudinal and competing risks outcomes for dynamic prediction in multiple myeloma: joint estimation and corrected two-stage approaches

    Authors: Danilo Alvares, Jessica K. Barrett, François Mercier, Spyros Roumpanis, Sean Yiu, Felipe Castro, Jochen Schulze, Yajing Zhu

    Abstract: Predicting cancer-associated clinical events is challenging in oncology. In Multiple Myeloma (MM), a cancer of plasma cells, disease progression is determined by changes in biomarkers, such as serum concentration of the paraprotein secreted by plasma cells (M-protein). Therefore, the time-dependent behaviour of M-protein and the transition across lines of therapy (LoT) that may be a consequence of… ▽ More

    Submitted 11 November, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 40 pages, 13 figures

  8. arXiv:2308.12460  [pdf, other

    stat.ME stat.AP

    Bayesian blockwise inference for joint models of longitudinal and multistate processes

    Authors: Sida Chen, Danilo Alvares, Christopher Jackson, Jessica Barrett

    Abstract: Joint models (JM) for longitudinal and survival data have gained increasing interest and found applications in a wide range of clinical and biomedical settings. These models facilitate the understanding of the relationship between outcomes and enable individualized predictions. In many applications, more complex event processes arise, necessitating joint longitudinal and multistate models. However… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  9. arXiv:2305.06703  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Fine-Gray: Monotonic neural networks for competing risks

    Authors: Vincent Jeanselme, Chang Ho Yoon, Brian Tom, Jessica Barrett

    Abstract: Time-to-event modelling, known as survival analysis, differs from standard regression as it addresses censoring in patients who do not experience the event of interest. Despite competitive performances in tackling this problem, machine learning methods often ignore other competing risks that preclude the event of interest. This practice biases the survival estimation. Extensions to address this ch… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Presented at the Conference on Health, Inference, and Learning (CHIL) 2023

  10. arXiv:2304.04652  [pdf, other

    stat.ME

    A Framework for Understanding Selection Bias in Real-World Healthcare Data

    Authors: Ritoban Kundu, Xu Shi, Jean Morrison, Jessica Barrett, Bhramar Mukherjee

    Abstract: Using administrative patient-care data such as Electronic Health Records (EHR) and medical/ pharmaceutical claims for population-based scientific research has become increasingly common. With vast sample sizes leading to very small standard errors, researchers need to pay more attention to potential biases in the estimates of association parameters of interest, specifically to biases that do not d… ▽ More

    Submitted 17 August, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  11. arXiv:2302.04992  [pdf, other

    stat.AP

    Optimal risk-assessment scheduling for primary prevention of cardiovascular disease

    Authors: Francesca Gasperoni, Christopher H. Jackson, Angela M. Wood, Michael J. Sweeting, Paul J. Newcombe, David Stevens, Jessica K. Barrett

    Abstract: In this work, we introduce a personalised and age-specific Net Benefit function, composed of benefits and costs, to recommend optimal timing of risk assessments for cardiovascular disease prevention. We extend the 2-stage landmarking model to estimate patient-specific CVD risk profiles, adjusting for time-varying covariates. We apply our model to data from the Clinical Practice Research Datalink,… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  12. arXiv:1912.05258  [pdf, ps, other

    stat.ME stat.AP

    Sample Size Estimation using a Latent Variable Model for Mixed Outcome Co-Primary, Multiple Primary and Composite Endpoints

    Authors: Martina McMenamin, Jessica K. Barrett, Anna Berglind, James M. S. Wason

    Abstract: Mixed outcome endpoints that combine multiple continuous and discrete components to form co-primary, multiple primary or composite endpoints are often employed as primary outcome measures in clinical trials. There are many advantages to joint modelling the individual outcomes using a latent variable framework, however in order to make use of the model in practice we require techniques for sample s… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: 36 pages, 8 figures, 7 tables

  13. arXiv:1903.06676  [pdf, other

    stat.AP

    Selective recruitment designs for improving observational studies using electronic health records

    Authors: James E. Barrett, Aylin Cakiroglu, Catey Bunce, Anoop Shah, Spiros Denaxas

    Abstract: Large scale electronic health records (EHRs) present an opportunity to quickly identify suitable individuals in order to directly invite them to participate in an observational study. EHRs can contain data from millions of individuals, raising the question of how to optimally select a cohort of size n from a larger pool of size N. In this paper we propose a simple selective recruitment protocol th… ▽ More

    Submitted 13 February, 2019; originally announced March 2019.

  14. arXiv:1902.07037  [pdf, other

    stat.ME

    Employing latent variable models to improve efficiency in composite endpoint analysis

    Authors: Martina McMenamin, Jessica K. Barrett, Anna Berglind, James M. S. Wason

    Abstract: Composite endpoints that combine multiple outcomes on different scales are common in clinical trials, particularly in chronic conditions. In many of these cases, patients will have to cross a predefined responder threshold in each of the outcomes to be classed as a responder overall. One instance of this occurs in systemic lupus erythematosus (SLE), where the responder endpoint combines two contin… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: 44 pages, 12 figures

  15. Mixed effects models for healthcare longitudinal data with an informative visiting process: a Monte Carlo simulation study

    Authors: Alessandro Gasparini, Keith R. Abrams, Jessica K. Barrett, Rupert W. Major, Michael J. Sweeting, Nigel J. Brunskill, Michael J. Crowther

    Abstract: Electronic health records are being increasingly used in medical research to answer more relevant and detailed clinical questions; however, they pose new and significant methodological challenges. For instance, observation times are likely correlated with the underlying disease severity: patients with worse conditions utilise health care more and may have worse biomarker values recorded. Tradition… ▽ More

    Submitted 25 July, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

  16. Estimating the association between blood pressure variability and cardiovascular disease: An application using the ARIC Study

    Authors: Jessica K. Barrett, Raphael Huille, Richard Parker, Yuichiro Yano, Michael Griswold

    Abstract: The association between visit-to-visit systolic blood pressure variability and cardiovascular events has recently received a lot of attention in the cardiovascular literature. But blood pressure variability is usually estimated on a person-by-person basis, and is therefore subject to considerable measurement error. We demonstrate that hazard ratios estimated using this approach are subject to bias… ▽ More

    Submitted 23 January, 2019; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: 20 pages, 4 figures

  17. arXiv:1705.01730  [pdf, ps, other

    stat.AP cond-mat.dis-nn physics.data-an

    Replica analysis of overfitting in regression models for time-to-event data

    Authors: ACC Coolen, JE Barrett, P Paga, CJ Perez-Vicente

    Abstract: Overfitting, which happens when the number of parameters in a model is too large compared to the number of data points available for determining these parameters, is a serious and growing problem in survival analysis. While modern medicine presents us with data of unprecedented dimensionality, these data cannot yet be used effectively for clinical outcome prediction. Standard error measures in max… ▽ More

    Submitted 20 July, 2017; v1 submitted 4 May, 2017; originally announced May 2017.

    Comments: 37 pages, 9 figures

    MSC Class: 62

  18. arXiv:1509.01058  [pdf, other

    math.ST stat.ME

    Information-adaptive clinical trials with selective recruitment and binary outcomes

    Authors: James E. Barrett

    Abstract: Selective recruitment designs preferentially recruit individuals that are estimated to be statistically informative onto a clinical trial. Individuals that are expected to contribute less information have a lower probability of recruitment. Furthermore, in an information-adaptive design recruits are allocated to treatment arms in a manner that maximises information gain. The informativeness of an… ▽ More

    Submitted 30 May, 2017; v1 submitted 3 September, 2015; originally announced September 2015.

  19. arXiv:1502.03813  [pdf, other

    stat.AP math.ST

    Information-adaptive clinical trials: a selective recruitment design

    Authors: James E. Barrett

    Abstract: We propose a novel adaptive design for clinical trials with time-to-event outcomes and covariates (which may consist of or include biomarkers). Our method is based on the expected entropy of the posterior distribution of a proportional hazards model. The expected entropy is evaluated as a function of a patient's covariates, and the information gained due to a patient is defined as the decrease in… ▽ More

    Submitted 28 March, 2016; v1 submitted 12 February, 2015; originally announced February 2015.

  20. arXiv:1406.0812  [pdf, other

    math.ST stat.ME

    Covariate dimension reduction for survival data via the Gaussian process latent variable model

    Authors: James E. Barrett, Anthony C. C. Coolen

    Abstract: The analysis of high dimensional survival data is challenging, primarily due to the problem of overfitting which occurs when spurious relationships are inferred from data that subsequently fail to exist in test data. Here we propose a novel method of extracting a low dimensional representation of covariates in survival data by combining the popular Gaussian Process Latent Variable Model (GPLVM) wi… ▽ More

    Submitted 27 January, 2016; v1 submitted 3 June, 2014; originally announced June 2014.

  21. arXiv:1312.1591  [pdf, ps, other

    math.ST stat.ME

    Gaussian process regression for survival data with competing risks

    Authors: James E. Barrett, Anthony C. C. Coolen

    Abstract: We apply Gaussian process (GP) regression, which provides a powerful non-parametric probabilistic method of relating inputs to outputs, to survival data consisting of time-to-event and covariate measurements. In this context, the covariates are regarded as the `inputs' and the event times are the `outputs'. This allows for highly flexible inference of non-linear relationships between covariates an… ▽ More

    Submitted 5 September, 2014; v1 submitted 5 December, 2013; originally announced December 2013.

  22. arXiv:1307.0323  [pdf, ps, other

    stat.ML

    Dimensionality Detection and Integration of Multiple Data Sources via the GP-LVM

    Authors: James Barrett, Anthony C. C. Coolen

    Abstract: The Gaussian Process Latent Variable Model (GP-LVM) is a non-linear probabilistic method of embedding a high dimensional dataset in terms low dimensional `latent' variables. In this paper we illustrate that maximum a posteriori (MAP) estimation of the latent variables and hyperparameters can be used for model selection and hence we can determine the optimal number or latent variables and the most… ▽ More

    Submitted 1 July, 2013; originally announced July 2013.

    Comments: 15 pages, 3 figures