Skip to main content

Showing 1–14 of 14 results for author: Martín, G P

.
  1. arXiv:2504.18730  [pdf

    stat.ME

    A general sample size framework for developing or updating a clinical prediction model

    Authors: Richard D Riley, Rebecca Whittle, Mohsen Sadatsafavi, Glen P. Martin, Alexander Pate, Gary S. Collins, Joie Ensor

    Abstract: Aims: To propose a general sample size framework for developing or updating a clinical prediction model using any statistical or machine learning method, based on drawing samples from anticipated posterior distributions and targeting assurance in predictive performance. Methods: Users provide a reference model (eg, matching outcome incidence, predictor weights and c-statistic of previous models)… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 5 Tables, 5 Figures, 7400 words

  2. arXiv:2504.06799  [pdf

    stat.ME

    Compatibility of Missing Data Handling Methods across the Stages of Producing Clinical Prediction Models

    Authors: Antonia Tsvetanova, Matthew Sperrin, David A. Jenkins, Niels Peek, Iain Buchan, Stephanie Hyland, Marcus Taylor, Angela Wood, Richard D. Riley, Glen P. Martin

    Abstract: Missing data is a challenge when developing, validating and deploying clinical prediction models (CPMs). Traditionally, decisions concerning missing data handling during CPM development and validation havent accounted for whether missingness is allowed at deployment. We hypothesised that the missing data approach used during model development should optimise model performance upon deployment, whil… ▽ More

    Submitted 7 May, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

    Comments: 40 pages, 8 figures (10 supplementary figures)

  3. arXiv:2501.14482  [pdf

    stat.ME

    A decomposition of Fisher's information to inform sample size for developing fair and precise clinical prediction models -- Part 2: time-to-event outcomes

    Authors: Richard D Riley, Gary S Collins, Lucinda Archer, Rebecca Whittle, Amardeep Legha, Laura Kirton, Paula Dhiman, Mohsen Sadatsafavi, Nicola J Adderley, Joseph Alderman, Glen P Martin, Joie Ensor

    Abstract: Background: When developing a clinical prediction model using time-to-event data, previous research focuses on the sample size to minimise overfitting and precisely estimate the overall risk. However, instability of individual-level risk estimates may still be large. Methods: We propose a decomposition of Fisher's information matrix to examine and calculate the sample size required for developing… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: arXiv admin note: text overlap with arXiv:2407.09293

  4. arXiv:2412.04275  [pdf

    stat.ME

    Scoping review of methodology for aiding generalisability and transportability of clinical prediction models

    Authors: Kritchavat Ploddi, Matthew Sperrin, Glen P. Martin, Maurice M. O'Connell

    Abstract: Generalisability and transportability of clinical prediction models (CPMs) refer to their ability to maintain predictive performance when applied to new populations. While CPMs may show good generalisability or transportability to a specific new population, it is rare for a CPM to be developed using methods that prioritise good generalisability or transportability. There is an emerging literature… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  5. arXiv:2411.17806  [pdf, other

    hep-th gr-qc

    Cosmic censorship in a (dual) collider

    Authors: Marc Aragonès Fontboté, David Mateos, Guillem Pérez Martín, Wilke van der Schee, Javier G. Subils

    Abstract: We investigate cosmic censorship in anti-de Sitter space in holographic models in which the ground state is described by a good singularity. These include supersymmetric truncations of string/M-theory, for which a positive-energy theorem holds. At the boundary, our solutions describe a boost-invariant fluid in which the temperature decreases monotonically with time. On the gravity side, they corre… ▽ More

    Submitted 9 January, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: 5 pages + appendices, 11 figures

  6. arXiv:2407.09293  [pdf

    stat.ME

    A decomposition of Fisher's information to inform sample size for developing fair and precise clinical prediction models -- part 1: binary outcomes

    Authors: Richard D Riley, Gary S Collins, Rebecca Whittle, Lucinda Archer, Kym IE Snell, Paula Dhiman, Laura Kirton, Amardeep Legha, Xiaoxuan Liu, Alastair Denniston, Frank E Harrell Jr, Laure Wynants, Glen P Martin, Joie Ensor

    Abstract: When developing a clinical prediction model, the sample size of the development dataset is a key consideration. Small sample sizes lead to greater concerns of overfitting, instability, poor performance and lack of fairness. Previous research has outlined minimum sample size calculations to minimise overfitting and precisely estimate the overall risk. However even when meeting these criteria, the u… ▽ More

    Submitted 24 January, 2025; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 36 pages, 6 figures, 1 table

  7. How to develop, externally validate, and update multinomial prediction models

    Authors: Celina K Gehringer, Glen P Martin, Ben Van Calster, Kimme L Hyrich, Suzanne M M Verstappen, Jamie C Sergeant

    Abstract: Multinomial prediction models (MPMs) have a range of potential applications across healthcare where the primary outcome of interest has multiple nominal or ordinal categories. However, the application of MPMs is scarce, which may be due to the added methodological complexities that they bring. This article provides a guide of how to develop, externally validate, and update MPMs. Using a previously… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

  8. arXiv:2308.13394  [pdf

    stat.ME

    Calibration plots for multistate risk predictions models: an overview and simulation comparing novel approaches

    Authors: Alexander Pate, Matthew Sperrin, Richard D. Riley, Niels Peek, Tjeerd Van Staa, Jamie C. Sergeant, Mamas A. Mamas, Gregory Y. H. Lip, Martin O Flaherty, Michael Barrowman, Iain Buchan, Glen P. Martin

    Abstract: Introduction. There is currently no guidance on how to assess the calibration of multistate models used for risk prediction. We introduce several techniques that can be used to produce calibration plots for the transition probabilities of a multistate model, before assessing their performance in the presence of non-informative and informative censoring through a simulation. Methods. We studied p… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Pre-print for article currently under review

  9. arXiv:2207.12892  [pdf

    stat.ME stat.AP

    Minimum Sample Size for Developing a Multivariable Prediction Model using Multinomial Logistic Regression

    Authors: Alexander Pate, Richard D Riley, Gary S Collins, Maarten van Smeden, Ben Van Calster, Joie Ensor, Glen P Martin

    Abstract: Multinomial logistic regression models allow one to predict the risk of a categorical outcome with more than 2 categories. When developing such a model, researchers should ensure the number of participants (n) is appropriate relative to the number of events (E.k) and the number of predictor parameters (p.k) for each category k. We propose three criteria to determine the minimum n required in light… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  10. arXiv:2206.12295  [pdf

    stat.ME

    Imputation and Missing Indicators for handling missing data in the development and implementation of clinical prediction models: a simulation study

    Authors: Rose Sisk, Matthew Sperrin, Niels Peek, Maarten van Smeden, Glen P. Martin

    Abstract: Background: Existing guidelines for handling missing data are generally not consistent with the goals of prediction modelling, where missing data can occur at any stage of the model pipeline. Multiple imputation (MI), often heralded as the gold standard approach, can be challenging to apply in the clinic. Clearly, the outcome cannot be used to impute data at prediction time. Regression imputation… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 42 pages. Submitted to Statistical Methods in Medical Research in October 2021

  11. arXiv:2011.09815  [pdf

    stat.ME stat.AP stat.ML

    A scoping review of causal methods enabling predictions under hypothetical interventions

    Authors: Lijing Lin, Matthew Sperrin, David A. Jenkins, Glen P. Martin, Niels Peek

    Abstract: Background and Aims: The methods with which prediction models are usually developed mean that neither the parameters nor the predictions should be interpreted causally. However, when prediction models are used to support decision making, there is often a need for predicting outcomes under hypothetical interventions. We aimed to identify published methods for developing and validating prediction mo… ▽ More

    Submitted 12 January, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

    Journal ref: Diagnostic and Prognostic Research, 2021

  12. Towards a Framework for the Design, Implementation and Reporting of Methodology Scoping Reviews

    Authors: Glen P. Martin, David Jenkins, Lucy Bull, Rose Sisk, Lijing Lin, William Hulme, Anthony Wilson, Wenjuan Wang, Michael Barrowman, Camilla Sammut-Powell, Alexander Pate, Matthew Sperrin, Niels Peek

    Abstract: Background: In view of the growth of published papers, there is an increasing need for studies that summarise scientific research. An increasingly common review is a 'Methodology scoping review', which provides a summary of existing analytical methods, techniques and software, proposed or applied in research articles, which address an analytical problem or further an analytical approach. However,… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 22 pages, 2 tables

    Journal ref: Journal of Clinical Epidemiology. (2020)

  13. Clinical Prediction Models to Predict the Risk of Multiple Binary Outcomes: a comparison of approaches

    Authors: Glen P. Martin, Matthew Sperrin, Kym I. E. Snell, Iain Buchan, Richard D. Riley

    Abstract: Clinical prediction models (CPMs) are used to predict clinically relevant outcomes or events. Typically, prognostic CPMs are derived to predict the risk of a single future outcome. However, with rising emphasis on the prediction of multi-morbidity, there is growing need for CPMs to simultaneously predict risks for each of multiple future outcomes. A common approach to multi-outcome risk prediction… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 34 pages, 2 tables and 5 figures

  14. Examining the impact of data quality and completeness of electronic health records on predictions of patients risks of cardiovascular disease

    Authors: Yan Li, Matthew Sperrin, Glen P. Martin, Darren M Ashcroft, Tjeerd Pieter van Staa

    Abstract: The objective is to assess the extent of variation of data quality and completeness of electronic health records and impact on the robustness of risk predictions of incident cardiovascular disease (CVD) using a risk prediction tool that is based on routinely collected data (QRISK3). The study design is a longitudinal cohort study with a setting of 392 general practices (including 3.6 million patie… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 2 tables 4 figures in the main manuscript, 1 table 3 figure in appendix. Online published in IJMI, license CC-BY-NC-ND

    MSC Class: 62N01

    Journal ref: https://doi.org/10.1016/j.ijmedinf.2019.104033