-
A computationally efficient framework for realistic epidemic modelling through Gaussian Markov random fields
Authors:
Angelos Alexopoulos,
Paul Birrell,
Daniela De Angelis
Abstract:
We tackle limitations of ordinary differential equation-driven Susceptible-Infections-Removed (SIR) models and their extensions that have recently be employed for epidemic nowcasting and forecasting. In particular, we deal with challenges related to the extension of SIR-type models to account for the so-called \textit{environmental stochasticity}, i.e., external factors, such as seasonal forcing,…
▽ More
We tackle limitations of ordinary differential equation-driven Susceptible-Infections-Removed (SIR) models and their extensions that have recently be employed for epidemic nowcasting and forecasting. In particular, we deal with challenges related to the extension of SIR-type models to account for the so-called \textit{environmental stochasticity}, i.e., external factors, such as seasonal forcing, social cycles and vaccinations that can dramatically affect outbreaks of infectious diseases. Typically, in SIR-type models environmental stochasticity is modelled through stochastic processes. However, this stochastic extension of epidemic models leads to models with large dimension that increases over time. Here we propose a Bayesian approach to build an efficient modelling and inferential framework for epidemic nowcasting and forecasting by using Gaussian Markov random fields to model the evolution of these stochastic processes over time and across population strata. Importantly, we also develop a bespoke and computationally efficient Markov chain Monte Carlo algorithm to estimate the large number of parameters and latent states of the proposed model. We test our approach on simulated data and we apply it to real data from the Covid-19 pandemic in the United Kingdom.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
Estimating the duration of RT-PCR positivity for SARS-CoV-2 from doubly interval censored data with undetected infections
Authors:
Joshua Blake,
Paul Birrell,
A. Sarah Walker,
Koen B. Pouwels,
Thomas House,
Brian D. M. Tom,
Theodore Kypraios,
Daniela De Angelis
Abstract:
Monitoring the incidence of new infections during a pandemic is critical for an effective public health response. General population prevalence surveys for SARS-CoV-2 can provide high-quality data to estimate incidence. However, estimation relies on understanding the distribution of the duration that infections remain detectable. This study addresses this need using data from the Coronavirus Infec…
▽ More
Monitoring the incidence of new infections during a pandemic is critical for an effective public health response. General population prevalence surveys for SARS-CoV-2 can provide high-quality data to estimate incidence. However, estimation relies on understanding the distribution of the duration that infections remain detectable. This study addresses this need using data from the Coronavirus Infection Survey (CIS), a long-term, longitudinal, general population survey conducted in the UK. Analyzing these data presents unique challenges, such as doubly interval censoring, undetected infections, and false negatives. We propose a Bayesian nonparametric survival analysis approach, estimating a discrete-time distribution of durations and integrating prior information derived from a complementary study. Our methodology is validated through a simulation study, including its resilience to model misspecification, and then applied to the CIS dataset. This results in the first estimate of the full duration distribution in a general population, as well as methodology that could be transferred to new contexts.
△ Less
Submitted 7 February, 2025;
originally announced February 2025.
-
Real-time modelling of the SARS-CoV-2 pandemic in England 2020-2023: a challenging data integration
Authors:
Paul J Birrell,
Joshua Blake,
Joel Kandiah,
Angelos Alexopoulos,
Edwin van Leeuwen,
Koen Pouwels,
Sanmitra Ghosh,
Colin Starr,
Ann Sarah Walker,
Thomas A House,
Nigel Gay,
Thomas Finnie,
Nick Gent,
André Charlett,
Daniela De Angelis
Abstract:
A central pillar of the UK's response to the SARS-CoV-2 pandemic was the provision of up-to-the moment nowcasts and short term projections to monitor current trends in transmission and associated healthcare burden. Here we present a detailed deconstruction of one of the 'real-time' models that was key contributor to this response, focussing on the model adaptations required over three pandemic yea…
▽ More
A central pillar of the UK's response to the SARS-CoV-2 pandemic was the provision of up-to-the moment nowcasts and short term projections to monitor current trends in transmission and associated healthcare burden. Here we present a detailed deconstruction of one of the 'real-time' models that was key contributor to this response, focussing on the model adaptations required over three pandemic years characterised by the imposition of lockdowns, mass vaccination campaigns and the emergence of new pandemic strains. The Bayesian model integrates an array of surveillance and other data sources including a novel approach to incorporating prevalence estimates from an unprecedented large-scale household survey. We present a full range of estimates of the epidemic history and the changing severity of the infection, quantify the impact of the vaccination programme and deconstruct contributing factors to the reproduction number. We further investigate the sensitivity of model-derived insights to the availability and timeliness of prevalence data, identifying its importance to the production of robust estimates.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs
Authors:
Sanmitra Ghosh,
Paul J. Birrell,
Daniela De Angelis
Abstract:
Likelihood-free inference methods based on neural conditional density estimation were shown to drastically reduce the simulation burden in comparison to classical methods such as ABC. When applied in the context of any latent variable model, such as a Hidden Markov model (HMM), these methods are designed to only estimate the parameters, rather than the joint distribution of the parameters and the…
▽ More
Likelihood-free inference methods based on neural conditional density estimation were shown to drastically reduce the simulation burden in comparison to classical methods such as ABC. When applied in the context of any latent variable model, such as a Hidden Markov model (HMM), these methods are designed to only estimate the parameters, rather than the joint distribution of the parameters and the hidden states. Naive application of these methods to a HMM, ignoring the inference of this joint posterior distribution, will thus produce an inaccurate estimate of the posterior predictive distribution, in turn hampering the assessment of goodness-of-fit. To rectify this problem, we propose a novel, sample-efficient likelihood-free method for estimating the high-dimensional hidden states of an implicit HMM. Our approach relies on learning directly the intractable posterior distribution of the hidden states, using an autoregressive-flow, by exploiting the Markov property. Upon evaluating our approach on some implicit HMMs, we found that the quality of the estimates retrieved using our method is comparable to what can be achieved using a much more computationally expensive SMC algorithm.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
The Lifebelt Particle Filter for robust estimation from low-valued count data
Authors:
Alice Corbella,
Trevelyan J. McKinley,
Paul J. Birrell,
Daniela De Angelis,
Anne M. Presanis,
Gareth O. Roberts,
Simon E. F. Spencer
Abstract:
Particle filtering methods can be applied to estimation problems in discrete spaces on bounded domains, to sample from and marginalise over unknown hidden states. As in continuous settings, problems such as particle degradation can arise: proposed particles can be incompatible with the data, lying in low probability regions or outside the boundary constraints, and the discrete system could result…
▽ More
Particle filtering methods can be applied to estimation problems in discrete spaces on bounded domains, to sample from and marginalise over unknown hidden states. As in continuous settings, problems such as particle degradation can arise: proposed particles can be incompatible with the data, lying in low probability regions or outside the boundary constraints, and the discrete system could result in all particles having weights of zero. In this paper we introduce the Lifebelt Particle Filter (LBPF), a novel method for robust likelihood estimation in low-valued count problems. The LBPF combines a standard particle filter with one (or more) lifebelt particles which, by construction, lie within the boundaries of the discrete random variables, and therefore are compatible with the data. A mixture of resampled and non-resampled particles allows for the preservation of the lifebelt particle, which, together with the remaining particle swarm, provides samples from the filtering distribution, and can be used to generate unbiased estimates of the likelihood. The main benefit of the LBPF is that only one or few, wisely chosen, particles are sufficient to prevent particle collapse. Differently from other methods, there is no need to increase the number of particles, and therefore the computational effort, in regions of the parameter space that generate less likely hidden states. The LBPF can be used within a pseudo-marginal scheme to draw inferences on static parameters, $ \boldsymbolθ $, governing the system. We address here the estimation of a parameter governing probabilities of deaths and recoveries of hospitalised patients during an epidemic.
△ Less
Submitted 4 December, 2024; v1 submitted 8 December, 2022;
originally announced December 2022.
-
An approximate diffusion process for environmental stochasticity in infectious disease transmission modelling
Authors:
Sanmitra Ghosh,
Paul J. Birrell,
Daniela De Angelis
Abstract:
Modelling the transmission dynamics of an infectious disease is a complex task. Not only it is difficult to accurately model the inherent non-stationarity and heterogeneity of transmission, but it is nearly impossible to describe, mechanistically, changes in extrinsic environmental factors including public behaviour and seasonal fluctuations. An elegant approach to capturing environmental stochast…
▽ More
Modelling the transmission dynamics of an infectious disease is a complex task. Not only it is difficult to accurately model the inherent non-stationarity and heterogeneity of transmission, but it is nearly impossible to describe, mechanistically, changes in extrinsic environmental factors including public behaviour and seasonal fluctuations. An elegant approach to capturing environmental stochasticity is to model the force of infection as a stochastic process. However, inference in this context requires solving a computationally expensive ``missing data" problem, using data-augmentation techniques. We propose to model the time-varying transmission-potential as an approximate diffusion process using a path-wise series expansion of Brownian motion. This approximation replaces the ``missing data" imputation step with the inference of the expansion coefficients: a simpler and computationally cheaper task. We illustrate the merit of this approach through two examples: modelling influenza using a canonical SIR model, and the modelling of COVID-19 pandemic using a multi-type SEIR model.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Inferring Epidemics from Multiple Dependent Data via Pseudo-Marginal Methods
Authors:
Alice Corbella,
Anne M Presanis,
Paul J Birrell,
Daniela De Angelis
Abstract:
Health-policy planning requires evidence on the burden that epidemics place on healthcare systems. Multiple, often dependent, datasets provide a noisy and fragmented signal from the unobserved epidemic process including transmission and severity dynamics. This paper explores important challenges to the use of state-space models for epidemic inference when multiple dependent datasets are analysed.…
▽ More
Health-policy planning requires evidence on the burden that epidemics place on healthcare systems. Multiple, often dependent, datasets provide a noisy and fragmented signal from the unobserved epidemic process including transmission and severity dynamics. This paper explores important challenges to the use of state-space models for epidemic inference when multiple dependent datasets are analysed. We propose a new semi-stochastic model that exploits deterministic approximations for large-scale transmission dynamics while retaining stochasticity in the occurrence and reporting of relatively rare severe events. This model is suitable for many real-time situations including large seasonal epidemics and pandemics. Within this context, we develop algorithms to provide exact parameter inference and test them via simulation. Finally, we apply our joint model and the proposed algorithm to several surveillance data on the 2017-18 influenza epidemic in England to reconstruct transmission dynamics and estimate the daily new influenza infections as well as severity indicators such as the case-hospitalisation risk and the hospital-intensive care risk.
△ Less
Submitted 10 September, 2024; v1 submitted 19 April, 2022;
originally announced April 2022.
-
Trends in COVID-19 hospital outcomes in England before and after vaccine introduction, a cohort study
Authors:
Peter Kirwan,
Andre Charlett,
Paul Birrell,
Suzanne Elgohari,
Russell Hope,
Sema Mandal,
Daniela De Angelis,
Anne Presanis
Abstract:
Widespread vaccination campaigns have changed the landscape for COVID-19, vastly altering symptoms and reducing morbidity and mortality. We estimate trends in mortality by month of admission and vaccination status among those hospitalised with COVID-19 in England between March 2020 to September 2021, controlling for demographic factors and hospital load.
Among 259,727 hospitalised COVID-19 cases…
▽ More
Widespread vaccination campaigns have changed the landscape for COVID-19, vastly altering symptoms and reducing morbidity and mortality. We estimate trends in mortality by month of admission and vaccination status among those hospitalised with COVID-19 in England between March 2020 to September 2021, controlling for demographic factors and hospital load.
Among 259,727 hospitalised COVID-19 cases, 51,948 (20.0%) experienced mortality in hospital. Hospitalised fatality risk ranged from 40.3% (95% confidence interval 39.4-41.3%) in March 2020 to 8.1% (7.2-9.0%) in June 2021. Older individuals and those with multiple co-morbidities were more likely to die or else experienced longer stays prior to discharge. Compared to unvaccinated people, the hazard of hospitalised mortality was 0.71 (0.67-0.77) with a first vaccine dose, and 0.56 (0.52-0.61) with a second vaccine dose. Compared to hospital load at 0-20% of the busiest week, the hazard of hospitalised mortality during periods of peak load (90-100%), was 1.23 (1.12-1.34).
The prognosis for people hospitalised with COVID-19 in England has varied substantially throughout the pandemic and according to case-mix, vaccination, and hospital load. Our estimates provide an indication for demands on hospital resources, and the relationship between hospital burden and outcomes.
△ Less
Submitted 3 August, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
HIV transmission in men who have sex with men in England: on track for elimination by 2030?
Authors:
Francesco Brizzi,
Paul J Birrell,
Peter Kirwan,
Dana Ogaz,
Alison E Brown,
Valerie C Delpech,
O Noel Gill,
Daniela De Angelis
Abstract:
Background: After a decade of a treatment as prevention (TasP) strategy based on progressive HIV testing scale-up and earlier treatment, a reduction in the estimated number of new infections in men-who-have-sex-with-men (MSM) in England had yet to be identified by 2010. To achieve internationally agreed targets for HIV control and elimination, test-and-treat prevention efforts have been dramatical…
▽ More
Background: After a decade of a treatment as prevention (TasP) strategy based on progressive HIV testing scale-up and earlier treatment, a reduction in the estimated number of new infections in men-who-have-sex-with-men (MSM) in England had yet to be identified by 2010. To achieve internationally agreed targets for HIV control and elimination, test-and-treat prevention efforts have been dramatically intensified over the period 2010-2015, and, from 2016, further strengthened by pre-exposure prophylaxis (PrEP).
Methods: Application of a novel age-stratified back-calculation approach to data on new HIV diagnoses and CD4 count-at-diagnosis, enabled age-specific estimation of HIV incidence, undiagnosed infections and mean time-to-diagnosis across both the 2010-2015 and 2016-2018 periods. Estimated incidence trends were then extrapolated, to quantify the likelihood of achieving HIV elimination by 2030.
Findings: A fall in HIV incidence in MSM is estimated to have started in 2012/3, eighteen months before the observed fall in new diagnoses. A steep decrease from 2,770 annual infections (95% credible interval 2.490-3,040) in 2013 to 1,740 (1,500-2,010) in 2015 is estimated, followed by steady decline from 2016, reaching 854 (441-1,540) infections in 2018. A decline is consistently estimated in all age groups, with a fall particularly marked in the 24-35 age group, and slowest in the 45+ group. Comparable declines are estimated in the number of undiagnosed infections.
Interpretation: The peak and subsequent sharp decline in HIV incidence occurred prior to the phase-in of PrEP. Definining elimination as a public health threat to be < 50 new infections (1.1 infections per 10,000 at risk), 40% of incidence projections hit this threshold by 2030. In practice, targeted policies will be required, particularly among the 45+y where STIs are increasing most rapidly.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Evidence synthesis for stochastic epidemic models
Authors:
Paul J Birrell,
Daniela De Angelis,
Anne M Presanis
Abstract:
In recent years the role of epidemic models in informing public health policies has progressively grown. Models have become increasingly realistic and more complex, requiring the use of multiple data sources to estimate all quantities of interest. This review summarises the different types of stochastic epidemic models that use evidence synthesis and highlights current challenges.
In recent years the role of epidemic models in informing public health policies has progressively grown. Models have become increasingly realistic and more complex, requiring the use of multiple data sources to estimate all quantities of interest. This review summarises the different types of stochastic epidemic models that use evidence synthesis and highlights current challenges.
△ Less
Submitted 8 June, 2017;
originally announced June 2017.
-
Exploiting routinely collected severe case data to monitor and predict influenza outbreaks
Authors:
Alice Corbella,
Xu-Sheng Zhang,
Paul J. Birrell,
Nicky Boddington,
Anne M. Presanis,
Richard G. Pebody,
Daniela De Angelis
Abstract:
Influenza remains a significant burden on health systems. Effective responses rely on the timely understanding of the magnitude and the evolution of an outbreak. For monitoring purposes, data on severe cases of influenza in England are reported weekly to Public Health England. These data are both readily available and have the potential to provide valuable information to estimate and predict the k…
▽ More
Influenza remains a significant burden on health systems. Effective responses rely on the timely understanding of the magnitude and the evolution of an outbreak. For monitoring purposes, data on severe cases of influenza in England are reported weekly to Public Health England. These data are both readily available and have the potential to provide valuable information to estimate and predict the key transmission features of seasonal and pandemic influenza. We propose an epidemic model that links the underlying unobserved influenza transmission process to data on severe influenza cases. Within a Bayesian framework, we infer retrospectively the parameters of the epidemic model for each seasonal outbreak from 2012 to 2015, including: the effective reproduction number; the initial susceptibility; the probability of admission to intensive care given infection; and the effect of school closure on transmission. The model is also implemented in real time to assess whether early forecasting of the number of admission to intensive care is possible. Our model of admissions data allows reconstruction of the underlying transmission dynamics revealing: increased transmission during the season 2013/14 and a noticeable effect of Christmas school holiday on disease spread during season 2012/13 and 2014/15. When information on the initial immunity of the population is available, forecasts of the number of admissions to intensive care can be substantially improved. Readily available severe case data can be effectively used to estimate epidemiological characteristics and to predict the evolution of an epidemic, crucially allowing real-time monitoring of the transmission and severity of the outbreak.
△ Less
Submitted 13 November, 2017; v1 submitted 8 June, 2017;
originally announced June 2017.
-
Efficient real-time monitoring of an emerging influenza epidemic: how feasible?
Authors:
Paul J Birrell,
Lorenz Wernisch,
Brian D M Tom,
Leonhard Held,
Gareth O Roberts,
Richard G Pebody,
Daniela De Angelis
Abstract:
A prompt public health response to a new epidemic relies on the ability to monitor and predict its evolution in real time as data accumulate. The 2009 A/H1N1 outbreak in the UK revealed pandemic data as noisy, contaminated, potentially biased, and originating from multiple sources. This seriously challenges the capacity for real-time monitoring. Here we assess the feasibility of real-time inferenc…
▽ More
A prompt public health response to a new epidemic relies on the ability to monitor and predict its evolution in real time as data accumulate. The 2009 A/H1N1 outbreak in the UK revealed pandemic data as noisy, contaminated, potentially biased, and originating from multiple sources. This seriously challenges the capacity for real-time monitoring. Here we assess the feasibility of real-time inference based on such data by constructing an analytic tool combining an age-stratified SEIR transmission model with various observation models describing the data generation mechanisms. As batches of data become available, a sequential Monte Carlo (SMC) algorithm is developed to synthesise multiple imperfect data streams, iterate epidemic inferences and assess model adequacy amidst a rapidly evolving epidemic environment, substantially reducing computation time in comparison to standard MCMC, to ensure timely delivery of real-time epidemic assessments. In application to simulated data designed to mimic the 2009 A/H1N1 epidemic, SMC is shown to have additional benefits in terms of assessing predictive performance and coping with parameter non-identifiability.
△ Less
Submitted 3 May, 2019; v1 submitted 18 August, 2016;
originally announced August 2016.
-
Synthesising evidence to estimate pandemic (2009) A/H1N1 influenza severity in 2009-2011
Authors:
Anne M. Presanis,
Richard G. Pebody,
Paul J. Birrell,
Brian D. M. Tom,
Helen K. Green,
Hayley Durnall,
Douglas Fleming,
Daniela De Angelis
Abstract:
Knowledge of the severity of an influenza outbreak is crucial for informing and monitoring appropriate public health responses, both during and after an epidemic. However, case-fatality, case-intensive care admission and case-hospitalisation risks are difficult to measure directly. Bayesian evidence synthesis methods have previously been employed to combine fragmented, under-ascertained and biased…
▽ More
Knowledge of the severity of an influenza outbreak is crucial for informing and monitoring appropriate public health responses, both during and after an epidemic. However, case-fatality, case-intensive care admission and case-hospitalisation risks are difficult to measure directly. Bayesian evidence synthesis methods have previously been employed to combine fragmented, under-ascertained and biased surveillance data coherently and consistently, to estimate case-severity risks in the first two waves of the 2009 A/H1N1 influenza pandemic experienced in England. We present in detail the complex probabilistic model underlying this evidence synthesis, and extend the analysis to also estimate severity in the third wave of the pandemic strain during the 2010/2011 influenza season. We adapt the model to account for changes in the surveillance data available over the three waves. We consider two approaches: (a) a two-stage approach using posterior distributions from the model for the first two waves to inform priors for the third wave model; and (b) a one-stage approach modelling all three waves simultaneously. Both approaches result in the same key conclusions: (1) that the age-distribution of the case-severity risks is "u"-shaped, with children and older adults having the highest severity; (2) that the age-distribution of the infection attack rate changes over waves, school-age children being most affected in the first two waves and the attack rate in adults over 25 increasing from the second to third waves; and (3) that when averaged over all age groups, case-severity appears to increase over the three waves. The extent to which the final conclusion is driven by the change in age-distribution of those infected over time is subject to discussion.
△ Less
Submitted 3 February, 2015; v1 submitted 29 August, 2014;
originally announced August 2014.