-
Exact identifiability analysis for a class of partially observed near-linear stochastic differential equation models
Authors:
Alexander P Browning,
Michael J Chappell,
Hamid Rahkooy,
Torkel E Loman,
Ruth E Baker
Abstract:
Stochasticity plays a key role in many biological systems, necessitating the calibration of stochastic mathematical models to interpret associated data. For model parameters to be estimated reliably, it is typically the case that they must be structurally identifiable. Yet, while theory underlying structural identifiability analysis for deterministic differential equation models is highly develope…
▽ More
Stochasticity plays a key role in many biological systems, necessitating the calibration of stochastic mathematical models to interpret associated data. For model parameters to be estimated reliably, it is typically the case that they must be structurally identifiable. Yet, while theory underlying structural identifiability analysis for deterministic differential equation models is highly developed, there are currently no tools for the general assessment of stochastic models. In this work, we extend the well-established differential algebra framework for structural identifiability analysis to linear and a class of near-linear, two-dimensional, partially observed stochastic differential equation (SDE) models. Our framework is based on a deterministic recurrence relation that describes the dynamics of the statistical moments of the system of SDEs. From this relation, we iteratively form a series of necessarily satisfied equations involving only the observed moments, from which we are able to establish structurally identifiable parameter combinations. We demonstrate our framework for a suite of linear (two- and $n$-dimensional) and non-linear (two-dimensional) models. Most importantly, we define the notion of structural identifiability for SDE models and establish the effect of the initial condition on identifiability. We conclude with a discussion on the applicability and limitations of our approach, and potential future research directions.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Structural identifiability of linear-in-parameter parabolic PDEs through auxiliary elliptic operators
Authors:
Yurij Salmaniw,
Alexander P Browning
Abstract:
Parameter identifiability is often requisite to the effective application of mathematical models in the interpretation of biological data, however theory applicable to the study of partial differential equations remains limited. We present a new approach to structural identifiability analysis of fully observed parabolic equations that are linear in their parameters. Our approach frames identifiabi…
▽ More
Parameter identifiability is often requisite to the effective application of mathematical models in the interpretation of biological data, however theory applicable to the study of partial differential equations remains limited. We present a new approach to structural identifiability analysis of fully observed parabolic equations that are linear in their parameters. Our approach frames identifiability as an existence and uniqueness problem in a closely related elliptic equation and draws, for homogeneous equations, on the well-known Fredholm alternative to establish unconditional identifiability, and cases where specific choices of initial and boundary conditions lead to non-identifiability. While in some sense pathological, we demonstrate that this loss of structural identifiability has ramifications for practical identifiability; important particularly for spatial problems, where the initial condition is often limited by experimental constraints. For cases with nonlinear reaction terms, uniqueness of solutions to the auxiliary elliptic equation corresponds to identifiability, often leading to unconditional global identifiability under mild assumptions. We present analysis for a suite of simple scalar models with various boundary conditions that include linear (exponential) and nonlinear (logistic) source terms, and a special case of a two-species cell motility model. We conclude by discussing how this new perspective enables well-developed analysis tools to advance the developing theory underlying structural identifiability of partial differential equations.
△ Less
Submitted 6 April, 2025; v1 submitted 26 November, 2024;
originally announced November 2024.
-
Phenotypic heterogeneity in temporally fluctuating environments
Authors:
Alexander P Browning,
Sara Hamis
Abstract:
Many biological systems regulate phenotypic heterogeneity as a fitness-maximising strategy in uncertain and dynamic environments. Analysis of such strategies is typically confined both to a discrete set of environmental conditions, and to a discrete (often binary) set of phenotypes specialised to each condition. In this work, we extend theory on both fronts to encapsulate both a discrete and conti…
▽ More
Many biological systems regulate phenotypic heterogeneity as a fitness-maximising strategy in uncertain and dynamic environments. Analysis of such strategies is typically confined both to a discrete set of environmental conditions, and to a discrete (often binary) set of phenotypes specialised to each condition. In this work, we extend theory on both fronts to encapsulate both a discrete and continuous spectrum of phenotypes arising in response to two broad classes of environmental efluctuations that drive changes in the phenotype-dependent growth rates; specifically, stochastic environments that are temporally uncorrelated (specifically, white-noise processes) and correlated (specifically, Poisson and Ornstein-Uhlenbeck processes). For tractability, we restrict analysis to an exponential growth model, and consider biologically relevant simplifications that pertain to the relative timescale of phenotype switching. These assumptions yield a series of analytical and semi-analytical expressions that reveal environments in which both discrete and continuous phenotypic heterogeneity is evolutionary advantageous.
△ Less
Submitted 16 December, 2024; v1 submitted 5 November, 2024;
originally announced November 2024.
-
Framing global structural identifiability in terms of parameter symmetries
Authors:
Johannes G Borgqvist,
Alexander P Browning,
Fredrik Ohlsson,
Ruth E Baker
Abstract:
A key initial step in mechanistic modelling of dynamical systems using first-order ordinary differential equations is to conduct a global structural identifiability analysis. This entails deducing which parameter combinations can be estimated from certain observed outputs. The standard differential algebra approach answers this question by re-writing the model as a system of ordinary differential…
▽ More
A key initial step in mechanistic modelling of dynamical systems using first-order ordinary differential equations is to conduct a global structural identifiability analysis. This entails deducing which parameter combinations can be estimated from certain observed outputs. The standard differential algebra approach answers this question by re-writing the model as a system of ordinary differential equations solely depending on the observed outputs. Over the last decades, alternative approaches for analysing global structural identifiability based on so-called full symmetries, which are Lie symmetries acting on independent and dependent variables as well as parameters, have been proposed. However, the link between the standard differential algebra approach and that using full symmetries remains elusive. In this work, we establish this link by introducing the notion of parameter symmetries, which are a special type of full symmetry that alter parameters while preserving the observed outputs. Our main result states that a parameter combination is structurally identifiable if and only if it is a differential invariant of all parameter symmetries of a given model. We show that the standard differential algebra approach is consistent with the concept of considering structural identifiability in terms of parameter symmetries. We present an alternative symmetry-based approach, referred to as the CaLinInv-recipe, for analysing structural identifiability using parameter symmetries. Lastly, we demonstrate our approach on a glucose-insulin model and an epidemiological model of tuberculosis.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
Approximate solutions of a general stochastic velocity-jump model subject to discrete-time noisy observations
Authors:
Arianna Ceccarelli,
Alexander P. Browning,
Ruth E. Baker
Abstract:
Advances in experimental techniques allow the collection of high-resolution spatio-temporal data that track individual motile entities over time. These tracking data motivate the use of mathematical models to characterise the motion observed. In this paper, we aim to describe the solutions of velocity-jump models for single-agent motion in one spatial dimension, characterised by successive Markovi…
▽ More
Advances in experimental techniques allow the collection of high-resolution spatio-temporal data that track individual motile entities over time. These tracking data motivate the use of mathematical models to characterise the motion observed. In this paper, we aim to describe the solutions of velocity-jump models for single-agent motion in one spatial dimension, characterised by successive Markovian transitions within a finite network of n states, each with a specified velocity and a fixed rate of switching to every other state. In particular, we focus on obtaining the solutions of the model subject to noisy, discrete-time, observations, with no direct access to the agent state. The lack of direct observation of the hidden state makes the problem of finding the exact distributions generally intractable. Therefore, we derive a series of approximations for the data distributions. We verify the accuracy of these approximations by comparing them to the empirical distributions generated through simulations of four example model structures. These comparisons confirm that the approximations are accurate given sufficiently infrequent state switching relative to the imaging frequency. The approximate distributions computed can be used to obtain fast forwards predictions, to give guidelines on experimental design, and as likelihoods for inference and model selection.
△ Less
Submitted 25 March, 2025; v1 submitted 28 June, 2024;
originally announced June 2024.
-
Reducing phenotype-structured PDE models of cancer evolution to systems of ODEs: a generalised moment dynamics approach
Authors:
Chiara Villa,
Philip K Maini,
Alexander P Browning,
Adrianne L Jenner,
Sara Hamis,
Tyler Cassidy
Abstract:
Intratumour phenotypic heterogeneity is nowadays understood to play a critical role in disease progression and treatment failure. Accordingly, there has been increasing interest in the development of mathematical models capable of capturing its role in cancer cell adaptation. This can be systematically achieved by means of models comprising phenotype-structured nonlocal partial differential equati…
▽ More
Intratumour phenotypic heterogeneity is nowadays understood to play a critical role in disease progression and treatment failure. Accordingly, there has been increasing interest in the development of mathematical models capable of capturing its role in cancer cell adaptation. This can be systematically achieved by means of models comprising phenotype-structured nonlocal partial differential equations, tracking the evolution of the phenotypic density distribution of the cell population, which may be compared to gene and protein expression distributions obtained experimentally. Nevertheless, given the high analytical and computational cost of solving these models, much is to be gained from reducing them to systems of ordinary differential equations for the moments of the distribution. We propose a generalised method of model-reduction, relying on the use of a moment generating function, Taylor series expansion and truncation closure, to reduce a nonlocal reaction-advection-diffusion equation, with general phenotypic drift and proliferation rate functions, to a system of moment equations up to arbitrary order. Our method extends previous results in the literature, which we address via two examples, by removing any \textit{a priori} assumption on the shape of the distribution, and provides a flexible framework for mathematical modellers to account for the role of phenotypic heterogeneity in cancer adaptive dynamics, in a simpler mathematical framework.
△ Less
Submitted 9 April, 2025; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Structural identifiability analysis of linear reaction-advection-diffusion processes in mathematical biology
Authors:
Alexander P Browning,
Maria Tască,
Carles Falcó,
Ruth E Baker
Abstract:
Effective application of mathematical models to interpret biological data and make accurate predictions often requires that model parameters are identifiable. Approaches to assess the so-called structural identifiability of models are well-established for ordinary differential equation models, yet there are no commonly adopted approaches that can be applied to assess the structural identifiability…
▽ More
Effective application of mathematical models to interpret biological data and make accurate predictions often requires that model parameters are identifiable. Approaches to assess the so-called structural identifiability of models are well-established for ordinary differential equation models, yet there are no commonly adopted approaches that can be applied to assess the structural identifiability of the partial differential equation (PDE) models that are requisite to capture spatial features inherent to many phenomena. The differential algebra approach to structural identifiability has recently been demonstrated to be applicable to several specific PDE models. In this brief article, we present general methodology for performing structural identifiability analysis on partially observed reaction-advection-diffusion (RAD) PDE models that are linear in the unobserved quantities. We show that the differential algebra approach can always, in theory, be applied to such models. Moreover, despite the perceived complexity introduced by the addition of advection and diffusion terms, identifiability of spatial analogues of non-spatial models cannot decrease in structural identifiability. We conclude by discussing future possibilities and the computational cost of performing structural identifiability analysis on more general PDE models.
△ Less
Submitted 27 February, 2024; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Smoothing in linear multicompartment biological processes subject to stochastic input
Authors:
Alexander P Browning,
Adrianne L Jenner,
Ruth E Baker,
Philip K Maini
Abstract:
Many physical and biological systems rely on the progression of material through multiple independent stages. In viral replication, for example, virions enter a cell to undergo a complex process comprising several disparate stages before the eventual accumulation and release of replicated virions. While such systems may have some control over the internal dynamics that make up this progression, a…
▽ More
Many physical and biological systems rely on the progression of material through multiple independent stages. In viral replication, for example, virions enter a cell to undergo a complex process comprising several disparate stages before the eventual accumulation and release of replicated virions. While such systems may have some control over the internal dynamics that make up this progression, a challenge for many is to regulate behaviour under what are often highly variable external environments acting as system inputs. In this work, we study a simple analogue of this problem through a linear multicompartment model subject to a stochastic input in the form of a mean-reverting Ornstein-Uhlenbeck process, a type of Gaussian process. By expressing the system as a multidimensional Gaussian process, we derive several closed-form analytical results relating to the covariances and autocorrelations of the system, quantifying the smoothing effect discrete compartments afford multicompartment systems. Semi-analytical results demonstrate that feedback and feedforward loops can enhance system robustness, and simulation results probe the intractable problem of the first passage time distribution, which has specific relevance to eventual cell lysis in the viral replication cycle. Finally, we demonstrate that the smoothing seen in the process is a consequence of the discreteness of the system, and does not manifest in system with continuous transport. While we make progress through analysis of a simple linear problem, many of our insights are applicable more generally, and our work enables future analysis into multicompartment processes subject to stochastic inputs.
△ Less
Submitted 2 April, 2024; v1 submitted 3 May, 2023;
originally announced May 2023.
-
Geometric analysis enables biological insight from complex non-identifiable models using simple surrogates
Authors:
Alexander P Browning,
Matthew J Simpson
Abstract:
An enduring challenge in computational biology is to balance data quality and quantity with model complexity. Tools such as identifiability analysis and information criterion have been developed to harmonise this juxtaposition, yet cannot always resolve the mismatch between available data and the granularity required in mathematical models to answer important biological questions. Often, it is onl…
▽ More
An enduring challenge in computational biology is to balance data quality and quantity with model complexity. Tools such as identifiability analysis and information criterion have been developed to harmonise this juxtaposition, yet cannot always resolve the mismatch between available data and the granularity required in mathematical models to answer important biological questions. Often, it is only simple phenomenological models, such as the logistic and Gompertz growth models, that are identifiable from standard experimental measurements. To draw insights from the complex, non-identifiable models that incorporate key biological mechanisms of interest, we study the geometry of a map in parameter space from the complex model to a simple, identifiable, surrogate model. By studying how non-identifiable parameters in the complex model quantitatively relate to identifiable parameters in surrogate, we introduce and exploit a layer of interpretation between the set of non-identifiable parameters and the goodness-of-fit metric or likelihood studied in typical identifiability analysis. We demonstrate our approach by analysing a hierarchy of mathematical models for multicellular tumour spheroid growth. Typical data from tumour spheroid experiments are limited and noisy, and corresponding mathematical models are very often made arbitrarily complex. Our geometric approach is able to predict non-identifiabilities, subset non-identifiable parameter spaces into identifiable parameter combinations that relate to individual data features, and overall provide additional biological insight from complex non-identifiable models.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Efficient inference and identifiability analysis for differential equation models with random parameters
Authors:
Alexander P. Browning,
Christopher Drovandi,
Ian W. Turner,
Adrianne L. Jenner,
Matthew J. Simpson
Abstract:
Heterogeneity is a dominant factor in the behaviour of many biological processes. Despite this, it is common for mathematical and statistical analyses to ignore biological heterogeneity as a source of variability in experimental data. Therefore, methods for exploring the identifiability of models that explicitly incorporate heterogeneity through variability in model parameters are relatively under…
▽ More
Heterogeneity is a dominant factor in the behaviour of many biological processes. Despite this, it is common for mathematical and statistical analyses to ignore biological heterogeneity as a source of variability in experimental data. Therefore, methods for exploring the identifiability of models that explicitly incorporate heterogeneity through variability in model parameters are relatively underdeveloped. We develop a new likelihood-based framework, based on moment matching, for inference and identifiability analysis of differential equation models that capture biological heterogeneity through parameters that vary according to probability distributions. As our novel method is based on an approximate likelihood function, it is highly flexible; we demonstrate identifiability analysis using both a frequentist approach based on profile likelihood, and a Bayesian approach based on Markov-chain Monte Carlo. Through three case studies, we demonstrate our method by providing a didactic guide to inference and identifiability analysis of hyperparameters that relate to the statistical moments of model parameters from independent observed data. Our approach has a computational cost comparable to analysis of models that neglect heterogeneity, a significant improvement over many existing alternatives. We demonstrate how analysis of random parameter models can aid better understanding of the sources of heterogeneity from biological data.
△ Less
Submitted 27 October, 2022; v1 submitted 20 July, 2022;
originally announced July 2022.
-
Predicting radiotherapy patient outcomes with real-time clinical data using mathematical modelling
Authors:
Alexander P. Browning,
Thomas D. Lewin,
Ruth E. Baker,
Philip K. Maini,
Eduardo G. Moros,
Jimmy Caudell,
Helen M. Byrne,
Heiko Enderling
Abstract:
Longitudinal tumour volume data from head-and-neck cancer patients show that tumours of comparable pre-treatment size and stage may respond very differently to the same radiotherapy fractionation protocol. Mathematical models are often proposed to predict treatment outcome in this context, and have the potential to guide clinical decision-making and inform personalised fractionation protocols. Hin…
▽ More
Longitudinal tumour volume data from head-and-neck cancer patients show that tumours of comparable pre-treatment size and stage may respond very differently to the same radiotherapy fractionation protocol. Mathematical models are often proposed to predict treatment outcome in this context, and have the potential to guide clinical decision-making and inform personalised fractionation protocols. Hindering effective use of models in this context is the sparsity of clinical measurements juxtaposed with the model complexity required to produce the full range of possible patient responses. In this work, we present a compartment model of tumour volume and tumour composition, which, despite relative simplicity, is capable of producing a wide range of patient responses. We then develop novel statistical methodology and leverage a cohort of existing clinical data to produce a predictive model of both tumour volume progression and the associated level of uncertainty that evolves throughout a patient's course of treatment. To capture inter-patient variability, all model parameters are patient specific, with a bootstrap particle filter-like Bayesian approach developed to model a set of training data as prior knowledge. We validate our approach against a subset of unseen data, and demonstrate both the predictive ability of our trained model and its limitations.
△ Less
Submitted 13 December, 2023; v1 submitted 6 January, 2022;
originally announced January 2022.
-
Profile likelihood analysis for a stochastic model of diffusion in heterogeneous media
Authors:
Matthew J Simpson,
Alexander P Browning,
Christopher Drovandi,
Elliot J Carr,
Oliver J Maclaren,
Ruth E Baker
Abstract:
We compute profile likelihoods for a stochastic model of diffusive transport motivated by experimental observations of heat conduction in layered skin tissues. This process is modelled as a random walk in a layered one-dimensional material, where each layer has a distinct particle hopping rate. Particles are released at some location, and the duration of time taken for each particle to reach an ab…
▽ More
We compute profile likelihoods for a stochastic model of diffusive transport motivated by experimental observations of heat conduction in layered skin tissues. This process is modelled as a random walk in a layered one-dimensional material, where each layer has a distinct particle hopping rate. Particles are released at some location, and the duration of time taken for each particle to reach an absorbing boundary is recorded. To explore whether this data can be used to identify the hopping rates in each layer, we compute various profile likelihoods using two methods: first, an exact likelihood is evaluated using a relatively expensive Markov chain approach; and, second we form an approximate likelihood by assuming the distribution of exit times is given by a Gamma distribution whose first two moments match the expected moments from the continuum limit description of the stochastic model. Using the exact and approximate likelihoods we construct various profile likelihoods for a range of problems. In cases where parameter values are not identifiable, we make progress by re-interpreting those data with a reduced model with a smaller number of layers.
△ Less
Submitted 9 March, 2021; v1 submitted 6 November, 2020;
originally announced November 2020.