-
Approximation and inference methods for stochastic biochemical kinetics - a tutorial review
Authors:
David Schnoerr,
Guido Sanguinetti,
Ramon Grima
Abstract:
Stochastic fluctuations of molecule numbers are ubiquitous in biological systems. Important examples include gene expression and enzymatic processes in living cells. Such systems are typically modelled as chemical reaction networks whose dynamics are governed by the Chemical Master Equation. Despite its simple structure, no analytic solutions to the Chemical Master Equation are known for most syst…
▽ More
Stochastic fluctuations of molecule numbers are ubiquitous in biological systems. Important examples include gene expression and enzymatic processes in living cells. Such systems are typically modelled as chemical reaction networks whose dynamics are governed by the Chemical Master Equation. Despite its simple structure, no analytic solutions to the Chemical Master Equation are known for most systems. Moreover, stochastic simulations are computationally expensive, making systematic analysis and statistical inference a challenging task. Consequently, significant effort has been spent in recent decades on the development of efficient approximation and inference methods. This article gives an introduction to basic modelling concepts as well as an overview of state of the art methods. First, we motivate and introduce deterministic and stochastic methods for modelling chemical networks, and give an overview of simulation and exact solution methods. Next, we discuss several approximation methods, including the chemical Langevin equation, the system size expansion, moment closure approximations, time-scale separation approximations and hybrid methods. We discuss their various properties and review recent advances and remaining challenges for these methods. We present a comparison of several of these methods by means of a numerical case study and highlight some of their respective advantages and disadvantages. Finally, we discuss the problem of inference from experimental data in the Bayesian framework and review recent methods developed the literature. In summary, this review gives a self-contained introduction to modelling, approximations and inference methods for stochastic chemical kinetics.
△ Less
Submitted 12 January, 2017; v1 submitted 23 August, 2016;
originally announced August 2016.
-
Fast simulation of Brownian dynamics in a crowded environment
Authors:
Stephen Smith,
Ramon Grima
Abstract:
Brownian dynamics simulations are an increasingly popular tool for understanding spatially-distributed biochemical reaction systems. Recent improvements in our understanding of the cellular environment show that volume exclusion effects are fundamental to reaction networks inside cells. These systems are frequently studied by incorporating inert hard spheres (crowders) into three-dimensional Brown…
▽ More
Brownian dynamics simulations are an increasingly popular tool for understanding spatially-distributed biochemical reaction systems. Recent improvements in our understanding of the cellular environment show that volume exclusion effects are fundamental to reaction networks inside cells. These systems are frequently studied by incorporating inert hard spheres (crowders) into three-dimensional Brownian dynamics simulations, however these methods are extremely slow owing to the sheer number of possible collisions between particles. Here we propose a rigorous "crowder-free" method to dramatically increase simulation speed for crowded biochemical reaction systems by eliminating the need to explicitly simulate the crowders. We consider both the case where the reactive particles are point particles, and where they themselves occupy a volume. We use simulations of simple chemical reaction networks to confirm that our simplification is just as accurate as the original algorithm, and that it corresponds to a large speed increase.
△ Less
Submitted 28 May, 2016;
originally announced May 2016.
-
An exact solution to Brownian dynamics of a reversible bimolecular reaction in one dimension
Authors:
Stephen Smith,
Ramon Grima
Abstract:
Brownian dynamics is a popular fine-grained method for simulating systems of interacting particles, such as chemical reactions. Though the method is simple to simulate, it is generally assumed that the dynamics is impossible to solve exactly and analytically, aside from some trivial systems. We here give the first exact analytical solution to a non-trivial Brownian dynamics system: the reaction…
▽ More
Brownian dynamics is a popular fine-grained method for simulating systems of interacting particles, such as chemical reactions. Though the method is simple to simulate, it is generally assumed that the dynamics is impossible to solve exactly and analytically, aside from some trivial systems. We here give the first exact analytical solution to a non-trivial Brownian dynamics system: the reaction $A+B\xrightleftharpoons[]{}C$ in equilibrium in one-dimensional periodic space. The solution is a function of the particles' diffusion coefficients, radii, length of space and unbinding distance.
△ Less
Submitted 16 May, 2016;
originally announced May 2016.
-
Cox process representation and inference for stochastic reaction-diffusion processes
Authors:
David Schnoerr,
Ramon Grima,
Guido Sanguinetti
Abstract:
Complex behaviour in many systems arises from the stochastic interactions of spatially distributed particles or agents. Stochastic reaction-diffusion processes are widely used to model such behaviour in disciplines ranging from biology to the social sciences, yet they are notoriously difficult to simulate and calibrate to observational data. Here we use ideas from statistical physics and machine l…
▽ More
Complex behaviour in many systems arises from the stochastic interactions of spatially distributed particles or agents. Stochastic reaction-diffusion processes are widely used to model such behaviour in disciplines ranging from biology to the social sciences, yet they are notoriously difficult to simulate and calibrate to observational data. Here we use ideas from statistical physics and machine learning to provide a solution to the inverse problem of learning a stochastic reaction-diffusion process from data. Our solution relies on a non-trivial connection between stochastic reaction-diffusion processes and spatio-temporal Cox processes, a well-studied class of models from computational statistics. This connection leads to an efficient and flexible algorithm for parameter inference and model selection. Our approach shows excellent accuracy on numeric and real data examples from systems biology and epidemiology. Our work provides both insights into spatio-temporal stochastic systems, and a practical solution to a long-standing problem in computational modelling.
△ Less
Submitted 22 August, 2016; v1 submitted 8 January, 2016;
originally announced January 2016.
-
Molecular finite-size effects in stochastic models of equilibrium chemical systems
Authors:
Claudia Cianci,
Stephen Smith,
Ramon Grima
Abstract:
The reaction-diffusion master equation (RDME) is a standard modelling approach for understanding stochastic and spatial chemical kinetics. An inherent assumption is that molecules are point-like. Here we introduce the crowded reaction-diffusion master equation (cRDME) which takes into account volume exclusion effects on stochastic kinetics due to a finite molecular radius. We obtain an exact close…
▽ More
The reaction-diffusion master equation (RDME) is a standard modelling approach for understanding stochastic and spatial chemical kinetics. An inherent assumption is that molecules are point-like. Here we introduce the crowded reaction-diffusion master equation (cRDME) which takes into account volume exclusion effects on stochastic kinetics due to a finite molecular radius. We obtain an exact closed form solution of the RDME and of the cRDME for a general chemical system in equilibrium conditions. The difference between the two solutions increases with the ratio of molecular diameter to the compartment length scale. We show that an increase in molecular crowding can (i) lead to deviations from the classical inverse square root law for the noise-strength; (ii) flip the skewness of the probability distribution from right to left-skewed; (iii) shift the equilibrium of bimolecular reactions so that more product molecules are formed; (iv) strongly modulate the Fano factors and coefficients of variation. These crowding-induced effects are found to be particularly pronounced for chemical species not involved in chemical conservation laws.Finally we show that statistics obtained using the vRDME are in good agreement with those obtained from Brownian dynamics with excluded volume interactions.
△ Less
Submitted 27 January, 2016; v1 submitted 13 October, 2015;
originally announced October 2015.
-
Distribution approximations for the chemical master equation: comparison of the method of moments and the system size expansion
Authors:
Alexander Andreychenko,
Luca Bortolussi,
Ramon Grima,
Philipp Thomas,
Verena Wolf
Abstract:
The stochastic nature of chemical reactions involving randomly fluctuating population sizes has lead to a growing research interest in discrete-state stochastic models and their analysis. A widely-used approach is the description of the temporal evolution of the system in terms of a chemical master equation (CME). In this paper we study two approaches for approximating the underlying probability d…
▽ More
The stochastic nature of chemical reactions involving randomly fluctuating population sizes has lead to a growing research interest in discrete-state stochastic models and their analysis. A widely-used approach is the description of the temporal evolution of the system in terms of a chemical master equation (CME). In this paper we study two approaches for approximating the underlying probability distributions of the CME. The first approach is based on an integration of the statistical moments and the reconstruction of the distribution based on the maximum entropy principle. The second approach relies on an analytical approximation of the probability distribution of the CME using the system size expansion, considering higher-order terms than the linear noise approximation. We consider gene expression networks with unimodal and multimodal protein distributions to compare the accuracy of the two approaches. We find that both methods provide accurate approximations to the distributions of the CME while having different benefits and limitations in applications.
△ Less
Submitted 30 September, 2015;
originally announced September 2015.
-
The linear-noise approximation and the chemical master equation exactly agree up to second-order moments for a class of chemical systems
Authors:
Ramon Grima
Abstract:
It is well known that the linear-noise approximation (LNA) exactly agrees with the chemical master equation, up to second-order moments, for chemical systems composed of zero and first-order reactions. Here we show that this is also a property of the LNA for a subset of chemical systems with second-order reactions. This agreement is independent of the number of interacting molecules.
It is well known that the linear-noise approximation (LNA) exactly agrees with the chemical master equation, up to second-order moments, for chemical systems composed of zero and first-order reactions. Here we show that this is also a property of the LNA for a subset of chemical systems with second-order reactions. This agreement is independent of the number of interacting molecules.
△ Less
Submitted 2 September, 2015;
originally announced September 2015.
-
Approximate probability distributions of the master equation
Authors:
Philipp Thomas,
Ramon Grima
Abstract:
Master equations are common descriptions of mesoscopic systems. Analytical solutions to these equations can rarely be obtained. We here derive an analytical approximation of the time-dependent probability distribution of the master equation using orthogonal polynomials. The solution is given in two alternative formulations: a series with continuous and a series with discrete support both of which…
▽ More
Master equations are common descriptions of mesoscopic systems. Analytical solutions to these equations can rarely be obtained. We here derive an analytical approximation of the time-dependent probability distribution of the master equation using orthogonal polynomials. The solution is given in two alternative formulations: a series with continuous and a series with discrete support both of which can be systematically truncated. While both approximations satisfy the system size expansion of the master equation, the continuous distribution approximations become increasingly negative and tend to oscillations with increasing truncation order. In contrast, the discrete approximations rapidly converge to the underlying non-Gaussian distributions. The theory is shown to lead to particularly simple analytical expressions for the probability distributions of molecule numbers in metabolic reactions and gene expression systems.
△ Less
Submitted 2 October, 2015; v1 submitted 13 November, 2014;
originally announced November 2014.
-
System size expansion using Feynman rules and diagrams
Authors:
Philipp Thomas,
Christian Fleck,
Ramon Grima,
Nikola Popović
Abstract:
Few analytical methods exist for quantitative studies of large fluctuations in stochastic systems. In this article, we develop a simple diagrammatic approach to the Chemical Master Equation that allows us to calculate multi-time correlation functions which are accurate to a any desired order in van Kampen's system size expansion. Specifically, we present a set of Feynman rules from which this diag…
▽ More
Few analytical methods exist for quantitative studies of large fluctuations in stochastic systems. In this article, we develop a simple diagrammatic approach to the Chemical Master Equation that allows us to calculate multi-time correlation functions which are accurate to a any desired order in van Kampen's system size expansion. Specifically, we present a set of Feynman rules from which this diagrammatic perturbation expansion can be constructed algorithmically. We then apply the methodology to derive in closed form the leading order corrections to the linear noise approximation of the intrinsic noise power spectrum for general biochemical reaction networks. Finally, we illustrate our results by describing noise-induced oscillations in the Brusselator reaction scheme which are not captured by the common linear noise approximation.
△ Less
Submitted 4 September, 2014;
originally announced September 2014.
-
The complex chemical Langevin equation
Authors:
David Schnoerr,
Guido Sanguinetti,
Ramon Grima
Abstract:
The chemical Langevin equation (CLE) is a popular simulation method to probe the stochastic dynamics of chemical systems. The CLE's main disadvantage is its break down in finite time due to the problem of evaluating square roots of negative quantities whenever the molecule numbers become sufficiently small. We show that this issue is not a numerical integration problem, rather in many systems it i…
▽ More
The chemical Langevin equation (CLE) is a popular simulation method to probe the stochastic dynamics of chemical systems. The CLE's main disadvantage is its break down in finite time due to the problem of evaluating square roots of negative quantities whenever the molecule numbers become sufficiently small. We show that this issue is not a numerical integration problem, rather in many systems it is intrinsic to all representations of the CLE. Various methods of correcting the CLE have been proposed which avoid its break down. We show that these methods introduce undesirable artefacts in the CLE's predictions. In particular, for unimolecular systems, these correction methods lead to CLE predictions for the mean concentrations and variance of fluctuations which disagree with those of the chemical master equation. We show that, by extending the domain of the CLE to complex space, break down is eliminated, and the CLE's accuracy for unimolecular systems is restored. Although the molecule numbers are generally complex, we show that the "complex CLE" predicts real-valued quantities for the mean concentrations, the moments of intrinsic noise, power spectra and first passage times, hence admitting a physical interpretation. It is also shown to provide a more accurate approximation of the chemical master equation of simple biochemical circuits involving bimolecular reactions than the various corrected forms of the real-valued CLE, the linear-noise approximation and a commonly used two moment-closure approximation.
△ Less
Submitted 21 July, 2014; v1 submitted 10 June, 2014;
originally announced June 2014.
-
Rigorous elimination of fast stochastic variables from the linear noise approximation using projection operators
Authors:
Philipp Thomas,
Ramon Grima,
Arthur V. Straube
Abstract:
The linear noise approximation (LNA) offers a simple means by which one can study intrinsic noise in monostable biochemical networks. Using simple physical arguments, we have recently introduced the slow-scale LNA (ssLNA) which is a reduced version of the LNA under conditions of timescale separation. In this paper, we present the first rigorous derivation of the ssLNA using the projection operator…
▽ More
The linear noise approximation (LNA) offers a simple means by which one can study intrinsic noise in monostable biochemical networks. Using simple physical arguments, we have recently introduced the slow-scale LNA (ssLNA) which is a reduced version of the LNA under conditions of timescale separation. In this paper, we present the first rigorous derivation of the ssLNA using the projection operator technique and show that the ssLNA follows uniquely from the standard LNA under the same conditions of timescale separation as those required for the deterministic quasi-steady state approximation. We also show that the large molecule number limit of several common stochastic model reduction techniques under timescale separation conditions constitutes a special case of the ssLNA.
△ Less
Submitted 24 September, 2012;
originally announced September 2012.
-
Limitations of the stochastic quasi-steady-state approximation in open biochemical reaction networks
Authors:
Philipp Thomas,
Arthur V. Straube,
Ramon Grima
Abstract:
The application of the quasi-steady-state approximation to the Michaelis-Menten reaction embedded in large open chemical reaction networks is a popular model reduction technique in deterministic and stochastic simulations of biochemical reactions inside cells. It is frequently assumed that the predictions of the reduced master equations obtained using the stochastic quasi-steady-state approach are…
▽ More
The application of the quasi-steady-state approximation to the Michaelis-Menten reaction embedded in large open chemical reaction networks is a popular model reduction technique in deterministic and stochastic simulations of biochemical reactions inside cells. It is frequently assumed that the predictions of the reduced master equations obtained using the stochastic quasi-steady-state approach are in very good agreement with the predictions of the full master equations, provided the conditions for the validity of the deterministic quasi-steady-state approximation are fulfilled. We here use the linear-noise approximation to show that this assumption is not generally justified for the Michaelis-Menten reaction with substrate input, the simplest example of an open embedded enzyme reaction. The reduced master equation approach is found to considerably overestimate the size of intrinsic noise at low copy numbers of molecules. A simple formula is obtained for the relative error between the predictions of the reduced and full master equations for the variance of the substrate concentration fluctuations. The maximum error is reached when modeling moderately or highly efficient enzymes, in which case the error is approximately 30%. The theoretical predictions are validated by stochastic simulations using experimental parameter values for enzymes involved in proteolysis, gluconeogenesis and fermentation.
△ Less
Submitted 30 October, 2011;
originally announced October 2011.
-
Stochastic theory of large-scale enzyme-reaction networks: Finite copy number corrections to rate equation models
Authors:
Philipp Thomas,
Arthur V. Straube,
Ramon Grima
Abstract:
Chemical reactions inside cells occur in compartment volumes in the range of atto- to femtolitres. Physiological concentrations realized in such small volumes imply low copy numbers of interacting molecules with the consequence of considerable fluctuations in the concentrations. In contrast, rate equation models are based on the implicit assumption of infinitely large numbers of interacting molecu…
▽ More
Chemical reactions inside cells occur in compartment volumes in the range of atto- to femtolitres. Physiological concentrations realized in such small volumes imply low copy numbers of interacting molecules with the consequence of considerable fluctuations in the concentrations. In contrast, rate equation models are based on the implicit assumption of infinitely large numbers of interacting molecules, or equivalently, that reactions occur in infinite volumes at constant macroscopic concentrations. In this article we compute the finite-volume corrections (or equivalently the finite copy number corrections) to the solutions of the rate equations for chemical reaction networks composed of arbitrarily large numbers of enzyme-catalyzed reactions which are confined inside a small sub-cellular compartment. This is achieved by applying a mesoscopic version of the quasi-steady state assumption to the exact Fokker-Planck equation associated with the Poisson Representation of the chemical master equation. The procedure yields impressively simple and compact expressions for the finite-volume corrections. We prove that the predictions of the rate equations will always underestimate the actual steady-state substrate concentrations for an enzyme-reaction network confined in a small volume. In particular we show that the finite-volume corrections increase with decreasing sub-cellular volume, decreasing Michaelis-Menten constants and increasing enzyme saturation. The magnitude of the corrections depends sensitively on the topology of the network. The predictions of the theory are shown to be in excellent agreement with stochastic simulations for two types of networks typically associated with protein methylation and metabolism.
△ Less
Submitted 25 July, 2011;
originally announced July 2011.
-
How accurate are the non-linear chemical Fokker-Planck and chemical Langevin equations?
Authors:
Ramon Grima,
Philipp Thomas,
Arthur V. Straube
Abstract:
The chemical Fokker-Planck equation and the corresponding chemical Langevin equation are commonly used approximations of the chemical master equation. These equations are derived from an uncontrolled, second-order truncation of the Kramers-Moyal expansion of the chemical master equation and hence their accuracy remains to be clarified. We use the system-size expansion to show that chemical Fokker-…
▽ More
The chemical Fokker-Planck equation and the corresponding chemical Langevin equation are commonly used approximations of the chemical master equation. These equations are derived from an uncontrolled, second-order truncation of the Kramers-Moyal expansion of the chemical master equation and hence their accuracy remains to be clarified. We use the system-size expansion to show that chemical Fokker-Planck estimates of the mean concentrations and of the variance of the concentration fluctuations about the mean are accurate to order $Ω^{-3/2}$ for reaction systems which do not obey detailed balance and at least accurate to order $Ω^{-2}$ for systems obeying detailed balance, where $Ω$ is the characteristic size of the system. Hence the chemical Fokker-Planck equation turns out to be more accurate than the linear-noise approximation of the chemical master equation (the linear Fokker-Planck equation) which leads to mean concentration estimates accurate to order $Ω^{-1/2}$ and variance estimates accurate to order $Ω^{-3/2}$. This higher accuracy is particularly conspicuous for chemical systems realized in small volumes such as biochemical reactions inside cells. A formula is also obtained for the approximate size of the relative errors in the concentration and variance predictions of the chemical Fokker-Planck equation, where the relative error is defined as the difference between the predictions of the chemical Fokker-Planck equation and the master equation divided by the prediction of the master equation. For dimerization and enzyme-catalyzed reactions, the errors are typically less than few percent even when the steady-state is characterized by merely few tens of molecules.
△ Less
Submitted 28 July, 2011; v1 submitted 24 June, 2011;
originally announced June 2011.
-
Phase Transitions and superuniversality in the dynamics of a self-driven particle
Authors:
R. Grima
Abstract:
We study an active random walker model in which a particle's motion is determined by a self-generated field. The field encodes information about the particle's path history. This leads to either self-attractive or self-repelling behavior. For self-repelling behavior, we find a phase transition in the dynamics: when the coupling between the field and the walker exceeds a critical value, the parti…
▽ More
We study an active random walker model in which a particle's motion is determined by a self-generated field. The field encodes information about the particle's path history. This leads to either self-attractive or self-repelling behavior. For self-repelling behavior, we find a phase transition in the dynamics: when the coupling between the field and the walker exceeds a critical value, the particle's behavior changes from renormalized diffusion to one characterized by a diverging diffusion coefficient. The dynamical behavior for all cases is surprisingly independent of dimension and of the noise amplitude.
△ Less
Submitted 20 July, 2006;
originally announced July 2006.
-
Accurate discretization of advection-diffusion equations
Authors:
R. Grima,
T. J. Newman
Abstract:
We present an exact mathematical transformation which converts a wide class of advection-diffusion equations into a form allowing simple and direct spatial discretization in all dimensions, and thus the construction of accurate and more efficient numerical algorithms. These discretized forms can also be viewed as master equations which provides an alternative mesoscopic interpretation of advecti…
▽ More
We present an exact mathematical transformation which converts a wide class of advection-diffusion equations into a form allowing simple and direct spatial discretization in all dimensions, and thus the construction of accurate and more efficient numerical algorithms. These discretized forms can also be viewed as master equations which provides an alternative mesoscopic interpretation of advection-diffusion processes in terms of diffusion with spatially varying hopping rates.
△ Less
Submitted 13 December, 2004;
originally announced December 2004.