-
Integrating protein sequence embeddings with structure via graph-based deep learning for the prediction of single-residue properties
Authors:
Kevin Michalewicz,
Mauricio Barahona,
Barbara Bravi
Abstract:
Understanding the intertwined contributions of amino acid sequence and spatial structure is essential to explain protein behaviour. Here, we introduce INFUSSE (Integrated Network Framework Unifying Structure and Sequence Embeddings), a Deep Learning framework that combines sequence embeddings, generated by a Large Language Model (LLM), with graph-based representations of protein structures, integr…
▽ More
Understanding the intertwined contributions of amino acid sequence and spatial structure is essential to explain protein behaviour. Here, we introduce INFUSSE (Integrated Network Framework Unifying Structure and Sequence Embeddings), a Deep Learning framework that combines sequence embeddings, generated by a Large Language Model (LLM), with graph-based representations of protein structures, integrated through a diffusive Graph Convolutional Network (diff-GCN), to predict single-residue properties within proteins. Our approach follows two steps. First, we fine-tune LLM sequence embeddings obtained from bidirectional transformers to make predictions from protein sequence alone. Second, we combine these enriched sequence representations with a geometric graph Laplacian within diff-GCN to refine the initial predictions. This approach leads to improved predictions while allowing us to systematically disentangle the contribution of sequence and structure. We illustrate our framework by applying it to the prediction of local residue flexibility (B-factors) of antibody-antigen complexes, and show that it provides improved performance compared to current Machine Learning (ML) approaches. The addition of structural information via geometric graphs is shown to enhance predictions especially for intrinsically disordered regions, protein-protein interaction sites, and highly variable amino acid positions.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
ANTIPASTI: interpretable prediction of antibody binding affinity exploiting Normal Modes and Deep Learning
Authors:
Kevin Michalewicz,
Mauricio Barahona,
Barbara Bravi
Abstract:
The high binding affinity of antibodies towards their cognate targets is key to eliciting effective immune responses, as well as to the use of antibodies as research and therapeutic tools. Here, we propose ANTIPASTI, a Convolutional Neural Network model that achieves state-of-the-art performance in the prediction of antibody binding affinity using as input a representation of antibody-antigen stru…
▽ More
The high binding affinity of antibodies towards their cognate targets is key to eliciting effective immune responses, as well as to the use of antibodies as research and therapeutic tools. Here, we propose ANTIPASTI, a Convolutional Neural Network model that achieves state-of-the-art performance in the prediction of antibody binding affinity using as input a representation of antibody-antigen structures in terms of Normal Mode correlation maps derived from Elastic Network Models. This representation captures not only structural features but energetic patterns of local and global residue fluctuations. The learnt representations are interpretable: they reveal similarities of binding patterns among antibodies targeting the same antigen type, and can be used to quantify the importance of antibody regions contributing to binding affinity. Our results show the importance of the antigen imprint in the Normal Mode landscape, and the dominance of cooperative effects and long-range correlations between antibody regions to determine binding affinity.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
Moment-based parameter inference with error guarantees for stochastic reaction networks
Authors:
Zekai Li,
Mauricio Barahona,
Philipp Thomas
Abstract:
Inferring parameters of models of biochemical kinetics from single-cell data remains challenging because of the uncertainty arising from the intractability of the likelihood function of stochastic reaction networks. Such uncertainty falls beyond current error quantification measures, which focus on the effects of finite sample size and identifiability but lack theoretical guarantees when likelihoo…
▽ More
Inferring parameters of models of biochemical kinetics from single-cell data remains challenging because of the uncertainty arising from the intractability of the likelihood function of stochastic reaction networks. Such uncertainty falls beyond current error quantification measures, which focus on the effects of finite sample size and identifiability but lack theoretical guarantees when likelihood approximations are needed. Here, we propose a method for the inference of parameters of stochastic reaction networks that works for both steady-state and time-resolved data and is applicable to networks with non-linear and rational propensities. Our approach provides bounds on the parameters via convex optimisation over sets constrained by moment equations and moment matrices by taking observations to form moment intervals, which are then used to constrain parameters through convex sets. The bounds on the parameters contain the true parameters under the condition that the moment intervals contain the true moments, thus providing uncertainty quantification and error guarantees. Our approach does not need to predict moments and distributions for given parameters (i.e., it avoids solving or simulating the forward problem), and hence circumvents intractable likelihood computations or computationally expensive simulations. We demonstrate its use for uncertainty quantification, data integration and prediction of latent species statistics through synthetic data from common non-linear biochemical models including the Schlögl model and the toggle switch, a model of post-transcriptional regulation at steady state, and a birth-death model with time-dependent data.
△ Less
Submitted 13 January, 2025; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Interpretable statistical representations of neural population dynamics and geometry
Authors:
Adam Gosztolai,
Robert L. Peach,
Alexis Arnaudon,
Mauricio Barahona,
Pierre Vandergheynst
Abstract:
The dynamics of neuron populations commonly evolve on low-dimensional manifolds. Thus, we need methods that learn the dynamical processes over neural manifolds to infer interpretable and consistent latent representations. We introduce a representation learning method, MARBLE, that decomposes on-manifold dynamics into local flow fields and maps them into a common latent space using unsupervised geo…
▽ More
The dynamics of neuron populations commonly evolve on low-dimensional manifolds. Thus, we need methods that learn the dynamical processes over neural manifolds to infer interpretable and consistent latent representations. We introduce a representation learning method, MARBLE, that decomposes on-manifold dynamics into local flow fields and maps them into a common latent space using unsupervised geometric deep learning. In simulated non-linear dynamical systems, recurrent neural networks, and experimental single-neuron recordings from primates and rodents, we discover emergent low-dimensional latent representations that parametrise high-dimensional neural dynamics during gain modulation, decision-making, and changes in the internal state. These representations are consistent across neural networks and animals, enabling the robust comparison of cognitive computations. Extensive benchmarking demonstrates state-of-the-art within- and across-animal decoding accuracy of MARBLE compared with current representation learning approaches, with minimal user input. Our results suggest that manifold structure provides a powerful inductive bias to develop powerful decoding algorithms and assimilate data across experiments.
△ Less
Submitted 24 September, 2024; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Prediction of protein allosteric signalling pathways and functional residues through paths of optimised propensity
Authors:
Nan Wu,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
Allostery commonly refers to the mechanism that regulates protein activity through the binding of a molecule at a different, usually distal, site from the orthosteric site. The omnipresence of allosteric regulation in nature and its potential for drug design and screening render the study of allostery invaluable. Nevertheless, challenges remain as few computational methods are available to effecti…
▽ More
Allostery commonly refers to the mechanism that regulates protein activity through the binding of a molecule at a different, usually distal, site from the orthosteric site. The omnipresence of allosteric regulation in nature and its potential for drug design and screening render the study of allostery invaluable. Nevertheless, challenges remain as few computational methods are available to effectively predict allosteric sites, identify signalling pathways involved in allostery, or to aid with the design of suitable molecules targeting such sites. Recently, bond-to-bond propensity analysis has been shown successful at identifying allosteric sites for a large and diverse group of proteins from knowledge of the orthosteric sites and its ligands alone by using network analysis applied to energy-weighted atomistic protein graphs. To address the identification of signalling pathways, we propose here a method to compute and score paths of optimised propensity that link the orthosteric site with the identified allosteric sites, and identifies crucial residues that contribute to those paths. We showcase the approach with three well-studied allosteric proteins: h-Ras, caspase-1, and 3-phosphoinositide-dependent kinase-1 (PDK1). Key residues in both orthosteric and allosteric sites were identified and showed agreement with experimental results, and pivotal signalling residues along the pathway were also revealed, thus providing alternative targets for drug design. By using the computed path scores, we were also able to differentiate the activity of different allosteric modulators.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Computation of single-cell metabolite distributions using mixture models
Authors:
Mona K Tonn,
Philipp Thomas,
Mauricio Barahona,
Diego A Oyarzún
Abstract:
Metabolic heterogeneity is widely recognised as the next challenge in our understanding of non-genetic variation. A growing body of evidence suggests that metabolic heterogeneity may result from the inherent stochasticity of intracellular events. However, metabolism has been traditionally viewed as a purely deterministic process, on the basis that highly abundant metabolites tend to filter out sto…
▽ More
Metabolic heterogeneity is widely recognised as the next challenge in our understanding of non-genetic variation. A growing body of evidence suggests that metabolic heterogeneity may result from the inherent stochasticity of intracellular events. However, metabolism has been traditionally viewed as a purely deterministic process, on the basis that highly abundant metabolites tend to filter out stochastic phenomena. Here we bridge this gap with a general method for prediction of metabolite distributions across single cells. By exploiting the separation of time scales between enzyme expression and enzyme kinetics, our method produces estimates for metabolite distributions without the lengthy stochastic simulations that would be typically required for large metabolic models. The metabolite distributions take the form of Gaussian mixture models that are directly computable from single-cell expression data and standard deterministic models for metabolic pathways. The proposed mixture models provide a systematic method to predict the impact of biochemical parameters on metabolite distributions. Our method lays the groundwork for identifying the molecular processes that shape metabolic heterogeneity and its functional implications in disease.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Opportunities at the interface of network science and metabolic modelling
Authors:
Varshit Dusad,
Denise Thiel,
Mauricio Barahona,
Hector C. Keun,
Diego A. Oyarzún
Abstract:
Metabolism plays a central role in cell physiology because it provides the molecular machinery for growth. At the genome-scale, metabolism is made up of thousands of reactions interacting with one another. Untangling this complexity is key to understand how cells respond to genetic, environmental, or therapeutic perturbations. Here we discuss the roles of two complementary strategies for the analy…
▽ More
Metabolism plays a central role in cell physiology because it provides the molecular machinery for growth. At the genome-scale, metabolism is made up of thousands of reactions interacting with one another. Untangling this complexity is key to understand how cells respond to genetic, environmental, or therapeutic perturbations. Here we discuss the roles of two complementary strategies for the analysis of genome-scale metabolic models: Flux Balance Analysis (FBA) and network science. While FBA estimates metabolic flux on the basis of an optimisation principle, network approaches reveal emergent properties of the global metabolic connectivity. We highlight how the integration of both approaches promises to deliver insights on the structure and function of metabolic systems with wide-ranging implications in discovery science, precision medicine and industrial biotechnology.
△ Less
Submitted 17 December, 2020; v1 submitted 5 June, 2020;
originally announced June 2020.
-
HyperTraPS: Inferring probabilistic patterns of trait acquisition in evolutionary and disease progression pathways
Authors:
Sam F. Greenbury,
Mauricio Barahona,
Iain G. Johnston
Abstract:
The explosion of data throughout the biomedical sciences provides unprecedented opportunities to learn about the dynamics of evolution and disease progression, but harnessing these large and diverse datasets remains challenging. Here, we describe a highly generalisable statistical platform to infer the dynamic pathways by which many, potentially interacting, discrete traits are acquired or lost ov…
▽ More
The explosion of data throughout the biomedical sciences provides unprecedented opportunities to learn about the dynamics of evolution and disease progression, but harnessing these large and diverse datasets remains challenging. Here, we describe a highly generalisable statistical platform to infer the dynamic pathways by which many, potentially interacting, discrete traits are acquired or lost over time in biomedical systems. The platform uses HyperTraPS (hypercubic transition path sampling) to learn progression pathways from cross-sectional, longitudinal, or phylogenetically-linked data with unprecedented efficiency, readily distinguishing multiple competing pathways, and identifying the most parsimonious mechanisms underlying given observations. Its Bayesian structure quantifies uncertainty in pathway structure and allows interpretable predictions of behaviours, such as which symptom a patient will acquire next. We exploit the model's topology to provide visualisation tools for intuitive assessment of multiple, variable pathways. We apply the method to ovarian cancer progression and the evolution of multidrug resistance in tuberculosis, demonstrating its power to reveal previously undetected dynamic pathways.
△ Less
Submitted 28 November, 2019;
originally announced December 2019.
-
An edge-based formulation of elastic network models
Authors:
Maxwell Hodges,
Sophia N Yaliraki,
Mauricio Barahona
Abstract:
We present an edge-based framework for the study of geometric elastic network models to model mechanical interactions in physical systems. We use a formulation in the edge space, instead of the usual node-centric approach, to characterise edge fluctuations of geometric networks defined in d- dimensional space and define the edge mechanical embeddedness, an edge mechanical susceptibility measuring…
▽ More
We present an edge-based framework for the study of geometric elastic network models to model mechanical interactions in physical systems. We use a formulation in the edge space, instead of the usual node-centric approach, to characterise edge fluctuations of geometric networks defined in d- dimensional space and define the edge mechanical embeddedness, an edge mechanical susceptibility measuring the force felt on each edge given a force applied on the whole system. We further show that this formulation can be directly related to the infinitesimal rigidity of the network, which additionally permits three- and four-centre forces to be included in the network description. We exemplify the approach in protein systems, at both the residue and atomistic levels of description.
△ Less
Submitted 14 November, 2019;
originally announced November 2019.
-
Allostery and cooperativity in multimeric proteins: bond-to-bond propensities in ATCase
Authors:
Maxwell Hodges,
Mauricio Barahona,
Sophia N. Yaliraki
Abstract:
Aspartate carbamoyltransferase (ATCase) is a large dodecameric enzyme with six active sites that exhibits allostery: its catalytic rate is modulated by the binding of various substrates at distal points from the active sites. A recently developed method, bond-to-bond propensity analysis, has proven capable of predicting allosteric sites in a wide range of proteins using an energy-weighted atomisti…
▽ More
Aspartate carbamoyltransferase (ATCase) is a large dodecameric enzyme with six active sites that exhibits allostery: its catalytic rate is modulated by the binding of various substrates at distal points from the active sites. A recently developed method, bond-to-bond propensity analysis, has proven capable of predicting allosteric sites in a wide range of proteins using an energy-weighted atomistic graph obtained from the protein structure and given knowledge only of the location of the active site. Bond-to-bond propensity establishes if energy fluctuations at given bonds have significant effects on any other bond in the protein, by considering their propagation through the protein graph. In this work, we use bond-to-bond propensity analysis to study different aspects of ATCase activity using three different protein structures and sources of fluctuations. First, we predict key residues and bonds involved in the transition between inactive (T) and active (R) states of ATCase by analysing allosteric substrate binding as a source of energy perturbations in the protein graph. Our computational results also indicate that the effect of multiple allosteric binding is non linear: a switching effect is observed after a particular number and arrangement of substrates is bound suggesting a form of long range communication between the distantly arranged allosteric sites. Second, cooperativity is explored by considering a bisubstrate analogue as the source of energy fluctuations at the active site, also leading to the identification of highly significant residues to the T-R transition that enhance cooperativity across active sites. Finally, the inactive (T) structure is shown to exhibit a strong, non linear communication between the allosteric sites and the interface between catalytic subunits, rather than the active site.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Stationary distributions of continuous-time Markov chains: a review of theory and truncation-based approximations
Authors:
Juan Kuntz,
Philipp Thomas,
Guy-Bart Stan,
Mauricio Barahona
Abstract:
Computing the stationary distributions of a continuous-time Markov chain (CTMC) involves solving a set of linear equations. In most cases of interest, the number of equations is infinite or too large, and the equations cannot be solved analytically or numerically. Several approximation schemes overcome this issue by truncating the state space to a manageable size. In this review, we first give a c…
▽ More
Computing the stationary distributions of a continuous-time Markov chain (CTMC) involves solving a set of linear equations. In most cases of interest, the number of equations is infinite or too large, and the equations cannot be solved analytically or numerically. Several approximation schemes overcome this issue by truncating the state space to a manageable size. In this review, we first give a comprehensive theoretical account of the stationary distributions and their relation to the long-term behaviour of CTMCs that is readily accessible to non-experts and free of irreducibility assumptions made in standard texts. We then review truncation-based approximation schemes for CTMCs with infinite state spaces paying particular attention to the schemes' convergence and the errors they introduce, and we illustrate their performance with an example of a stochastic reaction network of relevance in biology and chemistry. We conclude by discussing computational trade-offs associated with error control and several open questions.
△ Less
Submitted 24 August, 2020; v1 submitted 12 September, 2019;
originally announced September 2019.
-
Cellular memory enhances bacterial chemotactic navigation in rugged environments
Authors:
Adam Gosztolai,
Mauricio Barahona
Abstract:
The response of microbes to external signals is mediated by biochemical networks with intrinsic time scales. These time scales give rise to a memory that impacts cellular behaviour. Here we study theoretically the role of cellular memory in Escherichia coli chemotaxis. Using an agent-based model, we show that cells with memory navigating rugged chemoattractant landscapes can enhance their drift sp…
▽ More
The response of microbes to external signals is mediated by biochemical networks with intrinsic time scales. These time scales give rise to a memory that impacts cellular behaviour. Here we study theoretically the role of cellular memory in Escherichia coli chemotaxis. Using an agent-based model, we show that cells with memory navigating rugged chemoattractant landscapes can enhance their drift speed by extracting information from environmental correlations. Maximal advantage is achieved when the memory is comparable to the time scale of fluctuations as perceived during swimming. We derive an analytical approximation for the drift velocity in rugged landscapes that explains the enhanced velocity, and recovers standard Keller-Segel gradient-sensing results in the limits when memory and fluctuation time scales are well separated. Our numerics also show that cellular memory can induce bet-hedging at the population level resulting in long-lived multi-modal distributions in heterogeneous landscapes.
△ Less
Submitted 13 February, 2020; v1 submitted 12 August, 2019;
originally announced August 2019.
-
Learning spatiotemporal signals using a recurrent spiking network that discretizes time
Authors:
Amadeus Maes,
Mauricio Barahona,
Claudia Clopath
Abstract:
Learning to produce spatiotemporal sequences is a common task that the brain has to solve. The same neural substrate may be used by the brain to produce different sequential behaviours. The way the brain learns and encodes such tasks remains unknown as current computational models do not typically use realistic biologically-plausible learning. Here, we propose a model where a spiking recurrent net…
▽ More
Learning to produce spatiotemporal sequences is a common task that the brain has to solve. The same neural substrate may be used by the brain to produce different sequential behaviours. The way the brain learns and encodes such tasks remains unknown as current computational models do not typically use realistic biologically-plausible learning. Here, we propose a model where a spiking recurrent network of excitatory and inhibitory biophysical neurons drives a read-out layer: the dynamics of the driver recurrent network is trained to encode time which is then mapped through the read-out neurons to encode another dimension, such as space or a phase. Different spatiotemporal patterns can be learned and encoded through the synaptic weights to the read-out neurons that follow common Hebbian learning rules. We demonstrate that the model is able to learn spatiotemporal dynamics on time scales that are behaviourally relevant and we show that the learned sequences are robustly replayed during a regime of spontaneous activity.
△ Less
Submitted 19 December, 2019; v1 submitted 20 July, 2019;
originally announced July 2019.
-
Stochastic modelling reveals mechanisms of metabolic heterogeneity
Authors:
Mona K. Tonn,
Philipp Thomas,
Mauricio Barahona,
Diego A Oyarzún
Abstract:
Phenotypic variation is a hallmark of cellular physiology. Metabolic heterogeneity, in particular, underpins single-cell phenomena such as microbial drug tolerance and growth variability. Much research has focussed on transcriptomic and proteomic heterogeneity, yet it remains unclear if such variation permeates to the metabolic state of a cell. Here we propose a stochastic model to show that compl…
▽ More
Phenotypic variation is a hallmark of cellular physiology. Metabolic heterogeneity, in particular, underpins single-cell phenomena such as microbial drug tolerance and growth variability. Much research has focussed on transcriptomic and proteomic heterogeneity, yet it remains unclear if such variation permeates to the metabolic state of a cell. Here we propose a stochastic model to show that complex forms of metabolic heterogeneity emerge from fluctuations in enzyme expression and catalysis. The analysis predicts clonal populations to split into two or more metabolically distinct subpopulations. We reveal mechanisms not seen in deterministic models, in which enzymes with unimodal expression distributions lead to metabolites with a bimodal or multimodal distribution across the population. Based on published data, the results suggest that metabolite heterogeneity may be more pervasive than previously thought. Our work casts light on links between gene expression and metabolism, and provides a theory to probe the sources of metabolite heterogeneity.
△ Less
Submitted 29 January, 2019;
originally announced January 2019.
-
Collective search with finite perception: transient dynamics and search efficiency
Authors:
Adam Gosztolai,
Jose A. Carrillo,
Mauricio Barahona
Abstract:
Motile organisms often use finite spatial perception of their surroundings to navigate and search their habitats. Yet standard models of search are usually based on purely local sensory information. To model how a finite perceptual horizon affects ecological search, we propose a framework for optimal navigation that combines concepts from random walks and optimal control theory. We show that, whil…
▽ More
Motile organisms often use finite spatial perception of their surroundings to navigate and search their habitats. Yet standard models of search are usually based on purely local sensory information. To model how a finite perceptual horizon affects ecological search, we propose a framework for optimal navigation that combines concepts from random walks and optimal control theory. We show that, while local strategies are optimal on asymptotically long and short search times, finite perception yields faster convergence and increased search efficiency over transient time scales relevant in biological systems. The benefit of the finite horizon can be maintained by the searchers tuning their response sensitivity to the length scale of the stimulant in the environment, and is enhanced when the agents interact as a result of increased consensus within subpopulations. Our framework sheds light on the role of spatial perception and transients in search movement and collective sensing of the environment.
△ Less
Submitted 13 December, 2018; v1 submitted 17 September, 2018;
originally announced September 2018.
-
The exit time finite state projection scheme: bounding exit distributions and occupation measures of continuous-time Markov chains
Authors:
Juan Kuntz,
Philipp Thomas,
Guy-Bart Stan,
Mauricio Barahona
Abstract:
We introduce the exit time finite state projection (ETFSP) scheme, a truncation-based method that yields approximations to the exit distribution and occupation measure associated with the time of exit from a domain (i.e., the time of first passage to the complement of the domain) of time-homogeneous continuous-time Markov chains. We prove that: (i) the computed approximations bound the measures fr…
▽ More
We introduce the exit time finite state projection (ETFSP) scheme, a truncation-based method that yields approximations to the exit distribution and occupation measure associated with the time of exit from a domain (i.e., the time of first passage to the complement of the domain) of time-homogeneous continuous-time Markov chains. We prove that: (i) the computed approximations bound the measures from below; (ii) the total variation distances between the approximations and the measures decrease monotonically as states are added to the truncation; and (iii) the scheme converges, in the sense that, as the truncation tends to the entire state space, the total variation distances tend to zero. Furthermore, we give a computable bound on the total variation distance between the exit distribution and its approximation, and we delineate the cases in which the bound is sharp. We also revisit the related finite state projection scheme and give a comprehensive account of its theoretical properties. We demonstrate the use of the ETFSP scheme by applying it to two biological examples: the computation of the first passage time associated with the expression of a gene, and the fixation times of competing species subject to demographic noise.
△ Less
Submitted 25 January, 2019; v1 submitted 29 January, 2018;
originally announced January 2018.
-
GlnK facilitates the dynamic regulation of bacterial nitrogen assimilation
Authors:
Adam Gosztolai,
Jörg Schumacher,
Volker Behrends,
Jacob G Bundy,
Franziska Heydenreich,
Mark H Bennett,
Martin Buck,
Mauricio Barahona
Abstract:
Ammonium assimilation in E. coli is regulated by two paralogous proteins (GlnB and GlnK), which orchestrate interactions with regulators of gene expression, transport proteins and metabolic pathways. Yet how they conjointly modulate the activity of glutamine synthetase (GS), the key enzyme for nitrogen assimilation, is poorly understood. We combine experiments and theory to study the dynamic roles…
▽ More
Ammonium assimilation in E. coli is regulated by two paralogous proteins (GlnB and GlnK), which orchestrate interactions with regulators of gene expression, transport proteins and metabolic pathways. Yet how they conjointly modulate the activity of glutamine synthetase (GS), the key enzyme for nitrogen assimilation, is poorly understood. We combine experiments and theory to study the dynamic roles of GlnB and GlnK during nitrogen starvation and upshift. We measure time-resolved in vivo concentrations of metabolites, total and post-translationally modified proteins, and develop a concise biochemical model of GlnB and GlnK that incorporates competition for active and allosteric sites, as well as functional sequestration of GlnK. The model predicts the responses of GS, GlnB and GlnK under time-varying external ammonium level in the wild type and two genetic knock-outs. Our results show that GlnK is tightly regulated under nitrogen-rich conditions, yet it is expressed during ammonium run-out and starvation. This suggests a role for GlnK as a buffer of nitrogen shock after starvation, and provides a further functional link between nitrogen and carbon metabolisms.
△ Less
Submitted 19 April, 2017;
originally announced April 2017.
-
Rigorous bounds on the stationary distributions of the chemical master equation via mathematical programming
Authors:
Juan Kuntz,
Philipp Thomas,
Guy-Bart Stan,
Mauricio Barahona
Abstract:
The stochastic dynamics of biochemical networks are usually modelled with the chemical master equation (CME). The stationary distributions of CMEs are seldom solvable analytically, and numerical methods typically produce estimates with uncontrolled errors. Here, we introduce mathematical programming approaches that yield approximations of these distributions with computable error bounds which enab…
▽ More
The stochastic dynamics of biochemical networks are usually modelled with the chemical master equation (CME). The stationary distributions of CMEs are seldom solvable analytically, and numerical methods typically produce estimates with uncontrolled errors. Here, we introduce mathematical programming approaches that yield approximations of these distributions with computable error bounds which enable the verification of their accuracy. First, we use semidefinite programming to compute increasingly tighter upper and lower bounds on the moments of the stationary distributions for networks with rational propensities. Second, we use these moment bounds to formulate linear programs that yield convergent upper and lower bounds on the stationary distributions themselves, their marginals and stationary averages. The bounds obtained also provide a computational test for the uniqueness of the distribution. In the unique case, the bounds form an approximation of the stationary distribution with a computable bound on its error. In the non-unique case, our approach yields converging approximations of the ergodic distributions. We illustrate our methodology through several biochemical examples taken from the literature: Schlögl's model for a chemical bifurcation, a two-dimensional toggle switch, a model for bursty gene expression, and a dimerisation model with multiple stationary distributions.
△ Less
Submitted 25 June, 2019; v1 submitted 17 February, 2017;
originally announced February 2017.
-
Prediction of allosteric sites and mediating interactions through bond-to-bond propensities
Authors:
Benjamin R. C. Amor,
Michael T. Schaub,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
Allosteric regulation is central to many biochemical processes. Allosteric sites provide a target to fine-tune protein activity, yet we lack computational methods to predict them. Here, we present an efficient graph-theoretical approach for identifying allosteric sites and the mediating interactions that connect them to the active site. Using an atomistic graph with edges weighted by covalent and…
▽ More
Allosteric regulation is central to many biochemical processes. Allosteric sites provide a target to fine-tune protein activity, yet we lack computational methods to predict them. Here, we present an efficient graph-theoretical approach for identifying allosteric sites and the mediating interactions that connect them to the active site. Using an atomistic graph with edges weighted by covalent and non-covalent bond energies, we obtain a bond-to-bond propensity that quantifies the effect of instantaneous bond fluctuations propagating through the protein. We use this propensity to detect the sites and communication pathways most strongly linked to the active site, assessing their significance through quantile regression and comparison against a reference set of 100 generic proteins. We exemplify our method in detail with three well-studied allosteric proteins: caspase-1, CheY, and h-Ras, correctly predicting the location of the allosteric site and identifying key allosteric interactions. Consistent prediction of allosteric sites is then attained in a further set of 17 proteins known to exhibit allostery. Because our propensity measure runs in almost linear time, it offers a scalable approach to high-throughput searches for candidate allosteric sites.
△ Less
Submitted 31 May, 2016;
originally announced May 2016.
-
Stochastic models of gene transcription with upstream drives: exact solution and sample path characterization
Authors:
Justine Dattani,
Mauricio Barahona
Abstract:
Gene transcription is a highly stochastic and dynamic process. As a result, the mRNA copy number of a given gene is heterogeneous both between cells and across time. We present a framework to model gene transcription in populations of cells with time-varying (stochastic or deterministic) transcription and degradation rates. Such rates can be understood as upstream cellular drives representing the…
▽ More
Gene transcription is a highly stochastic and dynamic process. As a result, the mRNA copy number of a given gene is heterogeneous both between cells and across time. We present a framework to model gene transcription in populations of cells with time-varying (stochastic or deterministic) transcription and degradation rates. Such rates can be understood as upstream cellular drives representing the effect of different aspects of the cellular environment. We show that the full solution of the master equation contains two components: a model-specific, upstream effective drive, which encapsulates the effect of cellular drives (e.g., entrainment, periodicity or promoter randomness), and a downstream transcriptional Poissonian part, which is common to all models. Our analytical framework treats cell-to-cell and dynamic variability consistently, unifying several approaches in the literature. We apply the obtained solution to characterise different models of experimental relevance, and to explain the influence on gene transcription of synchrony, stationarity, ergodicity, as well as the effect of time-scales and other dynamic characteristics of drives. We also show how the solution can be applied to the analysis of noise sources in single-cell data, and to reduce the computational cost of stochastic simulations.
△ Less
Submitted 8 January, 2017; v1 submitted 23 May, 2016;
originally announced May 2016.
-
Flux-dependent graphs for metabolic networks
Authors:
Mariano Beguerisse-Díaz,
Gabriel Bosque,
Diego Oyarzún,
Jesús Picó,
Mauricio Barahona
Abstract:
Cells adapt their metabolic fluxes in response to changes in the environment. We present a framework for the systematic construction of flux-based graphs derived from organism-wide metabolic networks. Our graphs encode the directionality of metabolic fluxes via edges that represent the flow of metabolites from source to target reactions. The methodology can be applied in the absence of a specific…
▽ More
Cells adapt their metabolic fluxes in response to changes in the environment. We present a framework for the systematic construction of flux-based graphs derived from organism-wide metabolic networks. Our graphs encode the directionality of metabolic fluxes via edges that represent the flow of metabolites from source to target reactions. The methodology can be applied in the absence of a specific biological context by modelling fluxes probabilistically, or can be tailored to different environmental conditions by incorporating flux distributions computed through constraint-based approaches such as Flux Balance Analysis. We illustrate our approach on the central carbon metabolism of Escherichia coli and on a metabolic model of human hepatocytes. The flux-dependent graphs under various environmental conditions and genetic perturbations exhibit systemic changes in their topological and community structure, which capture the re-routing of metabolic fluxes and the varying importance of specific reactions and pathways. By integrating constraint-based models and tools from network science, our framework allows the study of context-specific metabolic responses at a system level beyond standard pathway descriptions.
△ Less
Submitted 28 March, 2018; v1 submitted 5 May, 2016;
originally announced May 2016.
-
Flow-based network analysis of the Caenorhabditis elegans connectome
Authors:
Karol A. Bacik,
Michael T. Schaub,
Mariano Beguerisse-Díaz,
Yazan N. Billeh,
Mauricio Barahona
Abstract:
We exploit flow propagation on the directed neuronal network of the nematode Caenorhabditis elegans to reveal dynamically relevant features of its connectome. We find flow-based groupings of neurons at different levels of granularity, which we relate to functional and anatomical constituents of its nervous system. A systematic in silico evaluation of the full set of single and double neuron ablati…
▽ More
We exploit flow propagation on the directed neuronal network of the nematode Caenorhabditis elegans to reveal dynamically relevant features of its connectome. We find flow-based groupings of neurons at different levels of granularity, which we relate to functional and anatomical constituents of its nervous system. A systematic in silico evaluation of the full set of single and double neuron ablations is used to identify deletions that induce the most severe disruptions of the multi-resolution flow structure. Such ablations are linked to functionally relevant neurons, and suggest potential candidates for further in vivo investigation. In addition, we use the directional patterns of incoming and outgoing network flows at all scales to identify flow profiles for the neurons in the connectome, without pre-imposing a priori categories. The four flow roles identified are linked to signal propagation motivated by biological input-response scenarios.
△ Less
Submitted 8 August, 2016; v1 submitted 2 November, 2015;
originally announced November 2015.
-
Emergence of slow-switching assemblies in structured neuronal networks
Authors:
Michael T. Schaub,
Yazan N. Billeh,
Costas A. Anastassiou,
Christof Koch,
Mauricio Barahona
Abstract:
Unraveling the interplay between connectivity and spatio-temporal dynamics in neuronal networks is a key step to advance our understanding of neuronal information processing. Here we investigate how particular features of network connectivity underpin the propensity of neural networks to generate slow-switching assembly (SSA) dynamics, i.e., sustained epochs of increased firing within assemblies o…
▽ More
Unraveling the interplay between connectivity and spatio-temporal dynamics in neuronal networks is a key step to advance our understanding of neuronal information processing. Here we investigate how particular features of network connectivity underpin the propensity of neural networks to generate slow-switching assembly (SSA) dynamics, i.e., sustained epochs of increased firing within assemblies of neurons which transition slowly between different assemblies throughout the network. We show that the emergence of SSA activity is linked to spectral properties of the asymmetric synaptic weight matrix. In particular, the leading eigenvalues that dictate the slow dynamics exhibit a gap with respect to the bulk of the spectrum, and the associated Schur vectors exhibit a measure of block-localization on groups of neurons, thus resulting in coherent dynamical activity on those groups. Through simple rate models, we gain analytical understanding of the origin and importance of the spectral gap, and use these insights to develop new network topologies with alternative connectivity paradigms which also display SSA activity. Specifically, SSA dynamics involving excitatory and inhibitory neurons can be achieved by modifying the connectivity patterns between both types of neurons. We also show that SSA activity can occur at multiple timescales reflecting a hierarchy in the connectivity, and demonstrate the emergence of SSA in small-world like networks. Our work provides a step towards understanding how network structure (uncovered through advancements in neuroanatomy and connectomics) can impact on spatio-temporal neural activity and constrain the resulting dynamics.
△ Less
Submitted 20 July, 2015; v1 submitted 19 February, 2015;
originally announced February 2015.
-
Uncovering allosteric pathways in caspase-1 with Markov transient analysis and multiscale community detection
Authors:
B. Amor,
S. N. Yaliraki,
R. Woscholski,
M. Barahona
Abstract:
Allosteric regulation at distant sites is central to many cellular processes. In particular, allosteric sites in proteins are a major target to increase the range and selectivity of new drugs, and there is a need for methods capable of identifying intra-molecular signalling pathways leading to allosteric effects. Here, we use an atomistic graph-theoretical approach that exploits Markov transients…
▽ More
Allosteric regulation at distant sites is central to many cellular processes. In particular, allosteric sites in proteins are a major target to increase the range and selectivity of new drugs, and there is a need for methods capable of identifying intra-molecular signalling pathways leading to allosteric effects. Here, we use an atomistic graph-theoretical approach that exploits Markov transients to extract such pathways and exemplify our results in an important allosteric protein, caspase-1. Firstly, we use Markov Stability community detection to perform a multiscale analysis of the structure of caspase-1 which reveals that the active conformation has a weaker, less compartmentalised large-scale structure as compared to the inactive conformation, resulting in greater intra-protein coherence and signal propagation. We also carry out a full computational point mutagenesis and identify that only a few residues are critical to such structural coherence. Secondly, we characterise explicitly the transients of random walks originating at the active site and predict the location of a known allosteric site in this protein quantifying the contribution of individual bonds to the communication pathway between the active and allosteric sites. Several of the bonds we find have been shown experimentally to be functionally critical, but we also predict a number of as yet unidentified bonds which may contribute to the pathway. Our approach offers a computationally inexpensive method for the identification of allosteric sites and communication pathways in proteins using a fully atomistic description.
△ Less
Submitted 11 November, 2014;
originally announced November 2014.
-
Revealing cell assemblies at multiple levels of granularity
Authors:
Yazan N. Billeh,
Michael T. Schaub,
Costas A. Anastassiou,
Mauricio Barahona,
Christof Koch
Abstract:
Background: Current neuronal monitoring techniques, such as calcium imaging and multi-electrode arrays, enable recordings of spiking activity from hundreds of neurons simultaneously. Of primary importance in systems neuroscience is the identification of cell assemblies: groups of neurons that cooperate in some form within the recorded population.
New Method: We introduce a simple, integrated fra…
▽ More
Background: Current neuronal monitoring techniques, such as calcium imaging and multi-electrode arrays, enable recordings of spiking activity from hundreds of neurons simultaneously. Of primary importance in systems neuroscience is the identification of cell assemblies: groups of neurons that cooperate in some form within the recorded population.
New Method: We introduce a simple, integrated framework for the detection of cell-assemblies from spiking data without a priori assumptions about the size or number of groups present. We define a biophysically-inspired measure to extract a directed functional connectivity matrix between both excitatory and inhibitory neurons based on their spiking history. The resulting network representation is analyzed using the Markov Stability framework, a graph theoretical method for community detection across scales, to reveal groups of neurons that are significantly related in the recorded time-series at different levels of granularity.
Results and comparison with existing methods: Using synthetic spike-trains, including simulated data from leaky-integrate-and-fire networks, our method is able to identify important patterns in the data such as hierarchical structure that are missed by other standard methods. We further apply the method to experimental data from retinal ganglion cells of mouse and salamander, in which we identify cell-groups that correspond to known functional types, and to hippocampal recordings from rats exploring a linear track, where we detect place cells with high fidelity.
Conclusions: We present a versatile method to detect neural assemblies in spiking data applicable across a spectrum of relevant scales that contributes to understanding spatio-temporal information gathered from systems neuroscience experiments.
△ Less
Submitted 8 November, 2014;
originally announced November 2014.
-
Finding role communities in directed networks using Role-Based Similarity, Markov Stability and the Relaxed Minimum Spanning Tree
Authors:
Mariano Beguerisse-Díaz,
Borislav Vangelov,
Mauricio Barahona
Abstract:
We present a framework to cluster nodes in directed networks according to their roles by combining Role-Based Similarity (RBS) and Markov Stability, two techniques based on flows. First we compute the RBS matrix, which contains the pairwise similarities between nodes according to the scaled number of in- and out-directed paths of different lengths. The weighted RBS similarity matrix is then transf…
▽ More
We present a framework to cluster nodes in directed networks according to their roles by combining Role-Based Similarity (RBS) and Markov Stability, two techniques based on flows. First we compute the RBS matrix, which contains the pairwise similarities between nodes according to the scaled number of in- and out-directed paths of different lengths. The weighted RBS similarity matrix is then transformed into an undirected similarity network using the Relaxed Minimum-Spanning Tree (RMST) algorithm, which uses the geometric structure of the RBS matrix to unblur the network, such that edges between nodes with high, direct RBS are preserved. Finally, we partition the RMST similarity network into role-communities of nodes at all scales using Markov Stability to find a robust set of roles in the network. We showcase our framework through a biological and a man-made network.
△ Less
Submitted 6 September, 2013;
originally announced September 2013.
-
Toggling a Genetic Switch Using Reinforcement Learning
Authors:
Aivar Sootla,
Natalja Strelkowa,
Damien Ernst,
Mauricio Barahona,
Guy-Bart Stan
Abstract:
In this paper, we consider the problem of optimal exogenous control of gene regulatory networks. Our approach consists in adapting an established reinforcement learning algorithm called the fitted Q iteration. This algorithm infers the control law directly from the measurements of the system's response to external control inputs without the use of a mathematical model of the system. The measuremen…
▽ More
In this paper, we consider the problem of optimal exogenous control of gene regulatory networks. Our approach consists in adapting an established reinforcement learning algorithm called the fitted Q iteration. This algorithm infers the control law directly from the measurements of the system's response to external control inputs without the use of a mathematical model of the system. The measurement data set can either be collected from wet-lab experiments or artificially created by computer simulations of dynamical models of the system. The algorithm is applicable to a wide range of biological systems due to its ability to deal with nonlinear and stochastic system dynamics. To illustrate the application of the algorithm to a gene regulatory network, the regulation of the toggle switch system is considered. The control objective of this problem is to drive the concentrations of two specific proteins to a target region in the state space.
△ Less
Submitted 25 February, 2015; v1 submitted 12 March, 2013;
originally announced March 2013.
-
Linear models of activation cascades: analytical solutions and coarse-graining of delayed signal transduction
Authors:
Mariano Beguerisse-Diaz,
Radhika Desikan,
Mauricio Barahona
Abstract:
Cellular signal transduction usually involves activation cascades, the sequential activation of a series of proteins following the reception of an input signal. Here we study the classic model of weakly activated cascades and obtain analytical solutions for a variety of inputs. We show that in the special but important case of optimal-gain cascades (i.e., when the deactivation rates are identical)…
▽ More
Cellular signal transduction usually involves activation cascades, the sequential activation of a series of proteins following the reception of an input signal. Here we study the classic model of weakly activated cascades and obtain analytical solutions for a variety of inputs. We show that in the special but important case of optimal-gain cascades (i.e., when the deactivation rates are identical) the downstream output of the cascade can be represented exactly as a lumped nonlinear module containing an incomplete gamma function with real parameters that depend on the rates and length of the cascade, as well as parameters of the input signal. The expressions obtained can be applied to the non-identical case when the deactivation rates are random to capture the variability in the cascade outputs. We also show that cascades can be rearranged so that blocks with similar rates can be lumped and represented through our nonlinear modules. Our results can be used both to represent cascades in computational models of differential equations and to fit data efficiently, by reducing the number of equations and parameters involved. In particular, the length of the cascade appears as a real-valued parameter and can thus be fitted in the same manner as Hill coefficients. Finally, we show how the obtained nonlinear modules can be used instead of delay differential equations to model delays in signal transduction.
△ Less
Submitted 14 September, 2016; v1 submitted 1 December, 2011;
originally announced December 2011.
-
Protein multi-scale organization through graph partitioning and robustness analysis: Application to the myosin-myosin light chain interaction
Authors:
Antoine Delmotte,
Edward W Tate,
Sophia N Yaliraki,
Mauricio Barahona
Abstract:
Despite the recognized importance of the multi-scale spatio-temporal organization of proteins, most computational tools can only access a limited spectrum of time and spatial scales, thereby ignoring the effects on protein behavior of the intricate coupling between the different scales. Starting from a physico-chemical atomistic network of interactions that encodes the structure of the protein, we…
▽ More
Despite the recognized importance of the multi-scale spatio-temporal organization of proteins, most computational tools can only access a limited spectrum of time and spatial scales, thereby ignoring the effects on protein behavior of the intricate coupling between the different scales. Starting from a physico-chemical atomistic network of interactions that encodes the structure of the protein, we introduce a methodology based on multi-scale graph partitioning that can uncover partitions and levels of organization of proteins that span the whole range of scales, revealing biological features occurring at different levels of organization and tracking their effect across scales. Additionally, we introduce a measure of robustness to quantify the relevance of the partitions through the generation of biochemically-motivated surrogate random graph models. We apply the method to four distinct conformations of myosin tail interacting protein, a protein from the molecular motor of the malaria parasite, and study properties that have been experimentally addressed such as the closing mechanism, the presence of conserved clusters, and the identification through computational mutational analysis of key residues for binding.
△ Less
Submitted 20 September, 2011;
originally announced September 2011.
-
Squeeze-and-Breathe Evolutionary Monte Carlo Optimisation with Local Search Acceleration and its application to parameter fitting
Authors:
Mariano Beguerisse-Diaz,
Baojun Wang,
Radhika Desikan,
Mauricio Barahona
Abstract:
Motivation: Estimating parameters from data is a key stage of the modelling process, particularly in biological systems where many parameters need to be estimated from sparse and noisy data sets. Over the years, a variety of heuristics have been proposed to solve this complex optimisation problem, with good results in some cases yet with limitations in the biological setting.
Results: In this wo…
▽ More
Motivation: Estimating parameters from data is a key stage of the modelling process, particularly in biological systems where many parameters need to be estimated from sparse and noisy data sets. Over the years, a variety of heuristics have been proposed to solve this complex optimisation problem, with good results in some cases yet with limitations in the biological setting.
Results: In this work, we develop an algorithm for model parameter fitting that combines ideas from evolutionary algorithms, sequential Monte Carlo and direct search optimisation. Our method performs well even when the order of magnitude and/or the range of the parameters is unknown. The method refines iteratively a sequence of parameter distributions through local optimisation combined with partial resampling from a historical prior defined over the support of all previous iterations. We exemplify our method with biological models using both simulated and real experimental data and estimate the parameters efficiently even in the absence of a priori knowledge about the parameters.
△ Less
Submitted 4 November, 2011; v1 submitted 14 July, 2011;
originally announced July 2011.
-
Role-similarity based comparison of directed networks
Authors:
Kathryn Cooper,
Mauricio Barahona
Abstract:
The widespread relevance of complex networks is a valuable tool in the analysis of a broad range of systems. There is a demand for tools which enable the extraction of meaningful information and allow the comparison between different systems. We present a novel measure of similarity between nodes in different networks as a generalization of the concept of self-similarity. A similarity matrix is as…
▽ More
The widespread relevance of complex networks is a valuable tool in the analysis of a broad range of systems. There is a demand for tools which enable the extraction of meaningful information and allow the comparison between different systems. We present a novel measure of similarity between nodes in different networks as a generalization of the concept of self-similarity. A similarity matrix is assembled as the distance between feature vectors that contain the in and out paths of all lengths for each node. Hence, nodes operating in a similar flow environment are considered similar regardless of network membership. We demonstrate that this method has the potential to be influential in tasks such as assigning identity or function to uncharacterized nodes. In addition an innovative application of graph partitioning to the raw results extends the concept to the comparison of networks in terms of their underlying role-structure.
△ Less
Submitted 29 March, 2011;
originally announced March 2011.
-
Transient dynamics around unstable periodic orbits in the generalized repressilator model
Authors:
Natalja Strelkowa,
Mauricio Barahona
Abstract:
We study the spatio-temporal dynamics of the generalized repressilator, a system of coupled repressing genes arranged in a directed ring topology, and give analytical conditions for the emergence of a cascade of unstable periodic orbits (UPOs) that lead to reachable long-lived oscillating transients. Such transients dominate the finite time horizon dynamics that is relevant in confined, noisy envi…
▽ More
We study the spatio-temporal dynamics of the generalized repressilator, a system of coupled repressing genes arranged in a directed ring topology, and give analytical conditions for the emergence of a cascade of unstable periodic orbits (UPOs) that lead to reachable long-lived oscillating transients. Such transients dominate the finite time horizon dynamics that is relevant in confined, noisy environments such as bacterial cells (see our previous work [Strelkowa and Barahona, 2010]) and are therefore of interest for bioengineering and synthetic biology. We show that the family of unstable orbits possesses spatial symmetries and can also be understood in terms of traveling wave solutions of kink-like topological defects. The long-lived oscillatory transients correspond to the propagation of quasistable two-kink configurations that unravel over a long time. We also assess the similarities between the generalized repressilator model and other unidirectionally coupled electronic systems, such as magnetic flux gates, which have been implemented experimentally.
△ Less
Submitted 17 December, 2010;
originally announced December 2010.
-
Role-based similarity in directed networks
Authors:
Kathryn Cooper,
Mauricio Barahona
Abstract:
The widespread relevance of increasingly complex networks requires methods to extract meaningful coarse-grained representations of such systems. For undirected graphs, standard community detection methods use criteria largely based on density of connections to provide such representations. We propose a method for grouping nodes in directed networks based on the role of the nodes in the network, un…
▽ More
The widespread relevance of increasingly complex networks requires methods to extract meaningful coarse-grained representations of such systems. For undirected graphs, standard community detection methods use criteria largely based on density of connections to provide such representations. We propose a method for grouping nodes in directed networks based on the role of the nodes in the network, understood in terms of patterns of incoming and outgoing flows. The role groupings are obtained through the clustering of a similarity matrix, formed by the distances between feature vectors that contain the number of in and out paths of all lengths for each node. Hence nodes operating in a similar flow environment are grouped together although they may not themselves be densely connected. Our method, which includes a scale factor that reveals robust groupings based on increasingly global structure, provides an alternative criterion to uncover structure in networks where there is an implicit flow transfer in the system. We illustrate its application to a variety of data from ecology, world trade and cellular metabolism.
△ Less
Submitted 13 December, 2010;
originally announced December 2010.
-
Switchable Genetic Oscillator Operating in Quasi-Stable Mode
Authors:
Natalja Strelkowa,
Mauricio Barahona
Abstract:
Ring topologies of repressing genes have qualitatively different long-term dynamics if the number of genes is odd (they oscillate) or even (they exhibit bistability). However, these attractors may not fully explain the observed behavior in transient and stochastic environments such as the cell. We show here that even repressilators possess quasi-stable, travelling-wave periodic solutions that ar…
▽ More
Ring topologies of repressing genes have qualitatively different long-term dynamics if the number of genes is odd (they oscillate) or even (they exhibit bistability). However, these attractors may not fully explain the observed behavior in transient and stochastic environments such as the cell. We show here that even repressilators possess quasi-stable, travelling-wave periodic solutions that are reachable, long-lived and robust to parameter changes. These solutions underlie the sustained oscillations observed in even rings in the stochastic regime, even if these circuits are expected to behave as switches. The existence of such solutions can also be exploited for control purposes: operation of the system around the quasi-stable orbit allows us to turn on and off the oscillations reliably and on demand. We illustrate these ideas with a simple protocol based on optical interference that can induce oscillations robustly both in the stochastic and deterministic regimes.
△ Less
Submitted 19 November, 2009; v1 submitted 10 September, 2009;
originally announced September 2009.
-
A Dynamical Model of Lipoprotein Metabolism
Authors:
Elias August,
Kim H. Parker,
Mauricio Barahona
Abstract:
We present a dynamical model of lipoprotein metabolism derived by combining a cascading process in the blood stream and cellular level regulatory dynamics. We analyse the existence and stability of equilibria and show that this low-dimensional, nonlinear model exhibits bistability between a low and a high cholesterol state. A sensitivity analysis indicates that the intracellular concentration of…
▽ More
We present a dynamical model of lipoprotein metabolism derived by combining a cascading process in the blood stream and cellular level regulatory dynamics. We analyse the existence and stability of equilibria and show that this low-dimensional, nonlinear model exhibits bistability between a low and a high cholesterol state. A sensitivity analysis indicates that the intracellular concentration of cholesterol is robust to parametric variations while the plasma cholesterol can vary widely. We show how the dynamical response to time-dependent inputs can be used to diagnose the state of the system. We also establish the connection between parameters in the system and medical and genetic conditions.
△ Less
Submitted 28 October, 2006;
originally announced October 2006.
-
Stochastic kinetics of viral capsid assembly based on detailed protein structures
Authors:
Martin Hemberg,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
We present a generic computational framework for the simulation of viral capsid assembly which is quantitative and specific. Starting from PDB files containing atomic coordinates, the algorithm builds a coarse grained description of protein oligomers based on graph rigidity. These reduced protein descriptions are used in an extended Gillespie algorithm to investigate the stochastic kinetics of t…
▽ More
We present a generic computational framework for the simulation of viral capsid assembly which is quantitative and specific. Starting from PDB files containing atomic coordinates, the algorithm builds a coarse grained description of protein oligomers based on graph rigidity. These reduced protein descriptions are used in an extended Gillespie algorithm to investigate the stochastic kinetics of the assembly process. The association rates are obtained from a diffusive Smoluchowski equation for rapid coagulation, modified to account for water shielding and protein structure. The dissociation rates are derived by interpreting the splitting of oligomers as a process of graph partitioning akin to the escape from a multidimensional well. This modular framework is quantitative yet computationally tractable, with a small number of physically motivated parameters. The methodology is illustrated using two different viruses which are shown to follow quantitatively different assembly pathways. We also show how in this model the quasi-stationary kinetics of assembly can be described as a Markovian cascading process in which only a few intermediates and a small proportion of pathways are present. The observed pathways and intermediates can be related a posteriori to structural and energetic properties of the capsid oligomers.
△ Less
Submitted 28 October, 2006;
originally announced October 2006.
-
Perfect Sampling of the Master Equation for Gene Regulatory Networks
Authors:
Martin Hemberg,
Mauricio Barahona
Abstract:
We present a Perfect Sampling algorithm that can be applied to the Master Equation of Gene Regulatory Networks (GRNs). The method recasts Gillespie's Stochastic Simulation Algorithm (SSA) in the light of Markov Chain Monte Carlo methods and combines it with the Dominated Coupling From The Past (DCFTP) algorithm to provide guaranteed sampling from the stationary distribution. We show how the DCFT…
▽ More
We present a Perfect Sampling algorithm that can be applied to the Master Equation of Gene Regulatory Networks (GRNs). The method recasts Gillespie's Stochastic Simulation Algorithm (SSA) in the light of Markov Chain Monte Carlo methods and combines it with the Dominated Coupling From The Past (DCFTP) algorithm to provide guaranteed sampling from the stationary distribution. We show how the DCFTP-SSA can be generically applied to genetic networks with feedback formed by the interconnection of linear enzymatic reactions and nonlinear Monod- and Hill-type elements. We establish rigorous bounds on the error and convergence of the DCFTP-SSA, as compared to the standard SSA, through a set of increasingly complex examples. Once the building blocks for GRNs have been introduced, the algorithm is applied to study properly averaged dynamic properties of two experimentally relevant genetic networks: the toggle switch, a two-dimensional bistable system, and the repressilator, a six-dimensional genetic oscillator.
△ Less
Submitted 10 April, 2007; v1 submitted 27 October, 2006;
originally announced October 2006.