-
From FAIR to CURE: Guidelines for Computational Models of Biological Systems
Authors:
Herbert M. Sauro,
Eran Agmon,
Michael L. Blinov,
John H. Gennari,
Joe Hellerstein,
Adel Heydarabadipour,
Peter Hunter,
Bartholomew E. Jardine,
Elebeoba May,
David P. Nickerson,
Lucian P. Smith,
Gary D Bader,
Frank Bergmann,
Patrick M. Boyle,
Andreas Drager,
James R. Faeder,
Song Feng,
Juliana Freire,
Fabian Frohlich,
James A. Glazier,
Thomas E. Gorochowski,
Tomas Helikar,
Stefan Hoops,
Princess Imoukhuede,
Sarah M. Keating
, et al. (26 additional authors not shown)
Abstract:
Guidelines for managing scientific data have been established under the FAIR principles requiring that data be Findable, Accessible, Interoperable, and Reusable. In many scientific disciplines, especially computational biology, both data and models are key to progress. For this reason, and recognizing that such models are a very special type of 'data', we argue that computational models, especiall…
▽ More
Guidelines for managing scientific data have been established under the FAIR principles requiring that data be Findable, Accessible, Interoperable, and Reusable. In many scientific disciplines, especially computational biology, both data and models are key to progress. For this reason, and recognizing that such models are a very special type of 'data', we argue that computational models, especially mechanistic models prevalent in medicine, physiology and systems biology, deserve a complementary set of guidelines. We propose the CURE principles, emphasizing that models should be Credible, Understandable, Reproducible, and Extensible. We delve into each principle, discussing verification, validation, and uncertainty quantification for model credibility; the clarity of model descriptions and annotations for understandability; adherence to standards and open science practices for reproducibility; and the use of open standards and modular code for extensibility and reuse. We outline recommended and baseline requirements for each aspect of CURE, aiming to enhance the impact and trustworthiness of computational models, particularly in biomedical applications where credibility is paramount. Our perspective underscores the need for a more disciplined approach to modeling, aligning with emerging trends such as Digital Twins and emphasizing the importance of data and modeling standards for interoperability and reuse. Finally, we emphasize that given the non-trivial effort required to implement the guidelines, the community moves to automate as many of the guidelines as possible.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Beyond Boolean networks: new tools for the steady state analysis of multivalued networks
Authors:
J. García Galofre,
M. Pérez Millán,
A. Galarza Rial,
R. Laubenbacher,
A. Dickenstein
Abstract:
Boolean networks can be viewed as functions on the set of binary strings of a given length, described via logical rules. They were introduced as dynamic models into biology, in particular as logical models of intracellular regulatory networks involving genes, proteins, and metabolites. Since genes can have several modes of action depending on their expression levels, binary variables are often not…
▽ More
Boolean networks can be viewed as functions on the set of binary strings of a given length, described via logical rules. They were introduced as dynamic models into biology, in particular as logical models of intracellular regulatory networks involving genes, proteins, and metabolites. Since genes can have several modes of action depending on their expression levels, binary variables are often not sufficiently rich, requiring the use of multivalued networks instead.
In this paper, we explore the multivalued generalization of Boolean networks by writing the standard $(\wedge, \vee, \lnot)$ operations on $\{0, 1\}$ in terms of the operations $(\odot, \oplus, \neg)$ on $\big\{0,\frac{1}{m}, \frac{2}{m}, \dots, \frac{m-1}{m}, 1\big\}$ from multivalued logic. We recall the basic theory of this mathematical framework, and give a novel algorithm for computing the fixed points that in many cases has essentially the same complexity as in the binary case. Our approach provides a biologically intuitive representation of the network. Furthermore, it uses tools to compute lattice points in rational polytopes, tapping a rich area of algebraic combinatorics as a source for combinatorial algorithms for network analysis. An implementation of the algorithm is provided.
△ Less
Submitted 14 April, 2025; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Control of Medical Digital Twins with Artificial Neural Networks
Authors:
Lucas Böttcher,
Luis L. Fonseca,
Reinhard C. Laubenbacher
Abstract:
The objective of personalized medicine is to tailor interventions to an individual patient's unique characteristics. A key technology for this purpose involves medical digital twins, computational models of human biology that can be personalized and dynamically updated to incorporate patient-specific data collected over time. Certain aspects of human biology, such as the immune system, are not eas…
▽ More
The objective of personalized medicine is to tailor interventions to an individual patient's unique characteristics. A key technology for this purpose involves medical digital twins, computational models of human biology that can be personalized and dynamically updated to incorporate patient-specific data collected over time. Certain aspects of human biology, such as the immune system, are not easily captured with physics-based models, such as differential equations. Instead, they are often multi-scale, stochastic, and hybrid. This poses a challenge to existing model-based control and optimization approaches that cannot be readily applied to such models. Recent advances in automatic differentiation and neural-network control methods hold promise in addressing complex control problems. However, the application of these approaches to biomedical systems is still in its early stages. This work introduces dynamics-informed neural-network controllers as an alternative approach to control of medical digital twins. As a first use case for this method, the focus is on agent-based models, a versatile and increasingly common modeling platform in biomedicine. The effectiveness of the proposed neural-network control method is illustrated and benchmarked against other methods with two widely-used agent-based model types. The relevance of the method introduced here extends beyond medical digital twins to other complex dynamical systems.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Surrogate Modeling and Control of Medical Digital Twins
Authors:
Luis L. Fonseca,
Lucas Böttcher,
Borna Mehrad,
Reinhard C. Laubenbacher
Abstract:
The vision of personalized medicine is to identify interventions that maintain or restore a person's health based on their individual biology. Medical digital twins, computational models that integrate a wide range of health-related data about a person and can be dynamically updated, are a key technology that can help guide medical decisions. Such medical digital twin models can be high-dimensiona…
▽ More
The vision of personalized medicine is to identify interventions that maintain or restore a person's health based on their individual biology. Medical digital twins, computational models that integrate a wide range of health-related data about a person and can be dynamically updated, are a key technology that can help guide medical decisions. Such medical digital twin models can be high-dimensional, multi-scale, and stochastic. To be practical for healthcare applications, they often need to be simplified into low-dimensional surrogate models that can be used for the optimal design of interventions. This paper introduces surrogate modeling algorithms for the purpose of optimal control applications. As a use case, we focus on agent-based models (ABMs), a common model type in biomedicine for which there are no readily available optimal control algorithms. By deriving surrogate models that are based on systems of ordinary differential equations, we show how optimal control methods can be employed to compute effective interventions, which can then be lifted back to a given ABM. The relevance of the methods introduced here extends beyond medical digital twins to other complex dynamical systems.
△ Less
Submitted 20 May, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Modular control of Boolean network models
Authors:
David Murrugarra,
Alan Veliz-Cuba,
Elena Dimitrova,
Claus Kadelka,
Matthew Wheeler,
Reinhard Laubenbacher
Abstract:
The concept of control is crucial for effectively understanding and applying biological network models. Key structural features relate to control functions through gene regulation, signaling, or metabolic mechanisms, and computational models need to encode these. Applications often focus on model-based control, such as in biomedicine or metabolic engineering. In a recent paper, the authors develop…
▽ More
The concept of control is crucial for effectively understanding and applying biological network models. Key structural features relate to control functions through gene regulation, signaling, or metabolic mechanisms, and computational models need to encode these. Applications often focus on model-based control, such as in biomedicine or metabolic engineering. In a recent paper, the authors developed a theoretical framework of modularity in Boolean networks, which lead to a canonical semidirect product decomposition of these systems. In this paper, we present an approach to model-based control that exploits this modular structure, as well as the canalizing features of the regulatory mechanisms. We show how to identify control strategies from the individual modules, and we present a criterion based on canalizing features of the regulatory rules to identify modules that do not contribute to network control and can be excluded. For even moderately sized networks, finding global control inputs is computationally challenging. Our modular approach leads to an efficient approach to solving this problem. We apply it to a published Boolean network model of blood cancer large granular lymphocyte (T-LGL) leukemia to identify a minimal control set that achieves a desired control objective.
△ Less
Submitted 4 November, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
Forum on immune digital twins: a meeting report
Authors:
Reinhard Laubenbacher,
Fred Adler,
Gary An,
Filippo Castiglione,
Stephen Eubank,
Luis L. Fonseca,
James Glazier,
Tomas Helikar,
Marti Jett-Tilton,
Denise Kirschner,
Paul Macklin,
Borna Mehrad,
Beth Moore,
Virginia Pasour,
Ilya Shmulevich,
Amber Smith,
Isabel Voigt,
Thomas E. Yankeelov,
Tjalf Ziemssen
Abstract:
Medical digital twins are computational models of human biology relevant to a given medical condition, which can be tailored to an individual patient, thereby predicting the course of disease and individualized treatments, an important goal of personalized medicine. The immune system, which has a central role in many diseases, is highly heterogeneous between individuals, and thus poses a major cha…
▽ More
Medical digital twins are computational models of human biology relevant to a given medical condition, which can be tailored to an individual patient, thereby predicting the course of disease and individualized treatments, an important goal of personalized medicine. The immune system, which has a central role in many diseases, is highly heterogeneous between individuals, and thus poses a major challenge for this technology. If medical digital twins are to faithfully capture the characteristics of a patient's immune system, we need to answer many questions, such as: What do we need to know about the immune system to build mathematical models that reflect features of an individual? What data do we need to collect across the different scales of immune system action? What are the right modeling paradigms to properly capture immune system complexity? In February 2023, an international group of experts convened in Lake Nona, FL for two days to discuss these and other questions related to digital twins of the immune system. The group consisted of clinicians, immunologists, biologists, and mathematical modelers, representative of the interdisciplinary nature of medical digital twin development. A video recording of the entire event is available. This paper presents a synopsis of the discussions, brief descriptions of ongoing digital twin projects at different stages of progress. It also proposes a 5-year action plan for further developing this technology. The main recommendations are to identify and pursue a small number of promising use cases, to develop stimulation-specific assays of immune function in a clinical setting, and to develop a database of existing computational immune models, as well as advanced modeling technology and infrastructure.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Stability of Linear Boolean Networks
Authors:
Karthik Chandrasekhar,
Claus Kadelka,
Reinhard Laubenbacher,
David Murrugarra
Abstract:
Stability is an important characteristic of network models that has implications for other desirable aspects such as controllability. The stability of a Boolean network depends on various factors, such as the topology of its wiring diagram and the type of the functions describing its dynamics. In this paper, we study the stability of linear Boolean networks by computing Derrida curves and quantify…
▽ More
Stability is an important characteristic of network models that has implications for other desirable aspects such as controllability. The stability of a Boolean network depends on various factors, such as the topology of its wiring diagram and the type of the functions describing its dynamics. In this paper, we study the stability of linear Boolean networks by computing Derrida curves and quantifying the number of attractors and cycle lengths imposed by their network topologies. Derrida curves are commonly used to measure the stability of Boolean networks and several parameters such as the average in-degree K and the output bias p can indicate if a network is stable, critical, or unstable. For random unbiased Boolean networks there is a critical connectivity value Kc=2 such that if K<Kc networks operate in the ordered regime, and if K>Kc networks operate in the chaotic regime. Here, we show that for linear networks, which are the least canalizing and most unstable, the phase transition from order to chaos already happens at an average in-degree of Kc=1. Consistently, we also show that unstable networks exhibit a large number of attractors with very long limit cycles while stable and critical networks exhibit fewer attractors with shorter limit cycles. Additionally, we present theoretical results to quantify important dynamical properties of linear networks. First, we present a formula for the proportion of attractor states in linear systems. Second, we show that the expected number of fixed points in linear systems is 2, while general Boolean networks possess on average one fixed point. Third, we present a formula to quantify the number of bijective linear Boolean networks and provide a lower bound for the percentage of this type of network.
△ Less
Submitted 9 May, 2023; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Decomposition of Boolean networks: An approach to modularity of biological systems
Authors:
Claus Kadelka,
Reinhard Laubenbacher,
David Murrugarra,
Alan Veliz-Cuba,
Matthew Wheeler
Abstract:
This paper presents the foundation for a decomposition theory for Boolean networks, a type of discrete dynamical system that has found a wide range of applications in the life sciences, engineering, and physics. Given a Boolean network satisfying certain conditions, there is a unique collection of subnetworks so that the network can be reconstructed from these subnetworks by an extension operation…
▽ More
This paper presents the foundation for a decomposition theory for Boolean networks, a type of discrete dynamical system that has found a wide range of applications in the life sciences, engineering, and physics. Given a Boolean network satisfying certain conditions, there is a unique collection of subnetworks so that the network can be reconstructed from these subnetworks by an extension operation. The main result of the paper is that this structural decomposition induces a corresponding decomposition of the network dynamics. The theory is motivated by the search for a mathematical framework to formalize the hypothesis that biological systems are modular, widely accepted in the life sciences, but not well-defined and well-characterized. As an example of how dynamic modularity could be used for the efficient identification of phenotype control, the control strategies for the network can be found by identifying controls in its modules, one at a time.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Collectively canalizing Boolean functions
Authors:
Claus Kadelka,
Benjamin Keilty,
Reinhard Laubenbacher
Abstract:
This paper studies the mathematical properties of collectively canalizing Boolean functions, a class of functions that has arisen from applications in systems biology. Boolean networks are an increasingly popular modeling framework for regulatory networks, and the class of functions studied here captures a key feature of biological network dynamics, namely that a subset of one or more variables, u…
▽ More
This paper studies the mathematical properties of collectively canalizing Boolean functions, a class of functions that has arisen from applications in systems biology. Boolean networks are an increasingly popular modeling framework for regulatory networks, and the class of functions studied here captures a key feature of biological network dynamics, namely that a subset of one or more variables, under certain conditions, can dominate the value of a Boolean function, to the exclusion of all others. These functions have rich mathematical properties to be explored. The paper shows how the number and type of such sets influence a function's behavior and define a new measure for the canalizing strength of any Boolean function. We further connect the concept of collective canalization with the well-studied concept of the average sensitivity of a Boolean function. The relationship between Boolean functions and the dynamics of the networks they form is important in a wide range of applications beyond biology, such as computer science, and has been studied with statistical and simulation-based methods. But the rich relationship between structure and dynamics remains largely unexplored, and this paper is intended as a contribution to its mathematical foundation.
△ Less
Submitted 6 June, 2023; v1 submitted 31 August, 2020;
originally announced August 2020.
-
The Dynamics of Canalizing Boolean Networks
Authors:
Elijah Paul,
Gleb Pogudin,
William Qin,
Reinhard Laubenbacher
Abstract:
Boolean networks are a popular modeling framework in computational biology to capture the dynamics of molecular networks, such as gene regulatory networks. It has been observed that many published models of such networks are defined by regulatory rules driving the dynamics that have certain so-called canalizing properties. In this paper, we investigate the dynamics of a random Boolean network with…
▽ More
Boolean networks are a popular modeling framework in computational biology to capture the dynamics of molecular networks, such as gene regulatory networks. It has been observed that many published models of such networks are defined by regulatory rules driving the dynamics that have certain so-called canalizing properties. In this paper, we investigate the dynamics of a random Boolean network with such properties using analytical methods and simulations.
From our simulations, we observe that Boolean networks with higher canalizing depth have generally fewer attractors, the attractors are smaller, and the basins are larger, with implications for the stability and robustness of the models. These properties are relevant to many biological applications. Moreover, our results show that, from the standpoint of the attractor structure, high canalizing depth, compared to relatively small positive canalizing depth, has a very modest impact on dynamics.
Motivated by these observations, we conduct mathematical study of the attractor structure of a random Boolean network of canalizing depth one (i.e., the smallest positive depth). For every positive integer $\ell$, we give an explicit formula for the limit of the expected number of attractors of length $\ell$ in an $n$-state random Boolean network as $n$ goes to infinity.
△ Less
Submitted 5 December, 2019; v1 submitted 31 January, 2019;
originally announced February 2019.
-
Addressing current challenges in cancer immunotherapy with mathematical and computational modeling
Authors:
Anna Konstorum,
Anthony T. Vella,
Adam J. Adler,
Reinhard Laubenbacher
Abstract:
The goal of cancer immunotherapy is to boost a patient's immune response to a tumor. Yet, the design of an effective immunotherapy is complicated by various factors, including a potentially immunosuppressive tumor microenvironment, immune-modulating effects of conventional treatments, and therapy-related toxicities. These complexities can be incorporated into mathematical and computational models…
▽ More
The goal of cancer immunotherapy is to boost a patient's immune response to a tumor. Yet, the design of an effective immunotherapy is complicated by various factors, including a potentially immunosuppressive tumor microenvironment, immune-modulating effects of conventional treatments, and therapy-related toxicities. These complexities can be incorporated into mathematical and computational models of cancer immunotherapy that can then be used to aid in rational therapy design. In this review, we survey modeling approaches under the umbrella of the major challenges facing immunotherapy development, which encompass tumor classification, optimal treatment scheduling, and combination therapy design. Although overlapping, each challenge has presented unique opportunities for modelers to make contributions using analytical and numerical analysis of model outcomes, as well as optimization algorithms. We discuss several examples of models that have grown in complexity as more biological information has become available, showcasing how model development is a dynamic process interlinked with the rapid advances in tumor-immune biology. We conclude the review with recommendations for modelers both with respect to methodology and biological direction that might help keep modelers at the forefront of cancer immunotherapy development.
△ Less
Submitted 6 June, 2017;
originally announced June 2017.
-
The Influence of Canalization on the Robustness of Boolean Networks
Authors:
Claus Kadelka,
Jack Kuipers,
Reinhard Laubenbacher
Abstract:
Time- and state-discrete dynamical systems are frequently used to model molecular networks. This paper provides a collection of mathematical and computational tools for the study of robustness in Boolean network models. The focus is on networks governed by $k$-canalizing functions, a recently introduced class of Boolean functions that contains the well-studied class of nested canalizing functions.…
▽ More
Time- and state-discrete dynamical systems are frequently used to model molecular networks. This paper provides a collection of mathematical and computational tools for the study of robustness in Boolean network models. The focus is on networks governed by $k$-canalizing functions, a recently introduced class of Boolean functions that contains the well-studied class of nested canalizing functions. The activities and sensitivity of a function quantify the impact of input changes on the function output. This paper generalizes the latter concept to $c$-sensitivity and provides formulas for the activities and $c$-sensitivity of general $k$-canalizing functions as well as canalizing functions with more precisely defined structure. A popular measure for the robustness of a network, the Derrida value, can be expressed as a weighted sum of the $c$-sensitivities of the governing canalizing functions, and can also be calculated for a stochastic extension of Boolean networks. These findings provide a computationally efficient way to obtain Derrida values of Boolean networks, deterministic or stochastic, that does not involve simulation.
△ Less
Submitted 18 November, 2016; v1 submitted 15 July, 2016;
originally announced July 2016.
-
Identification of control targets in Boolean molecular network models via computational algebra
Authors:
David Murrugarra,
Alan Veliz-Cuba,
Boris Aguilar,
Reinhard Laubenbacher
Abstract:
Motivation: Many problems in biomedicine and other areas of the life sciences can be characterized as control problems, with the goal of finding strategies to change a disease or otherwise undesirable state of a biological system into another, more desirable, state through an intervention, such as a drug or other therapeutic treatment. The identification of such strategies is typically based on a…
▽ More
Motivation: Many problems in biomedicine and other areas of the life sciences can be characterized as control problems, with the goal of finding strategies to change a disease or otherwise undesirable state of a biological system into another, more desirable, state through an intervention, such as a drug or other therapeutic treatment. The identification of such strategies is typically based on a mathematical model of the process to be altered through targeted control inputs. This paper focuses on processes at the molecular level that determine the state of an individual cell, involving signaling or gene regulation. The mathematical model type considered is that of Boolean networks. The potential control targets can be represented by a set of nodes and edges that can be manipulated to produce a desired effect on the system. Experimentally, node manipulation requires technology to completely repress or fully activate a particular gene product while edge manipulations only require a drug that inactivates the interaction between two gene products. Results: This paper presents a method for the identification of potential intervention targets in Boolean molecular network models using algebraic techniques. The approach exploits an algebraic representation of Boolean networks to encode the control candidates in the network wiring diagram as the solutions of a system of polynomials equations, and then uses computational algebra techniques to find such controllers. The control methods in this paper are validated through the identification of combinatorial interventions in the signaling pathways of previously reported control targets in two well studied systems, a p53-mdm2 network and a blood T cell lymphocyte granular leukemia survival signaling network.
△ Less
Submitted 3 October, 2016; v1 submitted 21 August, 2015;
originally announced August 2015.
-
Multistate Nested Canalizing Functions and Their Networks
Authors:
Claus Kadelka,
Yuan Li,
Jack Kuipers,
John O. Adeyeye,
Reinhard Laubenbacher
Abstract:
This paper provides a collection of mathematical and computational tools for the study of robustness in nonlinear gene regulatory networks, represented by time- and state-discrete dynamical systems taking on multiple states. The focus is on networks governed by nested canalizing functions (NCFs), first introduced in the Boolean context by S. Kauffman. After giving a general definition of NCFs we a…
▽ More
This paper provides a collection of mathematical and computational tools for the study of robustness in nonlinear gene regulatory networks, represented by time- and state-discrete dynamical systems taking on multiple states. The focus is on networks governed by nested canalizing functions (NCFs), first introduced in the Boolean context by S. Kauffman. After giving a general definition of NCFs we analyze the class of such functions. We derive a formula for the normalized average $c$-sensitivities of multistate NCFs, which enables the calculation of the Derrida plot, a popular measure of network stability. We also provide a unique canonical parametrized polynomial form of NCFs. This form has several consequences. We can easily generate NCFs for varying parameter choices, and derive a closed form formula for the number of such functions in a given number of variables, as well as an asymptotic formula. Finally, we compute the number of equivalence classes of NCFs under permutation of variables. Together, the results of the paper represent a useful mathematical framework for the study of NCFs and their dynamic networks.
△ Less
Submitted 29 August, 2016; v1 submitted 14 November, 2014;
originally announced November 2014.
-
A Network Biology Approach to Denitrification in Pseudomonas aeruginosa
Authors:
Seda Arat,
George S. Bullerjahn,
Reinhard Laubenbacher
Abstract:
Pseudomonas aeruginosa is a metabolically flexible member of the Gammaproteobacteria. Under anaerobic conditions and the presence of nitrate, P. aeruginosa can perform (complete) denitrification, a respiratory process of dissimilatory nitrate reduction to nitrogen gas via nitrite ($NO_{2}$), nitric oxide ($NO$) and nitrous oxide ($N_{2}O$). This study focuses on understanding the influence of envi…
▽ More
Pseudomonas aeruginosa is a metabolically flexible member of the Gammaproteobacteria. Under anaerobic conditions and the presence of nitrate, P. aeruginosa can perform (complete) denitrification, a respiratory process of dissimilatory nitrate reduction to nitrogen gas via nitrite ($NO_{2}$), nitric oxide ($NO$) and nitrous oxide ($N_{2}O$). This study focuses on understanding the influence of environmental conditions on bacterial denitrification performance, using a mathematical model of a metabolic network in P. aeruginosa. To our knowledge, this is the first mathematical model of denitrification for this bacterium. Analysis of the long-term behavior of the network under changing concentration levels of oxygen ($O_{2}$), nitrate ($NO_{3}$), and phosphate ($PO_{4}$) suggests that $PO_{4}$ concentration strongly affects denitrification performance. The model provides five predictions on denitrification activity of P. aeruginosa under various environmental conditions, and these predictions are either experimentally validated or supported by pertinent biological literature. One motivation for this study is to capture the effect of $PO_{4}$ on a denitrification metabolic network of P. aeruginosa in order to shed light on mechanisms for greenhouse gas $N_{2}O$ accumulation during seasonal oxygen depletion in aquatic environments such as Lake Erie (Laurentian Great Lakes, USA).
△ Less
Submitted 6 April, 2015; v1 submitted 16 December, 2013;
originally announced December 2013.
-
Dimension Reduction of Large AND-NOT Network Models
Authors:
Alan Veliz-Cuba,
Reinhard Laubenbacher,
Boris Aguilar
Abstract:
Boolean networks have been used successfully in modeling biological networks and provide a good framework for theoretical analysis. However, the analysis of large networks is not trivial. In order to simplify the analysis of such networks, several model reduction algorithms have been proposed; however, it is not clear if such algorithms scale well with respect to the number of nodes. The goal of t…
▽ More
Boolean networks have been used successfully in modeling biological networks and provide a good framework for theoretical analysis. However, the analysis of large networks is not trivial. In order to simplify the analysis of such networks, several model reduction algorithms have been proposed; however, it is not clear if such algorithms scale well with respect to the number of nodes. The goal of this paper is to propose and implement an algorithm for the reduction of AND-NOT network models for the purpose of steady state computation. Our method of network reduction is the use of "steady state approximations" that do not change the number of steady states. Our algorithm is designed to work at the wiring diagram level without the need to evaluate or simplify Boolean functions. Also, our implementation of the algorithm takes advantage of the sparsity typical of discrete models of biological systems. The main features of our algorithm are that it works at the wiring diagram level, it runs in polynomial time, and it preserves the number of steady states. We used our results to study AND-NOT network models of gene networks and showed that our algorithm greatly simplifies steady state analysis. Furthermore, our algorithm can handle sparse AND-NOT networks with up to 1000000 nodes.
△ Less
Submitted 27 November, 2013;
originally announced November 2013.
-
Stabilizing Gene Regulatory Networks Through Feedforward Loops
Authors:
Claus Kadelka,
David Murrugarra,
Reinhard Laubenbacher
Abstract:
The global dynamics of gene regulatory networks are known to show robustness to perturbations in the form of intrinsic and extrinsic noise, as well as mutations of individual genes. One molecular mechanism underlying this robustness has been identified as the action of so-called microRNAs that operate via feedforward loops. We present results of a computational study, using the modeling framework…
▽ More
The global dynamics of gene regulatory networks are known to show robustness to perturbations in the form of intrinsic and extrinsic noise, as well as mutations of individual genes. One molecular mechanism underlying this robustness has been identified as the action of so-called microRNAs that operate via feedforward loops. We present results of a computational study, using the modeling framework of stochastic Boolean networks, which explores the role that such network motifs play in stabilizing global dynamics. The paper introduces a new measure for the stability of stochastic networks. The results show that certain types of feedforward loops do indeed buffer the network against stochastic effects.
△ Less
Submitted 22 May, 2013; v1 submitted 30 April, 2013;
originally announced April 2013.
-
Multistate nested canalizing functions
Authors:
J. O. Adeyeye,
C. Kadelka,
R. Laubenbacher,
Y. Li
Abstract:
The concept of a nested canalizing Boolean function has been studied over the course of the last decade in the context of understanding the regulatory logic of molecular interaction networks, such as gene regulatory networks. Such functions appear preferentially in published models of such networks. Recently, this concept has been generalized to include multi-state functions, and a recursive formu…
▽ More
The concept of a nested canalizing Boolean function has been studied over the course of the last decade in the context of understanding the regulatory logic of molecular interaction networks, such as gene regulatory networks. Such functions appear preferentially in published models of such networks. Recently, this concept has been generalized to include multi-state functions, and a recursive formula has been derived for their number, as a function of the number of variables. This paper carries out a detailed analysis of the class of nested canalizing functions over an arbitrary finite field. Furthermore, the paper generalizes the concept further, and derives a closed formula for the number of such generalized functions. The paper also derives a closed formula for the number of equivalence classes under permutation of variables. This is motivated by the fact that two nested canalizing functions that differ by a permutation of the variables share many important properties with each other. The paper contributes to the effort of identifying a class of functions over finite fields that are of interest in biology and also have interesting mathematical properties.
△ Less
Submitted 12 April, 2013;
originally announced April 2013.
-
AND-NOT logic framework for steady state analysis of Boolean network models
Authors:
Alan Veliz-Cuba,
Kristina Buschur,
Rose Hamershock,
Ariel Kniss,
Esther Wolff,
Reinhard Laubenbacher
Abstract:
Finite dynamical systems (e.g. Boolean networks and logical models) have been used in modeling biological systems to focus attention on the qualitative features of the system, such as the wiring diagram. Since the analysis of such systems is hard, it is necessary to focus on subclasses that have the properties of being general enough for modeling and simple enough for theoretical analysis. In this…
▽ More
Finite dynamical systems (e.g. Boolean networks and logical models) have been used in modeling biological systems to focus attention on the qualitative features of the system, such as the wiring diagram. Since the analysis of such systems is hard, it is necessary to focus on subclasses that have the properties of being general enough for modeling and simple enough for theoretical analysis. In this paper we propose the class of AND-NOT networks for modeling biological systems and show that it provides several advantages. Some of the advantages include: Any finite dynamical system can be written as an AND-NOT network with similar dynamical properties. There is a one-to-one correspondence between AND-NOT networks, their wiring diagrams, and their dynamics. Results about AND-NOT networks can be stated at the wiring diagram level without losing any information. Results about AND-NOT networks are applicable to any Boolean network. We apply our results to a Boolean model of Th-cell differentiation.
△ Less
Submitted 23 November, 2012;
originally announced November 2012.
-
Boolean nested canalizing functions: a comprehensive analysis
Authors:
Yuan Li,
John O. Adeyeye,
David Murrugarra,
Boris Aguilar,
Reinhard Laubenbacher
Abstract:
Boolean network models of molecular regulatory networks have been used successfully in computational systems biology. The Boolean functions that appear in published models tend to have special properties, in particular the property of being nested canalizing, a concept inspired by the concept of canalization in evolutionary biology. It has been shown that networks comprised of nested canalizing fu…
▽ More
Boolean network models of molecular regulatory networks have been used successfully in computational systems biology. The Boolean functions that appear in published models tend to have special properties, in particular the property of being nested canalizing, a concept inspired by the concept of canalization in evolutionary biology. It has been shown that networks comprised of nested canalizing functions have dynamic properties that make them suitable for modeling molecular regulatory networks, namely a small number of (large) attractors, as well as relatively short limit cycles.
This paper contains a detailed analysis of this class of functions, based on a novel normal form as polynomial functions over the Boolean field. The concept of layer is introduced that stratifies variables into different classes depending on their level of dominance. Using this layer concept a closed form formula is derived for the number of nested canalizing functions with a given number of variables. Additional metrics considered include Hamming weight, the activity number of any variable, and the average sensitivity of the function. It is also shown that the average sensitivity of any nested canalizing function is between 0 and 2. This provides a rationale for why nested canalizing functions are stable, since a random Boolean function in n variables has average sensitivity n/2. The paper also contains experimental evidence that the layer number is an important factor in network stability.
△ Less
Submitted 17 January, 2013; v1 submitted 23 April, 2012;
originally announced April 2012.
-
Modeling Stochasticity and Variability in Gene Regulatory Networks
Authors:
David Murrugarra,
Alan Veliz-Cuba,
Boris Aguilar,
Seda Arat,
Reinhard Laubenbacher
Abstract:
Modeling stochasticity in gene regulatory networks is an important and complex problem in molecular systems biology. To elucidate intrinsic noise, several modeling strategies such as the Gillespie algorithm have been used successfully. This paper contributes an approach as an alternative to these classical settings. Within the discrete paradigm, where genes, proteins, and other molecular component…
▽ More
Modeling stochasticity in gene regulatory networks is an important and complex problem in molecular systems biology. To elucidate intrinsic noise, several modeling strategies such as the Gillespie algorithm have been used successfully. This paper contributes an approach as an alternative to these classical settings. Within the discrete paradigm, where genes, proteins, and other molecular components of gene regulatory networks are modeled as discrete variables and are assigned as logical rules describing their regulation through interactions with other components. Stochasticity is modeled at the biological function level under the assumption that even if the expression levels of the input nodes of an update rule guarantee activation or degradation there is a probability that the process will not occur due to stochastic effects. This approach allows a finer analysis of discrete models and provides a natural setup for cell population simulations to study cell-to-cell variability. We applied our methods to two of the most studied regulatory networks, the outcome of lambda phage infection of bacteria and the p53-mdm2 complex.
△ Less
Submitted 26 March, 2012; v1 submitted 26 January, 2012;
originally announced January 2012.
-
Structure and Dynamics of Polynomial Dynamical Systems
Authors:
Reinhard Laubenbacher,
David Murrugarra,
Alan Veliz-Cuba
Abstract:
Discrete models have a long tradition in engineering, including finite state machines, Boolean networks, Petri nets, and agent-based models. Of particular importance is the question of how the model structure constrains its dynamics. This paper discusses an algebraic framework to study such questions. The systems discussed here are given by mappings on an affine space over a finite field, whose co…
▽ More
Discrete models have a long tradition in engineering, including finite state machines, Boolean networks, Petri nets, and agent-based models. Of particular importance is the question of how the model structure constrains its dynamics. This paper discusses an algebraic framework to study such questions. The systems discussed here are given by mappings on an affine space over a finite field, whose coordinate functions are polynomials. They form a general class of models which can represent many discrete model types. Assigning to such a system its dependency graph, that is, the directed graph that indicates the variable dependencies, provides a mapping from systems to graphs. A basic property of this mapping is derived and used to prove that dynamical systems with an acyclic dependency graph can only have a unique fixed point in their phase space and no periodic orbits. This result is then applied to a published model of in vitro virus competition.
△ Less
Submitted 31 July, 2011;
originally announced August 2011.
-
The Number of Multistate Nested Canalyzing Functions
Authors:
David Murrugarra,
Reinhard Laubenbacher
Abstract:
Identifying features of molecular regulatory networks is an important problem in systems biology. It has been shown that the combinatorial logic of such networks can be captured in many cases by special functions called nested canalyzing in the context of discrete dynamic network models. It was also shown that the dynamics of networks constructed from such functions has very special properties tha…
▽ More
Identifying features of molecular regulatory networks is an important problem in systems biology. It has been shown that the combinatorial logic of such networks can be captured in many cases by special functions called nested canalyzing in the context of discrete dynamic network models. It was also shown that the dynamics of networks constructed from such functions has very special properties that are consistent with what is known about molecular networks, and that simplify analysis. It is important to know how restrictive this class of functions is, for instance for the purpose of network reverse-engineering. This paper contains a formula for the number of such functions and a comparison to the class of all functions. In particular, it is shown that, as the number of variables becomes large, the ratio of the number of nested canalyzing functions to the number of all functions converges to zero. This shows that the class of nested canalyzing functions is indeed very restrictive, indicating that molecular networks have very special properties. The principal tool used for this investigation is a description of these functions as polynomials and a parameterization of the class of all such polynomials in terms of relations on their coefficients.
△ Less
Submitted 6 August, 2011; v1 submitted 31 July, 2011;
originally announced August 2011.
-
Regulatory patterns in molecular interaction networks
Authors:
David Murrugarra,
Reinhard Laubenbacher
Abstract:
Understanding design principles of molecular interaction networks is an important goal of molecular systems biology. Some insights have been gained into features of their network topology through the discovery of graph theoretic patterns that constrain network dynamics. This paper contributes to the identification of patterns in the mechanisms that govern network dynamics. The control of nodes in…
▽ More
Understanding design principles of molecular interaction networks is an important goal of molecular systems biology. Some insights have been gained into features of their network topology through the discovery of graph theoretic patterns that constrain network dynamics. This paper contributes to the identification of patterns in the mechanisms that govern network dynamics. The control of nodes in gene regulatory, signaling, and metabolic networks is governed by a variety of biochemical mechanisms, with inputs from other network nodes that act additively or synergistically. This paper focuses on a certain type of logical rule that appears frequently as a regulatory pattern. Within the context of the multistate discrete model paradigm, a rule type is introduced that reduces to the concept of nested canalyzing function in the Boolean network case. It is shown that networks that employ this type of multivalued logic exhibit more robust dynamics than random networks, with few attractors and short limit cycles. It is also shown that the majority of regulatory functions in many published models of gene regulatory and signaling networks are nested canalyzing.
△ Less
Submitted 26 June, 2011; v1 submitted 17 February, 2011;
originally announced February 2011.
-
ADAM: Analysis of Discrete Models of Biological Systems Using Computer Algebra
Authors:
Franziska Hinkelmann,
Madison Brandon,
Bonny Guang,
Rustin McNeill,
Grigoriy Blekherman,
Alan Veliz-Cuba,
Reinhard Laubenbacher
Abstract:
Background: Many biological systems are modeled qualitatively with discrete models, such as probabilistic Boolean networks, logical models, Petri nets, and agent-based models, with the goal to gain a better understanding of the system. The computational complexity to analyze the complete dynamics of these models grows exponentially in the number of variables, which impedes working with complex mod…
▽ More
Background: Many biological systems are modeled qualitatively with discrete models, such as probabilistic Boolean networks, logical models, Petri nets, and agent-based models, with the goal to gain a better understanding of the system. The computational complexity to analyze the complete dynamics of these models grows exponentially in the number of variables, which impedes working with complex models. Although there exist sophisticated algorithms to determine the dynamics of discrete models, their implementations usually require labor-intensive formatting of the model formulation, and they are oftentimes not accessible to users without programming skills. Efficient analysis methods are needed that are accessible to modelers and easy to use. Method: By converting discrete models into algebraic models, tools from computational algebra can be used to analyze their dynamics. Specifically, we propose a method to identify attractors of a discrete model that is equivalent to solving a system of polynomial equations, a long-studied problem in computer algebra. Results: A method for efficiently identifying attractors, and the web-based tool Analysis of Dynamic Algebraic Models (ADAM), which provides this and other analysis methods for discrete models. ADAM converts several discrete model types automatically into polynomial dynamical systems and analyzes their dynamics using tools from computer algebra. Based on extensive experimentation with both discrete models arising in systems biology and randomly generated networks, we found that the algebraic algorithms presented in this manuscript are fast for systems with the structure maintained by most biological systems, namely sparseness, i.e., while the number of nodes in a biological network may be quite large, each node is affected only by a small number of other nodes, and robustness, i.e., small number of attractors.
△ Less
Submitted 8 May, 2011; v1 submitted 3 December, 2010;
originally announced December 2010.
-
A Mathematical Framework for Agent Based Models of Complex Biological Networks
Authors:
Franziska Hinkelmann,
David Murrugarra,
Abdul Salam Jarrah,
Reinhard Laubenbacher
Abstract:
Agent-based modeling and simulation is a useful method to study biological phenomena in a wide range of fields, from molecular biology to ecology. Since there is currently no agreed-upon standard way to specify such models it is not always easy to use published models. Also, since model descriptions are not usually given in mathematical terms, it is difficult to bring mathematical analysis tools t…
▽ More
Agent-based modeling and simulation is a useful method to study biological phenomena in a wide range of fields, from molecular biology to ecology. Since there is currently no agreed-upon standard way to specify such models it is not always easy to use published models. Also, since model descriptions are not usually given in mathematical terms, it is difficult to bring mathematical analysis tools to bear, so that models are typically studied through simulation. In order to address this issue, Grimm et al. proposed a protocol for model specification, the so-called ODD protocol, which provides a standard way to describe models. This paper proposes an addition to the ODD protocol which allows the description of an agent-based model as a dynamical system, which provides access to computational and theoretical tools for its analysis. The mathematical framework is that of algebraic models, that is, time-discrete dynamical systems with algebraic structure. It is shown by way of several examples how this mathematical specification can help with model analysis.
△ Less
Submitted 9 September, 2010; v1 submitted 2 June, 2010;
originally announced June 2010.
-
Boolean Models of Bistable Biological Systems
Authors:
Franziska Hinkelmann,
Reinhard Laubenbacher
Abstract:
This paper presents an algorithm for approximating certain types of dynamical systems given by a system of ordinary delay differential equations by a Boolean network model. Often Boolean models are much simpler to understand than complex differential equations models. The motivation for this work comes from mathematical systems biology. While Boolean mechanisms do not provide information about e…
▽ More
This paper presents an algorithm for approximating certain types of dynamical systems given by a system of ordinary delay differential equations by a Boolean network model. Often Boolean models are much simpler to understand than complex differential equations models. The motivation for this work comes from mathematical systems biology. While Boolean mechanisms do not provide information about exact concentration rates or time scales, they are often sufficient to capture steady states and other key dynamics. Due to their intuitive nature, such models are very appealing to researchers in the life sciences. This paper is focused on dynamical systems that exhibit bistability and are desc ribedby delay equations. It is shown that if a certain motif including a feedback loop is present in the wiring diagram of the system, the Boolean model captures the bistability of molecular switches. The method is appl ied to two examples from biology, the lac operon and the phage lambda lysis/lysogeny switch.
△ Less
Submitted 10 December, 2009;
originally announced December 2009.
-
Parameter estimation for Boolean models of biological networks
Authors:
Elena Dimitrova,
Luis David Garcia-Puente,
Franziska Hinkelmann,
Abdul S. Jarrah,
Reinhard Laubenbacher,
Brandilyn Stigler,
Michael Stillman,
Paola Vera-Licona
Abstract:
Boolean networks have long been used as models of molecular networks and play an increasingly important role in systems biology. This paper describes a software package, Polynome, offered as a web service, that helps users construct Boolean network models based on experimental data and biological input. The key feature is a discrete analog of parameter estimation for continuous models. With only…
▽ More
Boolean networks have long been used as models of molecular networks and play an increasingly important role in systems biology. This paper describes a software package, Polynome, offered as a web service, that helps users construct Boolean network models based on experimental data and biological input. The key feature is a discrete analog of parameter estimation for continuous models. With only experimental data as input, the software can be used as a tool for reverse-engineering of Boolean network models from experimental time course data.
△ Less
Submitted 20 August, 2009;
originally announced August 2009.
-
Inference of ecological interaction networks
Authors:
Paola Vera-Licona,
Reinhard Laubenbacher
Abstract:
The inference of the interactions between organisms in an ecosystem from observational data is an important problem in ecology. This paper presents a mathematical inference method, originally developed for the inference of biochemical networks in molecular biology, adapted for the inference of networks of ecological interactions. The method is applied to a network of invertebrate families (taxa)…
▽ More
The inference of the interactions between organisms in an ecosystem from observational data is an important problem in ecology. This paper presents a mathematical inference method, originally developed for the inference of biochemical networks in molecular biology, adapted for the inference of networks of ecological interactions. The method is applied to a network of invertebrate families (taxa) in a rice field.
△ Less
Submitted 9 May, 2008;
originally announced May 2008.
-
The Dynamics of Conjunctive and Disjunctive Boolean Networks
Authors:
Abdul Salam Jarrah,
Reinhard Laubenbacher,
Alan Veliz-Cuba
Abstract:
The relationship between the properties of a dynamical system and the structure of its defining equations has long been studied in many contexts. Here we study this problem for the class of conjunctive (resp. disjunctive) Boolean networks, that is, Boolean networks in which all Boolean functions are constructed with the AND (resp. OR) operator only. The main results of this paper describe networ…
▽ More
The relationship between the properties of a dynamical system and the structure of its defining equations has long been studied in many contexts. Here we study this problem for the class of conjunctive (resp. disjunctive) Boolean networks, that is, Boolean networks in which all Boolean functions are constructed with the AND (resp. OR) operator only. The main results of this paper describe network dynamics in terms of the structure of the network dependency graph (topology). For a given such network, all possible limit cycle lengths are computed and lower and upper bounds for the number of cycles of each length are given. In particular, the exact number of fixed points is obtained. The bounds are in terms of structural features of the dependency graph and its partially ordered set of strongly connected components. For networks with strongly connected dependency graph, the exact cycle structure is computed.
△ Less
Submitted 2 May, 2008;
originally announced May 2008.
-
Design of experiments and biochemical network inference
Authors:
Reinhard Laubenbacher,
Brandilyn Stigler
Abstract:
Design of experiments is a branch of statistics that aims to identify efficient procedures for planning experiments in order to optimize knowledge discovery. Network inference is a subfield of systems biology devoted to the identification of biochemical networks from experimental data. Common to both areas of research is their focus on the maximization of information gathered from experimentatio…
▽ More
Design of experiments is a branch of statistics that aims to identify efficient procedures for planning experiments in order to optimize knowledge discovery. Network inference is a subfield of systems biology devoted to the identification of biochemical networks from experimental data. Common to both areas of research is their focus on the maximization of information gathered from experimentation. The goal of this paper is to establish a connection between these two areas coming from the common use of polynomial models and techniques from computational algebra.
△ Less
Submitted 31 December, 2007;
originally announced January 2008.
-
Computer algebra in systems biology
Authors:
Reinhard Laubenbacher,
Bernd Sturmfels
Abstract:
Systems biology focuses on the study of entire biological systems rather than on their individual components. With the emergence of high-throughput data generation technologies for molecular biology and the development of advanced mathematical modeling techniques, this field promises to provide important new insights. At the same time, with the availability of increasingly powerful computers, co…
▽ More
Systems biology focuses on the study of entire biological systems rather than on their individual components. With the emergence of high-throughput data generation technologies for molecular biology and the development of advanced mathematical modeling techniques, this field promises to provide important new insights. At the same time, with the availability of increasingly powerful computers, computer algebra has developed into a useful tool for many applications. This article illustrates the use of computer algebra in systems biology by way of a well-known gene regulatory network, the Lac Operon in the bacterium E. coli.
△ Less
Submitted 18 December, 2008; v1 submitted 27 December, 2007;
originally announced December 2007.
-
The effect of negative feedback loops on the dynamics of Boolean networks
Authors:
Eduardo Sontag,
Alan Veliz-Cuba,
Reinhard Laubenbacher,
Abdul Salam Jarrah
Abstract:
Feedback loops in a dynamic network play an important role in determining the dynamics of that network. Through a computational study, in this paper we show that networks with fewer independent negative feedback loops tend to exhibit more regular behavior than those with more negative loops. To be precise, we study the relationship between the number of independent feedback loops and the number…
▽ More
Feedback loops in a dynamic network play an important role in determining the dynamics of that network. Through a computational study, in this paper we show that networks with fewer independent negative feedback loops tend to exhibit more regular behavior than those with more negative loops. To be precise, we study the relationship between the number of independent feedback loops and the number and length of the limit cycles in the phase space of dynamic Boolean networks. We show that, as the number of independent negative feedback loops increases, the number (length) of limit cycles tends to decrease (increase). These conclusions are consistent with the fact, for certain natural biological networks, that they on the one hand exhibit generally regular behavior and on the other hand show less negative feedback loops than randomized networks with the same numbers of nodes and connectivity.
△ Less
Submitted 20 November, 2007; v1 submitted 23 July, 2007;
originally announced July 2007.
-
Nested Canalyzing, unate cascade, and polynomial functions
Authors:
Abdul Salam Jarrah,
Blessilda Raposa,
Reinhard Laubenbacher
Abstract:
This paper focuses on the study of certain classes of Boolean functions that have appeared in several different contexts. Nested canalyzing functions have been studied recently in the context of Boolean network models of gene regulatory networks. In the same context, polynomial functions over finite fields have been used to develop network inference methods for gene regulatory networks. Finally,…
▽ More
This paper focuses on the study of certain classes of Boolean functions that have appeared in several different contexts. Nested canalyzing functions have been studied recently in the context of Boolean network models of gene regulatory networks. In the same context, polynomial functions over finite fields have been used to develop network inference methods for gene regulatory networks. Finally, unate cascade functions have been studied in the design of logic circuits and binary decision diagrams. This paper shows that the class of nested canalyzing functions is equal to that of unate cascade functions. Furthermore, it provides a description of nested canalyzing functions as a certain type of Boolean polynomial function. Using the polynomial framework one can show that the class of nested canalyzing functions, or, equivalently, the class of unate cascade functions, forms an algebraic variety which makes their analysis amenable to the use of techniques from algebraic geometry and computational algebra. As a corollary of the functional equivalence derived here, a formula in the literature for the number of unate cascade functions provides such a formula for the number of nested canalyzing functions.
△ Less
Submitted 25 July, 2007; v1 submitted 12 June, 2006;
originally announced June 2006.
-
Reverse-engineering of polynomial dynamical systems
Authors:
Abdul Salam Jarrah,
Reinhard Laubenbacher,
Brandilyn Stigler,
Michael Stillman
Abstract:
Multivariate polynomial dynamical systems over finite fields have been studied in several contexts, including engineering and mathematical biology. An important problem is to construct models of such systems from a partial specification of dynamic properties, e.g., from a collection of state transition measurements. Here, we consider static models, which are directed graphs that represent the ca…
▽ More
Multivariate polynomial dynamical systems over finite fields have been studied in several contexts, including engineering and mathematical biology. An important problem is to construct models of such systems from a partial specification of dynamic properties, e.g., from a collection of state transition measurements. Here, we consider static models, which are directed graphs that represent the causal relationships between system variables, so-called wiring diagrams. This paper contains an algorithm which computes all possible minimal wiring diagrams for a given set of state transition measurements. The paper also contains several statistical measures for model selection. The algorithm uses primary decomposition of monomial ideals as the principal tool. An application to the reverse-engineering of a gene regulatory network is included. The algorithm and the statistical measures are implemented in Macaulay2 and are available from the authors.
△ Less
Submitted 19 May, 2006;
originally announced May 2006.
-
Discretization of Time Series Data
Authors:
Elena S. Dimitrova,
John J. McGee,
Reinhard C. Laubenbacher
Abstract:
Data discretization, also known as binning, is a frequently used technique in computer science, statistics, and their applications to biological data analysis. We present a new method for the discretization of real-valued data into a finite number of discrete values. Novel aspects of the method are the incorporation of an information-theoretic criterion and a criterion to determine the optimal n…
▽ More
Data discretization, also known as binning, is a frequently used technique in computer science, statistics, and their applications to biological data analysis. We present a new method for the discretization of real-valued data into a finite number of discrete values. Novel aspects of the method are the incorporation of an information-theoretic criterion and a criterion to determine the optimal number of values. While the method can be used for data clustering, the motivation for its development is the need for a discretization algorithm for several multivariate time series of heterogeneous data, such as transcript, protein, and metabolite concentration measurements. As several modeling methods for biochemical networks employ discrete variable states, the method needs to preserve correlations between variables as well as the dynamic features of the time series. A C++ implementation of the algorithm is available from the authors at http://polymath.vbi.vt.edu/discretization .
△ Less
Submitted 29 August, 2005; v1 submitted 15 May, 2005;
originally announced May 2005.
-
A Computational Algebra Approach to the Reverse Engineering of Gene Regulatory Networks
Authors:
Reinhard Laubenbacher,
Brandilyn Stigler
Abstract:
This paper proposes a new method to reverse engineer gene regulatory networks from experimental data. The modeling framework used is time-discrete deterministic dynamical systems, with a finite set of states for each of the variables. The simplest examples of such models are Boolean networks, in which variables have only two possible states. The use of a larger number of possible states allows a…
▽ More
This paper proposes a new method to reverse engineer gene regulatory networks from experimental data. The modeling framework used is time-discrete deterministic dynamical systems, with a finite set of states for each of the variables. The simplest examples of such models are Boolean networks, in which variables have only two possible states. The use of a larger number of possible states allows a finer discretization of experimental data and more than one possible mode of action for the variables, depending on threshold values. Furthermore, with a suitable choice of state set, one can employ powerful tools from computational algebra, that underlie the reverse-engineering algorithm, avoiding costly enumeration strategies. To perform well, the algorithm requires wildtype together with perturbation time courses. This makes it suitable for small to meso-scale networks rather than networks on a genome-wide scale. The complexity of the algorithm is quadratic in the number of variables and cubic in the number of time points. The algorithm is validated on a recently published Boolean network model of segment polarity development in Drosophila melanogaster.
△ Less
Submitted 17 December, 2003;
originally announced December 2003.