-
Estimating Cellular Goals from High-Dimensional Biological Data
Authors:
Laurence Yang,
Michael A. Saunders,
Jean-Christophe Lachance,
Bernhard O. Palsson,
José Bento
Abstract:
Optimization-based models have been used to predict cellular behavior for over 25 years. The constraints in these models are derived from genome annotations, measured macro-molecular composition of cells, and by measuring the cell's growth rate and metabolism in different conditions. The cellular goal (the optimization problem that the cell is trying to solve) can be challenging to derive experime…
▽ More
Optimization-based models have been used to predict cellular behavior for over 25 years. The constraints in these models are derived from genome annotations, measured macro-molecular composition of cells, and by measuring the cell's growth rate and metabolism in different conditions. The cellular goal (the optimization problem that the cell is trying to solve) can be challenging to derive experimentally for many organisms, including human or mammalian cells, which have complex metabolic capabilities and are not well understood. Existing approaches to learning goals from data include (a) estimating a linear objective function, or (b) estimating linear constraints that model complex biochemical reactions and constrain the cell's operation. The latter approach is important because often the known/observed biochemical reactions are not enough to explain observations, and hence there is a need to extend automatically the model complexity by learning new chemical reactions. However, this leads to nonconvex optimization problems, and existing tools cannot scale to realistically large metabolic models. Hence, constraint estimation is still used sparingly despite its benefits for modeling cell metabolism, which is important for developing novel antimicrobials against pathogens, discovering cancer drug targets, and producing value-added chemicals. Here, we develop the first approach to estimating constraint reactions from data that can scale to realistically large metabolic models. Previous tools have been used on problems having less than 75 biochemical reactions and 60 metabolites, which limits real-life-size applications. We perform extensive experiments using 75 large-scale metabolic network models for different organisms (including bacteria, yeasts, and mammals) and show that our algorithm can recover cellular constraint reactions, even when some measurements are missing.
△ Less
Submitted 20 May, 2019; v1 submitted 11 July, 2018;
originally announced July 2018.
-
Creation and analysis of biochemical constraint-based models: the COBRA Toolbox v3.0
Authors:
Laurent Heirendt,
Sylvain Arreckx,
Thomas Pfau,
Sebastián N. Mendoza,
Anne Richelle,
Almut Heinken,
Hulda S. Haraldsdóttir,
Jacek Wachowiak,
Sarah M. Keating,
Vanja Vlasov,
Stefania Magnusdóttir,
Chiam Yu Ng,
German Preciat,
Alise Žagare,
Siu H. J. Chan,
Maike K. Aurich,
Catherine M. Clancy,
Jennifer Modamio,
John T. Sauls,
Alberto Noronha,
Aarash Bordbar,
Benjamin Cousins,
Diana C. El Assal,
Luis V. Valcarcel,
Iñigo Apaolaza
, et al. (30 additional authors not shown)
Abstract:
COnstraint-Based Reconstruction and Analysis (COBRA) provides a molecular mechanistic framework for integrative analysis of experimental data and quantitative prediction of physicochemically and biochemically feasible phenotypic states. The COBRA Toolbox is a comprehensive software suite of interoperable COBRA methods. It has found widespread applications in biology, biomedicine, and biotechnology…
▽ More
COnstraint-Based Reconstruction and Analysis (COBRA) provides a molecular mechanistic framework for integrative analysis of experimental data and quantitative prediction of physicochemically and biochemically feasible phenotypic states. The COBRA Toolbox is a comprehensive software suite of interoperable COBRA methods. It has found widespread applications in biology, biomedicine, and biotechnology because its functions can be flexibly combined to implement tailored COBRA protocols for any biochemical network. Version 3.0 includes new methods for quality controlled reconstruction, modelling, topological analysis, strain and experimental design, network visualisation as well as network integration of chemoinformatic, metabolomic, transcriptomic, proteomic, and thermochemical data. New multi-lingual code integration also enables an expansion in COBRA application scope via high-precision, high-performance, and nonlinear numerical optimisation solvers for multi-scale, multi-cellular and reaction kinetic modelling, respectively. This protocol can be adapted for the generation and analysis of a constraint-based model in a wide variety of molecular systems biology scenarios. This protocol is an update to the COBRA Toolbox 1.0 and 2.0. The COBRA Toolbox 3.0 provides an unparalleled depth of constraint-based reconstruction and analysis methods.
△ Less
Submitted 23 February, 2018; v1 submitted 11 October, 2017;
originally announced October 2017.
-
Reliable and efficient solution of genome-scale models of Metabolism and macromolecular Expression
Authors:
Ding Ma,
Laurence Yang,
Ronan M. T. Fleming,
Ines Thiele,
Bernhard O. Palsson,
Michael A. Saunders
Abstract:
Constraint-Based Reconstruction and Analysis (COBRA) is currently the only methodology that permits integrated modeling of Metabolism and macromolecular Expression (ME) at genome-scale. Linear optimization computes steady-state flux solutions to ME models, but flux values are spread over many orders of magnitude. Standard double-precision solvers may return inaccurate solutions or report that no s…
▽ More
Constraint-Based Reconstruction and Analysis (COBRA) is currently the only methodology that permits integrated modeling of Metabolism and macromolecular Expression (ME) at genome-scale. Linear optimization computes steady-state flux solutions to ME models, but flux values are spread over many orders of magnitude. Standard double-precision solvers may return inaccurate solutions or report that no solution exists. Exact simplex solvers are extremely slow and hence not practical for ME models that currently have 70,000 constraints and variables and will grow larger. We have developed a quadruple-precision version of our linear and nonlinear optimizer MINOS, and a solution procedure (DQQ) involving Double and Quad MINOS that achieves efficiency and reliability for ME models. DQQ enables extensive use of large, multiscale, linear and nonlinear models in systems biology and many other applications.
△ Less
Submitted 26 September, 2016; v1 submitted 31 May, 2016;
originally announced June 2016.
-
A variational principle for computing nonequilibrium fluxes and potentials in genome-scale biochemical networks
Authors:
Ronan M. T. Fleming,
Christopher M. Maes,
Michael A. Saunders,
Yinyu Ye,
Bernhard Ø. Palsson
Abstract:
We derive a convex optimization problem on a steady-state nonequilibrium network of biochemical reactions, with the property that energy conservation and the second law of thermodynamics both hold at the problem solution. This suggests a new variational principle for biochemical networks that can be implemented in a computationally tractable manner. We derive the Lagrange dual of the optimization…
▽ More
We derive a convex optimization problem on a steady-state nonequilibrium network of biochemical reactions, with the property that energy conservation and the second law of thermodynamics both hold at the problem solution. This suggests a new variational principle for biochemical networks that can be implemented in a computationally tractable manner. We derive the Lagrange dual of the optimization problem and use strong duality to demonstrate that a biochemical analogue of Tellegen's theorem holds at optimality. Each optimal flux is dependent on a free parameter that we relate to an elementary kinetic parameter when mass action kinetics is assumed.
△ Less
Submitted 19 September, 2011; v1 submitted 8 May, 2011;
originally announced May 2011.