-
The prebiotic emergence of biological evolution
Authors:
Charles D. Kocher,
Ken A. Dill
Abstract:
The origin of life must have been preceded by Darwin-like evolutionary dynamics that could propagate it. How did that adaptive dynamics arise? And from what prebiotic molecules? Using evolutionary invasion analysis, we develop a universal framework for describing any origin story for evolutionary dynamics. We find that cooperative autocatalysts, i.e. autocatalysts whose per-unit reproductive rate…
▽ More
The origin of life must have been preceded by Darwin-like evolutionary dynamics that could propagate it. How did that adaptive dynamics arise? And from what prebiotic molecules? Using evolutionary invasion analysis, we develop a universal framework for describing any origin story for evolutionary dynamics. We find that cooperative autocatalysts, i.e. autocatalysts whose per-unit reproductive rate grows as their population increases, have the special property of being able to cross a barrier that separates their initial degradation-dominated state from a growth-dominated state with evolutionary dynamics. For some model parameters, this leap to persistent propagation is likely, not rare. We apply this analysis to the Foldcat Mechanism, wherein peptides fold and help catalyze the elongation of each other. Foldcats are found to have cooperative autocatalysis and be capable of emergent evolutionary dynamics.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Inferring a network from dynamical signals at its nodes
Authors:
Corey Weistuch,
Luca Agozzino,
Lilianne R. Mujica-Parodi,
Ken A. Dill
Abstract:
We give an approximate solution to the difficult inverse problem of inferring the topology of an unknown network from given time-dependent signals at the nodes. For example, we measure signals from individual neurons in the brain, and infer how they are inter-connected. We use Maximum Caliber as an inference principle. The combinatorial challenge of high-dimensional data is handled using two diffe…
▽ More
We give an approximate solution to the difficult inverse problem of inferring the topology of an unknown network from given time-dependent signals at the nodes. For example, we measure signals from individual neurons in the brain, and infer how they are inter-connected. We use Maximum Caliber as an inference principle. The combinatorial challenge of high-dimensional data is handled using two different approximations to the pairwise couplings. We show two proofs of principle: in a nonlinear genetic toggle switch circuit, and in a toy neural network.
△ Less
Submitted 3 September, 2020; v1 submitted 5 April, 2020;
originally announced April 2020.
-
How did prebiotic polymers become informational foldamers?
Authors:
Elizaveta A Guseva,
Ronald N Zuckermann,
Ken A Dill
Abstract:
A mystery about the origins of life is which molecular structures $-$ and what spontaneous processes $-$ drove the autocatalytic transition from simple chemistry to biology? Using the HP lattice model of polymer sequence spaces leads to the prediction that random sequences of hydrophobic ($H$) and polar ($P$) monomers can collapse into relatively compact structures, exposing hydrophobic surfaces,…
▽ More
A mystery about the origins of life is which molecular structures $-$ and what spontaneous processes $-$ drove the autocatalytic transition from simple chemistry to biology? Using the HP lattice model of polymer sequence spaces leads to the prediction that random sequences of hydrophobic ($H$) and polar ($P$) monomers can collapse into relatively compact structures, exposing hydrophobic surfaces, acting as primitive versions of today's protein catalysts, elongating other such HP polymers, as ribosomes would now do. Such foldamer-catalysts form an autocatalytic set, growing short chains into longer chains that have particular sequences. The system has capacity for the multimodality: ability to settle at multiple distinct quasi-stable states characterized by different groups of dominating polymers. This is a testable mechanism that we believe is relevant to the early origins of life.
△ Less
Submitted 28 April, 2016;
originally announced April 2016.
-
Inferring transition rates on networks with incomplete knowledge
Authors:
Purushottam D. Dixit,
Abhinav Jain,
Gerhard Stock,
Ken. A. Dill
Abstract:
Across many fields, a problem of interest is to predict the transition rates between nodes of a network, given limited stationary state and dynamical information. We give a solution using the principle of Maximum Caliber. We find the transition rate matrix by maximizing the path entropy of a random walker on the network constrained to reproducing a stationary distribution and a few dynamical avera…
▽ More
Across many fields, a problem of interest is to predict the transition rates between nodes of a network, given limited stationary state and dynamical information. We give a solution using the principle of Maximum Caliber. We find the transition rate matrix by maximizing the path entropy of a random walker on the network constrained to reproducing a stationary distribution and a few dynamical averages. A main finding here is that when constrained only by the mean jump rate, the rate matrix is given by a square-root dependence of the rate, $ω_{ab} \propto \sqrt{p_b/p_a}$, on $p_a$ and $p_b$, the stationary state populations at nodes a and b. We give two examples of our approach. First, we show that this method correctly predicts the correlated rates in a biochemical network of two genes, where we know the exact results from prior simulation. Second, we show that it correctly predicts rates of peptide conformational transitions, when compared to molecular dynamics simulations. This method can be used to infer large numbers of rates on known networks where smaller numbers of steady-state node populations are known.
△ Less
Submitted 6 April, 2015;
originally announced April 2015.
-
Simulated evolution of protein-protein interaction networks with realistic topology
Authors:
Jack Peterson,
Steve Presse,
Kristin S. Peterson,
Ken A. Dill
Abstract:
We model the evolution of eukaryotic protein-protein interaction (PPI) networks. In our model, PPI networks evolve by two known biological mechanisms: (1) Gene duplication, which is followed by rapid diversification of duplicate interactions. (2) Neofunctionalization, in which a mutation leads to a new interaction with some other protein. Since many interactions are due to simple surface compatibi…
▽ More
We model the evolution of eukaryotic protein-protein interaction (PPI) networks. In our model, PPI networks evolve by two known biological mechanisms: (1) Gene duplication, which is followed by rapid diversification of duplicate interactions. (2) Neofunctionalization, in which a mutation leads to a new interaction with some other protein. Since many interactions are due to simple surface compatibility, we hypothesize there is an increased likelihood of interacting with other proteins in the target protein's neighborhood. We find good agreement of the model on 10 different network properties compared to high-confidence experimental PPI networks in yeast, fruit flies, and humans. Key findings are: (1) PPI networks evolve modular structures, with no need to invoke particular selection pressures. (2) Proteins in cells have on average about 6 degrees of separation, similar to some social networks, such as human-communication and actor networks. (3) Unlike social networks, which have a shrinking diameter (degree of maximum separation) over time, PPI networks are predicted to grow in diameter. (4) The model indicates that evolutionarily old proteins should have higher connectivities and be more centrally embedded in their networks. This suggests a way in which present-day proteomics data could provide insights into biological evolution.
△ Less
Submitted 6 January, 2015;
originally announced January 2015.
-
Transition States in Protein Folding Kinetics: The Structural Interpretation of Phi-values
Authors:
Thomas R. Weikl,
Ken A. Dill
Abstract:
Phi-values are experimental measures of the effects of mutations on the folding kinetics of a protein. A central question is which structural information Phi-values contain about the transition state of folding. Traditionally, a Phi-value is interpreted as the 'nativeness' of a mutated residue in the transition state. However, this interpretation is often problematic because it assumes a linear…
▽ More
Phi-values are experimental measures of the effects of mutations on the folding kinetics of a protein. A central question is which structural information Phi-values contain about the transition state of folding. Traditionally, a Phi-value is interpreted as the 'nativeness' of a mutated residue in the transition state. However, this interpretation is often problematic because it assumes a linear relation between the nativeness of the residue and its free-energy contribution. We present here a better structural interpretation of Phi-values for mutations within a given helix. Our interpretation is based on a simple physical model that distinguishes between secondary and tertiary free-energy contributions of helical residues. From a linear fit of our model to the experimental data, we obtain two structural parameters: the extent of helix formation in the transition state, and the nativeness of tertiary interactions in the transition state. We apply our model to all proteins with well-characterized helices for which more than 10 Phi-values are available: protein A, CI2, and protein L. The model captures nonclassical Phi-values <0 or >1 in these helices, and explains how different mutations at a given site can lead to different Phi-values.
△ Less
Submitted 30 May, 2006;
originally announced May 2006.
-
Phi-values in protein folding kinetics have energetic and structural components
Authors:
Claudia Merlo,
Ken A. Dill,
Thomas R. Weikl
Abstract:
Phi-values are experimental measures of how the kinetics of protein folding is changed by single-site mutations. Phi-values measure energetic quantities, but are often interpreted in terms of the structures of the transition state ensemble. Here we describe a simple analytical model of the folding kinetics in terms of the formation of protein substructures. The model shows that Phi-values have b…
▽ More
Phi-values are experimental measures of how the kinetics of protein folding is changed by single-site mutations. Phi-values measure energetic quantities, but are often interpreted in terms of the structures of the transition state ensemble. Here we describe a simple analytical model of the folding kinetics in terms of the formation of protein substructures. The model shows that Phi-values have both structural and energetic components. In addition, it provides a natural and general interpretation of "nonclassical" Phi-values (i.e., less than zero, or greater than one). The model reproduces the Phi-values for 20 single-residue mutations in the alpha-helix of the protein CI2, including several nonclassical Phi-values, in good agreement with experiments.
△ Less
Submitted 14 July, 2005;
originally announced July 2005.
-
Cooperativity in two-state protein folding kinetics
Authors:
Thomas R. Weikl,
Matteo Palassini,
Ken A. Dill
Abstract:
We present a solvable model that predicts the folding kinetics of two-state proteins from their native structures. The model is based on conditional chain entropies. It assumes that folding processes are dominated by small-loop closure events that can be inferred from native structures. For CI2, the src SH3 domain, TNfn3, and protein L, the model reproduces two-state kinetics, and it predicts we…
▽ More
We present a solvable model that predicts the folding kinetics of two-state proteins from their native structures. The model is based on conditional chain entropies. It assumes that folding processes are dominated by small-loop closure events that can be inferred from native structures. For CI2, the src SH3 domain, TNfn3, and protein L, the model reproduces two-state kinetics, and it predicts well the average Phi-values for secondary structures. The barrier to folding is the formation of predominantly local structures such as helices and hairpins, which are needed to bring nonlocal pairs of amino acids into contact.
△ Less
Submitted 6 November, 2003;
originally announced November 2003.
-
Symmetry and designability for lattice protein models
Authors:
Tairan Wang,
Jonathan Miller,
Ned S. Wingreen,
Chao Tang,
Ken A. Dill
Abstract:
Native protein folds often have a high degree of symmetry. We study the relationship between the symmetries of native proteins, and their designabilities -- how many different sequences encode a given native structure. Using a two-dimensional lattice protein model based on hydrophobicity, we find that those native structures that are encoded by the largest number of different sequences have high…
▽ More
Native protein folds often have a high degree of symmetry. We study the relationship between the symmetries of native proteins, and their designabilities -- how many different sequences encode a given native structure. Using a two-dimensional lattice protein model based on hydrophobicity, we find that those native structures that are encoded by the largest number of different sequences have high symmetry. However only certain symmetries are enhanced, e.g. x/y-mirror symmetry and $180^o$ rotation, while others are suppressed. If it takes a large number of mutations to destabilize the native state of a protein, then, by definition, the state is highly designable. Hence, our findings imply that insensitivity to mutation implies high symmetry. It appears that the relationship between designability and symmetry results because protein substructures are also designable. Native protein folds may therefore be symmetric because they are composed of repeated designable substructures.
△ Less
Submitted 23 June, 2000;
originally announced June 2000.