-
Frustration, dynamics and catalysis
Authors:
R. Gonzalo Parra,
Diego U. Ferreiro
Abstract:
The controlled dissipation of chemical potentials is the fundamental way cells make a living. Enzyme-mediated catalysis allows the various transformations to proceed at biologically relevant rates with remarkable precision and efficiency. Theory, experiments and computational studies coincide to show that local frustration is a useful concept to relate protein dynamics with catalytic power. Local…
▽ More
The controlled dissipation of chemical potentials is the fundamental way cells make a living. Enzyme-mediated catalysis allows the various transformations to proceed at biologically relevant rates with remarkable precision and efficiency. Theory, experiments and computational studies coincide to show that local frustration is a useful concept to relate protein dynamics with catalytic power. Local frustration gives rise to the asperities of the energy landscapes that can harness the thermal fluctuations to guide the functional protein motions. We review here recent advances into these relationships from various fields of protein science. The biologically relevant dynamics is tuned by the evolution of protein sequences that modulate the local frustration patterns to near optimal values.
△ Less
Submitted 7 July, 2025; v1 submitted 1 May, 2025;
originally announced May 2025.
-
Frustration In Physiology And Molecular Medicine
Authors:
R. Gonzalo Parra,
Elizabeth A. Komives,
Peter G. Wolynes,
Diego U. Ferreiro
Abstract:
Molecules provide the ultimate language in terms of which physiology and pathology must be understood. Myriads of proteins participate in elaborate networks of interactions and perform chemical activities coordinating the life of cells. To perform these often amazing tasks, proteins must move and we must think of them as dynamic ensembles of three dimensional structures formed first by folding the…
▽ More
Molecules provide the ultimate language in terms of which physiology and pathology must be understood. Myriads of proteins participate in elaborate networks of interactions and perform chemical activities coordinating the life of cells. To perform these often amazing tasks, proteins must move and we must think of them as dynamic ensembles of three dimensional structures formed first by folding the polypeptide chains so as to minimize the conflicts between the interactions of their constituent amino acids. It is apparent however that, even when completely folded, not all conflicting interactions have been resolved so the structure remains "locally frustrated". Over the last decades it has become clearer that this local frustration is not just a random accident but plays an essential part of the inner workings of protein molecules. We will review here the physical origins of the frustration concept and review evidence that local frustration is important for protein physiology, protein-protein recognition, catalysis and allostery. Also, we highlight examples showing how alterations in the local frustration patterns can be linked to distinct pathologies. Finally we explore the extensions of the impact of frustration in higher order levels of organization of systems including gene regulatory networks and the neural networks of the brain.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Inferring repeat protein energetics from evolutionary information
Authors:
Rocío Espada,
R. Gonzalo Parra,
Thierry Mora,
Aleksandra M. Walczak,
Diego U. Ferreiro
Abstract:
Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often re…
▽ More
Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often related to their physiological behavior. We propose a description for the energetic variation given by sequence modifications in repeat proteins, systems for which the overall problem is simplified by their inherent symmetry. We explicitly account for single amino acid and pair-wise interactions and treat higher order correlations with a single term. We show that the resulting force field can be interpreted with structural detail. We trace the variations in the energetic scores of natural proteins and relate them to their experimental characterization. The resulting energetic force field allows the prediction of the folding free energy change for several mutants, and can be used to generate synthetic sequences that are statistically indistinguishable from the natural counterparts.
△ Less
Submitted 15 March, 2017; v1 submitted 9 March, 2017;
originally announced March 2017.
-
Protein Repeats from First Principles
Authors:
Pablo Turjanski,
R. Gonzalo Parra,
Rocío Espada,
Verónica Becher,
Diego U. Ferreiro
Abstract:
Some natural proteins display recurrent structural patterns. Despite being highly similar at the tertiary structure level, repetitions within a single repeat protein can be extremely variable at the sequence level. We propose a mathematical definition of a repeat and investigate the occurrences of these in different protein families. We found that long stretches of perfect repetitions are infreque…
▽ More
Some natural proteins display recurrent structural patterns. Despite being highly similar at the tertiary structure level, repetitions within a single repeat protein can be extremely variable at the sequence level. We propose a mathematical definition of a repeat and investigate the occurrences of these in different protein families. We found that long stretches of perfect repetitions are infrequent in individual natural proteins, even for those which are known to fold into structures of recurrent structural motifs. We found that natural repeat proteins are indeed repetitive in their families, exhibiting abundant stretches of 6 amino acids or longer that are perfect repetitions in the reference family. We provide a systematic quantification for this repetitiveness, and show that this form of repetitiveness is not exclusive of repeat proteins, but also occurs in globular domains. A by-product of this work is a fast classifier of proteins into families, which yields likelihood value about a given protein belonging to a given family.
△ Less
Submitted 8 October, 2015;
originally announced October 2015.
-
Capturing coevolutionary signals in repeat proteins
Authors:
Rocío Espada,
R. Gonzalo Parra,
Thierry Mora,
Aleksandra M. Walczak,
Diego Ferreiro
Abstract:
The analysis of correlations of amino acid occurrences in globular proteins has led to the development of statistical tools that can identify native contacts -- portions of the chains that come to close distance in folded structural ensembles. Here we introduce a statistical coupling analysis for repeat proteins -- natural systems for which the identification of domains remains challenging. We sho…
▽ More
The analysis of correlations of amino acid occurrences in globular proteins has led to the development of statistical tools that can identify native contacts -- portions of the chains that come to close distance in folded structural ensembles. Here we introduce a statistical coupling analysis for repeat proteins -- natural systems for which the identification of domains remains challenging. We show that the inherent translational symmetry of repeat protein sequences introduces a strong bias in the pair correlations at precisely the length scale of the repeat-unit. Equalizing for this bias reveals true co-evolutionary signals from which local native-contacts can be identified. Importantly, parameter values obtained for all other interactions are not significantly affected by the equalization. We quantify the robustness of the procedure and assign confidence levels to the interactions, identifying the minimum number of sequences needed to extract evolutionary information in several repeat protein families. The overall procedure can be used to reconstruct the interactions at long distances, identifying the characteristics of the strongest couplings in each family, and can be applied to any system that appears translationally symmetric.
△ Less
Submitted 25 July, 2014;
originally announced July 2014.
-
Detecting Repetitions and Periodicities in Proteins by Tiling the Structural Space
Authors:
R. Gonzalo Parra,
Rocío Espada,
Ignacio E. Sánchez,
Manfred J. Sippl,
Diego U. Ferreiro
Abstract:
The notion of energy landscapes provides conceptual tools for understanding the complexities of protein folding and function. Energy Landscape Theory indicates that it is much easier to find sequences that satisfy the "Principle of Minimal Frustration" when the folded structure is symmetric (Wolynes, P. G. Symmetry and the Energy Landscapes of Biomolecules. Proc. Natl. Acad. Sci. U.S.A. 1996, 93,…
▽ More
The notion of energy landscapes provides conceptual tools for understanding the complexities of protein folding and function. Energy Landscape Theory indicates that it is much easier to find sequences that satisfy the "Principle of Minimal Frustration" when the folded structure is symmetric (Wolynes, P. G. Symmetry and the Energy Landscapes of Biomolecules. Proc. Natl. Acad. Sci. U.S.A. 1996, 93, 14249-14255). Similarly, repeats and structural mosaics may be fundamentally related to landscapes with multiple embedded funnels. Here we present analytical tools to detect and compare structural repetitions in protein molecules. By an exhaustive analysis of the distribution of structural repeats using a robust metric we define those portions of a protein molecule that best describe the overall structure as a tessellation of basic units. The patterns produced by such tessellations provide intuitive representations of the repeating regions and their association towards higher order arrangements. We find that some protein architectures can be described as nearly periodic, while in others clear separations between repetitions exist. Since the method is independent of amino acid sequence information we can identify structural units that can be encoded by a variety of distinct amino acid sequences.
△ Less
Submitted 12 June, 2013;
originally announced June 2013.