-
PLUMED Tutorials: a collaborative, community-driven learning ecosystem
Authors:
Gareth A. Tribello,
Massimiliano Bonomi,
Giovanni Bussi,
Carlo Camilloni,
Blake I. Armstrong,
Andrea Arsiccio,
Simone Aureli,
Federico Ballabio,
Mattia Bernetti,
Luigi Bonati,
Samuel G. H. Brookes,
Z. Faidon Brotzakis,
Riccardo Capelli,
Michele Ceriotti,
Kam-Tung Chan,
Pilar Cossio,
Siva Dasetty,
Davide Donadio,
Bernd Ensing,
Andrew L. Ferguson,
Guillaume Fraux,
Julian D. Gale,
Francesco Luigi Gervasio,
Toni Giorgino,
Nicholas S. M. Herringer
, et al. (38 additional authors not shown)
Abstract:
In computational physics, chemistry, and biology, the implementation of new techniques in a shared and open source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while…
▽ More
In computational physics, chemistry, and biology, the implementation of new techniques in a shared and open source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while the COVID-19 pandemic highlighted the benefits of online training, traditional online tutorials can quickly become outdated and may not cover all the software's functionalities. To address these issues, here we introduce ``PLUMED Tutorials'', a collaborative model for developing, sharing, and updating online tutorials. This initiative utilizes repository management and continuous integration to ensure compatibility with software updates. Moreover, the tutorials are interconnected to form a structured learning path and are enriched with automatic annotations to provide broader context. This paper illustrates the development, features, and advantages of PLUMED Tutorials, aiming to foster an open community for creating and sharing educational resources.
△ Less
Submitted 29 November, 2024;
originally announced December 2024.
-
A practical guide to the simultaneous determination of protein structure and dynamics using metainference
Authors:
Thomas Löhr,
Carlo Camilloni,
Massimiliano Bonomi,
Michele Vendruscolo
Abstract:
Accurate protein structural ensembles can be determined with metainference, a Bayesian inference method that integrates experimental information with prior knowledge of the system and deals with all sources of uncertainty and errors as well as with system heterogeneity. Furthermore, metainference can be implemented using the metadynamics approach, which enables the computational study of complex b…
▽ More
Accurate protein structural ensembles can be determined with metainference, a Bayesian inference method that integrates experimental information with prior knowledge of the system and deals with all sources of uncertainty and errors as well as with system heterogeneity. Furthermore, metainference can be implemented using the metadynamics approach, which enables the computational study of complex biological systems requiring extensive conformational sampling. In this chapter, we provide a step-by-step guide to perform and analyse metadynamic metainference simulations using the ISDB module of the open-source PLUMED library, as well as a series of practical tips to avoid common mistakes. Specifically, we will guide the reader in the process of learning how to model the structural ensemble of a small disordered peptide by combining state-of-the-art molecular mechanics force fields with nuclear magnetic resonance data, including chemical shifts, scalar couplings and residual dipolar couplings.
△ Less
Submitted 23 January, 2019;
originally announced January 2019.
-
A method for partitioning the information contained in a protein sequence between its structure and function
Authors:
A. Possenti,
M. Vendruscolo,
C. Camilloni,
G. Tiana
Abstract:
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We developed a computational algorithm…
▽ More
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We developed a computational algorithm to evaluate the amount of information necessary to specify the protein structure, keeping into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize designed proteins sequences.
△ Less
Submitted 23 May, 2018;
originally announced May 2018.
-
An implementation of the maximum-caliber principle by replica-averaged time-resolved restrained simulations
Authors:
Riccardo Capelli,
Guido Tiana,
Carlo Camilloni
Abstract:
Inferential methods can be used to integrate experimental informations and molecular simulations. The maximum entropy principle provides a framework for using equilibrium experimental data and it has been shown that replica-averaged simulations, restrained using a static potential, are a practical and powerful implementation of such principle. Here we show that replica-averaged simulations restrai…
▽ More
Inferential methods can be used to integrate experimental informations and molecular simulations. The maximum entropy principle provides a framework for using equilibrium experimental data and it has been shown that replica-averaged simulations, restrained using a static potential, are a practical and powerful implementation of such principle. Here we show that replica-averaged simulations restrained using a time-dependent potential are equivalent to the principle of maximum caliber, the dynamic version of the principle of maximum entropy, and thus may allow to integrate time-resolved data in molecular dynamics simulations. We provide an analytical proof of the equivalence as well as a computational validation making use of simple models and synthetic data. Some limitations and possible solutions are also discussed.
△ Less
Submitted 24 April, 2018; v1 submitted 19 February, 2018;
originally announced February 2018.
-
Properties of low-dimensional collective variables in the molecular dynamics of biopolymers
Authors:
R. Meloni,
C. Camilloni,
G. Tiana
Abstract:
The description of the dynamics of a complex, high-dimensional system in terms of a low-dimensional set of collective variables Y can be fruitful if the low dimensional representation satisfies a Langevin equation with drift and diffusion coefficients which depend only on Y. We present a computational scheme to evaluate whether a given collective variable provides a faithful low-dimensional repres…
▽ More
The description of the dynamics of a complex, high-dimensional system in terms of a low-dimensional set of collective variables Y can be fruitful if the low dimensional representation satisfies a Langevin equation with drift and diffusion coefficients which depend only on Y. We present a computational scheme to evaluate whether a given collective variable provides a faithful low-dimensional representation of the dynamics of a high-dimensional system. The scheme is based on the framework of finite-difference Langevin-equation, similar to that used for molecular-dynamics simulations. This allows one to calculate the drift and diffusion coefficients in any point of the full-dimensional system. The width of the distribution of drift and diffusion coefficients in an ensemble of microscopic points at the same value of Y indicates to which extent the dynamics of Y is described by a simple Langevin equation. Using a simple protein model we show that collective variables often used to describe biopolymers display a non-negligible width both in the drift and in the diffusion coefficients. We also show that the associated effective force is compatible with the equilibrium free--energy calculated from a microscopic sampling, but results in markedly different dynamical properties.
△ Less
Submitted 28 November, 2016; v1 submitted 1 September, 2016;
originally announced September 2016.
-
Metainference: A Bayesian Inference Method for Heterogeneous Systems
Authors:
Massimiliano Bonomi,
Carlo Camilloni,
Andrea Cavalli,
Michele Vendruscolo
Abstract:
Modelling a complex system is almost invariably a challenging task. The incorporation of experimental observations can be used to improve the quality of a model, and thus to obtain better predictions about the behavior of the corresponding system. This approach, however, is affected by a variety of different errors, especially when a system populates simultaneously an ensemble of different states…
▽ More
Modelling a complex system is almost invariably a challenging task. The incorporation of experimental observations can be used to improve the quality of a model, and thus to obtain better predictions about the behavior of the corresponding system. This approach, however, is affected by a variety of different errors, especially when a system populates simultaneously an ensemble of different states and experimental data are measured as averages over such states. To address this problem we present a Bayesian inference method, called metainference, that is able to deal with errors in experimental measurements as well as with experimental measurements averaged over multiple states. To achieve this goal, metainference models a finite sample of the distribution of models using a replica approach, in the spirit of the replica-averaging modelling based on the maximum entropy principle. To illustrate the method we present its application to a heterogeneous model system and to the determination of an ensemble of structures corresponding to the thermal fluctuations of a protein molecule. Metainference thus provides an approach to model complex systems with heterogeneous components and interconverting between different states by taking into account all possible sources of errors.
△ Less
Submitted 23 November, 2015; v1 submitted 18 September, 2015;
originally announced September 2015.
-
PLUMED 2: New feathers for an old bird
Authors:
Gareth A. Tribello,
Massimiliano Bonomi,
Davide Branduardi,
Carlo Camilloni,
Giovanni Bussi
Abstract:
Enhancing sampling and analyzing simulations are central issues in molecular simulation. Recently, we introduced PLUMED, an open-source plug-in that provides some of the most popular molecular dynamics (MD) codes with implementations of a variety of different enhanced sampling algorithms and collective variables (CVs). The rapid changes in this field, in particular new directions in enhanced sampl…
▽ More
Enhancing sampling and analyzing simulations are central issues in molecular simulation. Recently, we introduced PLUMED, an open-source plug-in that provides some of the most popular molecular dynamics (MD) codes with implementations of a variety of different enhanced sampling algorithms and collective variables (CVs). The rapid changes in this field, in particular new directions in enhanced sampling and dimensionality reduction together with new hardwares, require a code that is more flexible and more efficient. We therefore present PLUMED 2 here - a complete rewrite of the code in an object-oriented programming language (C++). This new version introduces greater flexibility and greater modularity, which both extends its core capabilities and makes it far easier to add new methods and CVs. It also has a simpler interface with the MD engines and provides a single software library containing both tools and core facilities. Ultimately, the new code better serves the ever-growing community of users and contributors in coping with the new challenges arising in the field.
△ Less
Submitted 3 October, 2013;
originally announced October 2013.
-
Ratcheted molecular-dynamics simulations identify efficiently the transition state of protein folding
Authors:
Guido Tiana,
Carlo Camilloni
Abstract:
The atomistic characterization of the transition state is a fundamental step to improve the understanding of the folding mechanism and the function of proteins. From a computational point of view, the identification of the conformations that build out the transition state is particularly cumbersome, mainly because of the large computational cost of generating a statistically-sound set of folding t…
▽ More
The atomistic characterization of the transition state is a fundamental step to improve the understanding of the folding mechanism and the function of proteins. From a computational point of view, the identification of the conformations that build out the transition state is particularly cumbersome, mainly because of the large computational cost of generating a statistically-sound set of folding trajectories. Here we show that a biasing algorithm, based on the physics of the ratchet-and-pawl, can be used to identify efficiently the transition state. The basic idea is that the algorithmic ratchet exerts a force on the protein when it is climbing the free-energy barrier, while it is inactive when it is descending. The transition state can be identified as the point of the trajectory where the ratchet changes regime. Besides discussing this strategy in general terms, we test it within a protein model whose transition state can be studied independently by plain molecular dynamics simulations. Finally, we show its power in explicit-solvent simulations, obtaining and characterizing a set of transition--state conformations for ACBP and CI2.
△ Less
Submitted 5 July, 2012;
originally announced July 2012.
-
Lymphotactin: how a protein can adopt two folds
Authors:
Carlo Camilloni,
Ludovico Sutto
Abstract:
Metamorphic proteins like Lymphotactin are a notable exception of the empirical principle that structured natural proteins possess a unique three dimensional structure. In particular, the human chemokine lymphotactin protein (Ltn) exists in two distinct conformations (one monomeric and one dimeric) under physiological conditions. In this work we use a Ca Go-model to show how this very peculiar b…
▽ More
Metamorphic proteins like Lymphotactin are a notable exception of the empirical principle that structured natural proteins possess a unique three dimensional structure. In particular, the human chemokine lymphotactin protein (Ltn) exists in two distinct conformations (one monomeric and one dimeric) under physiological conditions. In this work we use a Ca Go-model to show how this very peculiar behavior can be reproduced. From the study of the thermodynamics and of the kinetics we characterize the interconversion mechanism. In particular, this takes place through the docking of the two chains living in a third monomeric, partially unfolded, state which shows a residual structure involving a set of local contacts common to the two native conformations. The main feature of two-fold proteins appears to be the sharing of a common set of local contacts between the two distinct folds as confirmed by the study of two designed two-fold proteins. Metamorphic proteins may be more common than expected.
△ Less
Submitted 25 September, 2009;
originally announced September 2009.
-
Atomic-detailed milestones along the folding trajectory of protein G
Authors:
C. Camilloni,
G. Tiana,
R. A. Broglia
Abstract:
The high computational cost of carrying out molecular dynamics simulations of even small-size proteins is a major obstacle in the study, at atomic detail and in explicit solvent, of the physical mechanism which is at the basis of the folding of proteins. Making use of a biasing algorithm, based on the principle of the ratchet-and-pawl, we have been able to calculate eight folding trajectories (t…
▽ More
The high computational cost of carrying out molecular dynamics simulations of even small-size proteins is a major obstacle in the study, at atomic detail and in explicit solvent, of the physical mechanism which is at the basis of the folding of proteins. Making use of a biasing algorithm, based on the principle of the ratchet-and-pawl, we have been able to calculate eight folding trajectories (to an RMSD between 1.2A and 2.5A) of the B1 domain of protein G in explicit solvent without the need of high-performance computing. The simulations show that in the denatured state there is a complex network of cause-effect relationships among contacts, which results in a rather hierarchical folding mechanism. The network displays few local and nonlocal native contacts which are cause of most of the others, in agreement with the NOE signals obtained in mildly-denatured conditions. Also nonnative contacts play an active role in the folding kinetics. The set of conformations corresponding to the transition state display phi-values with a correlation coefficient of 0.69 with the experimental ones. They are structurally quite homogeneous and topologically native-like, although some of the side chains and most of the hydrogen bonds are not in place.
△ Less
Submitted 18 May, 2009;
originally announced May 2009.
-
PLUMED: a portable plugin for free-energy calculations with molecular dynamics
Authors:
M. Bonomi,
D. Branduardi,
G. Bussi,
C. Camilloni,
D. Provasi,
P. Raiteri,
D. Donadio,
F. Marinelli,
F. Pietrucci,
R. A. Broglia,
M. Parrinello
Abstract:
Here we present a program aimed at free-energy calculations in molecular systems. It consists of a series of routines that can be interfaced with the most popular classical molecular dynamics (MD) codes through a simple patching procedure. This leaves the possibility for the user to exploit many different MD engines depending on the system simulated and on the computational resources available.…
▽ More
Here we present a program aimed at free-energy calculations in molecular systems. It consists of a series of routines that can be interfaced with the most popular classical molecular dynamics (MD) codes through a simple patching procedure. This leaves the possibility for the user to exploit many different MD engines depending on the system simulated and on the computational resources available. Free-energy calculations can be performed as a function of many collective variables, with a particular focus on biological problems, and using state-of-the-art methods such as metadynamics, umbrella sampling and Jarzynski-equation based steered MD. The present software, written in ANSI-C language, can be easily interfaced with both fortran and C/C++ codes.
△ Less
Submitted 30 April, 2009; v1 submitted 5 February, 2009;
originally announced February 2009.
-
Metadynamic sampling of the free energy landscapes of proteins coupled with a Monte Carlo algorithm
Authors:
F. Marini,
C. Camilloni,
D. Provasi,
R. A. Broglia,
G. Tiana
Abstract:
Metadynamics is a powerful computational tool to obtain the free energy landscape of complex systems. The Monte Carlo algorithm has proven useful to calculate thermodynamic quantities associated with simplified models of proteins, and thus to gain an ever-increasing understanding on the general principles underlying the mechanism of protein folding. We show that it is possible to couple metadyna…
▽ More
Metadynamics is a powerful computational tool to obtain the free energy landscape of complex systems. The Monte Carlo algorithm has proven useful to calculate thermodynamic quantities associated with simplified models of proteins, and thus to gain an ever-increasing understanding on the general principles underlying the mechanism of protein folding. We show that it is possible to couple metadynamics and Monte Carlo algorithms to obtain the free energy of model proteins in a way which is computationally very economical.
△ Less
Submitted 3 October, 2007;
originally announced October 2007.
-
Exploring the Protein G Helix Free Energy Surface by Solute Tempering Metadynamics
Authors:
C. Camilloni,
D. Provasi,
G. Tiana,
R. A. Broglia
Abstract:
The free-energy landscape of the alpha-helix of protein G is studied by means of metadynamics coupled with a solute tempering algorithm. Metadynamics allows to overcome large energy barriers, whereas solute tempering improves the sampling with an affordable computational effort. From the sampled free-energy surface we are able to reproduce a number of experimental observations, such as the fact…
▽ More
The free-energy landscape of the alpha-helix of protein G is studied by means of metadynamics coupled with a solute tempering algorithm. Metadynamics allows to overcome large energy barriers, whereas solute tempering improves the sampling with an affordable computational effort. From the sampled free-energy surface we are able to reproduce a number of experimental observations, such as the fact that the lowest minimum corresponds to a globular conformation displaying some degree of beta-structure, that the helical state is metastable and involves only 65% of the chain. The calculations also show that the system populates consistently a pi-helix state and that the hydrophobic staple motif is present only in the free-energy minimum associated with the helices, and contributes to their stabilization. The use of metadynamics coupled with solute tempering results then particularly suitable to provide the thermodynamics of a short peptide, and its computational efficiency is promising to deal with larger proteins.
△ Less
Submitted 9 July, 2007;
originally announced July 2007.