-
Extending fragment-based free energy calculations with library Monte Carlo simulation: Annealing in interaction space
Authors:
Steven Lettieri,
Artem B. Mamonov,
Daniel M. Zuckerman
Abstract:
Pre-calculated libraries of molecular fragment configurations have previously been used as a basis for both equilibrium sampling (via "library-based Monte Carlo") and for obtaining absolute free energies using a polymer-growth formalism. Here, we combine the two approaches to extend the size of systems for which free energies can be calculated. We study a series of all-atom poly-alanine systems in…
▽ More
Pre-calculated libraries of molecular fragment configurations have previously been used as a basis for both equilibrium sampling (via "library-based Monte Carlo") and for obtaining absolute free energies using a polymer-growth formalism. Here, we combine the two approaches to extend the size of systems for which free energies can be calculated. We study a series of all-atom poly-alanine systems in a simple dielectric "solvent" and find that precise free energies can be obtained rapidly. For instance, for 12 residues, less than an hour of single-processor is required. The combined approach is formally equivalent to the "annealed importance sampling" algorithm; instead of annealing by decreasing temperature, however, interactions among fragments are gradually added as the molecule is "grown." We discuss implications for future binding affinity calculations in which a ligand is grown into a binding site.
△ Less
Submitted 10 September, 2010; v1 submitted 21 June, 2010;
originally announced June 2010.
-
Thermal Motions of the E. Coli Glucose-Galactose Binding Protein Studied Using Well-Sampled Semi-Atomistic Simulations
Authors:
Derek J. Cashman,
Artem B. Mamonov,
Divesh Bhatt,
Daniel M. Zuckerman
Abstract:
The E. coli glucose-galactose chemosensory receptor is a 309 residue, 32 kDa protein consisting of two distinct structural domains. In this computational study, we studied the protein's thermal fluctuations, including both the large scale interdomain movements that contribute to the receptor's mechanism of action, as well as smaller scale motions, using two different computational methods. We em…
▽ More
The E. coli glucose-galactose chemosensory receptor is a 309 residue, 32 kDa protein consisting of two distinct structural domains. In this computational study, we studied the protein's thermal fluctuations, including both the large scale interdomain movements that contribute to the receptor's mechanism of action, as well as smaller scale motions, using two different computational methods. We employ extremely fast, "semi-atomistic" Library-Based Monte Carlo (LBMC) simulations, which include all backbone atoms but "implicit" side chains. Our results were compared with previous experiments and an all-atom Langevin dynamics simulation. Both LBMC and Langevin dynamics simulations were performed using both the apo and glucose-bound form of the protein, with LBMC exhibiting significantly larger fluctuations. The LBMC simulations are also in general agreement with the disulfide trapping experiments of Careaga & Falke (JMB, 1992; Biophys. J., 1992), which indicate that distant residues in the crystal structure (i.e. beta carbons separated by 10 to 20 angstroms) form spontaneous transient contacts in solution. Our simulations illustrate several possible "mechanisms" (configurational pathways) for these fluctuations. We also observe several discrepancies between our calculations and experiment. Nevertheless, we believe that our semi-atomistic approach could be used to study the fluctuations in other proteins, perhaps for ensemble docking, or other analyses of protein flexibility in virtual screening studies.
△ Less
Submitted 27 October, 2009;
originally announced October 2009.
-
Efficient equilibrium sampling of all-atom peptides using library-based Monte Carlo
Authors:
Ying Ding,
Artem B. Mamonov,
Daniel M. Zuckerman
Abstract:
We applied our previously developed library-based Monte Carlo (LBMC) to equilibrium sampling of several implicitly solvated all-atom peptides. LBMC can perform equilibrium sampling of molecules using the pre-calculated statistical libraries of molecular-fragment configurations and energies. For this study, we employed residue-based fragments distributed according to the Boltzmann factor of the O…
▽ More
We applied our previously developed library-based Monte Carlo (LBMC) to equilibrium sampling of several implicitly solvated all-atom peptides. LBMC can perform equilibrium sampling of molecules using the pre-calculated statistical libraries of molecular-fragment configurations and energies. For this study, we employed residue-based fragments distributed according to the Boltzmann factor of the OPLS-AA forcefield describing the individual fragments. Two solvent models were employed: a simple uniform dielectric and the Generalized Born/Surface Area (GBSA) model. The efficiency of LBMC was compared to standard Langevin dynamics (LD) using three different statistical tools. The statistical analyses indicate that LBMC is more than 100 times faster than LD not only for the simple solvent model but also for GBSA.
△ Less
Submitted 26 January, 2010; v1 submitted 13 October, 2009;
originally announced October 2009.
-
Rapid sampling of all-atom peptides using a library-based polymer-growth approach
Authors:
A. B. Mamonov,
X. Zhang,
D. M. Zuckerman
Abstract:
We adapted existing polymer growth strategies for equilibrium sampling of peptides described by modern atomistic forcefields with implicit solvent. The main novel feature of our approach is the use of pre-calculated statistical libraries of molecular fragments. A molecule is sampled by combining fragment configurations -- of single residues in this study -- which are stored in the libraries. Ens…
▽ More
We adapted existing polymer growth strategies for equilibrium sampling of peptides described by modern atomistic forcefields with implicit solvent. The main novel feature of our approach is the use of pre-calculated statistical libraries of molecular fragments. A molecule is sampled by combining fragment configurations -- of single residues in this study -- which are stored in the libraries. Ensembles generated from the independent libraries are reweighted to conform with the Boltzmann factor distribution of the forcefield describing the full molecule. In this way, high-quality equilibrium sampling of small peptides (4-8 residues) typically requires less than one hour of single-processor wallclock time and can be significantly faster than Langevin simulations. Furthermore, approximate but clash-free ensembles can be generated for larger peptides (e.g., 16 residues) in less than a minute of single-processor computing. We also describe an application to free energy calculation, a "multi-resolution" implementation of the growth procedure and application to fragment assembly protein-structure prediction protocols.
△ Less
Submitted 4 March, 2010; v1 submitted 13 October, 2009;
originally announced October 2009.
-
Absolute free energies estimated by combining pre-calculated molecular fragment libraries
Authors:
Xin Zhang,
Artem B. Mamonov,
Daniel M. Zuckerman
Abstract:
The absolute free energy -- or partition function, equivalently -- of a molecule can be estimated computationally using a suitable reference system. Here, we demonstrate a practical method for staging such calculations by growing a molecule based on a series of fragments. Significant computer time is saved by pre-calculating fragment configurations and interactions for re-use in a variety of mol…
▽ More
The absolute free energy -- or partition function, equivalently -- of a molecule can be estimated computationally using a suitable reference system. Here, we demonstrate a practical method for staging such calculations by growing a molecule based on a series of fragments. Significant computer time is saved by pre-calculating fragment configurations and interactions for re-use in a variety of molecules. We employ such fragment libraries and interaction tables for amino acids and capping groups to estimate free energies for small peptides. Equilibrium ensembles for the molecules are generated at no additional computational cost, and are used to check our results by comparison to standard dynamics simulation.
△ Less
Submitted 30 January, 2009; v1 submitted 25 January, 2009;
originally announced January 2009.
-
A library-based Monte Carlo technique enables rapid equilibrium sampling of a protein model with atomistic components
Authors:
Artem B. Mamonov,
Divesh Bhatt,
Derek J. Cashman,
Daniel M. Zuckerman
Abstract:
There is significant interest in rapid protein simulations because of the time-scale limitations of all-atom methods. Exploiting the low cost and great availability of computer memory, we report a Monte Carlo technique for incorporating fully flexible atomistic protein components (e.g., peptide planes) into protein models without compromising sampling speed or statistical rigor. Building on exis…
▽ More
There is significant interest in rapid protein simulations because of the time-scale limitations of all-atom methods. Exploiting the low cost and great availability of computer memory, we report a Monte Carlo technique for incorporating fully flexible atomistic protein components (e.g., peptide planes) into protein models without compromising sampling speed or statistical rigor. Building on existing approximate methods (e.g., Rosetta), the technique uses pre-generated statistical libraries of all-atom components which are swapped with the corresponding protein components during a simulation. The simple model we study consists of the three all-atom backbone residues -- Ala, Gly, and Pro -- with structure-based (Go-like) interactions. For the five different proteins considered in this study, LBMC can generate at least 30 statistically independent configurations in about a month of single CPU time. Minimal additional cost is required to add residue-specific interactions.
△ Less
Submitted 4 December, 2008; v1 submitted 22 September, 2008;
originally announced September 2008.