-
Generating stable molecules using imitation and reinforcement learning
Authors:
Søren Ager Meldgaard,
Jonas Köhler,
Henrik Lund Mortensen,
Mads-Peter V. Christiansen,
Frank Noé,
Bjørk Hammer
Abstract:
Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning approach for generating molecules in cartesian coordinates…
▽ More
Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning approach for generating molecules in cartesian coordinates allowing for quantum chemical prediction of the stability. To improve sample-efficiency we learn basic chemical rules from imitation learning on the GDB-11 database to create an initial model applicable for all stoichiometries. We then deploy multiple copies of the model conditioned on a specific stoichiometry in a reinforcement learning setting. The models correctly identify low energy molecules in the database and produce novel isomers not found in the training set. Finally, we apply the model to larger molecules to show how reinforcement learning further refines the imitation learning model in domains far from the training data.
△ Less
Submitted 11 July, 2021;
originally announced July 2021.
-
Machine Learning Implicit Solvation for Molecular Dynamics
Authors:
Yaoyi Chen,
Andreas Krämer,
Nicholas E. Charron,
Brooke E. Husic,
Cecilia Clementi,
Frank Noé
Abstract:
Accurate modeling of the solvent environment for biological molecules is crucial for computational biology and drug design. A popular approach to achieve long simulation time scales for large system sizes is to incorporate the effect of the solvent in a mean-field fashion with implicit solvent models. However, a challenge with existing implicit solvent models is that they often lack accuracy or ce…
▽ More
Accurate modeling of the solvent environment for biological molecules is crucial for computational biology and drug design. A popular approach to achieve long simulation time scales for large system sizes is to incorporate the effect of the solvent in a mean-field fashion with implicit solvent models. However, a challenge with existing implicit solvent models is that they often lack accuracy or certain physical properties compared to explicit solvent models, as the many-body effects of the neglected solvent molecules is difficult to model as a mean field. Here, we leverage machine learning (ML) and multi-scale coarse graining (CG) in order to learn implicit solvent models that can approximate the energetic and thermodynamic properties of a given explicit solvent model with arbitrary accuracy, given enough training data. Following the previous ML--CG models CGnet and CGSchnet, we introduce ISSNet, a graph neural network, to model the implicit solvent potential of mean force. ISSNet can learn from explicit solvent simulation data and be readily applied to MD simulations. We compare the solute conformational distributions under different solvation treatments for two peptide systems. The results indicate that ISSNet models can outperform widely-used generalized Born and surface area models in reproducing the thermodynamics of small protein systems with respect to explicit solvent. The success of this novel method demonstrates the potential benefit of applying machine learning methods in accurate modeling of solvent effects for in silico research and biomedical applications.
△ Less
Submitted 26 August, 2021; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Permutation-Invariant Variational Autoencoder for Graph-Level Representation Learning
Authors:
Robin Winter,
Frank Noé,
Djork-Arné Clevert
Abstract:
Recently, there has been great success in applying deep neural networks on graph structured data. Most work, however, focuses on either node- or graph-level supervised learning, such as node, link or graph classification or node-level unsupervised learning (e.g. node clustering). Despite its wide range of possible applications, graph-level unsupervised learning has not received much attention yet.…
▽ More
Recently, there has been great success in applying deep neural networks on graph structured data. Most work, however, focuses on either node- or graph-level supervised learning, such as node, link or graph classification or node-level unsupervised learning (e.g. node clustering). Despite its wide range of possible applications, graph-level unsupervised learning has not received much attention yet. This might be mainly attributed to the high representation complexity of graphs, which can be represented by n! equivalent adjacency matrices, where n is the number of nodes. In this work we address this issue by proposing a permutation-invariant variational autoencoder for graph structured data. Our proposed model indirectly learns to match the node ordering of input and output graph, without imposing a particular node ordering or performing expensive graph matching. We demonstrate the effectiveness of our proposed model on various graph reconstruction and generation tasks and evaluate the expressive power of extracted representations for downstream graph-level classification and regression.
△ Less
Submitted 14 December, 2021; v1 submitted 20 April, 2021;
originally announced April 2021.
-
Symmetric and antisymmetric kernels for machine learning problems in quantum physics and chemistry
Authors:
Stefan Klus,
Patrick Gelß,
Feliks Nüske,
Frank Noé
Abstract:
We derive symmetric and antisymmetric kernels by symmetrizing and antisymmetrizing conventional kernels and analyze their properties. In particular, we compute the feature space dimensions of the resulting polynomial kernels, prove that the reproducing kernel Hilbert spaces induced by symmetric and antisymmetric Gaussian kernels are dense in the space of symmetric and antisymmetric functions, and…
▽ More
We derive symmetric and antisymmetric kernels by symmetrizing and antisymmetrizing conventional kernels and analyze their properties. In particular, we compute the feature space dimensions of the resulting polynomial kernels, prove that the reproducing kernel Hilbert spaces induced by symmetric and antisymmetric Gaussian kernels are dense in the space of symmetric and antisymmetric functions, and propose a Slater determinant representation of the antisymmetric Gaussian kernel, which allows for an efficient evaluation even if the state space is high-dimensional. Furthermore, we show that by exploiting symmetries or antisymmetries the size of the training data set can be significantly reduced. The results are illustrated with guiding examples and simple quantum physics and chemistry applications.
△ Less
Submitted 26 June, 2021; v1 submitted 31 March, 2021;
originally announced March 2021.
-
Parameterized Hypercomplex Graph Neural Networks for Graph Classification
Authors:
Tuan Le,
Marco Bertolini,
Frank Noé,
Djork-Arné Clevert
Abstract:
Despite recent advances in representation learning in hypercomplex (HC) space, this subject is still vastly unexplored in the context of graphs. Motivated by the complex and quaternion algebras, which have been found in several contexts to enable effective representation learning that inherently incorporates a weight-sharing mechanism, we develop graph neural networks that leverage the properties…
▽ More
Despite recent advances in representation learning in hypercomplex (HC) space, this subject is still vastly unexplored in the context of graphs. Motivated by the complex and quaternion algebras, which have been found in several contexts to enable effective representation learning that inherently incorporates a weight-sharing mechanism, we develop graph neural networks that leverage the properties of hypercomplex feature transformation. In particular, in our proposed class of models, the multiplication rule specifying the algebra itself is inferred from the data during training. Given a fixed model architecture, we present empirical evidence that our proposed model incorporates a regularization effect, alleviating the risk of overfitting. We also show that for fixed model capacity, our proposed method outperforms its corresponding real-formulated GNN, providing additional confirmation for the enhanced expressivity of HC embeddings. Finally, we test our proposed hypercomplex GNN on several open graph benchmark datasets and show that our models reach state-of-the-art performance while consuming a much lower memory footprint with 70& fewer parameters. Our implementations are available at https://github.com/bayer-science-for-a-better-life/phc-gnn.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Multiscale molecular kinetics by coupling Markov state models and reaction-diffusion dynamics
Authors:
Mauricio J. del Razo,
Manuel Dibak,
Christof Schütte,
Frank Noé
Abstract:
A novel approach to simulate simple protein-ligand systems at large time- and length-scales is to couple Markov state models (MSMs) of molecular kinetics with particle-based reaction-diffusion (RD) simulations, MSM/RD. Currently, MSM/RD lacks a mathematical framework to derive coupling schemes; is limited to isotropic ligands in a single conformational state, and is lacking a multi-particle extens…
▽ More
A novel approach to simulate simple protein-ligand systems at large time- and length-scales is to couple Markov state models (MSMs) of molecular kinetics with particle-based reaction-diffusion (RD) simulations, MSM/RD. Currently, MSM/RD lacks a mathematical framework to derive coupling schemes; is limited to isotropic ligands in a single conformational state, and is lacking a multi-particle extensions. In this work, we address these needs by developing a general MSM/RD framework by coarse-graining molecular dynamics into hybrid switching diffusion processes. Given enough data to parametrize the model, it is capable of modeling protein-protein interactions over large time- and length-scales, and it can be extended to handle multiple molecules. We derive the MSM/RD framework, and we implement and verify it for two protein-protein benchmark systems and one multiparticle implementation to model the formation of pentameric ring molecules. To enable reproducibility, we have published our code in the MSM/RD software package.
△ Less
Submitted 9 December, 2021; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Auto-Encoding Molecular Conformations
Authors:
Robin Winter,
Frank Noé,
Djork-Arné Clevert
Abstract:
In this work we introduce an Autoencoder for molecular conformations. Our proposed model converts the discrete spatial arrangements of atoms in a given molecular graph (conformation) into and from a continuous fixed-sized latent representation. We demonstrate that in this latent representation, similar conformations cluster together while distinct conformations split apart. Moreover, by training a…
▽ More
In this work we introduce an Autoencoder for molecular conformations. Our proposed model converts the discrete spatial arrangements of atoms in a given molecular graph (conformation) into and from a continuous fixed-sized latent representation. We demonstrate that in this latent representation, similar conformations cluster together while distinct conformations split apart. Moreover, by training a probabilistic model on a large dataset of molecular conformations, we demonstrate how our model can be used to generate diverse sets of energetically favorable conformations for a given molecule. Finally, we show that the continuous representation allows us to utilize optimization methods to find molecules that have conformations with favourable spatial properties.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
TorchMD: A deep learning framework for molecular simulations
Authors:
Stefan Doerr,
Maciej Majewsk,
Adrià Pérez,
Andreas Krämer,
Cecilia Clementi,
Frank Noe,
Toni Giorgino,
Gianni De Fabritiis
Abstract:
Molecular dynamics simulations provide a mechanistic description of molecules by relying on empirical potentials. The quality and transferability of such potentials can be improved leveraging data-driven models derived with machine learning approaches. Here, we present TorchMD, a framework for molecular simulations with mixed classical and machine learning potentials. All of force computations inc…
▽ More
Molecular dynamics simulations provide a mechanistic description of molecules by relying on empirical potentials. The quality and transferability of such potentials can be improved leveraging data-driven models derived with machine learning approaches. Here, we present TorchMD, a framework for molecular simulations with mixed classical and machine learning potentials. All of force computations including bond, angle, dihedral, Lennard-Jones and Coulomb interactions are expressed as PyTorch arrays and operations. Moreover, TorchMD enables learning and simulating neural network potentials. We validate it using standard Amber all-atom simulations, learning an ab-initio potential, performing an end-to-end training and finally learning and simulating a coarse-grained model for protein folding. We believe that TorchMD provides a useful tool-set to support molecular simulations of machine learning potentials. Code and data are freely available at \url{github.com/torchmd}.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Temperature-steerable flows
Authors:
Manuel Dibak,
Leon Klein,
Frank Noé
Abstract:
Boltzmann generators approach the sampling problem in many-body physics by combining a normalizing flow and a statistical reweighting method to generate samples of a physical system's equilibrium density. The equilibrium distribution is usually defined by an energy function and a thermodynamic state, such as a given temperature. Here we propose temperature-steerable flows (TSF) which are able to g…
▽ More
Boltzmann generators approach the sampling problem in many-body physics by combining a normalizing flow and a statistical reweighting method to generate samples of a physical system's equilibrium density. The equilibrium distribution is usually defined by an energy function and a thermodynamic state, such as a given temperature. Here we propose temperature-steerable flows (TSF) which are able to generate a family of probability densities parametrized by a choosable temperature parameter. TSFs can be embedded in a generalized ensemble sampling framework such as parallel tempering in order to sample a physical system across thermodynamic states, such as multiple temperatures.
△ Less
Submitted 1 December, 2020;
originally announced December 2020.
-
Training Invertible Linear Layers through Rank-One Perturbations
Authors:
Andreas Krämer,
Jonas Köhler,
Frank Noé
Abstract:
Many types of neural network layers rely on matrix properties such as invertibility or orthogonality. Retaining such properties during optimization with gradient-based stochastic optimizers is a challenging task, which is usually addressed by either reparameterization of the affected parameters or by directly optimizing on the manifold. This work presents a novel approach for training invertible l…
▽ More
Many types of neural network layers rely on matrix properties such as invertibility or orthogonality. Retaining such properties during optimization with gradient-based stochastic optimizers is a challenging task, which is usually addressed by either reparameterization of the affected parameters or by directly optimizing on the manifold. This work presents a novel approach for training invertible linear layers. In lieu of directly optimizing the network parameters, we train rank-one perturbations and add them to the actual weight matrices infrequently. This P$^{4}$Inv update allows keeping track of inverses and determinants without ever explicitly computing them. We show how such invertible blocks improve the mixing and thus the mode separation of the resulting normalizing flows. Furthermore, we outline how the P$^4$ concept can be utilized to retain properties other than invertibility.
△ Less
Submitted 30 November, 2020; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Convergence to the fixed-node limit in deep variational Monte Carlo
Authors:
Zeno Schätzle,
Jan Hermann,
Frank Noé
Abstract:
Variational quantum Monte Carlo (QMC) is an ab-initio method for solving the electronic Schrödinger equation that is exact in principle, but limited by the flexibility of the available ansatzes in practice. The recently introduced deep QMC approach, specifically two deep-neural-network ansatzes PauliNet and FermiNet, allows variational QMC to reach the accuracy of diffusion QMC, but little is unde…
▽ More
Variational quantum Monte Carlo (QMC) is an ab-initio method for solving the electronic Schrödinger equation that is exact in principle, but limited by the flexibility of the available ansatzes in practice. The recently introduced deep QMC approach, specifically two deep-neural-network ansatzes PauliNet and FermiNet, allows variational QMC to reach the accuracy of diffusion QMC, but little is understood about the convergence behavior of such ansatzes. Here, we analyze how deep variational QMC approaches the fixed-node limit with increasing network size. First, we demonstrate that a deep neural network can overcome the limitations of a small basis set and reach the mean-field complete-basis-set limit. Moving to electron correlation, we then perform an extensive hyperparameter scan of a deep Jastrow factor for LiH and H$_4$ and find that variational energies at the fixed-node limit can be obtained with a sufficiently large network. Finally, we benchmark mean-field and many-body ansatzes on H$_2$O, increasing the fraction of recovered fixed-node correlation energy of single-determinant Slater--Jastrow-type ansatzes by half an order of magnitude compared to previous variational QMC results and demonstrate that a single-determinant Slater--Jastrow--backflow version of the ansatz overcomes the fixed-node limitations. This analysis helps understanding the superb accuracy of deep variational ansatzes in comparison to the traditional trial wavefunctions at the respective level of theory, and will guide future improvements of the neural network architectures in deep QMC.
△ Less
Submitted 25 March, 2021; v1 submitted 11 October, 2020;
originally announced October 2020.
-
Relevance of Rotationally Equivariant Convolutions for Predicting Molecular Properties
Authors:
Benjamin Kurt Miller,
Mario Geiger,
Tess E. Smidt,
Frank Noé
Abstract:
Equivariant neural networks (ENNs) are graph neural networks embedded in $\mathbb{R}^3$ and are well suited for predicting molecular properties. The ENN library e3nn has customizable convolutions, which can be designed to depend only on distances between points, or also on angular features, making them rotationally invariant, or equivariant, respectively. This paper studies the practical value of…
▽ More
Equivariant neural networks (ENNs) are graph neural networks embedded in $\mathbb{R}^3$ and are well suited for predicting molecular properties. The ENN library e3nn has customizable convolutions, which can be designed to depend only on distances between points, or also on angular features, making them rotationally invariant, or equivariant, respectively. This paper studies the practical value of including angular dependencies for molecular property prediction directly via an ablation study with \texttt{e3nn} and the QM9 data set. We find that, for fixed network depth and parameter count, adding angular features decreased test error by an average of 23%. Meanwhile, increasing network depth decreased test error by only 4% on average, implying that rotationally equivariant layers are comparatively parameter efficient. We present an explanation of the accuracy improvement on the dipole moment, the target which benefited most from the introduction of angular features.
△ Less
Submitted 24 November, 2020; v1 submitted 19 August, 2020;
originally announced August 2020.
-
Coarse Graining Molecular Dynamics with Graph Neural Networks
Authors:
Brooke E. Husic,
Nicholas E. Charron,
Dominik Lemm,
Jiang Wang,
Adrià Pérez,
Maciej Majewski,
Andreas Krämer,
Yaoyi Chen,
Simon Olsson,
Gianni de Fabritiis,
Frank Noé,
Cecilia Clementi
Abstract:
Coarse graining enables the investigation of molecular dynamics for larger systems and at longer timescales than is possible at atomic resolution. However, a coarse graining model must be formulated such that the conclusions we draw from it are consistent with the conclusions we would draw from a model at a finer level of detail. It has been proven that a force matching scheme defines a thermodyna…
▽ More
Coarse graining enables the investigation of molecular dynamics for larger systems and at longer timescales than is possible at atomic resolution. However, a coarse graining model must be formulated such that the conclusions we draw from it are consistent with the conclusions we would draw from a model at a finer level of detail. It has been proven that a force matching scheme defines a thermodynamically consistent coarse-grained model for an atomistic system in the variational limit. Wang et al. [ACS Cent. Sci. 5, 755 (2019)] demonstrated that the existence of such a variational limit enables the use of a supervised machine learning framework to generate a coarse-grained force field, which can then be used for simulation in the coarse-grained space. Their framework, however, requires the manual input of molecular features upon which to machine learn the force field. In the present contribution, we build upon the advance of Wang et al.and introduce a hybrid architecture for the machine learning of coarse-grained force fields that learns their own features via a subnetwork that leverages continuous filter convolutions on a graph neural network architecture. We demonstrate that this framework succeeds at reproducing the thermodynamics for small biomolecular systems. Since the learned molecular representations are inherently transferable, the architecture presented here sets the stage for the development of machine-learned, coarse-grained force fields that are transferable across molecular systems.
△ Less
Submitted 6 November, 2020; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Equivariant Flows: Exact Likelihood Generative Learning for Symmetric Densities
Authors:
Jonas Köhler,
Leon Klein,
Frank Noé
Abstract:
Normalizing flows are exact-likelihood generative neural networks which approximately transform samples from a simple prior distribution to samples of the probability distribution of interest. Recent work showed that such generative models can be utilized in statistical mechanics to sample equilibrium states of many-body systems in physics and chemistry. To scale and generalize these results, it i…
▽ More
Normalizing flows are exact-likelihood generative neural networks which approximately transform samples from a simple prior distribution to samples of the probability distribution of interest. Recent work showed that such generative models can be utilized in statistical mechanics to sample equilibrium states of many-body systems in physics and chemistry. To scale and generalize these results, it is essential that the natural symmetries in the probability density -- in physics defined by the invariances of the target potential -- are built into the flow. We provide a theoretical sufficient criterion showing that the distribution generated by \textit{equivariant} normalizing flows is invariant with respect to these symmetries by design. Furthermore, we propose building blocks for flows which preserve symmetries which are usually found in physical/chemical many-body particle systems. Using benchmark systems motivated from molecular physics, we demonstrate that those symmetry preserving flows can provide better generalization capabilities and sampling efficiency.
△ Less
Submitted 26 October, 2020; v1 submitted 3 June, 2020;
originally announced June 2020.
-
Coupling particle-based reaction-diffusion simulations with reservoirs mediated by reaction-diffusion PDEs
Authors:
Margarita Kostré,
Christof Schütte,
Frank Noé,
Mauricio J. del Razo
Abstract:
Open biochemical systems of interacting molecules are ubiquitous in life-related processes. However, established computational methodologies, like molecular dynamics, are still mostly constrained to closed systems and timescales too small to be relevant for life processes. Alternatively, particle-based reaction-diffusion models are currently the most accurate and computationally feasible approach…
▽ More
Open biochemical systems of interacting molecules are ubiquitous in life-related processes. However, established computational methodologies, like molecular dynamics, are still mostly constrained to closed systems and timescales too small to be relevant for life processes. Alternatively, particle-based reaction-diffusion models are currently the most accurate and computationally feasible approach at these scales. Their efficiency lies in modeling entire molecules as particles that can diffuse and interact with each other. In this work, we develop modeling and numerical schemes for particle-based reaction-diffusion in an open setting, where the reservoirs are mediated by reaction-diffusion PDEs. We derive two important theoretical results. The first one is the mean-field for open systems of diffusing particles; the second one is the mean-field for a particle-based reaction-diffusion system with second-order reactions. We employ these two results to develop a numerical scheme that consistently couples particle-based reaction-diffusion processes with reaction-diffusion PDEs. This allows modeling open biochemical systems in contact with reservoirs that are time-dependent and spatially inhomogeneous, as in many relevant real-world applications.
△ Less
Submitted 29 May, 2020;
originally announced June 2020.
-
Ensemble Learning of Coarse-Grained Molecular Dynamics Force Fields with a Kernel Approach
Authors:
Jiang Wang,
Stefan Chmiela,
Klaus-Robert Müller,
Frank Noè,
Cecilia Clementi
Abstract:
Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field based on the kernel ridge regression algorithm. Here, we demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data in a sample efficient manner. The coarse-grained force field is learned by following the thermodynami…
▽ More
Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field based on the kernel ridge regression algorithm. Here, we demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data in a sample efficient manner. The coarse-grained force field is learned by following the thermodynamic consistency principle, here by minimizing the error between the predicted coarse-grained force and the all-atom mean force in the coarse-grained coordinates. Solving this problem by GDML directly is impossible because coarse-graining requires averaging over many training data points, resulting in impractical memory requirements for storing the kernel matrices. In this work, we propose a data-efficient and memory-saving alternative. Using ensemble learning and stratified sampling, we propose a 2-layer training scheme that enables GDML to learn an effective coarse-grained model. We illustrate our method on a simple biomolecular system, alanine dipeptide, by reconstructing the free energy landscape of a coarse-grained variant of this molecule. Our novel GDML training scheme yields a smaller free energy error than neural networks when the training set is small, and a comparably high accuracy when the training set is sufficiently large.
△ Less
Submitted 4 May, 2020;
originally announced May 2020.
-
Stochastic Normalizing Flows
Authors:
Hao Wu,
Jonas Köhler,
Frank Noé
Abstract:
The sampling of probability distributions specified up to a normalization constant is an important problem in both machine learning and statistical mechanics. While classical stochastic sampling methods such as Markov Chain Monte Carlo (MCMC) or Langevin Dynamics (LD) can suffer from slow mixing times there is a growing interest in using normalizing flows in order to learn the transformation of a…
▽ More
The sampling of probability distributions specified up to a normalization constant is an important problem in both machine learning and statistical mechanics. While classical stochastic sampling methods such as Markov Chain Monte Carlo (MCMC) or Langevin Dynamics (LD) can suffer from slow mixing times there is a growing interest in using normalizing flows in order to learn the transformation of a simple prior distribution to the given target distribution. Here we propose a generalized and combined approach to sample target densities: Stochastic Normalizing Flows (SNF) -- an arbitrary sequence of deterministic invertible functions and stochastic sampling blocks. We show that stochasticity overcomes expressivity limitations of normalizing flows resulting from the invertibility constraint, whereas trainable transformations between sampling steps improve efficiency of pure MCMC/LD along the flow. By invoking ideas from non-equilibrium statistical mechanics we derive an efficient training procedure by which both the sampler's and the flow's parameters can be optimized end-to-end, and by which we can compute exact importance weights without having to marginalize out the randomness of the stochastic blocks. We illustrate the representational power, sampling efficiency and asymptotic correctness of SNFs on several benchmarks including applications to sampling molecular systems in equilibrium.
△ Less
Submitted 26 October, 2020; v1 submitted 16 February, 2020;
originally announced February 2020.
-
Deep learning Markov and Koopman models with physical constraints
Authors:
Andreas Mardt,
Luca Pasquali,
Frank Noé,
Hao Wu
Abstract:
The long-timescale behavior of complex dynamical systems can be described by linear Markov or Koopman models in a suitable latent space. Recent variational approaches allow the latent space representation and the linear dynamical model to be optimized via unsupervised machine learning methods. Incorporation of physical constraints such as time-reversibility or stochasticity into the dynamical mode…
▽ More
The long-timescale behavior of complex dynamical systems can be described by linear Markov or Koopman models in a suitable latent space. Recent variational approaches allow the latent space representation and the linear dynamical model to be optimized via unsupervised machine learning methods. Incorporation of physical constraints such as time-reversibility or stochasticity into the dynamical model has been established for a linear, but not for arbitrarily nonlinear (deep learning) representations of the latent space. Here we develop theory and methods for deep learning Markov and Koopman models that can bear such physical constraints. We prove that the model is an universal approximator for reversible Markov processes and that it can be optimized with either maximum likelihood or the variational approach of Markov processes (VAMP). We demonstrate that the model performs equally well for equilibrium and systematically better for biased data compared to existing approaches, thus providing a tool to study the long-timescale processes of dynamical systems.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Neural Mode Jump Monte Carlo
Authors:
Luigi Sbailò,
Manuel Dibak,
Frank Noé
Abstract:
Markov chain Monte Carlo methods are a powerful tool for sampling equilibrium configurations in complex systems. One problem these methods often face is slow convergence over large energy barriers. In this work, we propose a novel method which increases convergence in systems composed of many metastable states. This method aims to connect metastable regions directly using generative neural network…
▽ More
Markov chain Monte Carlo methods are a powerful tool for sampling equilibrium configurations in complex systems. One problem these methods often face is slow convergence over large energy barriers. In this work, we propose a novel method which increases convergence in systems composed of many metastable states. This method aims to connect metastable regions directly using generative neural networks in order to propose new configurations in the Markov chain and optimizes the acceptance probability of large jumps between modes in configuration space. We provide a comprehensive theory and demonstrate the method on example systems.
△ Less
Submitted 11 December, 2019;
originally announced December 2019.
-
Machine learning for protein folding and dynamics
Authors:
Frank Noé,
Gianni De Fabritiis,
Cecilia Clementi
Abstract:
Many aspects of the study of protein folding and dynamics have been affected by the recent advances in machine learning. Methods for the prediction of protein structures from their sequences are now heavily based on machine learning tools. The way simulations are performed to explore the energy landscape of protein systems is also changing as force-fields are started to be designed by means of mac…
▽ More
Many aspects of the study of protein folding and dynamics have been affected by the recent advances in machine learning. Methods for the prediction of protein structures from their sequences are now heavily based on machine learning tools. The way simulations are performed to explore the energy landscape of protein systems is also changing as force-fields are started to be designed by means of machine learning methods. These methods are also used to extract the essential information from large simulation datasets and to enhance the sampling of rare events such as folding/unfolding transitions. While significant challenges still need to be tackled, we expect these methods to play an important role on the study of protein folding and dynamics in the near future. We discuss here the recent advances on all these fronts and the questions that need to be addressed for machine learning approaches to become mainstream in protein simulation.
△ Less
Submitted 21 November, 2019;
originally announced November 2019.
-
Machine learning for molecular simulation
Authors:
Frank Noé,
Alexandre Tkatchenko,
Klaus-Robert Müller,
Cecilia Clementi
Abstract:
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for a machine learning revolution and have already been profoundly impacted by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction…
▽ More
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for a machine learning revolution and have already been profoundly impacted by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction of quantum-mechanical energies and forces, coarse-grained molecular dynamics, the extraction of free energy surfaces and kinetics and generative network approaches to sample molecular equilibrium structures and compute thermodynamics. To explain these methods and illustrate open methodological problems, we review some important principles of molecular physics and describe how they can be incorporated into machine learning structures. Finally, we identify and describe a list of open challenges for the interface between ML and molecular simulation.
△ Less
Submitted 7 November, 2019;
originally announced November 2019.
-
Special Topic: Markov Models of Molecular Kinetics
Authors:
Frank Noé,
Edina Rosta
Abstract:
The Journal of Chemical Physics (JCP) article collection on Markov Models of Molecular Kinetics (MMMK) features recent advances developing and using Markov State Models (MSMs) in atomistic molecular simulations and related applications. This editorial provides a brief overview of the state of the art in the field and relates it to the articles in this JCP collection.
The Journal of Chemical Physics (JCP) article collection on Markov Models of Molecular Kinetics (MMMK) features recent advances developing and using Markov State Models (MSMs) in atomistic molecular simulations and related applications. This editorial provides a brief overview of the state of the art in the field and relates it to the articles in this JCP collection.
△ Less
Submitted 2 November, 2019;
originally announced November 2019.
-
Generating valid Euclidean distance matrices
Authors:
Moritz Hoffmann,
Frank Noé
Abstract:
Generating point clouds, e.g., molecular structures, in arbitrary rotations, translations, and enumerations remains a challenging task. Meanwhile, neural networks utilizing symmetry invariant layers have been shown to be able to optimize their training objective in a data-efficient way. In this spirit, we present an architecture which allows to produce valid Euclidean distance matrices, which by c…
▽ More
Generating point clouds, e.g., molecular structures, in arbitrary rotations, translations, and enumerations remains a challenging task. Meanwhile, neural networks utilizing symmetry invariant layers have been shown to be able to optimize their training objective in a data-efficient way. In this spirit, we present an architecture which allows to produce valid Euclidean distance matrices, which by construction are already invariant under rotation and translation of the described object. Motivated by the goal to generate molecular structures in Cartesian space, we use this architecture to construct a Wasserstein GAN utilizing a permutation invariant critic network. This makes it possible to generate molecular structures in a one-shot fashion by producing Euclidean distance matrices which have a three-dimensional embedding.
△ Less
Submitted 14 November, 2019; v1 submitted 7 October, 2019;
originally announced October 2019.
-
Equivariant Flows: sampling configurations for multi-body systems with symmetric energies
Authors:
Jonas Köhler,
Leon Klein,
Frank Noé
Abstract:
Flows are exact-likelihood generative neural networks that transform samples from a simple prior distribution to the samples of the probability distribution of interest. Boltzmann Generators (BG) combine flows and statistical mechanics to sample equilibrium states of strongly interacting many-body systems such as proteins with 1000 atoms. In order to scale and generalize these results, it is essen…
▽ More
Flows are exact-likelihood generative neural networks that transform samples from a simple prior distribution to the samples of the probability distribution of interest. Boltzmann Generators (BG) combine flows and statistical mechanics to sample equilibrium states of strongly interacting many-body systems such as proteins with 1000 atoms. In order to scale and generalize these results, it is essential that the natural symmetries of the probability density - in physics defined by the invariances of the energy function - are built into the flow. Here we develop theoretical tools for constructing such equivariant flows and demonstrate that a BG that is equivariant with respect to rotations and particle permutations can generalize to sampling nontrivially new configurations where a nonequivariant BG cannot.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Deep neural network solution of the electronic Schrödinger equation
Authors:
Jan Hermann,
Zeno Schätzle,
Frank Noé
Abstract:
[New and updated results were published in Nature Chemistry, doi:10.1038/s41557-020-0544-y.] The electronic Schrödinger equation describes fundamental properties of molecules and materials, but can only be solved analytically for the hydrogen atom. The numerically exact full configuration-interaction method is exponentially expensive in the number of electrons. Quantum Monte Carlo is a possible wa…
▽ More
[New and updated results were published in Nature Chemistry, doi:10.1038/s41557-020-0544-y.] The electronic Schrödinger equation describes fundamental properties of molecules and materials, but can only be solved analytically for the hydrogen atom. The numerically exact full configuration-interaction method is exponentially expensive in the number of electrons. Quantum Monte Carlo is a possible way out: it scales well to large molecules, can be parallelized, and its accuracy has, as yet, only been limited by the flexibility of the used wave function ansatz. Here we propose PauliNet, a deep-learning wave function ansatz that achieves nearly exact solutions of the electronic Schrödinger equation. PauliNet has a multireference Hartree-Fock solution built in as a baseline, incorporates the physics of valid wave functions, and is trained using variational quantum Monte Carlo (VMC). PauliNet outperforms comparable state-of-the-art VMC ansatzes for atoms, diatomic molecules and a strongly-correlated hydrogen chain by a margin and is yet computationally efficient. We anticipate that thanks to the favourable scaling with system size, this method may become a new leading method for highly accurate electronic-strucutre calculations on medium-sized molecular systems.
△ Less
Submitted 23 September, 2020; v1 submitted 16 September, 2019;
originally announced September 2019.
-
Hydrodynamic coupling for particle-based solvent-free membrane models
Authors:
Mohsen Sadeghi,
Frank Noé
Abstract:
The great challenge with biological membrane systems is the wide range of scales involved, from nanometers and picoseconds for individual lipids, to the micrometers and beyond millisecond for cellular signalling processes. While solvent-free coarse-grained membrane models are convenient for large-scale simulations, and promising to provide insight into slow processes involving membranes, these mod…
▽ More
The great challenge with biological membrane systems is the wide range of scales involved, from nanometers and picoseconds for individual lipids, to the micrometers and beyond millisecond for cellular signalling processes. While solvent-free coarse-grained membrane models are convenient for large-scale simulations, and promising to provide insight into slow processes involving membranes, these models usually have unrealistic kinetics. One major obstacle is the lack of an equally convenient way of introducing hydrodynamic coupling without significantly increasing the computational cost of the model. To address this, we introduce a framework based on anisotropic Langevin dynamics, for which major in-plane and out-of-plane hydrodynamic effects are modeled via friction and diffusion tensors from analytical or semi-analytical solutions to Stokes hydrodynamic equations. Using this framework, we obtain accurate dispersion relations for planar membrane patches, both free-standing and in the vicinity of a wall. We also briefly discuss how non-equilibrium dynamics is affected by hydrodynamic interactions.
△ Less
Submitted 17 March, 2021; v1 submitted 6 September, 2019;
originally announced September 2019.
-
Diffusion-influenced reaction rates in the presence of pair interactions
Authors:
Manuel Dibak,
Christoph Fröhner,
Frank Noé,
Felix Höfling
Abstract:
The kinetics of bimolecular reactions in solution depends, among other factors, on intermolecular forces such as steric repulsion or electrostatic interaction. Microscopically, a pair of molecules first has to meet by diffusion before the reaction can take place. In this work, we establish an extension of Doi's volume reaction model to molecules interacting via pair potentials, which is a key ingr…
▽ More
The kinetics of bimolecular reactions in solution depends, among other factors, on intermolecular forces such as steric repulsion or electrostatic interaction. Microscopically, a pair of molecules first has to meet by diffusion before the reaction can take place. In this work, we establish an extension of Doi's volume reaction model to molecules interacting via pair potentials, which is a key ingredient for interacting-particle-based reaction-diffusion (iPRD) simulations. As a central result, we relate model parameters and macroscopic reaction rate constants in this situation. We solve the corresponding reaction-diffusion equation in the steady state and derive semi-analytical expressions for the reaction rate constant and the local concentration profiles. Our results apply to the full spectrum from well-mixed to diffusion--limited kinetics. For limiting cases, we give explicit formulas, and we provide a computationally inexpensive numerical scheme for the general case, including the intermediate, diffusion-influenced regime. The obtained rate constants decompose uniquely into encounter and formation rates, and we discuss the effect of the potential on both subprocesses, exemplified for a soft harmonic repulsion and a Lennard-Jones potential. The analysis is complemented by extensive stochastic iPRD simulations, and we find excellent agreement with the theoretical predictions.
△ Less
Submitted 21 August, 2019;
originally announced August 2019.
-
Deflation reveals dynamical structure in nondominant reaction coordinates
Authors:
Brooke E. Husic,
Frank Noé
Abstract:
The output of molecular dynamics simulations is high-dimensional, and the degrees of freedom among the atoms are related in intricate ways. Therefore, a variety of analysis frameworks have been introduced in order to distill complex motions into lower-dimensional representations that model the system dynamics. These dynamical models have been developed to optimally approximate the system's global…
▽ More
The output of molecular dynamics simulations is high-dimensional, and the degrees of freedom among the atoms are related in intricate ways. Therefore, a variety of analysis frameworks have been introduced in order to distill complex motions into lower-dimensional representations that model the system dynamics. These dynamical models have been developed to optimally approximate the system's global kinetics. However, the separate aims of optimizing global kinetics and modeling a process of interest diverge when the process of interest is not the slowest process in the system. Here, we introduce deflation into state-of-the-art methods in molecular kinetics in order to preserve the use of variational optimization tools when the slowest dynamical mode is not the same as the one we seek to model and understand. First, we showcase deflation for a simple toy system and introduce the deflated variational approach to Markov processes (dVAMP). Using dVAMP, we show that nondominant reaction coordinates produced using deflation are more informative than their counterparts generated without deflation. Then, we examine a protein folding system in which the slowest dynamical mode is not folding. Following a dVAMP analysis, we show that deflation can be used to obscure this undesired slow process from a kinetic model, in this case a VAMPnet. The incorporation of deflation into current methods opens the door for enhanced sampling strategies and more flexible, targeted model building.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Kernel methods for detecting coherent structures in dynamical data
Authors:
Stefan Klus,
Brooke E. Husic,
Mattes Mollenhauer,
Frank Noé
Abstract:
We illustrate relationships between classical kernel-based dimensionality reduction techniques and eigendecompositions of empirical estimates of reproducing kernel Hilbert space (RKHS) operators associated with dynamical systems. In particular, we show that kernel canonical correlation analysis (CCA) can be interpreted in terms of kernel transfer operators and that it can be obtained by optimizing…
▽ More
We illustrate relationships between classical kernel-based dimensionality reduction techniques and eigendecompositions of empirical estimates of reproducing kernel Hilbert space (RKHS) operators associated with dynamical systems. In particular, we show that kernel canonical correlation analysis (CCA) can be interpreted in terms of kernel transfer operators and that it can be obtained by optimizing the variational approach for Markov processes (VAMP) score. As a result, we show that coherent sets of particle trajectories can be computed by kernel CCA. We demonstrate the efficiency of this approach with several examples, namely the well-known Bickley jet, ocean drifter data, and a molecular dynamics problem with a time-dependent potential. Finally, we propose a straightforward generalization of dynamic mode decomposition (DMD) called coherent mode decomposition (CMD). Our results provide a generic machine learning approach to the computation of coherent sets with an objective score that can be used for cross-validation and the comparison of different methods.
△ Less
Submitted 7 October, 2019; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Machine Learning for Molecular Dynamics on Long Timescales
Authors:
Frank Noé
Abstract:
Molecular Dynamics (MD) simulation is widely used to analyze the properties of molecules and materials. Most practical applications, such as comparison with experimental measurements, designing drug molecules, or optimizing materials, rely on statistical quantities, which may be prohibitively expensive to compute from direct long-time MD simulations. Classical Machine Learning (ML) techniques have…
▽ More
Molecular Dynamics (MD) simulation is widely used to analyze the properties of molecules and materials. Most practical applications, such as comparison with experimental measurements, designing drug molecules, or optimizing materials, rely on statistical quantities, which may be prohibitively expensive to compute from direct long-time MD simulations. Classical Machine Learning (ML) techniques have already had a profound impact on the field, especially for learning low-dimensional models of the long-time dynamics and for devising more efficient sampling schemes for computing long-time statistics. Novel ML methods have the potential to revolutionize long-timescale MD and to obtain interpretable models. ML concepts such as statistical estimator theory, end-to-end learning, representation learning and active learning are highly interesting for the MD researcher and will help to develop new solutions to hard MD problems. With the aim of better connecting the MD and ML research areas and spawning new research on this interface, we define the learning problems in long-timescale MD, present successful approaches and outline some of the unsolved ML problems in this application field.
△ Less
Submitted 18 December, 2018;
originally announced December 2018.
-
Machine Learning of coarse-grained Molecular Dynamics Force Fields
Authors:
Jiang Wang,
Simon Olsson,
Christoph Wehmeyer,
Adria Perez,
Nicholas E. Charron,
Gianni de Fabritiis,
Frank Noe,
Cecilia Clementi
Abstract:
Atomistic or ab-initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential t…
▽ More
Atomistic or ab-initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential to match defined properties of high-resolution models or experimental data. In this paper, we reformulate coarse-graining as a supervised machine learning problem. We use statistical learning theory to decompose the coarse-graining error and cross-validation to select and compare the performance of different models. We introduce CGnets, a deep learning approach, that learns coarse-grained free energy functions and can be trained by a force matching scheme. CGnets maintain all physically relevant invariances and allow one to incorporate prior physics knowledge to avoid sampling of unphysical structures. We show that CGnets can capture all-atom explicit-solvent free energy surfaces with models using only a few coarse-grained beads and no solvent, while classical coarse-graining methods fail to capture crucial features of the free energy surface. Thus, CGnets are able to capture multi-body terms that emerge from the dimensionality reduction.
△ Less
Submitted 3 April, 2019; v1 submitted 4 December, 2018;
originally announced December 2018.
-
Boltzmann Generators -- Sampling Equilibrium States of Many-Body Systems with Deep Learning
Authors:
Frank Noé,
Simon Olsson,
Jonas Köhler,
Hao Wu
Abstract:
Computing equilibrium states in condensed-matter many-body systems, such as solvated proteins, is a long-standing challenge. Lacking methods for generating statistically independent equilibrium samples in "one shot", vast computational effort is invested for simulating these system in small steps, e.g., using Molecular Dynamics. Combining deep learning and statistical mechanics, we here develop Bo…
▽ More
Computing equilibrium states in condensed-matter many-body systems, such as solvated proteins, is a long-standing challenge. Lacking methods for generating statistically independent equilibrium samples in "one shot", vast computational effort is invested for simulating these system in small steps, e.g., using Molecular Dynamics. Combining deep learning and statistical mechanics, we here develop Boltzmann Generators, that are shown to generate unbiased one-shot equilibrium samples of representative condensed matter systems and proteins. Boltzmann Generators use neural networks to learn a coordinate transformation of the complex configurational equilibrium distribution to a distribution that can be easily sampled. Accurate computation of free energy differences and discovery of new configurations are demonstrated, providing a statistical mechanics tool that can avoid rare events during sampling without prior knowledge of reaction coordinates.
△ Less
Submitted 12 July, 2019; v1 submitted 4 December, 2018;
originally announced December 2018.
-
Identification of kinetic order parameters for non-equilibrium dynamics
Authors:
Fabian Paul,
Hao Wu,
Maximilian Vossel,
Bert L. de Groot,
Frank Noé
Abstract:
A popular approach to analyze the dynamics of high-dimensional many-body systems, such as macromolecules, is to project the trajectories onto a space of slowly-varying collective variables, where subsequent analyses are made, such as clustering or estimation of free energy profiles or Markov state models (MSMs). However, existing "dynamical" dimension reduction methods, such as the time-lagged ind…
▽ More
A popular approach to analyze the dynamics of high-dimensional many-body systems, such as macromolecules, is to project the trajectories onto a space of slowly-varying collective variables, where subsequent analyses are made, such as clustering or estimation of free energy profiles or Markov state models (MSMs). However, existing "dynamical" dimension reduction methods, such as the time-lagged independent component analysis (TICA) are only valid if the dynamics obeys detailed balance (microscopic reversibility) and typically require long, equilibrated simulation trajectories. Here we develop a dimension reduction method for non-equilibrium dynamics based on the recently developed Variational Approach for Markov Processes (VAMP) by Wu and Noé. VAMP is illustrated by obtaining a low-dimensional description of a single file ion diffusion model and by identifying long-lived states from molecular dynamics simulations of the KcsA channel protein in an external electrochemical potential. This analysis provides detailed insights into the coupling of conformational dynamics, the configuration of the selectivity filter, and the conductance of the channel. We recommend VAMP as a replacement for the less general TICA method.
△ Less
Submitted 19 March, 2019; v1 submitted 29 November, 2018;
originally announced November 2018.
-
The mechanism of RNA base fraying: molecular dynamics simulations analyzed with core-set Markov state models
Authors:
Giovanni Pinamonti,
Fabian Paul,
Frank Noé,
Alex Rodriguez,
Giovanni Bussi
Abstract:
The process of RNA base fraying (i.e. the transient opening of the termini of a helix) is involved in many aspects of RNA dynamics. We here use molecular dynamics simulations and Markov state models to characterize the kinetics of RNA fraying and its sequence and direction dependence. In particular, we first introduce a method for determining biomolecular dynamics employing core-set Markov state m…
▽ More
The process of RNA base fraying (i.e. the transient opening of the termini of a helix) is involved in many aspects of RNA dynamics. We here use molecular dynamics simulations and Markov state models to characterize the kinetics of RNA fraying and its sequence and direction dependence. In particular, we first introduce a method for determining biomolecular dynamics employing core-set Markov state models constructed using an advanced clustering technique. The method is validated on previously reported simulations. We then use the method to analyze extensive trajectories for four different RNA model duplexes. Results obtained using D. E. Shaw research and AMBER force fields are compared and discussed in detail, and show a non-trivial interplay between the stability of intermediate states and the overall fraying kinetics.
△ Less
Submitted 7 March, 2019; v1 submitted 29 November, 2018;
originally announced November 2018.
-
Variational Selection of Features for Molecular Kinetics
Authors:
Martin K. Scherer,
Brooke E. Husic,
Moritz Hoffmann,
Fabian Paul,
Hao Wu,
Frank Noé
Abstract:
The modeling of atomistic biomolecular simulations using kinetic models such as Markov state models (MSMs) has had many notable algorithmic advances in recent years. The variational principle has opened the door for a nearly fully automated toolkit for selecting models that predict the long-time kinetics from molecular dynamics simulations. However, one yet-unoptimized step of the pipeline involve…
▽ More
The modeling of atomistic biomolecular simulations using kinetic models such as Markov state models (MSMs) has had many notable algorithmic advances in recent years. The variational principle has opened the door for a nearly fully automated toolkit for selecting models that predict the long-time kinetics from molecular dynamics simulations. However, one yet-unoptimized step of the pipeline involves choosing the features, or collective variables, from which the model should be constructed. In order to build intuitive models, these collective variables are often sought to be interpretable and familiar features, such as torsional angles or contact distances in a protein structure. However, previous approaches for evaluating the chosen features rely on constructing a full MSM, which in turn requires additional hyperparameters to be chosen, and hence leads to a computationally expensive framework. Here, we present a method to optimize the feature choice directly, without requiring the construction of the final kinetic model. We demonstrate our rigorous preprocessing algorithm on a canonical set of twelve fast-folding protein simulations, and show that our procedure leads to more efficient model selection.
△ Less
Submitted 25 April, 2019; v1 submitted 28 November, 2018;
originally announced November 2018.
-
Reversible Interacting-Particle Reaction Dynamics
Authors:
Christoph Fröhner,
Frank Noé
Abstract:
Interacting-Particle Reaction Dynamics (iPRD) simulates the spatiotemporal evolution of particles that experience interaction forces and can react with one another. The combination of interaction forces and reactions enable a wide range of complex reactive systems in biology and chemistry, but give rise to new questions such as how to evolve the dynamical equations in a computationally efficient a…
▽ More
Interacting-Particle Reaction Dynamics (iPRD) simulates the spatiotemporal evolution of particles that experience interaction forces and can react with one another. The combination of interaction forces and reactions enable a wide range of complex reactive systems in biology and chemistry, but give rise to new questions such as how to evolve the dynamical equations in a computationally efficient and statistically correct manner. Here we consider reversible reactions such as A + B <--> C with interacting particles and derive expressions for the microscopic iPRD simulation parameters such that desired values for the equilibrium constant and the dissociation rate are obtained in the dilute limit. We then introduce a Monte-Carlo algorithm that ensures detailed balance in the iPRD time-evolution (iPRD-DB). iPRD-DB guarantees the correct thermodynamics at all concentrations and maintains the desired kinetics in the dilute limit, where chemical rates are well-defined and kinetic measurement experiments usually operate. We show that in dense particle systems, the incorporation of detailed balance is essential to obtain physically realistic solutions. iPRD-DB is implemented in ReaDDy 2 (https://readdy.github.io).
△ Less
Submitted 19 July, 2018;
originally announced July 2018.
-
Deep Generative Markov State Models
Authors:
Hao Wu,
Andreas Mardt,
Luca Pasquali,
Frank Noe
Abstract:
We propose a deep generative Markov State Model (DeepGenMSM) learning framework for inference of metastable dynamical systems and prediction of trajectories. After unsupervised training on time series data, the model contains (i) a probabilistic encoder that maps from high-dimensional configuration space to a small-sized vector indicating the membership to metastable (long-lived) states, (ii) a Ma…
▽ More
We propose a deep generative Markov State Model (DeepGenMSM) learning framework for inference of metastable dynamical systems and prediction of trajectories. After unsupervised training on time series data, the model contains (i) a probabilistic encoder that maps from high-dimensional configuration space to a small-sized vector indicating the membership to metastable (long-lived) states, (ii) a Markov chain that governs the transitions between metastable states and facilitates analysis of the long-time dynamics, and (iii) a generative part that samples the conditional distribution of configurations in the next time step. The model can be operated in a recursive fashion to generate trajectories to predict the system evolution from a defined starting state and propose new configurations. The DeepGenMSM is demonstrated to provide accurate estimates of the long-time kinetics and generate valid distributions for molecular dynamics (MD) benchmark systems. Remarkably, we show that DeepGenMSMs are able to make long time-steps in molecular configuration space and generate physically realistic structures in regions that were not seen in training data.
△ Less
Submitted 11 January, 2019; v1 submitted 19 May, 2018;
originally announced May 2018.
-
Grand canonical diffusion-influenced reactions: a stochastic theory with applications to multiscale reaction-diffusion simulations
Authors:
Mauricio J. del Razo,
Hong Qian,
Frank Noé
Abstract:
Smoluchowski-type models for diffusion-influenced reactions (A+B -> C) can be formulated within two frameworks: the probabilistic-based approach for a pair A, B of reacting particles and the concentration-based approach for systems in contact with a bath that generates a concentration gradient of B particles that interact with A. Although these two approaches are mathematically similar, it is not…
▽ More
Smoluchowski-type models for diffusion-influenced reactions (A+B -> C) can be formulated within two frameworks: the probabilistic-based approach for a pair A, B of reacting particles and the concentration-based approach for systems in contact with a bath that generates a concentration gradient of B particles that interact with A. Although these two approaches are mathematically similar, it is not straightforward to establish a precise mathematical relationship between them. Determining this relationship is essential to derive particle-based numerical methods that are quantitatively consistent with bulk concentration dynamics. In this work, we determine the relationship between the two approaches by introducing the grand canonical Smoluchowski master equation (GC-SME), which consists of a continuous-time Markov chain that models an arbitrary number of B particles, each one of them following Smoluchowski's probabilistic dynamics. We show that the GC-SME recovers the concentration-based approach by taking either the hydrodynamic or the large copy number limit. In addition, we show that the GC-SME provides a clear statistical mechanical interpretation of the concentration-based approach and yields an emergent chemical potential for nonequilibrium spatially inhomogeneous reaction processes. We further exploit the GC-SME robust framework to accurately derive multiscale/hybrid numerical methods that couple particle-based reaction-diffusion simulations with bulk concentration descriptions, as described in detail through two computational implementations.
△ Less
Submitted 15 January, 2019; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Optimal data-driven estimation of generalized Markov state models for non-equilibrium dynamics
Authors:
Péter Koltai,
Hao Wu,
Frank Noé,
Christof Schütte
Abstract:
There are multiple ways in which a stochastic system can be out of statistical equilibrium. It might be subject to time-varying forcing; or be in a transient phase on its way towards equilibrium; it might even be in equilibrium without us noticing it, due to insufficient observations; and it even might be a system failing to admit an equilibrium distribution at all. We review some of the approache…
▽ More
There are multiple ways in which a stochastic system can be out of statistical equilibrium. It might be subject to time-varying forcing; or be in a transient phase on its way towards equilibrium; it might even be in equilibrium without us noticing it, due to insufficient observations; and it even might be a system failing to admit an equilibrium distribution at all. We review some of the approaches that model the effective statistical behavior of equilibrium and non-equilibrium dynamical systems, and show that both cases can be considered under the unified framework of optimal low-rank approximation of so-called transfer operators. Particular attention is given to the connection between these methods, Markov state models, and the concept of metastability, further to the estimation of such reduced order models from finite simulation data. We illustrate our considerations by numerical examples.
△ Less
Submitted 12 January, 2018;
originally announced January 2018.
-
MSM/RD: Coupling Markov state models of molecular kinetics with reaction-diffusion simulations
Authors:
Manuel Dibak,
Mauricio J. del Razo,
David De Sancho,
Christof Schütte,
Frank Noé
Abstract:
Molecular dynamics (MD) simulations can model the interactions between macromolecules with high spatiotemporal resolution but at a high computational cost. By combining high-throughput MD with Markov state models (MSMs), it is now possible to obtain long-timescale behavior of small to intermediate biomolecules and complexes. To model the interactions of many molecules at large lengthscales, partic…
▽ More
Molecular dynamics (MD) simulations can model the interactions between macromolecules with high spatiotemporal resolution but at a high computational cost. By combining high-throughput MD with Markov state models (MSMs), it is now possible to obtain long-timescale behavior of small to intermediate biomolecules and complexes. To model the interactions of many molecules at large lengthscales, particle-based reaction-diffusion (RD) simulations are more suitable but lack molecular detail. Thus, coupling MSMs and RD simulations (MSM/RD) would be highly desirable, as they could efficiently produce simulations at large time- and lengthscales, while still conserving the characteristic features of the interactions observed at atomic detail. While such a coupling seems straightforward, fundamental questions are still open: Which definition of MSM states is suitable? Which protocol to merge and split RD particles in an association/dissociation reaction will conserve the correct bimolecular kinetics and thermodynamics? In this paper, we make the first step towards MSM/RD by laying out a general theory of coupling and proposing a first implementation for association/dissociation of a protein with a small ligand (A + B <--> C). Applications on a toy model and CO diffusion into the heme cavity of myoglobin are reported.
△ Less
Submitted 12 June, 2018; v1 submitted 21 December, 2017;
originally announced December 2017.
-
Time-lagged autoencoders: Deep learning of slow collective variables for molecular kinetics
Authors:
Christoph Wehmeyer,
Frank Noé
Abstract:
Inspired by the success of deep learning techniques in the physical and chemical sciences, we apply a modification of an autoencoder type deep neural network to the task of dimension reduction of molecular dynamics data. We can show that our time-lagged autoencoder reliably finds low-dimensional embeddings for high-dimensional feature spaces which capture the slow dynamics of the underlying stocha…
▽ More
Inspired by the success of deep learning techniques in the physical and chemical sciences, we apply a modification of an autoencoder type deep neural network to the task of dimension reduction of molecular dynamics data. We can show that our time-lagged autoencoder reliably finds low-dimensional embeddings for high-dimensional feature spaces which capture the slow dynamics of the underlying stochastic processes - beyond the capabilities of linear dimension reduction techniques.
△ Less
Submitted 30 October, 2017;
originally announced October 2017.
-
An efficient multi-scale Green's Functions Reaction Dynamics scheme
Authors:
Luigi Sbailò,
Frank Noé
Abstract:
Molecular Dynamics - Green's Functions Reaction Dynamics (MD-GFRD) is a multiscale simulation method for particle dynamics or particle-based reaction-diffusion dynamics that is suited for systems involving low particle densities. Particles in a low-density region are just diffusing and not interacting. In this case one can avoid the costly integration of microscopic equations of motion, such as mo…
▽ More
Molecular Dynamics - Green's Functions Reaction Dynamics (MD-GFRD) is a multiscale simulation method for particle dynamics or particle-based reaction-diffusion dynamics that is suited for systems involving low particle densities. Particles in a low-density region are just diffusing and not interacting. In this case one can avoid the costly integration of microscopic equations of motion, such as molecular dynamics (MD), and instead turn to an event-based scheme in which the times to the next particle interaction and the new particle positions at that time can be sampled. At high (local) concentrations, however, e.g. when particles are interacting in a nontrivial way, particle positions must still be updated with small time steps of the microscopic dynamical equations. The efficiency of a multi-scale simulation that uses these two schemes largely depends on the coupling between them and the decisions when to switch between the two scales. Here we present an efficient scheme for multi-scale MD-GFRD simulations. It has been shown that MD-GFRD schemes are more efficient than brute-force molecular dynamics simulations up to a molar concentration of $10^{2}μM$. In this paper, we show that the choice of the propagation domains has a relevant impact on the computational performance. Domains are constructed using a local optimization of their sizes and a minimal domain size is proposed. The algorithm is shown to be more efficient than brute-force Brownian dynamics simulations up to a molar concentration of $10^{3}μM$ and is up to an order of magnitude more efficient compared with previous MD-GFRD schemes.
△ Less
Submitted 21 October, 2017;
originally announced October 2017.
-
Particle-based membrane model for mesoscopic simulation of cellular dynamics
Authors:
Mohsen Sadeghi,
Thomas R. Weikl,
Frank Noé
Abstract:
We present a simple and computationally efficient coarse-grained and solvent-free model for simulating lipid bilayer membranes. In order to be used in concert with particle-based reaction-diffusion simulations, the model is purely based on interacting and reacting particles, each representing a coarse patch of a lipid monolayer. Particle interactions include nearest-neighbor bond-stretching and an…
▽ More
We present a simple and computationally efficient coarse-grained and solvent-free model for simulating lipid bilayer membranes. In order to be used in concert with particle-based reaction-diffusion simulations, the model is purely based on interacting and reacting particles, each representing a coarse patch of a lipid monolayer. Particle interactions include nearest-neighbor bond-stretching and angle-bending, and are parameterized so as to reproduce the local membrane mechanics given by the Helfrich energy density over a range of relevant curvatures. In-plane fluidity is implemented with Monte Carlo bond-flipping moves. The physical accuracy of the model is verified by five tests: (i) Power spectrum analysis of equilibrium thermal undulations is used to verify that the particle-based representation correctly captures the dynamics predicted by the continuum model of fluid membranes. (ii) It is verified that the input bending stiffness, against which the potential parameters are optimized, is accurately recovered. (iii) Isothermal area compressibility modulus of the membrane is calculated and is shown to be tunable to reproduce available values for different lipid bilayers, independent of the bending rigidity. (iv) Simulation of two-dimensional shear flow under a gravity force is employed to measure the effective in-plane viscosity of the membrane model, and show the possibility of modeling membranes with specified viscosities. (v) Interaction of the bilayer membrane with a spherical nanoparticle is modeled as a test case for large membrane deformations and budding involved in cellular processes such as endocytosis...
△ Less
Submitted 29 January, 2018; v1 submitted 13 October, 2017;
originally announced October 2017.
-
VAMPnets: Deep learning of molecular kinetics
Authors:
Andreas Mardt,
Luca Pasquali,
Hao Wu,
Frank Noé
Abstract:
There is an increasing demand for computing the relevant structures, equilibria and long-timescale kinetics of biomolecular processes, such as protein-drug binding, from high-throughput molecular dynamics simulations. Current methods employ transformation of simulated coordinates into structural features, dimension reduction, clustering the dimension-reduced data, and estimation of a Markov state…
▽ More
There is an increasing demand for computing the relevant structures, equilibria and long-timescale kinetics of biomolecular processes, such as protein-drug binding, from high-throughput molecular dynamics simulations. Current methods employ transformation of simulated coordinates into structural features, dimension reduction, clustering the dimension-reduced data, and estimation of a Markov state model or related model of the interconversion rates between molecular structures. This handcrafted approach demands a substantial amount of modeling expertise, as poor decisions at any step will lead to large modeling errors. Here we employ the variational approach for Markov processes (VAMP) to develop a deep learning framework for molecular kinetics using neural networks, dubbed VAMPnets. A VAMPnet encodes the entire mapping from molecular coordinates to Markov states, thus combining the whole data processing pipeline in a single end-to-end framework. Our method performs equally or better than state-of-the art Markov modeling methods and provides easily interpretable few-state kinetic models.
△ Less
Submitted 20 December, 2017; v1 submitted 16 October, 2017;
originally announced October 2017.
-
Variational approach for learning Markov processes from time series data
Authors:
Hao Wu,
Frank Noé
Abstract:
Inference, prediction and control of complex dynamical systems from time series is important in many areas, including financial markets, power grid management, climate and weather modeling, or molecular dynamics. The analysis of such highly nonlinear dynamical systems is facilitated by the fact that we can often find a (generally nonlinear) transformation of the system coordinates to features in w…
▽ More
Inference, prediction and control of complex dynamical systems from time series is important in many areas, including financial markets, power grid management, climate and weather modeling, or molecular dynamics. The analysis of such highly nonlinear dynamical systems is facilitated by the fact that we can often find a (generally nonlinear) transformation of the system coordinates to features in which the dynamics can be excellently approximated by a linear Markovian model. Moreover, the large number of system variables often change collectively on large time- and length-scales, facilitating a low-dimensional analysis in feature space. In this paper, we introduce a variational approach for Markov processes (VAMP) that allows us to find optimal feature mappings and optimal Markovian models of the dynamics from given time series data. The key insight is that the best linear model can be obtained from the top singular components of the Koopman operator. This leads to the definition of a family of score functions called VAMP-r which can be calculated from data, and can be employed to optimize a Markovian model. In addition, based on the relationship between the variational scores and approximation errors of Koopman operators, we propose a new VAMP-E score, which can be applied to cross-validation for hyper-parameter optimization and model selection in VAMP. VAMP is valid for both reversible and nonreversible processes and for stationary and non-stationary processes or realizations.
△ Less
Submitted 15 August, 2019; v1 submitted 14 July, 2017;
originally announced July 2017.
-
Data-driven model reduction and transfer operator approximation
Authors:
Stefan Klus,
Feliks Nüske,
Péter Koltai,
Hao Wu,
Ioannis Kevrekidis,
Christof Schütte,
Frank Noé
Abstract:
In this review paper, we will present different data-driven dimension reduction techniques for dynamical systems that are based on transfer operator theory as well as methods to approximate transfer operators and their eigenvalues, eigenfunctions, and eigenmodes. The goal is to point out similarities and differences between methods developed independently by the dynamical systems, fluid dynamics,…
▽ More
In this review paper, we will present different data-driven dimension reduction techniques for dynamical systems that are based on transfer operator theory as well as methods to approximate transfer operators and their eigenvalues, eigenfunctions, and eigenmodes. The goal is to point out similarities and differences between methods developed independently by the dynamical systems, fluid dynamics, and molecular dynamics communities such as time-lagged independent component analysis (TICA), dynamic mode decomposition (DMD), and their respective generalizations. As a result, extensions and best practices developed for one particular method can be carried over to other related methods.
△ Less
Submitted 18 September, 2017; v1 submitted 29 March, 2017;
originally announced March 2017.
-
Markov State Models from short non-Equilibrium Simulations - Analysis and Correction of Estimation Bias
Authors:
Feliks Nüske,
Hao Wu,
Jan-Hendrik Prinz,
Christoph Wehmeyer,
Cecilia Clementi,
Frank Noé
Abstract:
Many state of the art methods for the thermodynamic and kinetic characterization of large and complex biomolecular systems by simulation rely on ensemble approaches, where data from large numbers of relatively short trajectories are integrated. In this context, Markov state models (MSMs) are extremely popular because they can be used to compute stationary quantities and long-time kinetics from ens…
▽ More
Many state of the art methods for the thermodynamic and kinetic characterization of large and complex biomolecular systems by simulation rely on ensemble approaches, where data from large numbers of relatively short trajectories are integrated. In this context, Markov state models (MSMs) are extremely popular because they can be used to compute stationary quantities and long-time kinetics from ensembles of short simulations, provided that these short simulations are in "local equilibrium" within the MSM states. However, in the last over 15 years since the inception of MSMs, it has been controversially discussed and not yet been answered how deviations from local equilibrium can be detected, whether these deviations induce a practical bias in MSM estimation, and how to correct for them. In this paper, we address these issues: We systematically analyze the estimation of Markov state models (MSMs) from short non-equilibrium simulations, and we provide an expression for the error between unbiased transition probabilities and the expected estimate from many short simulations. We show that the unbiased MSM estimate can be obtained even from relatively short non-equilibrium simulations in the limit of long lag times and good discretization. Further, we exploit observable operator model (OOM) theory to derive an unbiased estimator for the MSM transition matrix that corrects for the effect of starting out of equilibrium, even when short lag times are used. Finally, we show how the OOM framework can be used to estimate the exact eigenvalues or relaxation timescales of the system without estimating an MSM transition matrix, which allows us to practically assess the discretization quality of the MSM. Applications to model systems and molecular dynamics simulation data of alanine dipeptide are included for illustration. The improved MSM estimator is implemented in PyEMMA as of version 2.3.
△ Less
Submitted 6 January, 2017;
originally announced January 2017.
-
Predicting the kinetics of RNA oligonucleotides using Markov state models
Authors:
Giovanni Pinamonti,
Jianbo Zhao,
David E. Condon,
Fabian Paul,
Frank Noé,
Douglas H. Turner,
Giovanni Bussi
Abstract:
Nowadays different experimental techniques, such as single molecule or relaxation experiments, can provide dynamic properties of biomolecular systems, but the amount of detail obtainable with these methods is often limited in terms of time or spatial resolution. Here we use state-of-the-art computational techniques, namely atomistic molecular dynamics and Markov state models, to provide insight in…
▽ More
Nowadays different experimental techniques, such as single molecule or relaxation experiments, can provide dynamic properties of biomolecular systems, but the amount of detail obtainable with these methods is often limited in terms of time or spatial resolution. Here we use state-of-the-art computational techniques, namely atomistic molecular dynamics and Markov state models, to provide insight into the rapid dynamics of short RNA oligonucleotides, in order to elucidate the kinetics of stacking interactions. Analysis of multiple microsecond-long simulations indicates that the main relaxation modes of such molecules can consist of transitions between alternative folded states, rather than between random coils and native structures. After properly removing structures that are artificially stabilized by known inaccuracies of the current RNA AMBER force field, the kinetic properties predicted are consistent with the timescales of previously reported relaxation experiments.
△ Less
Submitted 22 December, 2016;
originally announced December 2016.
-
Variational Koopman models: slow collective variables and molecular kinetics from short off-equilibrium simulations
Authors:
Hao Wu,
Feliks Nüske,
Fabian Paul,
Stefan Klus,
Peter Koltai,
Frank Noé
Abstract:
Markov state models (MSMs) and Master equation models are popular approaches to approximate molecular kinetics, equilibria, metastable states, and reaction coordinates in terms of a state space discretization usually obtained by clustering. Recently, a powerful generalization of MSMs has been introduced, the variational approach (VA) of molecular kinetics and its special case the time-lagged indep…
▽ More
Markov state models (MSMs) and Master equation models are popular approaches to approximate molecular kinetics, equilibria, metastable states, and reaction coordinates in terms of a state space discretization usually obtained by clustering. Recently, a powerful generalization of MSMs has been introduced, the variational approach (VA) of molecular kinetics and its special case the time-lagged independent component analysis (TICA), which allow us to approximate slow collective variables and molecular kinetics by linear combinations of smooth basis functions or order parameters. While it is known how to estimate MSMs from trajectories whose starting points are not sampled from an equilibrium ensemble, this has not yet been the case for TICA and the VA. Previous estimates from short trajectories, have been strongly biased and thus not variationally optimal. Here, we employ Koopman operator theory and ideas from dynamic mode decomposition (DMD) to extend the VA and TICA to non-equilibrium data. The main insight is that the VA and TICA provide a coefficient matrix that we call Koopman model, as it approximates the underlying dynamical (Koopman) operator in conjunction with the basis set used. This Koopman model can be used to compute a stationary vector to reweight the data to equilibrium. From such a Koopman-reweighted sample, equilibrium expectation values and variationally optimal reversible Koopman models can be constructed even with short simulations. The Koopman model can be used to propagate densities, and its eigenvalue decomposition provide estimates of relaxation timescales and slow collective variables for dimension reduction. Koopman models are generalizations of Markov state models, TICA and the linear VA and allow molecular kinetics to be described without a cluster discretization.
△ Less
Submitted 22 January, 2017; v1 submitted 20 October, 2016;
originally announced October 2016.
-
Spectral learning of dynamic systems from nonequilibrium data
Authors:
Hao Wu,
Frank Noé
Abstract:
Observable operator models (OOMs) and related models are one of the most important and powerful tools for modeling and analyzing stochastic systems. They exactly describe dynamics of finite-rank systems and can be efficiently and consistently estimated through spectral learning under the assumption of identically distributed data. In this paper, we investigate the properties of spectral learning w…
▽ More
Observable operator models (OOMs) and related models are one of the most important and powerful tools for modeling and analyzing stochastic systems. They exactly describe dynamics of finite-rank systems and can be efficiently and consistently estimated through spectral learning under the assumption of identically distributed data. In this paper, we investigate the properties of spectral learning without this assumption due to the requirements of analyzing large-time scale systems, and show that the equilibrium dynamics of a system can be extracted from nonequilibrium observation data by imposing an equilibrium constraint. In addition, we propose a binless extension of spectral learning for continuous data. In comparison with the other continuous-valued spectral algorithms, the binless algorithm can achieve consistent estimation of equilibrium dynamics with only linear complexity.
△ Less
Submitted 20 June, 2017; v1 submitted 4 September, 2016;
originally announced September 2016.