BoostMD: Accelerating molecular sampling by leveraging ML force field features from previous time-steps
Authors:
Lars L. Schaaf,
Ilyes Batatia,
Christoph Brunken,
Thomas D. Barrett,
Jules Tilly
Abstract:
Simulating atomic-scale processes, such as protein dynamics and catalytic reactions, is crucial for advancements in biology, chemistry, and materials science. Machine learning force fields (MLFFs) have emerged as powerful tools that achieve near quantum mechanical accuracy, with promising generalization capabilities. However, their practical use is often limited by long inference times compared to…
▽ More
Simulating atomic-scale processes, such as protein dynamics and catalytic reactions, is crucial for advancements in biology, chemistry, and materials science. Machine learning force fields (MLFFs) have emerged as powerful tools that achieve near quantum mechanical accuracy, with promising generalization capabilities. However, their practical use is often limited by long inference times compared to classical force fields, especially when running extensive molecular dynamics (MD) simulations required for many biological applications. In this study, we introduce BoostMD, a surrogate model architecture designed to accelerate MD simulations. BoostMD leverages node features computed at previous time steps to predict energies and forces based on positional changes. This approach reduces the complexity of the learning task, allowing BoostMD to be both smaller and significantly faster than conventional MLFFs. During simulations, the computationally intensive reference MLFF is evaluated only every $N$ steps, while the lightweight BoostMD model handles the intermediate steps at a fraction of the computational cost. Our experiments demonstrate that BoostMD achieves an eight-fold speedup compared to the reference model and generalizes to unseen dipeptides. Furthermore, we find that BoostMD accurately samples the ground-truth Boltzmann distribution when running molecular dynamics. By combining efficient feature reuse with a streamlined architecture, BoostMD offers a robust solution for conducting large-scale, long-timescale molecular simulations, making high-accuracy ML-driven modeling more accessible and practical.
△ Less
Submitted 21 December, 2024;
originally announced December 2024.
Self-Parametrizing System-Focused Atomistic Models
Authors:
Christoph Brunken,
Markus Reiher
Abstract:
Computational studies of chemical reactions in complex environments such as proteins, nanostructures, or on surfaces require accurate and efficient atomistic models applicable to the nanometer scale. In general, an accurate parametrization of the atomistic entities will not be available for arbitrary system classes, but demands a fast automated system-focused parametrization procedure to be quickl…
▽ More
Computational studies of chemical reactions in complex environments such as proteins, nanostructures, or on surfaces require accurate and efficient atomistic models applicable to the nanometer scale. In general, an accurate parametrization of the atomistic entities will not be available for arbitrary system classes, but demands a fast automated system-focused parametrization procedure to be quickly applicable, reliable, flexible, and reproducible. Here, we develop and combine an automatically parametrizable quantum chemically derived molecular mechanics model with machine-learned corrections under autonomous uncertainty quantification and refinement. Our approach first generates an accurate, physically motivated model from a minimum energy structure and its corresponding Hessian matrix by a partial Hessian fitting procedure of the force constants. This model is then the starting point to generate a large number of configurations for which additional off-minimum reference data can be evaluated on the fly. A $Δ$-machine learning model is trained on these data to provide a correction to energies and forces including uncertainty estimates. During the procedure, the flexibility of the machine learning model is tailored to the amount of available training data. The parametrization of large systems is enabled by a fragmentation approach. Due to their modular nature, all model construction steps allow for model improvement in a rolling fashion. Our approach may also be employed for the generation of system-focused electrostatic molecular mechanics embedding environments in a quantum-mechanical/molecular-mechanical hybrid model for arbitrary atomistic structures at the nanoscale.
△ Less
Submitted 15 February, 2020; v1 submitted 27 August, 2019;
originally announced August 2019.