-
Graph Convolutional Neural Networks for (QM)ML/MM Molecular Dynamics Simulations
Authors:
Albert Hofstetter,
Lennard Böselt,
Sereina Riniker
Abstract:
To accurately study chemical reactions in the condensed phase or within enzymes, both a quantum-mechanical description and sufficient configurational sampling is required to reach converged estimates. Here, quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) simulations play an important role, providing QM accuracy for the region of interest at a decreased computational cost. How…
▽ More
To accurately study chemical reactions in the condensed phase or within enzymes, both a quantum-mechanical description and sufficient configurational sampling is required to reach converged estimates. Here, quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) simulations play an important role, providing QM accuracy for the region of interest at a decreased computational cost. However, QM/MM simulations are still too expensive to study large systems on longer time scales. Recently, machine learning (ML) models have been proposed to replace the QM description. The main limitation of these models lies in the accurate description of long-range interactions present in condensed-phase systems. To overcome this issue, a recent workflow has been introduced combining a semi-empirical method (i.e. density functional tight binding (DFTB)) and a high-dimensional neural network potential (HDNNP) in a $Δ$-learning scheme. This approach has been shown to be capable of correctly incorporating long-range interactions within a cutoff of 1.4 nm. One of the promising alternative approaches to efficiently take long-range effects into account is the development of graph convolutional neural networks (GCNN) for the prediction of the potential-energy surface. In this work, we investigate the use of GCNN models -- with and without a $Δ$-learning scheme -- for (QM)ML/MM MD simulations. We show that the $Δ$-learning approach using a GCNN and DFTB and as baseline achieves competitive performance on our benchmarking set of solutes and chemical reactions in water. The method is additionally validated by performing prospective (QM)ML/MM MD simulations of retinoic acid in water and S-adenoslymethioniat interacting with cytosine in water. The results indicate that the $Δ$-learning GCNN model is a valuable alternative for (QM)ML/MM MD simulations of condensed-phase systems.
△ Less
Submitted 4 October, 2022; v1 submitted 29 June, 2022;
originally announced June 2022.
-
A Bayesian approach to NMR crystal structure determination
Authors:
Edgar A. Engel,
Andrea Anelli,
Albert Hofstetter,
Federico Paruzzo,
Lyndon Emsley,
Michele Ceriotti
Abstract:
Nuclear Magnetic Resonance (NMR) spectroscopy is particularly well-suited to determine the structure of molecules and materials in powdered form. Structure determination usually proceeds by finding the best match between experimentally observed NMR chemical shifts and those of candidate structures. Chemical shifts for the candidate configurations have traditionally been computed by electronic-stru…
▽ More
Nuclear Magnetic Resonance (NMR) spectroscopy is particularly well-suited to determine the structure of molecules and materials in powdered form. Structure determination usually proceeds by finding the best match between experimentally observed NMR chemical shifts and those of candidate structures. Chemical shifts for the candidate configurations have traditionally been computed by electronic-structure methods, and more recently predicted by machine learning. However, the reliability of the determination depends on the errors in the predicted shifts. Here we propose a Bayesian framework for determining the confidence in the identification of the experimental crystal structure, based on knowledge of the typical error in the electronic structure methods. We also extend the recently-developed ShiftML machine-learning model, including the evaluation of the uncertainty of its predictions. We demonstrate the approach on the determination of the structures of six organic molecular crystals. We critically assess the reliability of the structure determinations, facilitated by the introduction of a visualization of the of similarity between candidate configurations in terms of their chemical shifts and their structures. We also show that the commonly used values for the errors in calculated $^{13}$C shifts are underestimated, and that more accurate, self-consistently determined uncertainties make it possible to use $^{13}$C shifts to improve the accuracy of structure determinations.
△ Less
Submitted 12 November, 2019; v1 submitted 2 September, 2019;
originally announced September 2019.
-
Chemical Shifts in Molecular Solids by Machine Learning
Authors:
Federico M. Paruzzo,
Albert Hofstetter,
Félix Musil,
Sandip De,
Michele Ceriotti,
Lyndon Emsley
Abstract:
The calculation of chemical shifts in solids has enabled methods to determine crystal structures in powders. The dependence of chemical shifts on local atomic environments sets them among the most powerful tools for structure elucidation of powdered solids or amorphous materials. Unfortunately, this dependency comes with the cost of high accuracy first-principle calculations to qualitatively predi…
▽ More
The calculation of chemical shifts in solids has enabled methods to determine crystal structures in powders. The dependence of chemical shifts on local atomic environments sets them among the most powerful tools for structure elucidation of powdered solids or amorphous materials. Unfortunately, this dependency comes with the cost of high accuracy first-principle calculations to qualitatively predict chemical shifts in solids. Machine learning methods have recently emerged as a way to overcome the need for explicit high accuracy first-principle calculations. However, the vast chemical and combinatorial space spanned by molecular solids, together with the strong dependency of chemical shifts of atoms on their environment, poses a huge challenge for any machine learning method. Here we propose a machine learning method based on local environments to accurately predict chemical shifts of different molecular solids and of different polymorphs within DFT accuracy (RMSE of 0.49 ppm ( 1 H), 4.3ppm ( 13 C), 13.3 ppm ( 15 N), and 17.7 ppm ( 17 O) with $R^2$ of 0.97 for 1 H, 0.99 for 13 C, 0.99 for 15 N, and 0.99 for 17 O). We also demonstrate that the trained model is able to correctly determine, based on the match between experimentally-measured and ML-predicted shifts, structures of cocaine and the drug 4-[4-(2-adamantylcarbamoyl)-5-tert-butylpyrazol-1-yl]benzoic acid in an chemical shift based NMR crystallography approach.
△ Less
Submitted 29 May, 2018;
originally announced May 2018.
-
Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network
Authors:
S. Alireza Ghasemi,
Albert Hofstetter,
Santanu Saha,
Stefan Goedecker
Abstract:
Based on an analysis of the short range chemical environment of each atom in a system, standard machine learning based approaches to the construction of interatomic potentials aim at determining directly the central quantity which is the total energy. This prevents for instance an accurate description of the energetics of systems where long range charge transfer is important as well as of ionized…
▽ More
Based on an analysis of the short range chemical environment of each atom in a system, standard machine learning based approaches to the construction of interatomic potentials aim at determining directly the central quantity which is the total energy. This prevents for instance an accurate description of the energetics of systems where long range charge transfer is important as well as of ionized systems. We propose therefore not to target directly with machine learning methods the total energy but an intermediate physical quantity namely the charge density, which then in turn allows to determine the total energy. By allowing the electronic charge to distribute itself in an optimal way over the system, we can describe not only neutral but also ionized systems with unprecedented accuracy. We demonstrate the power of our approach for both neutral and ionized NaCl clusters where charge redistribution plays a decisive role for the energetics. We are able to obtain chemical accuracy, i.e. errors of less than a milli Hartree per atom compared to the reference density functional results. The introduction of physically motivated quantities which are determined by the short range atomic environment via a neural network leads also to an increased stability of the machine learning process and transferability of the potential.
△ Less
Submitted 29 January, 2015;
originally announced January 2015.