-
Training atomic neural networks using fragment-based data generated in virtual reality
Authors:
Silvia Amabilino,
Lars A. Bratholm,
Simon J. Bennie,
Michael B. O'Connor,
David R. Glowacki
Abstract:
The ability to understand and engineer molecular structures relies on having accurate descriptions of the energy as a function of atomic coordinates. Here we outline a new paradigm for deriving energy functions of hyperdimensional molecular systems, which involves generating data for low-dimensional systems in virtual reality (VR) to then efficiently train atomic neural networks (ANNs). This gener…
▽ More
The ability to understand and engineer molecular structures relies on having accurate descriptions of the energy as a function of atomic coordinates. Here we outline a new paradigm for deriving energy functions of hyperdimensional molecular systems, which involves generating data for low-dimensional systems in virtual reality (VR) to then efficiently train atomic neural networks (ANNs). This generates high quality data for specific areas of interest within the hyperdimensional space that characterizes a molecule's potential energy surface (PES). We demonstrate the utility of this approach by gathering data within VR to train ANNs on chemical reactions involving fewer than 8 heavy atoms. This strategy enables us to predict the energies of much higher-dimensional systems, e.g. containing nearly 100 atoms. Training on datasets containing only 15K geometries, this approach generates mean absolute errors around 2 kcal/mol. This represents one of the first times that an ANN-PES for a large reactive radical has been generated using such a small dataset. Our results suggest VR enables the intelligent curation of high-quality data, which accelerates the learning process.
△ Less
Submitted 30 May, 2020;
originally announced July 2020.
-
Interactive molecular dynamics in virtual reality from quantum chemistry to drug binding: An open-source multi-person framework
Authors:
Michael O'Connor,
Simon J. Bennie,
Helen M. Deeks,
Alexander Jamieson-Binnie,
Alex J. Jones,
Robin J. Shannon,
Rebecca Walters,
Thomas J. Mitchell,
Adrian J. Mulholland,
David R. Glowacki
Abstract:
As molecular scientists have made progress in their ability to engineer nano-scale molecular structure, we are facing new challenges in our ability to engineer molecular dynamics (MD) and flexibility. Dynamics at the molecular scale differs from the familiar mechanics of everyday objects, because it involves a complicated, highly correlated, and three-dimensional many-body dynamical choreography w…
▽ More
As molecular scientists have made progress in their ability to engineer nano-scale molecular structure, we are facing new challenges in our ability to engineer molecular dynamics (MD) and flexibility. Dynamics at the molecular scale differs from the familiar mechanics of everyday objects, because it involves a complicated, highly correlated, and three-dimensional many-body dynamical choreography which is often non-intuitive even for highly trained researchers. We recently described how interactive molecular dynamics in virtual reality (iMD-VR) can help to meet this challenge, enabling researchers to manipulate real-time MD simulations of flexible structures in 3D. In this article, we outline efforts to extend immersive technologies to the molecular sciences, and we introduce 'Narupa', a flexible, open-source, multi-person iMD-VR software framework which enables groups of researchers to simultaneously cohabit real-time simulation environments to interactively visualize and manipulate the dynamics of molecular structures with atomic-level precision. We outline several application domains where iMD-VR is facilitating research, communication, and creative approaches within the molecular sciences, including training machines to learn reactive potential energy surfaces (PESs), biomolecular conformational sampling, protein-ligand binding, reaction discovery using 'on-the-fly' quantum chemistry, and transport dynamics in materials. We touch on iMD-VR's various cognitive and perceptual affordances, and how these provide research insight for molecular systems. By synergistically combining human spatial reasoning and design insight with computational automation, technologies like iMD-VR have the potential to improve our ability to understand, engineer, and communicate microscopic dynamical behavior, offering the potential to usher in a new paradigm for engineering molecules and nano-architectures.
△ Less
Submitted 1 May, 2019; v1 submitted 5 February, 2019;
originally announced February 2019.
-
Training neural nets to learn reactive potential energy surfaces using interactive quantum chemistry in virtual reality
Authors:
Silvia Amabilino,
Lars A. Bratholm,
Simon J. Bennie,
Alain C. Vaucher,
Markus Reiher,
David R. Glowacki
Abstract:
Whilst the primary bottleneck to a number of computational workflows was not so long ago limited by processing power, the rise of machine learning technologies has resulted in a paradigm shift which places increasing value on issues related to data curation - i.e., data size, quality, bias, format, and coverage. Increasingly, data-related issues are equally as important as the algorithmic methods…
▽ More
Whilst the primary bottleneck to a number of computational workflows was not so long ago limited by processing power, the rise of machine learning technologies has resulted in a paradigm shift which places increasing value on issues related to data curation - i.e., data size, quality, bias, format, and coverage. Increasingly, data-related issues are equally as important as the algorithmic methods used to process and learn from the data. Here we introduce an open source GPU-accelerated neural network (NN) framework for learning reactive potential energy surfaces (PESs), and investigate the use of real-time interactive ab initio molecular dynamics in virtual reality (iMD-VR) as a new strategy for rapidly sampling geometries along reaction pathways which can be used to train NNs to learn reactive PESs. Focussing on hydrogen abstraction reactions of CN radical with isopentane, we compare the performance of NNs trained using iMD-VR data versus NNs trained using a more traditional method, namely molecular dynamics (MD) constrained to sample a predefined grid of points along hydrogen abstraction reaction coordinates. Both the NN trained using iMD-VR data and the NN trained using the constrained MD data reproduce important qualitative features of the reactive PESs, such as a low and early barrier to abstraction. Quantitatively, learning is sensitive to the training dataset. Our results show that user-sampled structures obtained with the quantum chemical iMD-VR machinery enable better sampling in the vicinity of the minimum energy path (MEP). As a result, the NN trained on the iMD-VR data does very well predicting energies in the vicinity of the MEP, but less well predicting energies for 'off-path' structures. The NN trained on the constrained MD data does better in predicting energies for 'off-path' structures, given that it included a number of such structures in its training set.
△ Less
Submitted 22 January, 2019; v1 submitted 16 January, 2019;
originally announced January 2019.