-
A Descriptor Is All You Need: Accurate Machine Learning of Nonadiabatic Coupling Vectors
Authors:
Jakub Martinka,
Lina Zhang,
Yi-Fan Hou,
Mikołaj Martyka,
Jiří Pittner,
Mario Barbatti,
Pavlo O. Dral
Abstract:
Nonadiabatic couplings (NACs) play a crucial role in modeling photochemical and photophysical processes with methods such as the widely used fewest-switches surface hopping (FSSH). There is therefore a strong incentive to machine learn NACs for accelerating simulations. However, this is challenging due to NACs' vectorial, double-valued character and the singularity near a conical intersection seam…
▽ More
Nonadiabatic couplings (NACs) play a crucial role in modeling photochemical and photophysical processes with methods such as the widely used fewest-switches surface hopping (FSSH). There is therefore a strong incentive to machine learn NACs for accelerating simulations. However, this is challenging due to NACs' vectorial, double-valued character and the singularity near a conical intersection seam. For the first time, we design NAC-specific descriptors based on our domain expertise and show that they allow learning NACs with never-before-reported accuracy of $R^2$ exceeding 0.99. The key to success is also our new ML phase-correction procedure. We demonstrate the efficiency and robustness of our approach on a prototypical example of fully ML-driven FSSH simulations of fulvene targeting the SA-2-CASSCF(6,6) electronic structure level. This ML-FSSH dynamics leads to an accurate description of $S_1$ decay while reducing error bars by allowing the execution of a large ensemble of trajectories. Our implementations are available in open-source MLatom.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Artificial Intelligence for Direct Prediction of Molecular Dynamics Across Chemical Space
Authors:
Fuchun Ge,
Pavlo O. Dral
Abstract:
Molecular dynamics (MD) is a powerful tool for exploring the behavior of atomistic systems, but its reliance on sequential numerical integration limits simulation efficiency. We present MDtrajNet-1, a foundational AI model that directly generates MD trajectories across chemical space, bypassing force calculations and integration. This approach accelerates simulations by up to two orders of magnitu…
▽ More
Molecular dynamics (MD) is a powerful tool for exploring the behavior of atomistic systems, but its reliance on sequential numerical integration limits simulation efficiency. We present MDtrajNet-1, a foundational AI model that directly generates MD trajectories across chemical space, bypassing force calculations and integration. This approach accelerates simulations by up to two orders of magnitude compared to traditional MD, even those enhanced by machine-learning interatomic potentials. MDtrajNet-1 combines equivariant neural networks with a Transformer-based architecture to achieve strong accuracy and transferability in predicting long-time trajectories for both known and unseen systems. Remarkably, the errors of the trajectories generated by MDtrajNet-1 for various molecular systems are close to those of the conventional ab initio MD. The model's flexible design supports diverse application scenarios, including different statistical ensembles, boundary conditions, and interaction types. By overcoming the intrinsic speed barrier of conventional MD, MDtrajNet-1 opens new frontiers in efficient and scalable atomistic simulations.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations
Authors:
Jinming Hu,
Hassan Nawaz,
Yuting Rui,
Lijie Chi,
Arif Ullah,
Pavlo O. Dral
Abstract:
We have developed Aitomia - a platform powered by AI to assist in performing AI-driven atomistic and quantum chemical (QC) simulations. This evolving intelligent assistant platform is equipped with chatbots and AI agents to help experts and guide non-experts in setting up and running the atomistic simulations, monitoring their computation status, analyzing the simulation results, and summarizing t…
▽ More
We have developed Aitomia - a platform powered by AI to assist in performing AI-driven atomistic and quantum chemical (QC) simulations. This evolving intelligent assistant platform is equipped with chatbots and AI agents to help experts and guide non-experts in setting up and running the atomistic simulations, monitoring their computation status, analyzing the simulation results, and summarizing them for the user in text and graphical forms. We achieve these goals by exploiting open-source large language models (LLMs, original and fine-tuned), rule-based agents, and a retrieval-augmented generation (RAG) system. Aitomia leverages the versatility of our MLatom ecosystem, supporting AI-enhanced computational chemistry tasks ranging from ground- to excited-state calculations such as geometry optimizations, thermochemistry, and spectra calculations. Aitomia is the first intelligent assistant publicly accessible online on a cloud computing platform for atomistic simulations of broad scope (Aitomistic Hub at https://aitomistic.xyz), while it may also be deployed locally as described at http://mlatom.com/aitomia. Aitomia is expected to lower the barrier to performing atomistic simulations, democratizing simulations, and accelerating research and development in the relevant fields.
△ Less
Submitted 2 July, 2025; v1 submitted 12 May, 2025;
originally announced May 2025.
-
All-in-one foundational models learning across quantum chemical levels
Authors:
Yuxinxin Chen,
Pavlo O. Dral
Abstract:
Machine learning (ML) potentials typically target a single quantum chemical (QC) level while the ML models developed for multi-fidelity learning have not been shown to provide scalable solutions for foundational models. Here we introduce the all-in-one (AIO) ANI model architecture based on multimodal learning which can learn an arbitrary number of QC levels. Our all-in-one learning approach offers…
▽ More
Machine learning (ML) potentials typically target a single quantum chemical (QC) level while the ML models developed for multi-fidelity learning have not been shown to provide scalable solutions for foundational models. Here we introduce the all-in-one (AIO) ANI model architecture based on multimodal learning which can learn an arbitrary number of QC levels. Our all-in-one learning approach offers a more general and easier-to-use alternative to transfer learning. We use it to train the AIO-ANI-UIP foundational model with the generalization capability comparable to semi-empirical GFN2-xTB and DFT with a double-zeta basis set for organic molecules. We show that the AIO-ANI model can learn across different QC levels ranging from semi-empirical to density functional theory to coupled cluster. We also use AIO models to design the foundational model Δ-AIO-ANI based on Δ-learning with increased accuracy and robustness compared to AIO-ANI-UIP. The code and the foundational models are available at https://github.com/dralgroup/aio-ani; they will be integrated into the universal and updatable AI-enhanced QM (UAIQM) library and made available in the MLatom package so that they can be used online at the XACS cloud computing platform (see https://github.com/dralgroup/mlatom for updates).
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Molecular Quantum Chemical Data Sets and Databases for Machine Learning Potentials
Authors:
Arif Ullah,
Yuxinxin Chen,
Pavlo O. Dral
Abstract:
The field of computational chemistry is increasingly leveraging machine learning (ML) potentials to predict molecular properties with high accuracy and efficiency, providing a viable alternative to traditional quantum mechanical (QM) methods, which are often computationally intensive. Central to the success of ML models is the quality and comprehensiveness of the data sets on which they are traine…
▽ More
The field of computational chemistry is increasingly leveraging machine learning (ML) potentials to predict molecular properties with high accuracy and efficiency, providing a viable alternative to traditional quantum mechanical (QM) methods, which are often computationally intensive. Central to the success of ML models is the quality and comprehensiveness of the data sets on which they are trained. Quantum chemistry data sets and databases, comprising extensive information on molecular structures, energies, forces, and other properties derived from QM calculations, are crucial for developing robust and generalizable ML potentials. In this review, we provide an overview of the current landscape of quantum chemical data sets and databases. We examine key characteristics and functionalities of prominent resources, including the types of information they store, the level of electronic structure theory employed, the diversity of chemical space covered, and the methodologies used for data creation. Additionally, an updatable resource is provided to track new data sets and databases at https://github.com/Arif-PhyChem/datasets_and_databases_4_MLPs. Looking forward, we discuss the challenges associated with the rapid growth of quantum chemical data sets and databases, emphasizing the need for updatable and accessible resources to ensure the long-term utility of them. We also address the importance of data format standardization and the ongoing efforts to align with the FAIR principles to enhance data interoperability and reusability. Drawing inspiration from established materials databases, we advocate for the development of user-friendly and sustainable platforms for these data sets and databases.
△ Less
Submitted 13 October, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
A simple approach to rotationally invariant machine learning of avector quantity
Authors:
Jakub Martinka,
Marek Pederzoli,
Mario Barbatti,
Pavlo O. Dral,
Jiří Pittner
Abstract:
Unlike with the energy, which is a scalar property, machine learning (ML) predictions of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. If the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. There have been several approaches suggested…
▽ More
Unlike with the energy, which is a scalar property, machine learning (ML) predictions of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. If the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. There have been several approaches suggested to properly treat this issue. For nonadiabatic couplings and polarizabilities, for example, it was possible to construct virtual quantities from which the above tensorial properties are obtained by differentiation and thus guarantee the covariance. Here we propose a simpler alternative technique, which does not require construction of auxiliary properties or application of special equivariant ML techniques. We suggest a three-step approach, using the molecular tensor of inertia. In the first step, the molecule is rotated using the eigenvectors of this tensor to its principal axes. In the second step, the ML procedure predicts the vector property relative to this orientation, based on a training set where all vector properties were in this same coordinate system. As third step, it remains to transform the ML estimate of the vector property back to the original orientation. This rotate-predict-rotate (RPR) procedure should thus guarantee proper covariance of a vector property and is trivially extensible also to tensors such as polarizability. The PRP procedure has an advantage that the accurate models can be trained very fast for thousands of molecular configurations which might be beneficial where many trainings are required (e.g., in active learning). We have implemented the RPR technique, using the MLatom and Newton-X programs for ML and MD and performed its assessment on the dipole moment along MD trajectories of 1,2-dichloroethane.
△ Less
Submitted 23 July, 2024; v1 submitted 18 July, 2024;
originally announced July 2024.
-
Physics-Informed Neural Networks and Beyond: Enforcing Physical Constraints in Quantum Dissipative Dynamics
Authors:
Arif Ullah,
Yu Huang,
Ming Yang,
Pavlo O. Dral
Abstract:
Neural networks (NNs) accelerate simulations of quantum dissipative dynamics. Ensuring that these simulations adhere to fundamental physical laws is crucial, but has been largely ignored in the state-of-the-art NN approaches. We show that this may lead to implausible results measured by violation of the trace conservation. To recover the correct physical behavior, we develop physics-informed NNs (…
▽ More
Neural networks (NNs) accelerate simulations of quantum dissipative dynamics. Ensuring that these simulations adhere to fundamental physical laws is crucial, but has been largely ignored in the state-of-the-art NN approaches. We show that this may lead to implausible results measured by violation of the trace conservation. To recover the correct physical behavior, we develop physics-informed NNs (PINNs) that mitigate the violations to a good extend. Beyond that, we propose a novel uncertainty-aware approach that enforces perfect trace conservation by design, surpassing PINNs.
△ Less
Submitted 5 September, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Physics-informed active learning for accelerating quantum chemical simulations
Authors:
Yi-Fan Hou,
Lina Zhang,
Quanhao Zhang,
Fuchun Ge,
Pavlo O. Dral
Abstract:
Quantum chemical simulations can be greatly accelerated by constructing machine learning potentials, which is often done using active learning (AL). The usefulness of the constructed potentials is often limited by the high effort required and their insufficient robustness in the simulations. Here we introduce the end-to-end AL for constructing robust data-efficient potentials with affordable inves…
▽ More
Quantum chemical simulations can be greatly accelerated by constructing machine learning potentials, which is often done using active learning (AL). The usefulness of the constructed potentials is often limited by the high effort required and their insufficient robustness in the simulations. Here we introduce the end-to-end AL for constructing robust data-efficient potentials with affordable investment of time and resources and minimum human interference. Our AL protocol is based on the physics-informed sampling of training points, automatic selection of initial data, uncertainty quantification, and convergence monitoring. The versatility of this protocol is shown in our implementation of quasi-classical molecular dynamics for simulating vibrational spectra, conformer search of a key biochemical molecule, and time-resolved mechanism of the Diels-Alder reactions. These investigations took us days instead of weeks of pure quantum chemical calculations on a high-performance computing cluster. The code in MLatom and tutorials are available at https://github.com/dralgroup/mlatom.
△ Less
Submitted 16 July, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
MLatom software ecosystem for surface hopping dynamics in Python with quantum mechanical and machine learning methods
Authors:
Lina Zhang,
Sebastian V. Pios,
Mikołaj Martyka,
Fuchun Ge,
Yi-Fan Hou,
Yuxinxin Chen,
Lipeng Chen,
Joanna Jankowska,
Mario Barbatti,
Pavlo O. Dral
Abstract:
We present an open-source MLatom@XACS software ecosystem for on-the-fly surface hopping nonadiabatic dynamics based on the Landau-Zener-Belyaev-Lebedev (LZBL) algorithm. The dynamics can be performed via Python API with a wide range of quantum mechanical (QM) and machine learning (ML) methods, including ab initio QM (CASSCF and ADC(2)), semi-empirical QM methods (e.g., AM1, PM3, OMx, and ODMx), an…
▽ More
We present an open-source MLatom@XACS software ecosystem for on-the-fly surface hopping nonadiabatic dynamics based on the Landau-Zener-Belyaev-Lebedev (LZBL) algorithm. The dynamics can be performed via Python API with a wide range of quantum mechanical (QM) and machine learning (ML) methods, including ab initio QM (CASSCF and ADC(2)), semi-empirical QM methods (e.g., AM1, PM3, OMx, and ODMx), and many types of machine learning potentials (e.g., KREG, ANI, and MACE). Combinations of QM and ML methods can also be used. While the user can build their own combinations, we provide AIQM1, which is based on Δ-learning and can be used out of the box. We showcase how AIQM1 reproduces the isomerization quantum yield of trans-azobenzene at a low cost. We provide example scripts that, in a dozen lines, enable the user to obtain the final population plots by simply providing the initial geometry of a molecule. Thus, those scripts perform geometry optimization, normal mode calculations, initial condition sampling, parallel trajectories propagation, population analysis, and final result plotting. Given the capabilities of MLatom to be used for training different ML models, this ecosystem can be seamlessly integrated into the protocols building ML models for nonadiabatic dynamics. In the future, a deeper and more efficient integration of MLatom with Newton-X will enable vast range of functionalities for surface hopping dynamics, such as fewest-switches surface hopping, to facilitate similar workflows via the Python API.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Tell machine learning potentials what they are needed for: Simulation-oriented training exemplified for glycine
Authors:
Fuchun Ge,
Ran Wang,
Chen Qu,
Peikun Zheng,
Apurba Nandi,
Riccardo Conte,
Paul L. Houston,
Joel M. Bowman,
Pavlo O. Dral
Abstract:
Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PES) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy…
▽ More
Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PES) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy with MLPs on an example of a full-dimensional, global PES for the glycine amino acid. Our results show that the errors in the test set do not unambiguously reflect the MLP performance in different simulation tasks such as relative conformer energies, barriers, vibrational levels, and zero-point vibrational energies. We also offer an easily accessible solution for improving the MLP quality in a simulation-oriented manner, yielding the most precise relative conformer energies and barriers. This solution also passed the stringent test by the diffusion Monte Carlo simulations.
△ Less
Submitted 7 April, 2024; v1 submitted 17 March, 2024;
originally announced March 2024.
-
AI-enhanced on-the-fly simulation of nonlinear time-resolved spectra
Authors:
Sebastian V. Pios,
Maxim F. Gelin,
Arif Ullah,
Pavlo O. Dral,
Lipeng Chen
Abstract:
Time-resolved spectroscopy is an important tool for unraveling the minute details of structural changes of molecules of biological and technological significance. The nonlinear femtosecond signals detected for such systems must be interpreted, but it is a challenging task for which theoretical simulations are often indispensable. Accurate simulations of transient-absorption or two-dimensional elec…
▽ More
Time-resolved spectroscopy is an important tool for unraveling the minute details of structural changes of molecules of biological and technological significance. The nonlinear femtosecond signals detected for such systems must be interpreted, but it is a challenging task for which theoretical simulations are often indispensable. Accurate simulations of transient-absorption or two-dimensional electronic spectra are, however, computationally very expensive, prohibiting the wider adoption of existing first-principles methods. Here, we report an AI-enhanced protocol to drastically reduce the computational cost of simulating nonlinear time-resolved electronic spectra which makes such simulations affordable for polyatomic molecules of increasing size. The protocol is based on doorway-window approach for the on-the-fly surface-hopping simulations. We show its applicability for the prototypical molecule of pyrazine for which it produces spectra with high precision with respect to ab initio reference while cutting the computational cost by at least 95% compared to pure first-principles simulations.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
MLatom 3: Platform for machine learning-enhanced computational chemistry simulations and workflows
Authors:
Pavlo O. Dral,
Fuchun Ge,
Yi-Fan Hou,
Peikun Zheng,
Yuxinxin Chen,
Mario Barbatti,
Olexandr Isayev,
Cheng Wang,
Bao-Xin Xue,
Max Pinheiro Jr,
Yuming Su,
Yiheng Dai,
Yangtao Chen,
Lina Zhang,
Shuang Zhang,
Arif Ullah,
Quanhao Zhang,
Yanchi Ou
Abstract:
Machine learning (ML) is increasingly becoming a common tool in computational chemistry. At the same time, the rapid development of ML methods requires a flexible software framework for designing custom workflows. MLatom 3 is a program package designed to leverage the power of ML to enhance typical computational chemistry simulations and to create complex workflows. This open-source package provid…
▽ More
Machine learning (ML) is increasingly becoming a common tool in computational chemistry. At the same time, the rapid development of ML methods requires a flexible software framework for designing custom workflows. MLatom 3 is a program package designed to leverage the power of ML to enhance typical computational chemistry simulations and to create complex workflows. This open-source package provides plenty of choice to the users who can run simulations with the command line options, input files, or with scripts using MLatom as a Python package, both on their computers and on the online XACS cloud computing at XACScloud.com. Computational chemists can calculate energies and thermochemical properties, optimize geometries, run molecular and quantum dynamics, and simulate (ro)vibrational, one-photon UV/vis absorption, and two-photon absorption spectra with ML, quantum mechanical, and combined models. The users can choose from an extensive library of methods containing pre-trained ML models and quantum mechanical approximations such as AIQM1 approaching coupled-cluster accuracy. The developers can build their own models using various ML algorithms. The great flexibility of MLatom is largely due to the extensive use of the interfaces to many state-of-the-art software packages and libraries.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Four-Dimensional-Spacetime Atomistic Artificial Intelligence Models
Authors:
Fuchun Ge,
Lina Zhang,
Yi-Fan Hou,
Yuxinxin Chen,
Arif Ullah,
Pavlo O. Dral
Abstract:
We demonstrate that AI can learn atomistic systems in the four-dimensional (4D) spacetime. For this, we introduce the 4D-spacetime GICnet model which for the given initial conditions - nuclear positions and velocities at time zero - can predict nuclear positions and velocities as a continuous function of time up to the distant future. Such models of molecules can be unrolled in the time dimension…
▽ More
We demonstrate that AI can learn atomistic systems in the four-dimensional (4D) spacetime. For this, we introduce the 4D-spacetime GICnet model which for the given initial conditions - nuclear positions and velocities at time zero - can predict nuclear positions and velocities as a continuous function of time up to the distant future. Such models of molecules can be unrolled in the time dimension to yield long-time high-resolution molecular dynamics trajectories with high efficiency and accuracy. 4D-spacetime models can make predictions for different times in any order and do not need a stepwise evaluation of forces and integration of the equations of motions at discretized time steps, which is a major advance over the traditional, cost-inefficient molecular dynamics. These models can be used to speed up dynamics, simulate vibrational spectra, and obtain deeper insight into nuclear motions as we demonstrate for a series of organic molecules.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Energy-conserving molecular dynamics is not energy conserving
Authors:
Lina Zhang,
Yi-Fan Hou,
Fuchun Ge,
Pavlo O. Dral
Abstract:
Molecular dynamics (MD) is a widely-used tool for simulating the molecular and materials properties. It is a common wisdom that molecular dynamics simulations should obey physical laws and, hence, lots of effort is put into ensuring that molecular dynamics simulations are energy conserving. The emergence of machine learning (ML) potentials for MD leads to a growing realization that monitoring cons…
▽ More
Molecular dynamics (MD) is a widely-used tool for simulating the molecular and materials properties. It is a common wisdom that molecular dynamics simulations should obey physical laws and, hence, lots of effort is put into ensuring that molecular dynamics simulations are energy conserving. The emergence of machine learning (ML) potentials for MD leads to a growing realization that monitoring conservation of energy during simulations is of low utility because the dynamics is often unphysically dissociative. Other ML methods for MD are not based on a potential and provide only forces or trajectories which are reasonable but not necessarily energy-conserving. Here we propose to clearly distinguish between the simulation-energy and true-energy conservation and highlight that the simulations should focus on decreasing the degree of true-energy non-conservation. We introduce very simple, new criteria for evaluating the quality of molecular dynamics estimating the degree of true-energy non-conservation and we demonstrate their practical utility on an example of infrared spectra simulations. These criteria are more important and intuitive than simply evaluating the quality of the ML potential energies and forces as is commonly done and can be applied universally, e.g., even for trajectories with unknown or discontinuous potential energy. Such an approach introduces new standards for evaluating MD by focusing on the true-energy conservation and can help in developing more accurate methods for simulating molecular and materials properties.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
MLQD: A package for machine learning-based quantum dissipative dynamics
Authors:
Arif Ullah,
Pavlo O. Dral
Abstract:
Machine learning has emerged as a promising paradigm to study the quantum dissipative dynamics of open quantum systems. To facilitate the use of our recently published ML-based approaches for quantum dissipative dynamics, here we present an open-source Python package MLQD (https://github.com/Arif-PhyChem/MLQD), which currently supports the three ML-based quantum dynamics approaches: (1) the recurs…
▽ More
Machine learning has emerged as a promising paradigm to study the quantum dissipative dynamics of open quantum systems. To facilitate the use of our recently published ML-based approaches for quantum dissipative dynamics, here we present an open-source Python package MLQD (https://github.com/Arif-PhyChem/MLQD), which currently supports the three ML-based quantum dynamics approaches: (1) the recursive dynamics with kernel ridge regression (KRR) method, (2) the non-recursive artificial-intelligence-based quantum dynamics (AIQD) approach and (3) the blazingly fast one-shot trajectory learning (OSTL) approach, where both AIQD and OSTL use the convolutional neural networks (CNN). This paper describes the features of the MLQD package, the technical details, optimization of hyperparameters, visualization of results, and the demonstration of the MLQD's applicability for two widely studied systems, namely the spin-boson model and the Fenna--Matthews--Olson (FMO) complex. To make MLQD more user-friendly and accessible, we have made it available on the XACS cloud computing platform (https://XACScloud.com) via the interface to the MLatom package (http://MLatom.com).
△ Less
Submitted 20 September, 2023; v1 submitted 28 February, 2023;
originally announced March 2023.
-
QD3SET-1: A Database with Quantum Dissipative Dynamics Data Sets
Authors:
Arif Ullah,
Luis E. Herrera Rodriguez,
Pavlo O. Dral,
Alexei A. Kananenka
Abstract:
Simulations of the dynamics of dissipative quantum systems utilize many methods such as physics-based quantum, semiclassical, and quantum-classical as well as machine learning-based approximations, development and testing of which requires diverse data sets. Here we present a new database QD3SET-1 containing eight data sets of quantum dynamical data for two systems of broad interest, spin-boson (S…
▽ More
Simulations of the dynamics of dissipative quantum systems utilize many methods such as physics-based quantum, semiclassical, and quantum-classical as well as machine learning-based approximations, development and testing of which requires diverse data sets. Here we present a new database QD3SET-1 containing eight data sets of quantum dynamical data for two systems of broad interest, spin-boson (SB) model and the Fenna--Matthews--Olson (FMO) complex, generated with two different methods solving the dynamics, approximate local thermalizing Lindblad master equation (LTLME) and highly accurate hierarchy equations of motion (HEOM). One data set was generated with the SB model which is a two-level quantum system coupled to a harmonic environment using HEOM for 1,000 model parameters. Seven data sets were collected for the FMO complex of different sizes(7- and 8-site monomer and 24-site trimer with LTLME and 8-site monomer with HEOM) for 500--879 model parameters. Our QD3SET-1 database contains both population and coherence dynamics data and part of it has been already used for machine learning-based quantum dynamics studies.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
Ultra-Fast Semi-Empirical Quantum Chemistry for High-Throughput Computational Campaigns with Sparrow
Authors:
Francesco Bosia,
Peikun Zheng,
Alain Vaucher,
Thomas Weymuth,
Pavlo O. Dral,
Markus Reiher
Abstract:
Semi-empirical quantum chemical approaches are known to compromise accuracy for feasibility of calculations on huge molecules. However, the need for ultrafast calculations in interactive quantum mechanical studies, high-throughput virtual screening, and for data-driven machine learning has shifted the emphasis towards calculation runtimes recently. This comes with new constraints for the software…
▽ More
Semi-empirical quantum chemical approaches are known to compromise accuracy for feasibility of calculations on huge molecules. However, the need for ultrafast calculations in interactive quantum mechanical studies, high-throughput virtual screening, and for data-driven machine learning has shifted the emphasis towards calculation runtimes recently. This comes with new constraints for the software implementation as many fast calculations would suffer from a large overhead of manual setup and other procedures that are comparatively fast when studying a single molecular structure, but which become prohibitively slow for high-throughput demands. In this work, we discuss the effect of various well-established semi-empirical approximations on calculation speed and relate this to data transfer rates from the raw-data source computer to the results visualization front end. For the former, we consider desktop computers, local high performance computing, as well as remote cloud services in order to elucidate the effect on interactive calculations, for web and cloud interfaces in local applications, and in world-wide interactive virtual sessions. The models discussed in this work have been implemented into our open-source software SCINE Sparrow.
△ Less
Submitted 10 April, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
A comparative study of different machine learning methods for dissipative quantum dynamics
Authors:
Luis E. Herrera Rodriguez,
Arif Ullah,
Kennet J. Rueda Espinosa,
Pavlo O. Dral,
Alexei A. Kananenka
Abstract:
It has been recently shown that supervised machine learning (ML) algorithms can accurately and efficiently predict the long-time populations dynamics of dissipative quantum systems given only short-time population dynamics. In the present article we benchmaked 22 ML models on their ability to predict long-time dynamics of a two-level quantum system linearly coupled to harmonic bath. The models inc…
▽ More
It has been recently shown that supervised machine learning (ML) algorithms can accurately and efficiently predict the long-time populations dynamics of dissipative quantum systems given only short-time population dynamics. In the present article we benchmaked 22 ML models on their ability to predict long-time dynamics of a two-level quantum system linearly coupled to harmonic bath. The models include uni- and bidirectional recurrent, convolutional, and fully-connected feed-forward artificial neural networks (ANNs) and kernel ridge regression (KRR) with linear and most commonly used nonlinear kernels. Our results suggest that KRR with nonlinear kernels can serve as inexpensive yet accurate way to simulate long-time dynamics in cases where the constant length of input trajectories is appropriate. Convolutional Gated Recurrent Unit model is found to be the most efficient ANN model.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
One-shot trajectory learning of open quantum systems dynamics
Authors:
Arif Ullah,
Pavlo O. Dral
Abstract:
Nonadiabatic quantum dynamics are important for understanding light-harvesting processes, but their propagation with traditional methods can be rather expensive. Here we present a one-shot trajectory learning approach that allows to directly make ultra-fast prediction of the entire trajectory of the reduced density matrix for a new set of such simulation parameters as temperature and reorganizatio…
▽ More
Nonadiabatic quantum dynamics are important for understanding light-harvesting processes, but their propagation with traditional methods can be rather expensive. Here we present a one-shot trajectory learning approach that allows to directly make ultra-fast prediction of the entire trajectory of the reduced density matrix for a new set of such simulation parameters as temperature and reorganization energy. The whole 10ps long propagation takes 70 milliseconds as we demonstrate on the comparatively large quantum system, the Fenna-Matthews-Olsen (FMO) complex. Our approach also significantly reduces time and memory requirements for training.
△ Less
Submitted 8 June, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Structure-based Sampling and Self-correcting Machine Learning for Accurate Calculations of Potential Energy Surfaces and Vibrational Levels
Authors:
Pavlo O. Dral,
Alec Owens,
Sergei N. Yurchenko,
Walter Thiel
Abstract:
We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level \textit{ab initio} energies are…
▽ More
We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level \textit{ab initio} energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a published high-level \textit{ab initio} PES of methyl chloride with 44,819 points. The ML model is trained on sets of different size and then used to predict the energies for tens of thousands of nuclear configurations within seconds. The resulting datasets are utilized in variational calculations of the vibrational energy levels of CH$_3$Cl. By using both structure-based sampling and self-correction, the size of the training set can be kept small (e.g. 10\% of the points) without any significant loss of accuracy. In \textit{ab initio} rovibrational spectroscopy, it is thus possible to reduce the number of computationally costly electronic structure calculations through structure-based sampling and self-correcting KKR-based machine learning by up to 90\%.
△ Less
Submitted 17 August, 2018;
originally announced August 2018.
-
Big Data meets Quantum Chemistry Approximations: The $Δ$-Machine Learning Approach
Authors:
Raghunathan Ramakrishnan,
Pavlo O. Dral,
Matthias Rupp,
O. Anatole von Lilienfeld
Abstract:
Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron…
▽ More
Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k constitutional isomers of C$_7$H$_{10}$O$_2$ we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semi-empirical quantum chemistry and machine learning models trained on 1 and 10\% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.
△ Less
Submitted 17 March, 2015;
originally announced March 2015.