Search | arXiv e-print repository

A Descriptor Is All You Need: Accurate Machine Learning of Nonadiabatic Coupling Vectors

Authors: Jakub Martinka, Lina Zhang, Yi-Fan Hou, Mikołaj Martyka, Jiří Pittner, Mario Barbatti, Pavlo O. Dral

Abstract: Nonadiabatic couplings (NACs) play a crucial role in modeling photochemical and photophysical processes with methods such as the widely used fewest-switches surface hopping (FSSH). There is therefore a strong incentive to machine learn NACs for accelerating simulations. However, this is challenging due to NACs' vectorial, double-valued character and the singularity near a conical intersection seam… ▽ More Nonadiabatic couplings (NACs) play a crucial role in modeling photochemical and photophysical processes with methods such as the widely used fewest-switches surface hopping (FSSH). There is therefore a strong incentive to machine learn NACs for accelerating simulations. However, this is challenging due to NACs' vectorial, double-valued character and the singularity near a conical intersection seam. For the first time, we design NAC-specific descriptors based on our domain expertise and show that they allow learning NACs with never-before-reported accuracy of $R^2$ exceeding 0.99. The key to success is also our new ML phase-correction procedure. We demonstrate the efficiency and robustness of our approach on a prototypical example of fully ML-driven FSSH simulations of fulvene targeting the SA-2-CASSCF(6,6) electronic structure level. This ML-FSSH dynamics leads to an accurate description of $S_1$ decay while reducing error bars by allowing the execution of a large ensemble of trajectories. Our implementations are available in open-source MLatom. △ Less

Submitted 29 May, 2025; originally announced May 2025.

arXiv:2505.16301 [pdf]

Artificial Intelligence for Direct Prediction of Molecular Dynamics Across Chemical Space

Authors: Fuchun Ge, Pavlo O. Dral

Abstract: Molecular dynamics (MD) is a powerful tool for exploring the behavior of atomistic systems, but its reliance on sequential numerical integration limits simulation efficiency. We present MDtrajNet-1, a foundational AI model that directly generates MD trajectories across chemical space, bypassing force calculations and integration. This approach accelerates simulations by up to two orders of magnitu… ▽ More Molecular dynamics (MD) is a powerful tool for exploring the behavior of atomistic systems, but its reliance on sequential numerical integration limits simulation efficiency. We present MDtrajNet-1, a foundational AI model that directly generates MD trajectories across chemical space, bypassing force calculations and integration. This approach accelerates simulations by up to two orders of magnitude compared to traditional MD, even those enhanced by machine-learning interatomic potentials. MDtrajNet-1 combines equivariant neural networks with a Transformer-based architecture to achieve strong accuracy and transferability in predicting long-time trajectories for both known and unseen systems. Remarkably, the errors of the trajectories generated by MDtrajNet-1 for various molecular systems are close to those of the conventional ab initio MD. The model's flexible design supports diverse application scenarios, including different statistical ensembles, boundary conditions, and interaction types. By overcoming the intrinsic speed barrier of conventional MD, MDtrajNet-1 opens new frontiers in efficient and scalable atomistic simulations. △ Less

Submitted 22 May, 2025; originally announced May 2025.

arXiv:2505.08195 [pdf]

Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations

Authors: Jinming Hu, Hassan Nawaz, Yuting Rui, Lijie Chi, Arif Ullah, Pavlo O. Dral

Abstract: We have developed Aitomia - a platform powered by AI to assist in performing AI-driven atomistic and quantum chemical (QC) simulations. This evolving intelligent assistant platform is equipped with chatbots and AI agents to help experts and guide non-experts in setting up and running the atomistic simulations, monitoring their computation status, analyzing the simulation results, and summarizing t… ▽ More We have developed Aitomia - a platform powered by AI to assist in performing AI-driven atomistic and quantum chemical (QC) simulations. This evolving intelligent assistant platform is equipped with chatbots and AI agents to help experts and guide non-experts in setting up and running the atomistic simulations, monitoring their computation status, analyzing the simulation results, and summarizing them for the user in text and graphical forms. We achieve these goals by exploiting open-source large language models (LLMs, original and fine-tuned), rule-based agents, and a retrieval-augmented generation (RAG) system. Aitomia leverages the versatility of our MLatom ecosystem, supporting AI-enhanced computational chemistry tasks ranging from ground- to excited-state calculations such as geometry optimizations, thermochemistry, and spectra calculations. Aitomia is the first intelligent assistant publicly accessible online on a cloud computing platform for atomistic simulations of broad scope (Aitomistic Hub at https://aitomistic.xyz), while it may also be deployed locally as described at http://mlatom.com/aitomia. Aitomia is expected to lower the barrier to performing atomistic simulations, democratizing simulations, and accelerating research and development in the relevant fields. △ Less

Submitted 2 July, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

arXiv:2409.12015 [pdf]

All-in-one foundational models learning across quantum chemical levels

Authors: Yuxinxin Chen, Pavlo O. Dral

Abstract: Machine learning (ML) potentials typically target a single quantum chemical (QC) level while the ML models developed for multi-fidelity learning have not been shown to provide scalable solutions for foundational models. Here we introduce the all-in-one (AIO) ANI model architecture based on multimodal learning which can learn an arbitrary number of QC levels. Our all-in-one learning approach offers… ▽ More Machine learning (ML) potentials typically target a single quantum chemical (QC) level while the ML models developed for multi-fidelity learning have not been shown to provide scalable solutions for foundational models. Here we introduce the all-in-one (AIO) ANI model architecture based on multimodal learning which can learn an arbitrary number of QC levels. Our all-in-one learning approach offers a more general and easier-to-use alternative to transfer learning. We use it to train the AIO-ANI-UIP foundational model with the generalization capability comparable to semi-empirical GFN2-xTB and DFT with a double-zeta basis set for organic molecules. We show that the AIO-ANI model can learn across different QC levels ranging from semi-empirical to density functional theory to coupled cluster. We also use AIO models to design the foundational model Δ-AIO-ANI based on Δ-learning with increased accuracy and robustness compared to AIO-ANI-UIP. The code and the foundational models are available at https://github.com/dralgroup/aio-ani; they will be integrated into the universal and updatable AI-enhanced QM (UAIQM) library and made available in the MLatom package so that they can be used online at the XACS cloud computing platform (see https://github.com/dralgroup/mlatom for updates). △ Less

Submitted 18 September, 2024; originally announced September 2024.

arXiv:2408.12058 [pdf, ps, other]

doi 10.1088/2632-2153/ad8f13

Molecular Quantum Chemical Data Sets and Databases for Machine Learning Potentials

Authors: Arif Ullah, Yuxinxin Chen, Pavlo O. Dral

Abstract: The field of computational chemistry is increasingly leveraging machine learning (ML) potentials to predict molecular properties with high accuracy and efficiency, providing a viable alternative to traditional quantum mechanical (QM) methods, which are often computationally intensive. Central to the success of ML models is the quality and comprehensiveness of the data sets on which they are traine… ▽ More The field of computational chemistry is increasingly leveraging machine learning (ML) potentials to predict molecular properties with high accuracy and efficiency, providing a viable alternative to traditional quantum mechanical (QM) methods, which are often computationally intensive. Central to the success of ML models is the quality and comprehensiveness of the data sets on which they are trained. Quantum chemistry data sets and databases, comprising extensive information on molecular structures, energies, forces, and other properties derived from QM calculations, are crucial for developing robust and generalizable ML potentials. In this review, we provide an overview of the current landscape of quantum chemical data sets and databases. We examine key characteristics and functionalities of prominent resources, including the types of information they store, the level of electronic structure theory employed, the diversity of chemical space covered, and the methodologies used for data creation. Additionally, an updatable resource is provided to track new data sets and databases at https://github.com/Arif-PhyChem/datasets_and_databases_4_MLPs. Looking forward, we discuss the challenges associated with the rapid growth of quantum chemical data sets and databases, emphasizing the need for updatable and accessible resources to ensure the long-term utility of them. We also address the importance of data format standardization and the ongoing efforts to align with the FAIR principles to enhance data interoperability and reusability. Drawing inspiration from established materials databases, we advocate for the development of user-friendly and sustainable platforms for these data sets and databases. △ Less

Submitted 13 October, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

arXiv:2407.13468 [pdf, other]

A simple approach to rotationally invariant machine learning of avector quantity

Authors: Jakub Martinka, Marek Pederzoli, Mario Barbatti, Pavlo O. Dral, Jiří Pittner

Abstract: Unlike with the energy, which is a scalar property, machine learning (ML) predictions of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. If the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. There have been several approaches suggested… ▽ More Unlike with the energy, which is a scalar property, machine learning (ML) predictions of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. If the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. There have been several approaches suggested to properly treat this issue. For nonadiabatic couplings and polarizabilities, for example, it was possible to construct virtual quantities from which the above tensorial properties are obtained by differentiation and thus guarantee the covariance. Here we propose a simpler alternative technique, which does not require construction of auxiliary properties or application of special equivariant ML techniques. We suggest a three-step approach, using the molecular tensor of inertia. In the first step, the molecule is rotated using the eigenvectors of this tensor to its principal axes. In the second step, the ML procedure predicts the vector property relative to this orientation, based on a training set where all vector properties were in this same coordinate system. As third step, it remains to transform the ML estimate of the vector property back to the original orientation. This rotate-predict-rotate (RPR) procedure should thus guarantee proper covariance of a vector property and is trivially extensible also to tensors such as polarizability. The PRP procedure has an advantage that the accurate models can be trained very fast for thousands of molecular configurations which might be beneficial where many trainings are required (e.g., in active learning). We have implemented the RPR technique, using the MLatom and Newton-X programs for ML and MD and performed its assessment on the dipole moment along MD trajectories of 1,2-dichloroethane. △ Less

Submitted 23 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

Comments: typo corrected, reference added

arXiv:2404.14021 [pdf, other]

Physics-Informed Neural Networks and Beyond: Enforcing Physical Constraints in Quantum Dissipative Dynamics

Authors: Arif Ullah, Yu Huang, Ming Yang, Pavlo O. Dral

Abstract: Neural networks (NNs) accelerate simulations of quantum dissipative dynamics. Ensuring that these simulations adhere to fundamental physical laws is crucial, but has been largely ignored in the state-of-the-art NN approaches. We show that this may lead to implausible results measured by violation of the trace conservation. To recover the correct physical behavior, we develop physics-informed NNs (… ▽ More Neural networks (NNs) accelerate simulations of quantum dissipative dynamics. Ensuring that these simulations adhere to fundamental physical laws is crucial, but has been largely ignored in the state-of-the-art NN approaches. We show that this may lead to implausible results measured by violation of the trace conservation. To recover the correct physical behavior, we develop physics-informed NNs (PINNs) that mitigate the violations to a good extend. Beyond that, we propose a novel uncertainty-aware approach that enforces perfect trace conservation by design, surpassing PINNs. △ Less

Submitted 5 September, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: Two figures and 1 table in main text, one table and three figures in Supporting information

arXiv:2404.11811 [pdf]

doi 10.1021/acs.jctc.4c00821

Physics-informed active learning for accelerating quantum chemical simulations

Authors: Yi-Fan Hou, Lina Zhang, Quanhao Zhang, Fuchun Ge, Pavlo O. Dral

Abstract: Quantum chemical simulations can be greatly accelerated by constructing machine learning potentials, which is often done using active learning (AL). The usefulness of the constructed potentials is often limited by the high effort required and their insufficient robustness in the simulations. Here we introduce the end-to-end AL for constructing robust data-efficient potentials with affordable inves… ▽ More Quantum chemical simulations can be greatly accelerated by constructing machine learning potentials, which is often done using active learning (AL). The usefulness of the constructed potentials is often limited by the high effort required and their insufficient robustness in the simulations. Here we introduce the end-to-end AL for constructing robust data-efficient potentials with affordable investment of time and resources and minimum human interference. Our AL protocol is based on the physics-informed sampling of training points, automatic selection of initial data, uncertainty quantification, and convergence monitoring. The versatility of this protocol is shown in our implementation of quasi-classical molecular dynamics for simulating vibrational spectra, conformer search of a key biochemical molecule, and time-resolved mechanism of the Diels-Alder reactions. These investigations took us days instead of weeks of pure quantum chemical calculations on a high-performance computing cluster. The code in MLatom and tutorials are available at https://github.com/dralgroup/mlatom. △ Less

Submitted 16 July, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.06189 [pdf]

doi 10.1021/acs.jctc.4c00468

MLatom software ecosystem for surface hopping dynamics in Python with quantum mechanical and machine learning methods

Authors: Lina Zhang, Sebastian V. Pios, Mikołaj Martyka, Fuchun Ge, Yi-Fan Hou, Yuxinxin Chen, Lipeng Chen, Joanna Jankowska, Mario Barbatti, Pavlo O. Dral

Abstract: We present an open-source MLatom@XACS software ecosystem for on-the-fly surface hopping nonadiabatic dynamics based on the Landau-Zener-Belyaev-Lebedev (LZBL) algorithm. The dynamics can be performed via Python API with a wide range of quantum mechanical (QM) and machine learning (ML) methods, including ab initio QM (CASSCF and ADC(2)), semi-empirical QM methods (e.g., AM1, PM3, OMx, and ODMx), an… ▽ More We present an open-source MLatom@XACS software ecosystem for on-the-fly surface hopping nonadiabatic dynamics based on the Landau-Zener-Belyaev-Lebedev (LZBL) algorithm. The dynamics can be performed via Python API with a wide range of quantum mechanical (QM) and machine learning (ML) methods, including ab initio QM (CASSCF and ADC(2)), semi-empirical QM methods (e.g., AM1, PM3, OMx, and ODMx), and many types of machine learning potentials (e.g., KREG, ANI, and MACE). Combinations of QM and ML methods can also be used. While the user can build their own combinations, we provide AIQM1, which is based on Δ-learning and can be used out of the box. We showcase how AIQM1 reproduces the isomerization quantum yield of trans-azobenzene at a low cost. We provide example scripts that, in a dozen lines, enable the user to obtain the final population plots by simply providing the initial geometry of a molecule. Thus, those scripts perform geometry optimization, normal mode calculations, initial condition sampling, parallel trajectories propagation, population analysis, and final result plotting. Given the capabilities of MLatom to be used for training different ML models, this ecosystem can be seamlessly integrated into the protocols building ML models for nonadiabatic dynamics. In the future, a deeper and more efficient integration of MLatom with Newton-X will enable vast range of functionalities for surface hopping dynamics, such as fewest-switches surface hopping, to facilitate similar workflows via the Python API. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2403.11216 [pdf]

doi 10.1021/acs.jpclett.4c00746

Tell machine learning potentials what they are needed for: Simulation-oriented training exemplified for glycine

Authors: Fuchun Ge, Ran Wang, Chen Qu, Peikun Zheng, Apurba Nandi, Riccardo Conte, Paul L. Houston, Joel M. Bowman, Pavlo O. Dral

Abstract: Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PES) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy… ▽ More Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PES) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy with MLPs on an example of a full-dimensional, global PES for the glycine amino acid. Our results show that the errors in the test set do not unambiguously reflect the MLP performance in different simulation tasks such as relative conformer energies, barriers, vibrational levels, and zero-point vibrational energies. We also offer an easily accessible solution for improving the MLP quality in a simulation-oriented manner, yielding the most precise relative conformer energies and barriers. This solution also passed the stringent test by the diffusion Monte Carlo simulations. △ Less

Submitted 7 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

arXiv:2401.07399 [pdf, other]

AI-enhanced on-the-fly simulation of nonlinear time-resolved spectra

Authors: Sebastian V. Pios, Maxim F. Gelin, Arif Ullah, Pavlo O. Dral, Lipeng Chen

Abstract: Time-resolved spectroscopy is an important tool for unraveling the minute details of structural changes of molecules of biological and technological significance. The nonlinear femtosecond signals detected for such systems must be interpreted, but it is a challenging task for which theoretical simulations are often indispensable. Accurate simulations of transient-absorption or two-dimensional elec… ▽ More Time-resolved spectroscopy is an important tool for unraveling the minute details of structural changes of molecules of biological and technological significance. The nonlinear femtosecond signals detected for such systems must be interpreted, but it is a challenging task for which theoretical simulations are often indispensable. Accurate simulations of transient-absorption or two-dimensional electronic spectra are, however, computationally very expensive, prohibiting the wider adoption of existing first-principles methods. Here, we report an AI-enhanced protocol to drastically reduce the computational cost of simulating nonlinear time-resolved electronic spectra which makes such simulations affordable for polyatomic molecules of increasing size. The protocol is based on doorway-window approach for the on-the-fly surface-hopping simulations. We show its applicability for the prototypical molecule of pyrazine for which it produces spectra with high precision with respect to ab initio reference while cutting the computational cost by at least 95% compared to pure first-principles simulations. △ Less

Submitted 14 January, 2024; originally announced January 2024.

arXiv:2310.20155 [pdf]

doi 10.1021/acs.jctc.3c01203

MLatom 3: Platform for machine learning-enhanced computational chemistry simulations and workflows

Authors: Pavlo O. Dral, Fuchun Ge, Yi-Fan Hou, Peikun Zheng, Yuxinxin Chen, Mario Barbatti, Olexandr Isayev, Cheng Wang, Bao-Xin Xue, Max Pinheiro Jr, Yuming Su, Yiheng Dai, Yangtao Chen, Lina Zhang, Shuang Zhang, Arif Ullah, Quanhao Zhang, Yanchi Ou

Abstract: Machine learning (ML) is increasingly becoming a common tool in computational chemistry. At the same time, the rapid development of ML methods requires a flexible software framework for designing custom workflows. MLatom 3 is a program package designed to leverage the power of ML to enhance typical computational chemistry simulations and to create complex workflows. This open-source package provid… ▽ More Machine learning (ML) is increasingly becoming a common tool in computational chemistry. At the same time, the rapid development of ML methods requires a flexible software framework for designing custom workflows. MLatom 3 is a program package designed to leverage the power of ML to enhance typical computational chemistry simulations and to create complex workflows. This open-source package provides plenty of choice to the users who can run simulations with the command line options, input files, or with scripts using MLatom as a Python package, both on their computers and on the online XACS cloud computing at XACScloud.com. Computational chemists can calculate energies and thermochemical properties, optimize geometries, run molecular and quantum dynamics, and simulate (ro)vibrational, one-photon UV/vis absorption, and two-photon absorption spectra with ML, quantum mechanical, and combined models. The users can choose from an extensive library of methods containing pre-trained ML models and quantum mechanical approximations such as AIQM1 approaching coupled-cluster accuracy. The developers can build their own models using various ML algorithms. The great flexibility of MLatom is largely due to the extensive use of the interfaces to many state-of-the-art software packages and libraries. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2308.11311 [pdf]

doi 10.1021/acs.jpclett.3c01592

Four-Dimensional-Spacetime Atomistic Artificial Intelligence Models

Authors: Fuchun Ge, Lina Zhang, Yi-Fan Hou, Yuxinxin Chen, Arif Ullah, Pavlo O. Dral

Abstract: We demonstrate that AI can learn atomistic systems in the four-dimensional (4D) spacetime. For this, we introduce the 4D-spacetime GICnet model which for the given initial conditions - nuclear positions and velocities at time zero - can predict nuclear positions and velocities as a continuous function of time up to the distant future. Such models of molecules can be unrolled in the time dimension… ▽ More We demonstrate that AI can learn atomistic systems in the four-dimensional (4D) spacetime. For this, we introduce the 4D-spacetime GICnet model which for the given initial conditions - nuclear positions and velocities at time zero - can predict nuclear positions and velocities as a continuous function of time up to the distant future. Such models of molecules can be unrolled in the time dimension to yield long-time high-resolution molecular dynamics trajectories with high efficiency and accuracy. 4D-spacetime models can make predictions for different times in any order and do not need a stepwise evaluation of forces and integration of the equations of motions at discretized time steps, which is a major advance over the traditional, cost-inefficient molecular dynamics. These models can be used to speed up dynamics, simulate vibrational spectra, and obtain deeper insight into nuclear motions as we demonstrate for a series of organic molecules. △ Less

Submitted 22 August, 2023; originally announced August 2023.

arXiv:2308.11305 [pdf]

doi 10.1039/D3CP03515H

Energy-conserving molecular dynamics is not energy conserving

Authors: Lina Zhang, Yi-Fan Hou, Fuchun Ge, Pavlo O. Dral

Abstract: Molecular dynamics (MD) is a widely-used tool for simulating the molecular and materials properties. It is a common wisdom that molecular dynamics simulations should obey physical laws and, hence, lots of effort is put into ensuring that molecular dynamics simulations are energy conserving. The emergence of machine learning (ML) potentials for MD leads to a growing realization that monitoring cons… ▽ More Molecular dynamics (MD) is a widely-used tool for simulating the molecular and materials properties. It is a common wisdom that molecular dynamics simulations should obey physical laws and, hence, lots of effort is put into ensuring that molecular dynamics simulations are energy conserving. The emergence of machine learning (ML) potentials for MD leads to a growing realization that monitoring conservation of energy during simulations is of low utility because the dynamics is often unphysically dissociative. Other ML methods for MD are not based on a potential and provide only forces or trajectories which are reasonable but not necessarily energy-conserving. Here we propose to clearly distinguish between the simulation-energy and true-energy conservation and highlight that the simulations should focus on decreasing the degree of true-energy non-conservation. We introduce very simple, new criteria for evaluating the quality of molecular dynamics estimating the degree of true-energy non-conservation and we demonstrate their practical utility on an example of infrared spectra simulations. These criteria are more important and intuitive than simply evaluating the quality of the ML potential energies and forces as is commonly done and can be applied universally, e.g., even for trajectories with unknown or discontinuous potential energy. Such an approach introduces new standards for evaluating MD by focusing on the true-energy conservation and can help in developing more accurate methods for simulating molecular and materials properties. △ Less

Submitted 22 August, 2023; originally announced August 2023.

arXiv:2303.01264 [pdf, other]

doi 10.1016/j.cpc.2023.108940

MLQD: A package for machine learning-based quantum dissipative dynamics

Authors: Arif Ullah, Pavlo O. Dral

Abstract: Machine learning has emerged as a promising paradigm to study the quantum dissipative dynamics of open quantum systems. To facilitate the use of our recently published ML-based approaches for quantum dissipative dynamics, here we present an open-source Python package MLQD (https://github.com/Arif-PhyChem/MLQD), which currently supports the three ML-based quantum dynamics approaches: (1) the recurs… ▽ More Machine learning has emerged as a promising paradigm to study the quantum dissipative dynamics of open quantum systems. To facilitate the use of our recently published ML-based approaches for quantum dissipative dynamics, here we present an open-source Python package MLQD (https://github.com/Arif-PhyChem/MLQD), which currently supports the three ML-based quantum dynamics approaches: (1) the recursive dynamics with kernel ridge regression (KRR) method, (2) the non-recursive artificial-intelligence-based quantum dynamics (AIQD) approach and (3) the blazingly fast one-shot trajectory learning (OSTL) approach, where both AIQD and OSTL use the convolutional neural networks (CNN). This paper describes the features of the MLQD package, the technical details, optimization of hyperparameters, visualization of results, and the demonstration of the MLQD's applicability for two widely studied systems, namely the spin-boson model and the Fenna--Matthews--Olson (FMO) complex. To make MLQD more user-friendly and accessible, we have made it available on the XACS cloud computing platform (https://XACScloud.com) via the interface to the MLatom package (http://MLatom.com). △ Less

Submitted 20 September, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

arXiv:2301.12096 [pdf, other]

doi 10.3389/fphy.2023.1223973

QD3SET-1: A Database with Quantum Dissipative Dynamics Data Sets

Authors: Arif Ullah, Luis E. Herrera Rodriguez, Pavlo O. Dral, Alexei A. Kananenka

Abstract: Simulations of the dynamics of dissipative quantum systems utilize many methods such as physics-based quantum, semiclassical, and quantum-classical as well as machine learning-based approximations, development and testing of which requires diverse data sets. Here we present a new database QD3SET-1 containing eight data sets of quantum dynamical data for two systems of broad interest, spin-boson (S… ▽ More Simulations of the dynamics of dissipative quantum systems utilize many methods such as physics-based quantum, semiclassical, and quantum-classical as well as machine learning-based approximations, development and testing of which requires diverse data sets. Here we present a new database QD3SET-1 containing eight data sets of quantum dynamical data for two systems of broad interest, spin-boson (SB) model and the Fenna--Matthews--Olson (FMO) complex, generated with two different methods solving the dynamics, approximate local thermalizing Lindblad master equation (LTLME) and highly accurate hierarchy equations of motion (HEOM). One data set was generated with the SB model which is a two-level quantum system coupled to a harmonic environment using HEOM for 1,000 model parameters. Seven data sets were collected for the FMO complex of different sizes(7- and 8-site monomer and 24-site trimer with LTLME and 8-site monomer with HEOM) for 500--879 model parameters. Our QD3SET-1 database contains both population and coherence dynamics data and part of it has been already used for machine learning-based quantum dynamics studies. △ Less

Submitted 28 January, 2023; originally announced January 2023.

Comments: 14 pages, 3 figures

arXiv:2211.14392 [pdf, other]

doi 10.1063/5.0136404

Ultra-Fast Semi-Empirical Quantum Chemistry for High-Throughput Computational Campaigns with Sparrow

Authors: Francesco Bosia, Peikun Zheng, Alain Vaucher, Thomas Weymuth, Pavlo O. Dral, Markus Reiher

Abstract: Semi-empirical quantum chemical approaches are known to compromise accuracy for feasibility of calculations on huge molecules. However, the need for ultrafast calculations in interactive quantum mechanical studies, high-throughput virtual screening, and for data-driven machine learning has shifted the emphasis towards calculation runtimes recently. This comes with new constraints for the software… ▽ More Semi-empirical quantum chemical approaches are known to compromise accuracy for feasibility of calculations on huge molecules. However, the need for ultrafast calculations in interactive quantum mechanical studies, high-throughput virtual screening, and for data-driven machine learning has shifted the emphasis towards calculation runtimes recently. This comes with new constraints for the software implementation as many fast calculations would suffer from a large overhead of manual setup and other procedures that are comparatively fast when studying a single molecular structure, but which become prohibitively slow for high-throughput demands. In this work, we discuss the effect of various well-established semi-empirical approximations on calculation speed and relate this to data transfer rates from the raw-data source computer to the results visualization front end. For the former, we consider desktop computers, local high performance computing, as well as remote cloud services in order to elucidate the effect on interactive calculations, for web and cloud interfaces in local applications, and in world-wide interactive virtual sessions. The models discussed in this work have been implemented into our open-source software SCINE Sparrow. △ Less

Submitted 10 April, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

Comments: 39 pages, 4 figures, 4 tables

Journal ref: J. Chem. Phys. 158, 054118 (2023)

arXiv:2207.02417 [pdf, other]

doi 10.1088/2632-2153/ac9a9d

A comparative study of different machine learning methods for dissipative quantum dynamics

Authors: Luis E. Herrera Rodriguez, Arif Ullah, Kennet J. Rueda Espinosa, Pavlo O. Dral, Alexei A. Kananenka

Abstract: It has been recently shown that supervised machine learning (ML) algorithms can accurately and efficiently predict the long-time populations dynamics of dissipative quantum systems given only short-time population dynamics. In the present article we benchmaked 22 ML models on their ability to predict long-time dynamics of a two-level quantum system linearly coupled to harmonic bath. The models inc… ▽ More It has been recently shown that supervised machine learning (ML) algorithms can accurately and efficiently predict the long-time populations dynamics of dissipative quantum systems given only short-time population dynamics. In the present article we benchmaked 22 ML models on their ability to predict long-time dynamics of a two-level quantum system linearly coupled to harmonic bath. The models include uni- and bidirectional recurrent, convolutional, and fully-connected feed-forward artificial neural networks (ANNs) and kernel ridge regression (KRR) with linear and most commonly used nonlinear kernels. Our results suggest that KRR with nonlinear kernels can serve as inexpensive yet accurate way to simulate long-time dynamics in cases where the constant length of input trajectories is appropriate. Convolutional Gated Recurrent Unit model is found to be the most efficient ANN model. △ Less

Submitted 5 July, 2022; originally announced July 2022.

Comments: 22 pages, 6 figures

arXiv:2204.12661 [pdf, other]

doi 10.1021/acs.jpclett.2c01242

One-shot trajectory learning of open quantum systems dynamics

Authors: Arif Ullah, Pavlo O. Dral

Abstract: Nonadiabatic quantum dynamics are important for understanding light-harvesting processes, but their propagation with traditional methods can be rather expensive. Here we present a one-shot trajectory learning approach that allows to directly make ultra-fast prediction of the entire trajectory of the reduced density matrix for a new set of such simulation parameters as temperature and reorganizatio… ▽ More Nonadiabatic quantum dynamics are important for understanding light-harvesting processes, but their propagation with traditional methods can be rather expensive. Here we present a one-shot trajectory learning approach that allows to directly make ultra-fast prediction of the entire trajectory of the reduced density matrix for a new set of such simulation parameters as temperature and reorganization energy. The whole 10ps long propagation takes 70 milliseconds as we demonstrate on the comparatively large quantum system, the Fenna-Matthews-Olsen (FMO) complex. Our approach also significantly reduces time and memory requirements for training. △ Less

Submitted 8 June, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

Journal ref: J. Phys. Chem. Lett. 13 (2022) 6037-6041

arXiv:1808.05806 [pdf, other]

doi 10.1063/1.4989536

Structure-based Sampling and Self-correcting Machine Learning for Accurate Calculations of Potential Energy Surfaces and Vibrational Levels

Authors: Pavlo O. Dral, Alec Owens, Sergei N. Yurchenko, Walter Thiel

Abstract: We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level \textit{ab initio} energies are… ▽ More We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level \textit{ab initio} energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a published high-level \textit{ab initio} PES of methyl chloride with 44,819 points. The ML model is trained on sets of different size and then used to predict the energies for tens of thousands of nuclear configurations within seconds. The resulting datasets are utilized in variational calculations of the vibrational energy levels of CH$_3$Cl. By using both structure-based sampling and self-correction, the size of the training set can be kept small (e.g. 10\% of the points) without any significant loss of accuracy. In \textit{ab initio} rovibrational spectroscopy, it is thus possible to reduce the number of computationally costly electronic structure calculations through structure-based sampling and self-correcting KKR-based machine learning by up to 90\%. △ Less

Submitted 17 August, 2018; originally announced August 2018.

Journal ref: J. Chem. Phys. 146, 244108 (2017)

arXiv:1503.04987 [pdf, other]

doi 10.1021/acs.jctc.5b00099

Big Data meets Quantum Chemistry Approximations: The $Δ$-Machine Learning Approach

Authors: Raghunathan Ramakrishnan, Pavlo O. Dral, Matthias Rupp, O. Anatole von Lilienfeld

Abstract: Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron… ▽ More Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k constitutional isomers of C$_7$H$_{10}$O$_2$ we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semi-empirical quantum chemistry and machine learning models trained on 1 and 10\% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy. △ Less

Submitted 17 March, 2015; originally announced March 2015.

Showing 1–21 of 21 results for author: Dral, P O