-
Implicit Delta Learning of High Fidelity Neural Network Potentials
Authors:
Stephan Thaler,
Cristian Gabellini,
Nikhil Shenoy,
Prudencio Tossou
Abstract:
Neural network potentials (NNPs) offer a fast and accurate alternative to ab-initio methods for molecular dynamics (MD) simulations but are hindered by the high cost of training data from high-fidelity Quantum Mechanics (QM) methods. Our work introduces the Implicit Delta Learning (IDLe) method, which reduces the need for high-fidelity QM data by leveraging cheaper semi-empirical QM computations w…
▽ More
Neural network potentials (NNPs) offer a fast and accurate alternative to ab-initio methods for molecular dynamics (MD) simulations but are hindered by the high cost of training data from high-fidelity Quantum Mechanics (QM) methods. Our work introduces the Implicit Delta Learning (IDLe) method, which reduces the need for high-fidelity QM data by leveraging cheaper semi-empirical QM computations without compromising NNP accuracy or inference cost. IDLe employs an end-to-end multi-task architecture with fidelity-specific heads that decode energies based on a shared latent representation of the input atomistic system. In various settings, IDLe achieves the same accuracy as single high-fidelity baselines while using up to 50x less high-fidelity data. This result could significantly reduce data generation cost and consequently enhance accuracy and generalization, and expand chemical coverage for NNPs, advancing MD simulations for material science and drug discovery. Additionally, we provide a novel set of 11 million semi-empirical QM calculations to support future multi-fidelity NNP modeling.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
OpenQDC: Open Quantum Data Commons
Authors:
Cristian Gabellini,
Nikhil Shenoy,
Stephan Thaler,
Semih Canturk,
Daniel McNeela,
Dominique Beaini,
Michael Bronstein,
Prudencio Tossou
Abstract:
Machine Learning Interatomic Potentials (MLIPs) are a highly promising alternative to force-fields for molecular dynamics (MD) simulations, offering precise and rapid energy and force calculations. However, Quantum-Mechanical (QM) datasets, crucial for MLIPs, are fragmented across various repositories, hindering accessibility and model development. We introduce the openQDC package, consolidating 3…
▽ More
Machine Learning Interatomic Potentials (MLIPs) are a highly promising alternative to force-fields for molecular dynamics (MD) simulations, offering precise and rapid energy and force calculations. However, Quantum-Mechanical (QM) datasets, crucial for MLIPs, are fragmented across various repositories, hindering accessibility and model development. We introduce the openQDC package, consolidating 37 QM datasets from over 250 quantum methods and 400 million geometries into a single, accessible resource. These datasets are meticulously preprocessed, and standardized for MLIP training, covering a wide range of chemical elements and interactions relevant in organic chemistry. OpenQDC includes tools for normalization and integration, easily accessible via Python. Experiments with well-known architectures like SchNet, TorchMD-Net, and DimeNet reveal challenges for those architectures and constitute a leaderboard to accelerate benchmarking and guide novel algorithms development. Continuously adding datasets to OpenQDC will democratize QM dataset access, foster more collaboration and innovation, enhance MLIP development, and support their adoption in the MD field.
△ Less
Submitted 29 November, 2024;
originally announced November 2024.
-
Role of Structural and Conformational Diversity for Machine Learning Potentials
Authors:
Nikhil Shenoy,
Prudencio Tossou,
Emmanuel Noutahi,
Hadrien Mary,
Dominique Beaini,
Jiarui Ding
Abstract:
In the field of Machine Learning Interatomic Potentials (MLIPs), understanding the intricate relationship between data biases, specifically conformational and structural diversity, and model generalization is critical in improving the quality of Quantum Mechanics (QM) data generation efforts. We investigate these dynamics through two distinct experiments: a fixed budget one, where the dataset size…
▽ More
In the field of Machine Learning Interatomic Potentials (MLIPs), understanding the intricate relationship between data biases, specifically conformational and structural diversity, and model generalization is critical in improving the quality of Quantum Mechanics (QM) data generation efforts. We investigate these dynamics through two distinct experiments: a fixed budget one, where the dataset size remains constant, and a fixed molecular set one, which focuses on fixed structural diversity while varying conformational diversity. Our results reveal nuanced patterns in generalization metrics. Notably, for optimal structural and conformational generalization, a careful balance between structural and conformational diversity is required, but existing QM datasets do not meet that trade-off. Additionally, our results highlight the limitation of the MLIP models at generalizing beyond their training distribution, emphasizing the importance of defining applicability domain during model deployment. These findings provide valuable insights and guidelines for QM data generation efforts.
△ Less
Submitted 30 October, 2023;
originally announced November 2023.
-
The scalar, vector, and tensor modes in gravitational wave turbulence simulations
Authors:
Axel Brandenburg,
Grigol Gogoberidze,
Tina Kahniashvili,
Sayan Mandal,
Alberto Roper Pol,
Nakul Shenoy
Abstract:
We study the gravitational wave (GW) signal sourced by primordial turbulence that is assumed to be present at cosmological phase transitions like the electroweak and quantum chromodynamics phase transitions. We consider various models of primordial turbulence, such as those with and without helicity, purely hydrodynamical turbulence induced by fluid motions, and magnetohydrodynamic turbulence whos…
▽ More
We study the gravitational wave (GW) signal sourced by primordial turbulence that is assumed to be present at cosmological phase transitions like the electroweak and quantum chromodynamics phase transitions. We consider various models of primordial turbulence, such as those with and without helicity, purely hydrodynamical turbulence induced by fluid motions, and magnetohydrodynamic turbulence whose energy can be dominated either by kinetic or magnetic energy, depending on the nature of the turbulence. We also study circularly polarized GWs generated by parity violating sources such as helical turbulence. Our ultimate goal is to determine the efficiency of GW production through different classes of turbulence. We find that the GW energy and strain tend to be large for acoustic or irrotational turbulence, even though its tensor mode amplitude is relatively small at most wave numbers. Only at very small wave numbers is the spectral tensor mode significant, which might explain the efficient GW production in that case.
△ Less
Submitted 24 June, 2021; v1 submitted 1 March, 2021;
originally announced March 2021.
-
Prediction Of Arrival Of Nodes In A Scale Free Network
Authors:
S. M. Vijay Mahantesh,
Sudarshan Iyengar,
M. Vijesh,
Shruthi Nayak,
Nikitha Shenoy
Abstract:
Most of the networks observed in real life obey power-law degree distribution. It is hypothesized that the emergence of such a degree distribution is due to preferential attachment of the nodes. Barabasi-Albert model is a generative procedure that uses preferential attachment based on degree and one can use this model to generate networks with power-law degree distribution. In this model, the netw…
▽ More
Most of the networks observed in real life obey power-law degree distribution. It is hypothesized that the emergence of such a degree distribution is due to preferential attachment of the nodes. Barabasi-Albert model is a generative procedure that uses preferential attachment based on degree and one can use this model to generate networks with power-law degree distribution. In this model, the network is assumed to grow one node every time step. After the evolution of such a network, it is impossible for one to predict the exact order of node arrivals. We present in this article, a novel strategy to partially predict the order of node arrivals in such an evolved network. We show that our proposed method outperforms other centrality measure based approaches. We bin the nodes and predict the order of node arrivals between the bins with an accuracy of above 80%.
△ Less
Submitted 24 November, 2011; v1 submitted 21 November, 2011;
originally announced November 2011.