-
FlashMD: long-stride, universal prediction of molecular dynamics
Authors:
Filippo Bigi,
Sanggyu Chong,
Agustinus Kristiadi,
Michele Ceriotti
Abstract:
Molecular dynamics (MD) provides insights into atomic-scale processes by integrating over time the equations that describe the motion of atoms under the action of interatomic forces. Machine learning models have substantially accelerated MD by providing inexpensive predictions of the forces, but they remain constrained to minuscule time integration steps, which are required by the fast time scale…
▽ More
Molecular dynamics (MD) provides insights into atomic-scale processes by integrating over time the equations that describe the motion of atoms under the action of interatomic forces. Machine learning models have substantially accelerated MD by providing inexpensive predictions of the forces, but they remain constrained to minuscule time integration steps, which are required by the fast time scale of atomic motion. In this work, we propose FlashMD, a method to predict the evolution of positions and momenta over strides that are between one and two orders of magnitude longer than typical MD time steps. We incorporate considerations on the mathematical and physical properties of Hamiltonian dynamics in the architecture, generalize the approach to allow the simulation of any thermodynamic ensemble, and carefully assess the possible failure modes of such a long-stride MD approach. We validate FlashMD's accuracy in reproducing equilibrium and time-dependent properties, using both system-specific and general-purpose models, extending the ability of MD simulation to reach the long time scales needed to model microscopic processes of high scientific and technological relevance.
△ Less
Submitted 25 May, 2025;
originally announced May 2025.
-
Representing spherical tensors with scalar-based machine-learning models
Authors:
Michelangelo Domina,
Filippo Bigi,
Paolo Pegolo,
Michele Ceriotti
Abstract:
Rotational symmetry plays a central role in physics, providing an elegant framework to describe how the properties of 3D objects -- from atoms to the macroscopic scale -- transform under the action of rigid rotations. Equivariant models of 3D point clouds are able to approximate structure-property relations in a way that is fully consistent with the structure of the rotation group, by combining in…
▽ More
Rotational symmetry plays a central role in physics, providing an elegant framework to describe how the properties of 3D objects -- from atoms to the macroscopic scale -- transform under the action of rigid rotations. Equivariant models of 3D point clouds are able to approximate structure-property relations in a way that is fully consistent with the structure of the rotation group, by combining intermediate representations that are themselves spherical tensors. The symmetry constraints however make this approach computationally demanding and cumbersome to implement, which motivates increasingly popular unconstrained architectures that learn approximate symmetries as part of the training process. In this work, we explore a third route to tackle this learning problem, where equivariant functions are expressed as the product of a scalar function of the point cloud coordinates and a small basis of tensors with the appropriate symmetry. We also propose approximations of the general expressions that, while lacking universal approximation properties, are fast, simple to implement, and accurate in practical settings.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Roadmap on Advancements of the FHI-aims Software Package
Authors:
Joseph W. Abbott,
Carlos Mera Acosta,
Alaa Akkoush,
Alberto Ambrosetti,
Viktor Atalla,
Alexej Bagrets,
Jörg Behler,
Daniel Berger,
Björn Bieniek,
Jonas Björk,
Volker Blum,
Saeed Bohloul,
Connor L. Box,
Nicholas Boyer,
Danilo Simoes Brambila,
Gabriel A. Bramley,
Kyle R. Bryenton,
María Camarasa-Gómez,
Christian Carbogno,
Fabio Caruso,
Sucismita Chutia,
Michele Ceriotti,
Gábor Csányi,
William Dawson,
Francisco A. Delesma
, et al. (175 additional authors not shown)
Abstract:
Electronic-structure theory is the foundation of the description of materials including multiscale modeling of their properties and functions. Obviously, without sufficient accuracy at the base, reliable predictions are unlikely at any level that follows. The software package FHI-aims has proven to be a game changer for accurate free-energy calculations because of its scalability, numerical precis…
▽ More
Electronic-structure theory is the foundation of the description of materials including multiscale modeling of their properties and functions. Obviously, without sufficient accuracy at the base, reliable predictions are unlikely at any level that follows. The software package FHI-aims has proven to be a game changer for accurate free-energy calculations because of its scalability, numerical precision, and its efficient handling of density functional theory (DFT) with hybrid functionals and van der Waals interactions. It treats molecules, clusters, and extended systems (solids and liquids) on an equal footing. Besides DFT, FHI-aims also includes quantum-chemistry methods, descriptions for excited states and vibrations, and calculations of various types of transport. Recent advancements address the integration of FHI-aims into an increasing number of workflows and various artificial intelligence (AI) methods. This Roadmap describes the state-of-the-art of FHI-aims and advancements that are currently ongoing or planned.
△ Less
Submitted 30 April, 2025;
originally announced May 2025.
-
Reconstructions and Dynamics of $β$-Lithium Thiophosphate Surfaces
Authors:
Hanna Türk,
Davide Tisi,
Michele Ceriotti
Abstract:
Lithium thiophosphate (LPS) is a promising solid electrolyte for next-generation lithium-ion batteries due to its superior energy storage, high ionic conductivity, and low-flammability components. Despite its potential, the high reactivity of LPS with common contaminants such as atmospheric water, preparation solvents, and electrode materials poses significant challenges for commercialization. The…
▽ More
Lithium thiophosphate (LPS) is a promising solid electrolyte for next-generation lithium-ion batteries due to its superior energy storage, high ionic conductivity, and low-flammability components. Despite its potential, the high reactivity of LPS with common contaminants such as atmospheric water, preparation solvents, and electrode materials poses significant challenges for commercialization. The lack of understanding regarding the structure, morphology, and chemical behavior of LPS's surface slows down the search for solutions to these issues. Here, we utilize a machine learning interatomic potential to achieve a fundamental, atomistic understanding of the mechanical and chemical properties of the $β$-Li$_3$PS$_4$ surfaces. Employing molecular dynamics simulations, we identify relevant surface complexions formed by surface reconstructions, determine their surface energies and compute the Wulff shape of $β$-LPS. The most stable complexions exhibit properties distinctly different from the bulk, including amorphization, increased density, decreased conductivity and large deformation of the structure building blocks. We demonstrate that these surfaces are not static, but undergo significant dynamical activity which is clearly identified by an analysis featuring a time-averaged structural descriptor. Finally, we examine the changes of the electronic structure induced by the surface complexions, which provides us with details on changes in surface reactivity and active sites, underlining the importance to investigate surface complexions under realistic conditions.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
Exploring the design space of machine-learning models for quantum chemistry with a fully differentiable framework
Authors:
Divya Suman,
Jigyasa Nigam,
Sandra Saade,
Paolo Pegolo,
Hanna Tuerk,
Xing Zhang,
Garnet Kin-Lic Chan,
Michele Ceriotti
Abstract:
Traditional atomistic machine learning (ML) models serve as surrogates for quantum mechanical (QM) properties, predicting quantities such as dipole moments and polarizabilities, directly from compositions and geometries of atomic configurations. With the emergence of ML approaches to predict the "ingredients" of a QM calculation, such as the ground state charge density or the effective single-part…
▽ More
Traditional atomistic machine learning (ML) models serve as surrogates for quantum mechanical (QM) properties, predicting quantities such as dipole moments and polarizabilities, directly from compositions and geometries of atomic configurations. With the emergence of ML approaches to predict the "ingredients" of a QM calculation, such as the ground state charge density or the effective single-particle Hamiltonian, it has become possible to obtain multiple properties through analytical physics-based operations on these intermediate ML predictions. We present a framework to seamlessly integrate the prediction of an effective electronic Hamiltonian, for both molecular and condensed-phase systems, with PySCFAD, a differentiable QM workflow that facilitates its indirect training against functions of the Hamiltonian, such as electronic energy levels, dipole moments, polarizability, etc. We then use this framework to explore various possible choices within the design space of hybrid ML/QM models, examining the influence of incorporating multiple targets on model performance and learning a reduced-basis ML Hamiltonian that can reproduce targets computed from a much larger basis. Our benchmarks evaluate the accuracy and transferability of these hybrid models, compare them against predictions of atomic properties from their surrogate models, and provide indications to guide the design of the interface between the ML and QM components of the model.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
PET-MAD, a universal interatomic potential for advanced materials modeling
Authors:
Arslan Mazitov,
Filippo Bigi,
Matthias Kellner,
Paolo Pegolo,
Davide Tisi,
Guillaume Fraux,
Sergey Pozdnyakov,
Philip Loche,
Michele Ceriotti
Abstract:
Machine-learning interatomic potentials (MLIPs) have greatly extended the reach of atomic-scale simulations, offering the accuracy of first-principles calculations at a fraction of the effort. Leveraging large quantum mechanical databases and expressive architectures, recent "universal" models deliver qualitative accuracy across the periodic table but are often biased toward low-energy configurati…
▽ More
Machine-learning interatomic potentials (MLIPs) have greatly extended the reach of atomic-scale simulations, offering the accuracy of first-principles calculations at a fraction of the effort. Leveraging large quantum mechanical databases and expressive architectures, recent "universal" models deliver qualitative accuracy across the periodic table but are often biased toward low-energy configurations. We introduce PET-MAD, a generally applicable MLIP trained on a dataset combining stable inorganic and organic solids, systematically modified to enhance atomic diversity. Using a moderate but highly-consistent level of electronic-structure theory, we assess PET-MAD's accuracy on established benchmarks and advanced simulations of six materials. PET-MAD rivals state-of-the-art MLIPs for inorganic solids, while also being reliable for molecules, organic materials, and surfaces. It is stable and fast, enabling, out-of-the-box, the near-quantitative study of thermal and quantum mechanical fluctuations, functional properties, and phase transitions. It can be efficiently fine-tuned to deliver full quantum mechanical accuracy with a minimal number of targeted calculations.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
The dark side of the forces: assessing non-conservative force models for atomistic machine learning
Authors:
Filippo Bigi,
Marcel Langer,
Michele Ceriotti
Abstract:
The use of machine learning to estimate the energy of a group of atoms, and the forces that drive them to more stable configurations, has revolutionized the fields of computational chemistry and materials discovery. In this domain, rigorous enforcement of symmetry and conservation laws has traditionally been considered essential. For this reason, interatomic forces are usually computed as the deri…
▽ More
The use of machine learning to estimate the energy of a group of atoms, and the forces that drive them to more stable configurations, has revolutionized the fields of computational chemistry and materials discovery. In this domain, rigorous enforcement of symmetry and conservation laws has traditionally been considered essential. For this reason, interatomic forces are usually computed as the derivatives of the potential energy, ensuring energy conservation. Several recent works have questioned this physically constrained approach, suggesting that directly predicting the forces yields a better trade-off between accuracy and computational efficiency -- and that energy conservation can be learned during training. This work investigates the applicability of such non-conservative models in microscopic simulations. We identify and demonstrate several fundamental issues, from ill-defined convergence of geometry optimization to instability in various types of molecular dynamics. Contrary to the case of rotational symmetry, energy conservation is hard to learn, monitor, and correct for. The best approach to exploit the acceleration afforded by direct force prediction might be to use it in tandem with a conservative model, reducing -- rather than eliminating -- the additional cost of backpropagation, but avoiding the pathological behavior associated with non-conservative forces.
△ Less
Submitted 1 June, 2025; v1 submitted 16 December, 2024;
originally announced December 2024.
-
PLUMED Tutorials: a collaborative, community-driven learning ecosystem
Authors:
Gareth A. Tribello,
Massimiliano Bonomi,
Giovanni Bussi,
Carlo Camilloni,
Blake I. Armstrong,
Andrea Arsiccio,
Simone Aureli,
Federico Ballabio,
Mattia Bernetti,
Luigi Bonati,
Samuel G. H. Brookes,
Z. Faidon Brotzakis,
Riccardo Capelli,
Michele Ceriotti,
Kam-Tung Chan,
Pilar Cossio,
Siva Dasetty,
Davide Donadio,
Bernd Ensing,
Andrew L. Ferguson,
Guillaume Fraux,
Julian D. Gale,
Francesco Luigi Gervasio,
Toni Giorgino,
Nicholas S. M. Herringer
, et al. (38 additional authors not shown)
Abstract:
In computational physics, chemistry, and biology, the implementation of new techniques in a shared and open source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while…
▽ More
In computational physics, chemistry, and biology, the implementation of new techniques in a shared and open source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while the COVID-19 pandemic highlighted the benefits of online training, traditional online tutorials can quickly become outdated and may not cover all the software's functionalities. To address these issues, here we introduce ``PLUMED Tutorials'', a collaborative model for developing, sharing, and updating online tutorials. This initiative utilizes repository management and continuous integration to ensure compatibility with software updates. Moreover, the tutorials are interconnected to form a structured learning path and are enriched with automatic annotations to provide broader context. This paper illustrates the development, features, and advantages of PLUMED Tutorials, aiming to foster an open community for creating and sharing educational resources.
△ Less
Submitted 29 November, 2024;
originally announced December 2024.
-
Fast and flexible long-range models for atomistic machine learning
Authors:
Philip Loche,
Kevin K. Huguenin-Dumittan,
Melika Honarmand,
Qianjun Xu,
Egor Rumiantsev,
Wei Bin How,
Marcel F. Langer,
Michele Ceriotti
Abstract:
Most atomistic machine learning (ML) models rely on a locality ansatz, and decompose the energy into a sum of short-ranged, atom-centered contributions. This leads to clear limitations when trying to describe problems that are dominated by long-range physical effects - most notably electrostatics. Many approaches have been proposed to overcome these limitations, but efforts to make them efficient…
▽ More
Most atomistic machine learning (ML) models rely on a locality ansatz, and decompose the energy into a sum of short-ranged, atom-centered contributions. This leads to clear limitations when trying to describe problems that are dominated by long-range physical effects - most notably electrostatics. Many approaches have been proposed to overcome these limitations, but efforts to make them efficient and widely available are hampered by the need to incorporate an ad hoc implementation of methods to treat long-range interactions. We develop a framework aiming to bring some of the established algorithms to evaluate non-bonded interactions - including Ewald summation, classical particle-mesh Ewald (PME), and particle-particle/particle-mesh (P3M) Ewald - into atomistic ML. We provide a reference implementation for pyTorch as well as an experimental one for JAX. Beyond Coulomb and more general long-range potentials, we introduce purified descriptors which disregard the immediate neighborhood of each atom, and are more suitable for general long-ranged ML applications. Our implementations are fast, feature-rich, and modular: They provide an accurate evaluation of physical long-range forces that can be used in the construction of (semi)empirical baseline potentials; they exploit the availability of automatic differentiation to seamlessly combine long-range models with conventional, local ML schemes; and they are sufficiently flexible to implement more complex architectures that use physical interactions as building blocks. We benchmark and demonstrate our torch-pme and jax-pme libraries to perform molecular dynamics simulations, to train ML potentials, and to evaluate long-range equivariant descriptors of atomic structures.
△ Less
Submitted 24 March, 2025; v1 submitted 4 December, 2024;
originally announced December 2024.
-
Prediction rigidities for data-driven chemistry
Authors:
Sanggyu Chong,
Filippo Bigi,
Federico Grasselli,
Philip Loche,
Matthias Kellner,
Michele Ceriotti
Abstract:
The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a family of met…
▽ More
The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a family of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.
△ Less
Submitted 26 August, 2024;
originally announced August 2024.
-
Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Authors:
Beatriz Borges,
Negar Foroutan,
Deniz Bayazit,
Anna Sotnikova,
Syrielle Montariol,
Tanya Nazaretzky,
Mohammadreza Banaei,
Alireza Sakhaeirad,
Philippe Servant,
Seyed Parsa Neshaei,
Jibril Frej,
Angelika Romanou,
Gail Weiss,
Sepideh Mamooler,
Zeming Chen,
Simin Fan,
Silin Gao,
Mete Ismayilzada,
Debjit Paul,
Alexandre Schöpfer,
Andrej Janchevski,
Anja Tiede,
Clarence Linden,
Emanuele Troiani,
Francesco Salvi
, et al. (65 additional authors not shown)
Abstract:
AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by…
▽ More
AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by student use of generative AI. We investigate the potential scale of this vulnerability by measuring the degree to which AI assistants can complete assessment questions in standard university-level STEM courses. Specifically, we compile a novel dataset of textual assessment questions from 50 courses at EPFL and evaluate whether two AI assistants, GPT-3.5 and GPT-4 can adequately answer these questions. We use eight prompting strategies to produce responses and find that GPT-4 answers an average of 65.8% of questions correctly, and can even produce the correct answer across at least one prompting strategy for 85.1% of questions. When grouping courses in our dataset by degree program, these systems already pass non-project assessments of large numbers of core courses in various degree programs, posing risks to higher education accreditation that will be amplified as these models improve. Our results call for revising program-level assessment design in higher education in light of advances in generative AI.
△ Less
Submitted 27 November, 2024; v1 submitted 7 August, 2024;
originally announced August 2024.
-
Adaptive energy reference for machine-learning models of the electronic density of states
Authors:
Wei Bin How,
Sanggyu Chong,
Federico Grasselli,
Kevin K. Huguenin-Dumittan,
Michele Ceriotti
Abstract:
The electronic density of states (DOS) provides information regarding the distribution of electronic energy levels in a material, and can be used to approximate its optical and electronic properties and therefore guide computational material design. Given its usefulness and relative simplicity, it has been one of the first electronic properties used as target for machine-learning approaches going…
▽ More
The electronic density of states (DOS) provides information regarding the distribution of electronic energy levels in a material, and can be used to approximate its optical and electronic properties and therefore guide computational material design. Given its usefulness and relative simplicity, it has been one of the first electronic properties used as target for machine-learning approaches going beyond interatomic potentials. A subtle but important point, well-appreciated in the condensed matter community but usually overlooked in the construction of data-driven models, is that for bulk configurations the absolute energy reference of single-particle energy levels is ill-defined. Only energy differences matter, and quantities derived from the DOS are typically independent on the absolute alignment. We introduce an adaptive scheme that optimizes the energy reference of each structure as part of the training process, and show that it consistently improves the quality of ML models compared to traditional choices of energy reference, for different classes of materials and different model architectures. On a practical level, we trace the improved performance to the ability of this self-aligning scheme to match the most prominent features in the DOS. More broadly, we believe that this work highlights the importance of incorporating insights into the nature of the physical target into the definition of the architecture and of the appropriate figures of merit for machine-learning models, that translate in better transferability and overall performance.
△ Less
Submitted 24 January, 2025; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Probing the effects of broken symmetries in machine learning
Authors:
Marcel F. Langer,
Sergey N. Pozdnyakov,
Michele Ceriotti
Abstract:
Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivaria…
▽ More
Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivariant to translations, permutations, and rotations of the atoms. Incorporating symmetries -- rotations in particular -- constrains the model design space and implies more complicated architectures that are often also computationally demanding. There are indications that non-symmetric models can easily learn symmetries from data, and that doing so can even be beneficial for the accuracy of the model. We put a model that obeys rotational invariance only approximately to the test, in realistic scenarios involving simulations of gas-phase, liquid, and solid water. We focus specifically on physical observables that are likely to be affected -- directly or indirectly -- by symmetry breaking, finding negligible consequences when the model is used in an interpolative, bulk, regime. Even for extrapolative gas-phase predictions, the model remains very stable, even though symmetry artifacts are noticeable. We also discuss strategies that can be used to systematically reduce the magnitude of symmetry breaking when it occurs, and assess their impact on the convergence of observables.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
i-PI 3.0: a flexible and efficient framework for advanced atomistic simulations
Authors:
Yair Litman,
Venkat Kapil,
Yotam M. Y. Feldman,
Davide Tisi,
Tomislav Begušić,
Karen Fidanyan,
Guillaume Fraux,
Jacob Higer,
Matthias Kellner,
Tao E. Li,
Eszter S. Pós,
Elia Stocco,
George Trenins,
Barak Hirshberg,
Mariana Rossi,
Michele Ceriotti
Abstract:
Atomic-scale simulations have progressed tremendously over the past decade, largely due to the availability of machine-learning interatomic potentials. These potentials combine the accuracy of electronic structure calculations with the ability to reach extensive length and time scales. The i-PI package facilitates integrating the latest developments in this field with advanced modeling techniques,…
▽ More
Atomic-scale simulations have progressed tremendously over the past decade, largely due to the availability of machine-learning interatomic potentials. These potentials combine the accuracy of electronic structure calculations with the ability to reach extensive length and time scales. The i-PI package facilitates integrating the latest developments in this field with advanced modeling techniques, thanks to a modular software architecture based on inter-process communication through a socket interface. The choice of Python for implementation facilitates rapid prototyping but can add computational overhead. In this new release, we carefully benchmarked and optimized i-PI for several common simulation scenarios, making such overhead negligible when i-PI is used to model systems up to tens of thousands of atoms using widely adopted machine learning interatomic potentials, such as Behler-Parinello, DeePMD and MACE neural networks. We also present the implementation of several new features, including an efficient algorithm to model bosonic and fermionic exchange, a framework for uncertainty quantification to be used in conjunction with machine-learning potentials, a communication infrastructure that allows deeper integration with electronic-driven simulations, and an approach to simulate coupled photon-nuclear dynamics in optical or plasmonic cavities.
△ Less
Submitted 10 July, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
A prediction rigidity formalism for low-cost uncertainties in trained neural networks
Authors:
Filippo Bigi,
Sanggyu Chong,
Michele Ceriotti,
Federico Grasselli
Abstract:
Regression methods are fundamental for scientific and technological applications. However, fitted models can be highly unreliable outside of their training domain, and hence the quantification of their uncertainty is crucial in many of their applications. Based on the solution of a constrained optimization problem, we propose "prediction rigidities" as a method to obtain uncertainties of arbitrary…
▽ More
Regression methods are fundamental for scientific and technological applications. However, fitted models can be highly unreliable outside of their training domain, and hence the quantification of their uncertainty is crucial in many of their applications. Based on the solution of a constrained optimization problem, we propose "prediction rigidities" as a method to obtain uncertainties of arbitrary pre-trained regressors. We establish a strong connection between our framework and Bayesian inference, and we develop a last-layer approximation that allows the new method to be applied to neural networks. This extension affords cheap uncertainties without any modification to the neural network itself or its training procedure. We show the effectiveness of our method on a wide range of regression tasks, ranging from simple toy models to applications in chemistry and meteorology.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Uncertainty quantification by direct propagation of shallow ensembles
Authors:
Matthias Kellner,
Michele Ceriotti
Abstract:
Statistical learning algorithms provide a generally-applicable framework to sidestep time-consuming experiments, or accurate physics-based modeling, but they introduce a further source of error on top of the intrinsic limitations of the experimental or theoretical setup. Uncertainty estimation is essential to quantify this error, and make application of data-centric approaches more trustworthy. To…
▽ More
Statistical learning algorithms provide a generally-applicable framework to sidestep time-consuming experiments, or accurate physics-based modeling, but they introduce a further source of error on top of the intrinsic limitations of the experimental or theoretical setup. Uncertainty estimation is essential to quantify this error, and make application of data-centric approaches more trustworthy. To ensure that uncertainty quantification is used widely, one should aim for algorithms that are reasonably accurate, but also easy to implement and apply. In particular, including uncertainty quantification on top of an existing architecture should be straightforward, and add minimal computational overhead. Furthermore, it should be easy to manipulate or combine multiple machine-learning predictions, propagating uncertainty over further modeling steps. We compare several well-established uncertainty quantification frameworks against these requirements, and propose a practical approach, which we dub direct propagation of shallow ensembles, that provides a good compromise between ease of use and accuracy. We present benchmarks for generic datasets, and an in-depth study of applications to the field of atomistic machine learning for chemistry and materials. These examples underscore the importance of using a formulation that allows propagating errors without making strong assumptions on the correlations between different predictions of the model.
△ Less
Submitted 16 May, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Thermal conductivity of Li$_3$PS$_4$ solid electrolytes with ab initio accuracy
Authors:
Davide Tisi,
Federico Grasselli,
Lorenzo Gigli,
Michele Ceriotti
Abstract:
The vast amount of computational studies on electrical conduction in solid-state electrolytes is not mirrored by comparable efforts addressing thermal conduction, which has been scarcely investigated despite its relevance to thermal management and (over)heating of batteries. The reason for this lies in the complexity of the calculations: on one hand, the diffusion of ionic charge carriers makes la…
▽ More
The vast amount of computational studies on electrical conduction in solid-state electrolytes is not mirrored by comparable efforts addressing thermal conduction, which has been scarcely investigated despite its relevance to thermal management and (over)heating of batteries. The reason for this lies in the complexity of the calculations: on one hand, the diffusion of ionic charge carriers makes lattice methods formally unsuitable, due to the lack of equilibrium atomic positions needed for normal-mode expansion. On the other hand, the prohibitive cost of large-scale molecular dynamics (MD) simulations of heat transport in large systems at ab initio levels has hindered the use of MD-based methods. In this paper, we leverage recently developed machine-learning potentials targeting different ab initio functionals (PBEsol, r$^2$SCAN, PBE0) and a state-of-the-art formulation of the Green-Kubo theory of heat transport in multicomponent systems to compute the thermal conductivity of a promising solid-state electrolyte, Li$_3$PS$_4$, in all its polymorphs ($α$, $β$, and $γ$). By comparing MD estimates with lattice methods on the low-temperature, nondiffusive $γ$-Li$_3$PS$_4$, we highlight strong anharmonicities and negligible nuclear quantum effects, hence further justifying MD-based methods even for nondiffusive phases. Finally, for the ion-conducting $α$ and $β$ phases, where the multicomponent Green-Kubo MD approach is mandatory, our simulations indicate a weak temperature dependence of the thermal conductivity, a glass-like behavior due to the effective local disorder characterizing these Li-diffusing phases.
△ Less
Submitted 17 June, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
Natural Aging and Vacancy Trapping in Al-6xxx
Authors:
Abhinav C. P. Jain,
M. Ceriotti,
W. A. Curtin
Abstract:
Undesirable natural aging (NA) in Al-6xxx delays subsequent artificial aging (AA) but the size, composition, and evolution of clustering are challenging to measure. Here, atomistic details of early-stage clustering in Al-1\%Mg-0.6\%Si during NA are studied computationally using a chemically-accurate neural-network potential. Feasible growth paths for the preferred $β''$ precipitates identify: domi…
▽ More
Undesirable natural aging (NA) in Al-6xxx delays subsequent artificial aging (AA) but the size, composition, and evolution of clustering are challenging to measure. Here, atomistic details of early-stage clustering in Al-1\%Mg-0.6\%Si during NA are studied computationally using a chemically-accurate neural-network potential. Feasible growth paths for the preferred $β''$ precipitates identify: dominant clusters differing from $β''$ motifs; spontaneous vacancy-interstitial formation creating 14-18 solute atom $β''$-like motifs; and lower-energy clusters requiring chemical re-arrangement to form $β''$ nuclei. Quasi-on-lattice kinetic Monte Carlo simulations reveal that 8-14 solute atom clusters form within 1000 s but that growth slows considerably due to vacancy trapping inside clusters, with trapping energies of 0.3-0.5 eV. These findings rationalize why cluster growth and alloy hardness saturate during NA, confirm the concept of ''vacancy prisons", and suggest why clusters must be dissolved during AA before formation of $β''$. This atomistic understanding of NA may enable design of strategies to mitigate negative effects of NA.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Electronic excited states from physically-constrained machine learning
Authors:
Edoardo Cignoni,
Divya Suman,
Jigyasa Nigam,
Lorenzo Cupellini,
Benedetta Mennucci,
Michele Ceriotti
Abstract:
Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should be applied directly to predict the desired properties or be combined explicitly with physically-grounded operations. We present an example of an integrated modeling approach, in which a symmetry-adapted ML model of an effe…
▽ More
Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should be applied directly to predict the desired properties or be combined explicitly with physically-grounded operations. We present an example of an integrated modeling approach, in which a symmetry-adapted ML model of an effective Hamiltonian is trained to reproduce electronic excitations from a quantum-mechanical calculation. The resulting model can make predictions for molecules that are much larger and more complex than those that it is trained on, and allows for dramatic computational savings by indirectly targeting the outputs of well-converged calculations while using a parameterization corresponding to a minimal atom-centered basis. These results emphasize the merits of intertwining data-driven techniques with physical approximations, improving the transferability and interpretability of ML models without affecting their accuracy and computational efficiency, and providing a blueprint for developing ML-augmented electronic-structure methods.
△ Less
Submitted 7 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Mechanism of charge transport in lithium thiophosphate
Authors:
Lorenzo Gigli,
Davide Tisi,
Federico Grasselli,
Michele Ceriotti
Abstract:
Lithium ortho-thiophosphate (Li$_3$PS$_4$) has emerged as a promising candidate for solid-state-electrolyte batteries, thanks to its highly conductive phases, cheap components, and large electrochemical stability range. Nonetheless, the microscopic mechanisms of Li-ion transport in Li$_3$PS$_4$ are far to be fully understood, the role of PS$_4$ dynamics in charge transport still being controversia…
▽ More
Lithium ortho-thiophosphate (Li$_3$PS$_4$) has emerged as a promising candidate for solid-state-electrolyte batteries, thanks to its highly conductive phases, cheap components, and large electrochemical stability range. Nonetheless, the microscopic mechanisms of Li-ion transport in Li$_3$PS$_4$ are far to be fully understood, the role of PS$_4$ dynamics in charge transport still being controversial. In this work, we build machine learning potentials targeting state-of-the-art DFT references (PBEsol, r$^2$SCAN, and PBE0) to tackle this problem in all known phases of Li$_3$PS$_4$ ($α$, $β$ and $γ$), for large system sizes and timescales. We discuss the physical origin of the observed superionic behavior of Li$_3$PS$_4$: the activation of PS$_4$ flipping drives a structural transition to a highly conductive phase, characterized by an increase of Li-site availability and by a drastic reduction in the activation energy of Li-ion diffusion. We also rule out any paddle-wheel effects of PS$_4$ tetrahedra in the superionic phases -- previously claimed to enhance Li-ion diffusion -- due to the orders-of-magnitude difference between the rate of PS$_4$ flips and Li-ion hops at all temperatures below melting. We finally elucidate the role of inter-ionic dynamical correlations in charge transport, by highlighting the failure of the Nernst-Einstein approximation to estimate the electrical conductivity. Our results show a strong dependence on the target DFT reference, with PBE0 yielding the best quantitative agreement with experimental measurements not only for the electronic band-gap but also for the electrical conductivity of $β$- and $α$-Li$_3$PS$_4$.
△ Less
Submitted 10 January, 2024; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Modeling the ferroelectric phase transition in barium titanate with DFT accuracy and converged sampling
Authors:
Lorenzo Gigli,
Alexander Goscinski,
Michele Ceriotti,
Gareth A. Tribello
Abstract:
The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these fi…
▽ More
The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these finite-size effects one is thus often forced to use empirical models that describe the physics of the material in terms of effective interaction terms, that are fitted using the results from DFT. In this study we use a machine-learning (ML) potential trained on DFT, in combination with accelerated sampling techniques, to converge the thermodynamic properties of Barium Titanate (BTO) with first-principles accuracy and a full atomistic description. Our results indicate that the predicted Curie temperature depends strongly on the choice of DFT functional and system size, because of emergent long-range directional correlations in the local dipole fluctuations. Our findings demonstrate how the combination of ML models and traditional bottom-up modeling allow one to investigate emergent phenomena with the accuracy of first-principles calculations over the large size and time scales afforded by empirical models.
△ Less
Submitted 13 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Surface segregation in high-entropy alloys from alchemical machine learning
Authors:
Arslan Mazitov,
Maximilian A. Springer,
Nataliya Lopanitsyna,
Guillaume Fraux,
Sandip De,
Michele Ceriotti
Abstract:
High-entropy alloys (HEAs), containing several metallic elements in near-equimolar proportions, have long been of interest for their unique mechanical properties. More recently, they have emerged as a promising platform for the development of novel heterogeneous catalysts, because of the large design space, and the synergistic effects between their components. In this work we use a machine-learnin…
▽ More
High-entropy alloys (HEAs), containing several metallic elements in near-equimolar proportions, have long been of interest for their unique mechanical properties. More recently, they have emerged as a promising platform for the development of novel heterogeneous catalysts, because of the large design space, and the synergistic effects between their components. In this work we use a machine-learning potential that can model simultaneously up to 25 transition metals to study the tendency of different elements to segregate at the surface of a HEA. We use as a starting point a potential that was previously developed using exclusively crystalline bulk phases, and show that, thanks to the physically-inspired functional form of the model, adding a much smaller number of defective configurations makes it capable of describing surface phenomena. We then present several computational studies of surface segregation, including both a simulation of a 25-element alloy, that provides a rough estimate of the relative surface propensity of the various elements, and targeted studies of CoCrFeMnNi and IrFeCoNiCu, which provide further validation of the model, and insights to guide the modeling and design of alloys for heterogeneous catalysis.
△ Less
Submitted 11 January, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Physics-inspired Equivariant Descriptors of Non-bonded Interactions
Authors:
Kevin K. Huguenin-Dumittan,
Philip Loche,
Ni Haoran,
Michele Ceriotti
Abstract:
One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size scaling, this systematically neglects long-range (LR) effects, such as electrostatics or dispersion interaction. We present an extension of the long distance equivariant (LODE) framework that can handle diverse LR interacti…
▽ More
One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size scaling, this systematically neglects long-range (LR) effects, such as electrostatics or dispersion interaction. We present an extension of the long distance equivariant (LODE) framework that can handle diverse LR interactions in a consistent way, and seamlessly integrates with preexisting methods by building new sets of atom centered features. We provide a direct physical interpretation of these using the multipole expansion, which allows for simpler and more efficient implementations. The framework is applied to simple toy systems as proof of concept, and a heterogeneous set of molecular dimers to push the method to its limits. By generalizing LODE to arbitrary asymptotic behaviors, we provide a coherent approach to treat arbitrary two- and many-body non-bonded interactions in the data-driven modeling of matter.
△ Less
Submitted 3 October, 2023; v1 submitted 25 August, 2023;
originally announced August 2023.
-
Robustness of Local Predictions in Atomistic Machine Learning Models
Authors:
Sanggyu Chong,
Federico Grasselli,
Chiheb Ben Mahmoud,
Joe D. Morrow,
Volker L. Deringer,
Michele Ceriotti
Abstract:
Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemica…
▽ More
Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemical environments and motifs to complicated macroscopic properties. However, even though there exist practical justifications for these decompositions, only the global quantity is rigorously defined, and thus it is unclear to what extent the atomistic terms predicted by the model can be trusted. Here, we introduce a quantitative metric, which we call the local prediction rigidity (LPR), that allows one to assess how robust the locally decomposed predictions of ML models are. We investigate the dependence of LPR on the aspects of model training, particularly the composition of training dataset, for a range of different problems from simple toy models to real chemical systems. We present strategies to systematically enhance the LPR, which can be used to improve the robustness, interpretability, and transferability of atomistic ML models.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Probing the unfolded configurations of a $β$-hairpin using sketch-map
Authors:
Albert Ardevol,
Gareth A. Tribello,
Michele Ceriotti,
Michele Parrinello
Abstract:
This work examines the conformational ensemble involved in $β$-hairpin folding by means of advanced molecular dynamics simulations and dimensionality reduction. A fully atomistic description of the protein and the surrounding solvent molecules is used and this complex energy landscape is sampled by means of parallel tempering metadynamics simulations. The ensemble of configurations explored is ana…
▽ More
This work examines the conformational ensemble involved in $β$-hairpin folding by means of advanced molecular dynamics simulations and dimensionality reduction. A fully atomistic description of the protein and the surrounding solvent molecules is used and this complex energy landscape is sampled by means of parallel tempering metadynamics simulations. The ensemble of configurations explored is analysed using the recently proposed sketch-map algorithm. Further simulations allow us to probe how mutations affect the structures adopted by this protein. We find that many of the configurations adopted by a mutant are the same as those adopted by the wild type protein. Furthermore, certain mutations destabilize secondary structure containing configurations by preventing the formation of hydrogen bonds or by promoting the formation of new intramolecular contacts. Our analysis demonstrates that machine-learning techniques can be used to study the energy landscapes of complex molecules and that the visualizations that are generated in this way provide a natural basis for examining how the stabilities of particular configurations of the molecule are affected by factors such as temperature or structural mutations.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Smooth, exact rotational symmetrization for deep learning on point clouds
Authors:
Sergey N. Pozdnyakov,
Michele Ceriotti
Abstract:
Point clouds are versatile representations of 3D objects and have found widespread application in science and engineering. Many successful deep-learning models have been proposed that use them as input. The domain of chemical and materials modeling is especially challenging because exact compliance with physical constraints is highly desirable for a model to be usable in practice. These constraint…
▽ More
Point clouds are versatile representations of 3D objects and have found widespread application in science and engineering. Many successful deep-learning models have been proposed that use them as input. The domain of chemical and materials modeling is especially challenging because exact compliance with physical constraints is highly desirable for a model to be usable in practice. These constraints include smoothness and invariance with respect to translations, rotations, and permutations of identical atoms. If these requirements are not rigorously fulfilled, atomistic simulations might lead to absurd outcomes even if the model has excellent accuracy. Consequently, dedicated architectures, which achieve invariance by restricting their design space, have been developed. General-purpose point-cloud models are more varied but often disregard rotational symmetry. We propose a general symmetrization method that adds rotational equivariance to any given model while preserving all the other requirements. Our approach simplifies the development of better atomic-scale machine-learning schemes by relaxing the constraints on the design space and making it possible to incorporate ideas that proved effective in other domains. We demonstrate this idea by introducing the Point Edge Transformer (PET) architecture, which is not intrinsically equivariant but achieves state-of-the-art performance on several benchmark datasets of molecules and solids. A-posteriori application of our general protocol makes PET exactly equivariant, with minimal changes to its accuracy.
△ Less
Submitted 6 February, 2024; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Wigner kernels: body-ordered equivariant machine learning without a basis
Authors:
Filippo Bigi,
Sergey N. Pozdnyakov,
Michele Ceriotti
Abstract:
Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their neighbor densities has been used widely and very succesfully. W…
▽ More
Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their neighbor densities has been used widely and very succesfully. We propose a novel density-based method which involves computing ``Wigner kernels''. These are fully equivariant and body-ordered kernels that can be computed iteratively with a cost that is independent of the radial-chemical basis and grows only linearly with the maximum body-order considered. This is in marked contrast to feature-space models, which comprise an exponentially-growing number of terms with increasing order of correlations. We present several examples of the accuracy of models based on Wigner kernels in chemical applications, for both scalar and tensorial targets, reaching state-of-the-art accuracy on the popular QM9 benchmark dataset, and we discuss the broader relevance of these ideas to equivariant geometric machine-learning.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Completeness of Atomic Structure Representations
Authors:
Jigyasa Nigam,
Sergey N. Pozdnyakov,
Kevin K. Huguenin-Dumittan,
Michele Ceriotti
Abstract:
In this paper, we address the challenge of obtaining a comprehensive and symmetric representation of point particle groups, such as atoms in a molecule, which is crucial in physics and theoretical chemistry. The problem has become even more important with the widespread adoption of machine-learning techniques in science, as it underpins the capacity of models to accurately reproduce physical relat…
▽ More
In this paper, we address the challenge of obtaining a comprehensive and symmetric representation of point particle groups, such as atoms in a molecule, which is crucial in physics and theoretical chemistry. The problem has become even more important with the widespread adoption of machine-learning techniques in science, as it underpins the capacity of models to accurately reproduce physical relationships while being consistent with fundamental symmetries and conservation laws. However, some of the descriptors that are commonly used to represent point clouds -- most notably those based on discretized correlations of the neighbor density, that underpin most of the existing ML models of matter at the atomic scale -- are unable to distinguish between special arrangements of particles in three dimensions. This makes it impossible to machine learn their properties. Atom-density correlations are provably complete in the limit in which they simultaneously describe the mutual relationship between all atoms, which is impractical. We present a novel approach to construct descriptors of \emph{finite} correlations based on the relative arrangement of particle triplets, which can be employed to create symmetry-adapted models with universal approximation capabilities, which have the resolution of the neighbor discretization as the sole convergence parameter. Our strategy is demonstrated on a class of atomic arrangements that are specifically built to defy a broad class of conventional symmetric descriptors, showcasing its potential for addressing their limitations.
△ Less
Submitted 30 December, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
Fast evaluation of spherical harmonics with sphericart
Authors:
Filippo Bigi,
Guillaume Fraux,
Nicholas J. Browning,
Michele Ceriotti
Abstract:
Spherical harmonics provide a smooth, orthogonal, and symmetry-adapted basis to expand functions on a sphere, and they are used routinely in physical and theoretical chemistry as well as in different fields of science and technology, from geology and atmospheric sciences to signal processing and computer graphics. More recently, they have become a key component of rotationally equivariant models i…
▽ More
Spherical harmonics provide a smooth, orthogonal, and symmetry-adapted basis to expand functions on a sphere, and they are used routinely in physical and theoretical chemistry as well as in different fields of science and technology, from geology and atmospheric sciences to signal processing and computer graphics. More recently, they have become a key component of rotationally equivariant models in geometric machine learning, including applications to atomic-scale modeling of molecules and materials. We present an elegant and efficient algorithm for the evaluation of the real-valued spherical harmonics. Our construction features many of the desirable properties of existing schemes and allows to compute Cartesian derivatives in a numerically stable and computationally efficient manner. To facilitate usage, we implement this algorithm in sphericart, a fast C++ library which also provides C bindings, a Python API, and a PyTorch implementation that includes a GPU kernel.
△ Less
Submitted 30 April, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Solar Sail Propulsion by 2050: An Enabling Capability for Heliophysics Missions
Authors:
Les Johnson,
Nathan Barnes,
Matteo Ceriotti,
Thomas Y. Chen,
Artur Davoyan,
Louis Friedman,
Darren Garber,
Roman Kezerashvili,
Ken Kobayashi,
Greg Matloff,
Colin McInnes,
Pat Mulligan,
Grover Swartzlander,
Slava G. Turyshev
Abstract:
Solar sails enable missions to observe the solar environment from unique vantage points, such as sustained observations away from the Sun-Earth line; sub-L1 station keeping; high inclination solar orbits; Earth polar-sitting and polar-viewing observatories; fast transit missions to study heliosphere to interstellar medium transition, as well as missions of interest across a broad user community. R…
▽ More
Solar sails enable missions to observe the solar environment from unique vantage points, such as sustained observations away from the Sun-Earth line; sub-L1 station keeping; high inclination solar orbits; Earth polar-sitting and polar-viewing observatories; fast transit missions to study heliosphere to interstellar medium transition, as well as missions of interest across a broad user community. Recent and planned demonstration missions make this technology ready for use on near-term science missions.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
Modeling high-entropy transition-metal alloys with alchemical compression
Authors:
Nataliya Lopanitsyna,
Guillaume Fraux,
Maximilian A. Springer,
Sandip De,
Michele Ceriotti
Abstract:
Alloys composed of several elements in roughly equimolar composition, often referred to as high-entropy alloys, have long been of interest for their thermodynamics and peculiar mechanical properties, and more recently for their potential application in catalysis. They are a considerable challenge to traditional atomistic modeling, and also to data-driven potentials that for the most part have memo…
▽ More
Alloys composed of several elements in roughly equimolar composition, often referred to as high-entropy alloys, have long been of interest for their thermodynamics and peculiar mechanical properties, and more recently for their potential application in catalysis. They are a considerable challenge to traditional atomistic modeling, and also to data-driven potentials that for the most part have memory footprint, computational effort and data requirements which scale poorly with the number of elements included. We apply a recently proposed scheme to compress chemical information in a lower-dimensional space, which reduces dramatically the cost of the model with negligible loss of accuracy, to build a potential that can describe 25 d-block transition metals. The model shows semi-quantitative accuracy for prototypical alloys, and is remarkably stable when extrapolating to structures outside its training set. We use this framework to study element segregation in a computational experiment that simulates an equimolar alloy of all 25 elements, mimicking the seminal experiments by Cantor et al., and use our observations on the short-range order relations between the elements to define a data-driven set of Hume-Rothery rules that can serve as guidance for alloy design. We conclude with a study of three prototypical alloys, CoCrFeMnNi, CoCrFeMoNi and IrPdPtRhRu, determining their stability and the short-range order behavior of their constituents.
△ Less
Submitted 7 April, 2023; v1 submitted 26 December, 2022;
originally announced December 2022.
-
A data-driven interpretation of the stability of molecular crystals
Authors:
Rose K. Cersonsky,
Maria Pakhnova,
Edgar A. Engel,
Michele Ceriotti
Abstract:
Due to the subtle balance of intermolecular interactions that govern structure-property relations, predicting the stability of crystal structures formed from molecular building blocks is a highly non-trivial scientific problem. A particularly active and fruitful approach involves classifying the different combinations of interacting chemical moieties, as understanding the relative energetics of di…
▽ More
Due to the subtle balance of intermolecular interactions that govern structure-property relations, predicting the stability of crystal structures formed from molecular building blocks is a highly non-trivial scientific problem. A particularly active and fruitful approach involves classifying the different combinations of interacting chemical moieties, as understanding the relative energetics of different interactions enables the design of molecular crystals and fine-tuning their stabilities. While this is usually performed based on the empirical observation of the most commonly encountered motifs in known crystal structures, we propose to apply a combination of supervised and unsupervised machine-learning techniques to automate the construction of an extensive library of molecular building blocks. We introduce a structural descriptor tailored to the prediction of the binding (lattice) energy and apply it to a curated dataset of organic crystals and exploit its atom-centered nature to obtain a data-driven assessment of the contribution of different chemical groups to the lattice energy of the crystal. We then interpret this library using a low-dimensional representation of the structure-energy landscape and discuss selected examples of the insights into crystal engineering that can be extracted from this analysis, providing a complete database to guide the design of molecular materials.
△ Less
Submitted 22 December, 2022; v1 submitted 21 September, 2022;
originally announced September 2022.
-
A smooth basis for atomistic machine learning
Authors:
Filippo Bigi,
Kevin Huguenin-Dumittan,
Michele Ceriotti,
David E. Manolopoulos
Abstract:
Machine learning frameworks based on correlations of interatomic positions begin with a discretized description of the density of other atoms in the neighbourhood of each atom in the system. Symmetry considerations support the use of spherical harmonics to expand the angular dependence of this density, but there is as yet no clear rationale to choose one radial basis over another. Here we investig…
▽ More
Machine learning frameworks based on correlations of interatomic positions begin with a discretized description of the density of other atoms in the neighbourhood of each atom in the system. Symmetry considerations support the use of spherical harmonics to expand the angular dependence of this density, but there is as yet no clear rationale to choose one radial basis over another. Here we investigate the basis that results from the solution of the Laplacian eigenvalue problem within a sphere around the atom of interest. We show that this generates the smoothest possible basis of a given size within the sphere, and that a tensor product of Laplacian eigenstates also provides the smoothest possible basis for expanding any higher-order correlation of the atomic density within the appropriate hypersphere. We consider several unsupervised metrics of the quality of a basis for a given dataset, and show that the Laplacian eigenstate basis has a performance that is much better than some widely used basis sets and is competitive with data-driven bases that numerically optimize each metric. In supervised machine learning tests, we find that the optimal function smoothness of the Laplacian eigenstates leads to comparable or better performance than can be obtained from a data-driven basis of a similar size that has been optimized to describe the atom-density correlation for the specific dataset. We conclude that the smoothness of the basis functions is a key and hitherto largely overlooked aspect of successful atomic density representations.
△ Less
Submitted 23 May, 2023; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Beyond potentials: integrated machine-learning models for materials
Authors:
Michele Ceriotti
Abstract:
Over the past decade inter-atomic potentials based on machine-learning (ML) techniques have become an indispensable tool in the atomic-scale modeling of materials. Trained on energies and forces obtained from electronic-structure calculations, they inherit their predictive accuracy, and extend greatly the length and time scales that are accessible to explicit atomistic simulations. Inexpensive pre…
▽ More
Over the past decade inter-atomic potentials based on machine-learning (ML) techniques have become an indispensable tool in the atomic-scale modeling of materials. Trained on energies and forces obtained from electronic-structure calculations, they inherit their predictive accuracy, and extend greatly the length and time scales that are accessible to explicit atomistic simulations. Inexpensive predictions of the energetics of individual configurations have facilitated greatly the calculation of the thermodynamics of materials, including finite-temperature effects and disorder. More recently, machine-learning models have been closing the gap with first-principles calculations in another area: the prediction of arbitrarily complicated functional properties, from vibrational and optical spectroscopies to electronic excitations. The implementation of integrated machine-learning models, that combine energetic and functional predictions with statistical and dynamical sampling of atomic-scale properties is bringing the promise of predictive, uncompromising simulations of existing and novel materials closer to its full realisation.
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
Electronic-structure properties from atom-centered predictions of the electron density
Authors:
Andrea Grisafi,
Alan M. Lewis,
Mariana Rossi,
Michele Ceriotti
Abstract:
The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multi-centered atomic basis analogous to that routinely used in density fitting approximations. However, the non-orthogonality of t…
▽ More
The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multi-centered atomic basis analogous to that routinely used in density fitting approximations. However, the non-orthogonality of the basis poses challenges for the learning exercise, as it requires accounting for all the atomic density components at once. We devise a gradient-based approach to directly minimize the loss function of the regression problem in an optimized and highly sparse feature space. In so doing, we overcome the limitations associated with adopting an atom-centered model to learn the electron density over arbitrarily complex datasets, obtaining extremely accurate predictions. The enhanced framework is tested on 32-molecule periodic cells of liquid water, presenting enough complexity to require an optimal balance between accuracy and computational efficiency. We show that starting from the predicted density a single Kohn-Sham diagonalization step can be performed to access total energy components that carry an error of just 0.1 meV/atom with respect to the reference density functional calculations. Finally, we test our method on the highly heterogeneous QM9 benchmark dataset, showing that a small fraction of the training data is enough to derive ground-state total energies within chemical accuracy.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
Predicting hot-electron free energies from ground-state data
Authors:
Chiheb Ben Mahmoud,
Federico Grasselli,
Michele Ceriotti
Abstract:
Machine-learning potentials are usually trained on the ground-state, Born-Oppenheimer energy surface, which depends exclusively on the atomic positions and not on the simulation temperature. This disregards the effect of thermally-excited electrons, that is important in metals, and essential to the description of warm dense matter. An accurate physical description of these effects requires that th…
▽ More
Machine-learning potentials are usually trained on the ground-state, Born-Oppenheimer energy surface, which depends exclusively on the atomic positions and not on the simulation temperature. This disregards the effect of thermally-excited electrons, that is important in metals, and essential to the description of warm dense matter. An accurate physical description of these effects requires that the nuclei move on a temperature-dependent electronic free energy. We propose a method to obtain machine-learning predictions of this free energy at an arbitrary electron temperature using exclusively training data from ground-state calculations, avoiding the need to train temperature-dependent potentials, and benchmark it on metallic liquid hydrogen at the conditions of the core of gas giants and brown dwarfs. This work demonstrates the advantages of hybrid schemes that use physical consideration to combine machine-learning predictions, providing a blueprint for the development of similar approaches that extend the reach of atomistic modelling by removing the barrier between physics and data-driven methodologies.
△ Less
Submitted 28 September, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Comment on "Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four body interactions"
Authors:
Sergey N. Pozdnyakov,
Michael J. Willatt,
Albert P. Bartók,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finit…
▽ More
The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finite, discrete sets -- the continuous "quasi-constant" manifolds exhibit low, but non-zero, sensitivity to atomic displacements. Thus, it is possible to build interpolative machine-learning models of high-order interactions along the manifold, even though the numerical instabilities associated with proximity to the exact singularities affect the accuracy and transferability of such models, to an extent that depends on numerical details of the implementation.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Unified theory of atom-centered representations and message-passing machine-learning schemes
Authors:
Jigyasa Nigam,
Sergey Pozdnyakov,
Guillaume Fraux,
Michele Ceriotti
Abstract:
Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks…
▽ More
Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks in this class can be understood in terms of atom-centered density correlations (ACDC), that are used as a basis for a body-ordered, symmetry-adapted expansion of the targets. Several other schemes, that gather information on the relationship between neighboring atoms using "message-passing" ideas, cannot be directly mapped to correlations centered around a single atom. We generalize the ACDC framework to include multi-centered information, generating representations that provide a complete linear basis to regress symmetric functions of atomic coordinates, and provides a coherent foundation to systematize our understanding of both atom-centered and message-passing, invariant and equivariant machine-learning schemes.
△ Less
Submitted 1 April, 2022; v1 submitted 3 February, 2022;
originally announced February 2022.
-
Incompleteness of graph neural networks for points clouds in three dimensions
Authors:
Sergey N. Pozdnyakov,
Michele Ceriotti
Abstract:
Graph neural networks (GNN) are very popular methods in machine learning and have been applied very successfully to the prediction of the properties of molecules and materials. First-order GNNs are well known to be incomplete, i.e., there exist graphs that are distinct but appear identical when seen through the lens of the GNN. More complicated schemes have thus been designed to increase their res…
▽ More
Graph neural networks (GNN) are very popular methods in machine learning and have been applied very successfully to the prediction of the properties of molecules and materials. First-order GNNs are well known to be incomplete, i.e., there exist graphs that are distinct but appear identical when seen through the lens of the GNN. More complicated schemes have thus been designed to increase their resolving power. Applications to molecules (and more generally, point clouds), however, add a geometric dimension to the problem. The most straightforward and prevalent approach to construct graph representation for molecules regards atoms as vertices in a graph and draws a bond between each pair of atoms within a chosen cutoff. Bonds can be decorated with the distance between atoms, and the resulting "distance graph NNs" (dGNN) have empirically demonstrated excellent resolving power and are widely used in chemical ML, with all known indistinguishable configurations being resolved in the fully-connected limit, which is equivalent to infinite or sufficiently large cutoff. Here we present a counterexample that proves that dGNNs are not complete even for the restricted case of fully-connected graphs induced by 3D atom clouds. We construct pairs of distinct point clouds whose associated graphs are, for any cutoff radius, equivalent based on a first-order Weisfeiler-Lehman test. This class of degenerate structures includes chemically-plausible configurations, both for isolated structures and for infinite structures that are periodic in 1, 2, and 3 dimensions. The existence of indistinguishable configurations sets an ultimate limit to the expressive power of some of the well-established GNN architectures for atomistic machine learning. Models that explicitly use angular or directional information in the description of atomic environments can resolve this class of degeneracies.
△ Less
Submitted 7 November, 2022; v1 submitted 18 January, 2022;
originally announced January 2022.
-
Thermodynamics and dielectric response of $\text{BaTiO}_3$ by data-driven modeling
Authors:
Lorenzo Gigli,
Max Veit,
Michele Kotiuga,
Giovanni Pizzi,
Nicola Marzari,
Michele Ceriotti
Abstract:
Modeling ferroelectric materials from first principles is one of the successes of density-functional theory, and the driver of much development effort, requiring an accurate description of the electronic processes and the thermodynamic equilibrium that drive the spontaneous symmetry breaking and the emergence of macroscopic polarization. We demonstrate the development and application of an integra…
▽ More
Modeling ferroelectric materials from first principles is one of the successes of density-functional theory, and the driver of much development effort, requiring an accurate description of the electronic processes and the thermodynamic equilibrium that drive the spontaneous symmetry breaking and the emergence of macroscopic polarization. We demonstrate the development and application of an integrated machine learning model that describes on the same footing structural, energetic and functional properties of barium titanate ($\text{BaTiO}_3$), a prototypical ferroelectric. The model uses ab initio calculations as reference and achieves accurate yet inexpensive predictions of energy and polarization on time and length scales that are not accessible to direct ab initio modeling. These predictions allow us to assess the microscopic mechanism of the ferroelectric transition. The presence of an order-disorder transition for the Ti off-centered states is the main driver of the ferroelectric transition, even though the coupling between symmetry breaking and cell distortions determines the presence of intermediate, partly-ordered phases. Moreover, we thoroughly probe the static and dynamical behavior of $\text{BaTiO}_3$ across its phase diagram, without the need to introduce a coarse-grained description of the ferroelectric transition. Finally, we apply the polarization model to calculate dielectric response properties of the material in a fully ab initio manner, again reproducing the correct qualitative experimental behaviour.
△ Less
Submitted 12 September, 2022; v1 submitted 9 November, 2021;
originally announced November 2021.
-
Ranking the Synthesizability of Hypothetical Zeolites with the Sorting Hat
Authors:
Benjamin A. Helfrecht,
Giovanni Pireddu,
Rocio Semino,
Scott M. Auerbach,
Michele Ceriotti
Abstract:
Zeolites are nanoporous alumino-silicate frameworks widely used as catalysts and adsorbents. Even though millions of distinct siliceous networks can be generated by computer-aided searches, no new hypothetical framework has yet been synthesized. The needle-in-a-haystack problem of finding promising candidates among large databases of predicted structures has intrigued materials scientists for deca…
▽ More
Zeolites are nanoporous alumino-silicate frameworks widely used as catalysts and adsorbents. Even though millions of distinct siliceous networks can be generated by computer-aided searches, no new hypothetical framework has yet been synthesized. The needle-in-a-haystack problem of finding promising candidates among large databases of predicted structures has intrigued materials scientists for decades; most work to date on the zeolite problem has been limited to intuitive structural descriptors. Here, we tackle this problem through a rigorous data science scheme-the "zeolite sorting hat"-that exploits interatomic correlations to produce a 95% real versus theoretical zeolites classification accuracy. The hypothetical frameworks that are grouped together with known zeolites are promising candidates for synthesis, that can be further ranked by estimating their thermodynamic stability. A critical analysis of the classifier reveals the decisive structural features. Further partitioning into compositional classes provides guidance in the design of synthetic strategies.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties
Authors:
Jigyasa Nigam,
Michael Willatt,
Michele Ceriotti
Abstract:
Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that…
▽ More
Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that can be decomposed into atomic contributions. Many quantities that are relevant for quantum mechanical calculations, however -- most notably the single-particle Hamiltonian matrix when written in an atomic-orbital basis -- are not associated with a single center, but with two (or more) atoms in the structure. We discuss a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case, and show in particular how this construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom-centered orbital basis. These N-centers features are fully equivariant -- not only in terms of translations and rotations, but also in terms of permutations of the indices associated with the atoms -- and are suitable to construct symmetry-adapted machine-learning models of new classes of properties of molecules and materials.
△ Less
Submitted 20 December, 2021; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Local invertibility and sensitivity of atomic structure-feature mappings
Authors:
Sergey N. Pozdnyakov,
Liwei Zhang,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
The increasingly common applications of machine-learning schemes to atomic-scale simulations have triggered efforts to better understand the mathematical properties of the mapping between the Cartesian coordinates of the atoms and the variety of representations that can be used to convert them into a finite set of symmetric descriptors or features. Here, we analyze the sensitivity of the mapping t…
▽ More
The increasingly common applications of machine-learning schemes to atomic-scale simulations have triggered efforts to better understand the mathematical properties of the mapping between the Cartesian coordinates of the atoms and the variety of representations that can be used to convert them into a finite set of symmetric descriptors or features. Here, we analyze the sensitivity of the mapping to atomic displacements, showing that the combination of symmetry and smoothness leads to mappings that have singular points at which the Jacobian has one or more null singular values (besides those corresponding to infinitesimal translations and rotations). This is in fact desirable, because it enforces physical symmetry constraints on the values predicted by regression models constructed using such representations. However, besides these symmetry-induced singularities, there are also spurious singular points, that we find to be linked to the incompleteness of the mapping, i.e. the fact that, for certain classes of representations, structurally distinct configurations are not guaranteed to be mapped onto different feature vectors. Additional singularities can be introduced by a too aggressive truncation of the infinite basis set that is used to discretize the representations.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
The importance of nuclear quantum effects for NMR crystallography
Authors:
Edgar A. Engel,
Venkat Kapil,
Michele Ceriotti
Abstract:
The resolving power of solid-state nuclear magnetic resonance (NMR) crystallography depends heavily on the accuracy of computational predictions of NMR chemical shieldings of candidate structures, which are usually taken to be local minima in the potential energy. To test the limits of this approximation, we systematically study the importance of finite-temperature and quantum nuclear fluctuations…
▽ More
The resolving power of solid-state nuclear magnetic resonance (NMR) crystallography depends heavily on the accuracy of computational predictions of NMR chemical shieldings of candidate structures, which are usually taken to be local minima in the potential energy. To test the limits of this approximation, we systematically study the importance of finite-temperature and quantum nuclear fluctuations for $^1$H, $^{13}$C, and $^{15}$N shieldings in polymorphs of three paradigmatic molecular crystals -- benzene, glycine, and succinic acid. The effect of quantum fluctuations is comparable to the typical errors of shielding predictions for static nuclei with respect to experiments, and their inclusion to improve the agreement with measurements, translating to more reliable assignment of the NMR spectra to the correct candidate structure. The use of integrated machine-learning models, trained on first-principles energies and shieldings, renders rigorous sampling of nuclear fluctuations affordable, setting a new standard for the calculations underlying NMR structure determinations.
△ Less
Submitted 9 January, 2022; v1 submitted 27 June, 2021;
originally announced June 2021.
-
Learning electron densities in the condensed phase
Authors:
Alan M. Lewis,
Andrea Grisafi,
Michele Ceriotti,
Mariana Rossi
Abstract:
We introduce a local machine-learning method for predicting the electron densities of periodic systems. The framework is based on a numerical, atom-centred auxiliary basis, which enables an accurate expansion of the all-electron density in a form suitable for learning isolated and periodic systems alike. We show that using this formulation the electron densities of metals, semiconductors and molec…
▽ More
We introduce a local machine-learning method for predicting the electron densities of periodic systems. The framework is based on a numerical, atom-centred auxiliary basis, which enables an accurate expansion of the all-electron density in a form suitable for learning isolated and periodic systems alike. We show that using this formulation the electron densities of metals, semiconductors and molecular crystals can all be accurately predicted using symmetry-adapted Gaussian process regression models, properly adjusted for the non-orthogonal nature of the basis. These predicted densities enable the efficient calculation of electronic properties which present errors on the order of tens of meV/atom when compared to ab initio density-functional calculations. We demonstrate the key power of this approach by using a model trained on ice unit cells containing only 4 water molecules to predict the electron densities of cells containing up to 512 molecules, and see no increase in the magnitude of the errors of derived electronic properties when increasing the system size. Indeed, we find that these extrapolated derived energies are more accurate than those predicted using a direct machine-learning model. Finally, on heterogeneous datasets SALTED can predict electron densities with errors below 4%.
△ Less
Submitted 9 November, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
Optimal radial basis for density-based atomic representations
Authors:
Alexander Goscinski,
Félix Musil,
Sergey Pozdnyakov,
Michele Ceriotti
Abstract:
The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort…
▽ More
The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort has been dedicated to the optimization of the basis set, typically driven by heuristic considerations on the behavior of the regression target. Here we take a different, unsupervised viewpoint, aiming to determine the basis that encodes in the most compact way possible the structural information that is relevant for the dataset at hand. For each training dataset and number of basis functions, one can determine a unique basis that is optimal in this sense, and can be computed at no additional cost with respect to the primitive basis by approximating it with splines. We demonstrate that this construction yields representations that are accurate and computationally efficient, particularly when constructing representations that correspond to high-body order correlations. We present examples that involve both molecular and condensed-phase machine-learning models.
△ Less
Submitted 10 January, 2022; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Quantum vibronic effects on the electronic properties of solid and molecular carbon
Authors:
Arpan Kundu,
Marco Govoni,
Han Yang,
Michele Ceriotti,
Francois Gygi,
Giulia Galli
Abstract:
We study the effect of quantum vibronic coupling on the electronic properties of carbon allotropes, including molecules and solids, by combining path integral first principles molecular dynamics (FPMD) with a colored noise thermostat. In addition to avoiding several approximations commonly adopted in calculations of electron-phonon coupling, our approach only adds a moderate computational cost to…
▽ More
We study the effect of quantum vibronic coupling on the electronic properties of carbon allotropes, including molecules and solids, by combining path integral first principles molecular dynamics (FPMD) with a colored noise thermostat. In addition to avoiding several approximations commonly adopted in calculations of electron-phonon coupling, our approach only adds a moderate computational cost to FPMD simulations and hence it is applicable to large supercells, such as those required to describe amorphous solids. We predict the effect of electron-phonon coupling on the fundamental gap of amorphous carbon, and we show that in diamond the zero-phonon renormalization of the band gap is larger than previously reported.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Modeling the Ga/As binary system across temperaturesand compositions from first principles
Authors:
Giulio imbalzano,
Michele Ceriotti
Abstract:
Materials composed of elements from the third and fifth columns of the periodic table display a very rich behavior, with the phase diagram usually containing a metallic liquid phase and a polar semiconducting solid. As a consequence, it is very hard to achieve transferable empirical models of interactions between the atoms that can reliably predict their behavior across the temperature and composi…
▽ More
Materials composed of elements from the third and fifth columns of the periodic table display a very rich behavior, with the phase diagram usually containing a metallic liquid phase and a polar semiconducting solid. As a consequence, it is very hard to achieve transferable empirical models of interactions between the atoms that can reliably predict their behavior across the temperature and composition range that is relevant to the study of the synthesis and properties of III/V nanostructures and devices. We present a machine-learning potential trained on density functional theory reference data that provides a general-purpose model for the Ga$_x$As$_{1-x}$ system. We provide a series of stringent tests that showcase the accuracy of the potential, and its applicability across the whole binary phase space, computing with ab initio accuracy a large number of finite-temperature properties as well as the location of phase boundaries. We also show how a committe model can be used to reliably determine the uncertainty induced by the limitations of the ML model on its predictions, to identify regions of phase space that are predicted with insufficient accuracy, and to iteratively refine the training set to achieve consistent, reliable modeling.
△ Less
Submitted 12 January, 2022; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Efficient implementation of atom-density representations
Authors:
Félix Musil,
Max Veit,
Alexander Goscinski,
Guillaume Fraux,
Michael J. Willatt,
Markus Stricker,
Till Junge,
Michele Ceriotti
Abstract:
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear t…
▽ More
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear that many of the most effective representations share a fundamental formal connection: that they can all be expressed as a discretization of N-body correlation functions of the local atom density, suggesting the opportunity of standardizing and, more importantly, optimizing the calculation of such representations. We present an implementation, named librascal, whose modular design lends itself both to developing refinements to the density-based formalism and to rapid prototyping for new developments of rotationally equivariant atomistic representations. As an example, we discuss SOAP features, perhaps the most widely used member of this family of representations, to show how the expansion of the local density can be optimized for any choice of radial basis set. We discuss the representation in the context of a kernel ridge regression model, commonly used with SOAP features, and analyze how the computational effort scales for each of the individual steps of the calculation. By applying data reduction techniques in feature space, we show how to further reduce the total computational cost by at up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
Physics-inspired structural representations for molecules and materials
Authors:
Felix Musil,
Andrea Grisafi,
Albert P. Bartók,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the succe…
▽ More
The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks, and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field.
△ Less
Submitted 4 August, 2021; v1 submitted 12 January, 2021;
originally announced January 2021.