-
Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling
Authors:
Javier E. Santos,
Agnese Marcato,
Roman Colman,
Nicholas Lubbers,
Yen Ting Lin
Abstract:
Generative diffusion models have achieved remarkable success in producing high-quality images. However, these models typically operate in continuous intensity spaces, diffusing independently across pixels and color channels. As a result, they are fundamentally ill-suited for applications involving inherently discrete quantities-such as particle counts or material units-that are constrained by stri…
▽ More
Generative diffusion models have achieved remarkable success in producing high-quality images. However, these models typically operate in continuous intensity spaces, diffusing independently across pixels and color channels. As a result, they are fundamentally ill-suited for applications involving inherently discrete quantities-such as particle counts or material units-that are constrained by strict conservation laws like mass conservation, limiting their applicability in scientific workflows. To address this limitation, we propose Discrete Spatial Diffusion (DSD), a framework based on a continuous-time, discrete-state jump stochastic process that operates directly in discrete spatial domains while strictly preserving particle counts in both forward and reverse diffusion processes. By using spatial diffusion to achieve particle conservation, we introduce stochasticity naturally through a discrete formulation. We demonstrate the expressive flexibility of DSD by performing image synthesis, class conditioning, and image inpainting across standard image benchmarks, while exactly conditioning total image intensity. We validate DSD on two challenging scientific applications: porous rock microstructures and lithium-ion battery electrodes, demonstrating its ability to generate structurally realistic samples under strict mass conservation constraints, with quantitative evaluation using state-of-the-art metrics for transport and electrochemical performance.
△ Less
Submitted 16 May, 2025; v1 submitted 3 May, 2025;
originally announced May 2025.
-
Machine learning interatomic potential for modeling uranium mononitride
Authors:
Lorena Alzate-Vargas,
Kashi N. Subedi,
Nicholas Lubbers,
Michael W. D Cooper,
Roxanne M. Tutchton,
Tammie Gibson,
Richard A. Messerly
Abstract:
Uranium mononitride (UN) is a promising accident tolerant fuel due to its high fissile density and high thermal conductivity. In this study, we developed the first machine learning interatomic potentials for reliable atomic-scale modeling of UN at finite temperatures. We constructed a training set using density functional theory (DFT) calculations that was enriched through an active learning proce…
▽ More
Uranium mononitride (UN) is a promising accident tolerant fuel due to its high fissile density and high thermal conductivity. In this study, we developed the first machine learning interatomic potentials for reliable atomic-scale modeling of UN at finite temperatures. We constructed a training set using density functional theory (DFT) calculations that was enriched through an active learning procedure and two neural network potentials were generated. We found that both potentials can reproduce some thermophysical properties of interest such as temperature dependent heat capacity. We also evaluated the energy of stoichiometric defect reactions and defect migration barriers and found close agreement with DFT values demonstrating that our potentials can be used for a description of defects in UN.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
Machine Learning Models Capture Plasmon Dynamics in Ag Nanoparticles
Authors:
Adela Habib,
Nicholas Lubbers,
Sergei Tretiak,
Benjamin Nebgen
Abstract:
Highly energetic electron-hole pairs (hot carriers) formed from plasmon decay in metallic nanostructures promise sustainable pathways for energy-harvesting devices. However, efficient collection before thermalization remains an obstacle for realization of their full energy generating potential. Addressing this challenge requires detailed understanding of physical processes from plasmon excitation…
▽ More
Highly energetic electron-hole pairs (hot carriers) formed from plasmon decay in metallic nanostructures promise sustainable pathways for energy-harvesting devices. However, efficient collection before thermalization remains an obstacle for realization of their full energy generating potential. Addressing this challenge requires detailed understanding of physical processes from plasmon excitation in metal to their collection in a molecule or a semiconductor, where atomistic theoretical investigation may be particularly beneficial. Unfortunately, first-principles theoretical modeling of these processes is extremely costly, limiting the analysis to systems with a few 100s of atoms. Recent advances in machine learned interatomic potentials suggest that dynamics can be accelerated with surrogate models which replace the full solution of the Schroedinger Equation. Here, we modify an existing neural network, Hierarchically Interacting Particle Neural Network (HIP-NN), to predict plasmon dynamics in Ag nanoparticles. We demonstrate the model's capability to accurately predict plasmon dynamics in large nanoparticles of up to 561 atoms not present in the training dataset. More importantly, with machine learning models we gain a speed-up of about 200 times as compared with the rt-TDDFT calculations when predicting important physical quantities such as dynamic dipole moments in Ag55 and about 4000 times for extended nanoparticles that are 10 times larger. This underscores the promise of future machine learning accelerated electron/nuclear dynamics simulations for understanding fundamental properties of plasmon-driven hot carrier devices.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Machine Learning of consistent thermodynamic models using automatic differentiation
Authors:
David Rosenberger,
Kipton Barros,
Timothy C. Germann,
Nicholas Lubbers
Abstract:
We propose a data-driven method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS are that the relationships between state variables are given by derivatives of the system free energy. In this work, we model the free energy with an artificial neural net…
▽ More
We propose a data-driven method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS are that the relationships between state variables are given by derivatives of the system free energy. In this work, we model the free energy with an artificial neural network, and utilize automatic differentiation to directly learn the derivatives of the free energy. We demonstrate this approach on two different systems, the analytic van der Waals EOS, and published data for the Lennard-Jones fluid, and show that it is advantageous over direct learning of thermodynamic properties (i.e. not as derivatives of the free energy, but as independent properties), in terms of both accuracy and the exact preservation of the Maxwell relations. Furthermore, the method implicitly provides the free energy of a system without explicit integration.
△ Less
Submitted 9 March, 2022; v1 submitted 10 August, 2021;
originally announced August 2021.
-
Mean-field theory of an asset exchange model with economic growth and wealth distribution
Authors:
W. Klein,
N. Lubbers,
Kang K. L. Liu,
T. Khouw,
Harvey Gould
Abstract:
We develop a mean-field theory of the growth, exchange and distribution (GED) model introduced by Kang et al. (preceding paper) that accurately describes the phase transition in the limit that the number of agents $N$ approaches infinity. The GED model is a generalization of the Yard-Sale model in which the additional wealth added by economic growth is nonuniformly distributed to the agents accord…
▽ More
We develop a mean-field theory of the growth, exchange and distribution (GED) model introduced by Kang et al. (preceding paper) that accurately describes the phase transition in the limit that the number of agents $N$ approaches infinity. The GED model is a generalization of the Yard-Sale model in which the additional wealth added by economic growth is nonuniformly distributed to the agents according to their wealth in a way determined by the parameter $λ$. The model was shown numerically to have a phase transition at $λ=1$ and be characterized by critical exponents and critical slowing down. Our mean-field treatment of the GED model correctly predicts the existence of the phase transition, critical slowing down, the values of the critical exponents, and introduces an energy whose probability satisfies the Boltzmann distribution for $λ< 1$, implying that the system is in thermodynamic equilibrium in the limit that $N \to \infty$. We show that the values of the critical exponents obtained by varying $λ$ for a fixed value of $N$ do not satisfy the usual scaling laws, but do satisfy scaling if a combination of parameters, which we refer to as the Ginzburg parameter, is much greater than one and is held constant. We discuss possible implications of our results for understanding economic systems and the subtle nature of the mean-field limit in systems with both additive and multiplicative noise.
△ Less
Submitted 24 June, 2021; v1 submitted 1 February, 2021;
originally announced February 2021.
-
Simulation of a generalized asset exchange model with economic growth and wealth distribution
Authors:
Kang K. L. Liu,
N. Lubbers,
W. Klein,
J. Tobochnik,
B. M. Boghosian,
Harvey Gould
Abstract:
The agent-based Yard-Sale model of wealth inequality is generalized to incorporate exponential economic growth and its distribution. The distribution of economic growth is nonuniform and is determined by the wealth of each agent and a parameter $λ$. Our numerical results indicate that the model has a critical point at $λ=1$ between a phase for $λ< 1$ with economic mobility and exponentially growin…
▽ More
The agent-based Yard-Sale model of wealth inequality is generalized to incorporate exponential economic growth and its distribution. The distribution of economic growth is nonuniform and is determined by the wealth of each agent and a parameter $λ$. Our numerical results indicate that the model has a critical point at $λ=1$ between a phase for $λ< 1$ with economic mobility and exponentially growing wealth of all agents and a non-stationary phase for $λ\geq 1$ with wealth condensation and no mobility. We define the energy of the system and show that the system can be considered to be in thermodynamic equilibrium for $λ< 1$. Our estimates of various critical exponents are consistent with a mean-field theory (see following paper). The exponents do not obey the usual scaling laws unless a combination of parameters that we refer to as the Ginzburg parameter is held fixed as the transition is approached. The model illustrates that both poorer and richer agents benefit from economic growth if its distribution does not favor the richer agents too strongly. This work and the accompanying theory paper contribute to understanding whether the methods of equilibrium statistical mechanics can be applied to economic systems.
△ Less
Submitted 24 August, 2021; v1 submitted 1 February, 2021;
originally announced February 2021.
-
Simple and efficient algorithms for training machine learning potentials to force data
Authors:
Justin S. Smith,
Nicholas Lubbers,
Aidan P. Thompson,
Kipton Barros
Abstract:
Abstract Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be expensive to obtain. A quantum simulation often provides all atomic forces, in addition to the total energy of the system. These forces provide much more information…
▽ More
Abstract Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be expensive to obtain. A quantum simulation often provides all atomic forces, in addition to the total energy of the system. These forces provide much more information than the energy alone. It may appear that training a model to this large quantity of force data would introduce significant computational costs. Actually, training to all available force data should only be a few times more expensive than training to energies alone. Here, we present a new algorithm for efficient force training, and benchmark its accuracy by training to forces from real-world datasets for organic chemistry and bulk aluminum.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
Automated discovery of a robust interatomic potential for aluminum
Authors:
Justin S. Smith,
Benjamin Nebgen,
Nithin Mathew,
Jie Chen,
Nicholas Lubbers,
Leonid Burakovsky,
Sergei Tretiak,
Hai Ah Nam,
Timothy Germann,
Saryu Fensin,
Kipton Barros
Abstract:
Accuracy of molecular dynamics simulations depends crucially on the interatomic potential used to generate forces. The gold standard would be first-principles quantum mechanics (QM) calculations, but these become prohibitively expensive at large simulation scales. Machine learning (ML) based potentials aim for faithful emulation of QM at drastically reduced computational cost. The accuracy and rob…
▽ More
Accuracy of molecular dynamics simulations depends crucially on the interatomic potential used to generate forces. The gold standard would be first-principles quantum mechanics (QM) calculations, but these become prohibitively expensive at large simulation scales. Machine learning (ML) based potentials aim for faithful emulation of QM at drastically reduced computational cost. The accuracy and robustness of an ML potential is primarily limited by the quality and diversity of the training dataset. Using the principles of active learning (AL), we present a highly automated approach to dataset construction. The strategy is to use the ML potential under development to sample new atomic configurations and, whenever a configuration is reached for which the ML uncertainty is sufficiently large, collect new QM data. Here, we seek to push the limits of automation, removing as much expert knowledge from the AL process as possible. All sampling is performed using MD simulations starting from an initially disordered configuration, and undergoing non-equilibrium dynamics as driven by time-varying applied temperatures. We demonstrate this approach by building an ML potential for aluminum (ANI-Al). After many AL iterations, ANI-Al teaches itself to predict properties like the radial distribution function in melt, liquid-solid coexistence curve, and crystal properties such as defect energies and barriers. To demonstrate transferability, we perform a 1.3M atom shock simulation, and show that ANI-Al predictions agree very well with DFT calculations on local atomic environments sampled from the nonequilibrium dynamics. Interestingly, the configurations appearing in shock appear to have been well sampled in the AL training dataset, in a way that we illustrate visually.
△ Less
Submitted 24 August, 2020; v1 submitted 10 March, 2020;
originally announced March 2020.
-
Machine Learned Hückel Theory: Interfacing Physics and Deep Neural Networks
Authors:
Tetiana Zubatyuk,
Ben Nebgen,
Nicholas Lubbers,
Justin S. Smith,
Roman Zubatyuk,
Guoqing Zhou,
Christopher Koh,
Kipton Barros,
Olexandr Isayev,
Sergei Tretiak
Abstract:
The Hückel Hamiltonian is an incredibly simple tight-binding model famed for its ability to capture qualitative physics phenomena arising from electron interactions in molecules and materials. Part of its simplicity arises from using only two types of empirically fit physics-motivated parameters: the first describes the orbital energies on each atom and the second describes electronic interactions…
▽ More
The Hückel Hamiltonian is an incredibly simple tight-binding model famed for its ability to capture qualitative physics phenomena arising from electron interactions in molecules and materials. Part of its simplicity arises from using only two types of empirically fit physics-motivated parameters: the first describes the orbital energies on each atom and the second describes electronic interactions and bonding between atoms. By replacing these traditionally static parameters with dynamically predicted values, we vastly increase the accuracy of the extended Hückel model. The dynamic values are generated with a deep neural network, which is trained to reproduce orbital energies and densities derived from density functional theory. The resulting model retains interpretability while the deep neural network parameterization is smooth, accurate, and reproduces insightful features of the original static parameterization. Finally, we demonstrate that the Hückel model, and not the deep neural network, is responsible for capturing intricate orbital interactions in two molecular case studies. Overall, this work shows the promise of utilizing machine learning to formulate simple, accurate, and dynamically parameterized physics models.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Machine learning for molecular dynamics with strongly correlated electrons
Authors:
Hidemaro Suwa,
Justin S. Smith,
Nicholas Lubbers,
Cristian D. Batista,
Gia-Wei Chern,
Kipton Barros
Abstract:
We use machine learning to enable large-scale molecular dynamics (MD) of a correlated electron model under the Gutzwiller approximation scheme. This model exhibits a Mott transition as a function of on-site Coulomb repulsion $U$. The repeated solution of the Gutzwiller self-consistency equations would be prohibitively expensive for large-scale MD simulations. We show that machine learning models o…
▽ More
We use machine learning to enable large-scale molecular dynamics (MD) of a correlated electron model under the Gutzwiller approximation scheme. This model exhibits a Mott transition as a function of on-site Coulomb repulsion $U$. The repeated solution of the Gutzwiller self-consistency equations would be prohibitively expensive for large-scale MD simulations. We show that machine learning models of the Gutzwiller potential energy can be remarkably accurate. The models, which are trained with $N=33$ atoms, enable highly accurate MD simulations at much larger scales ($N\gtrsim10^{3}$). We investigate the physics of the smooth Mott crossover in the fluid phase.
△ Less
Submitted 10 April, 2019; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Inferring low-dimensional microstructure representations using convolutional neural networks
Authors:
Nicholas Lubbers,
Turab Lookman,
Kipton Barros
Abstract:
We apply recent advances in machine learning and computer vision to a central problem in materials informatics: The statistical representation of microstructural images. We use activations in a pre-trained convolutional neural network to provide a high-dimensional characterization of a set of synthetic microstructural images. Next, we use manifold learning to obtain a low-dimensional embedding of…
▽ More
We apply recent advances in machine learning and computer vision to a central problem in materials informatics: The statistical representation of microstructural images. We use activations in a pre-trained convolutional neural network to provide a high-dimensional characterization of a set of synthetic microstructural images. Next, we use manifold learning to obtain a low-dimensional embedding of this statistical characterization. We show that the low-dimensional embedding extracts the parameters used to generate the images. According to a variety of metrics, the convolutional neural network method yields dramatically better embeddings than the analogous method derived from two-point correlations alone.
△ Less
Submitted 30 November, 2018; v1 submitted 8 November, 2016;
originally announced November 2016.