-
Distributed Equivariant Graph Neural Networks for Large-Scale Electronic Structure Prediction
Authors:
Manasa Kaniselvan,
Alexander Maeder,
Chen Hao Xia,
Alexandros Nikolaos Ziogas,
Mathieu Luisier
Abstract:
Equivariant Graph Neural Networks (eGNNs) trained on density-functional theory (DFT) data can potentially perform electronic structure prediction at unprecedented scales, enabling investigation of the electronic properties of materials with extended defects, interfaces, or exhibiting disordered phases. However, as interactions between atomic orbitals typically extend over 10+ angstroms, the graph…
▽ More
Equivariant Graph Neural Networks (eGNNs) trained on density-functional theory (DFT) data can potentially perform electronic structure prediction at unprecedented scales, enabling investigation of the electronic properties of materials with extended defects, interfaces, or exhibiting disordered phases. However, as interactions between atomic orbitals typically extend over 10+ angstroms, the graph representations required for this task tend to be densely connected, and the memory requirements to perform training and inference on these large structures can exceed the limits of modern GPUs. Here we present a distributed eGNN implementation which leverages direct GPU communication and introduce a partitioning strategy of the input graph to reduce the number of embedding exchanges between GPUs. Our implementation shows strong scaling up to 128 GPUs, and weak scaling up to 512 GPUs with 87% parallel efficiency for structures with 3,000 to 190,000 atoms on the Alps supercomputer.
△ Less
Submitted 4 July, 2025;
originally announced July 2025.
-
Learning the Electronic Hamiltonian of Large Atomic Structures
Authors:
Chen Hao Xia,
Manasa Kaniselvan,
Alexandros Nikolaos Ziogas,
Marko Mladenović,
Rayen Mahjoub,
Alexander Maeder,
Mathieu Luisier
Abstract:
Graph neural networks (GNNs) have shown promise in learning the ground-state electronic properties of materials, subverting ab initio density functional theory (DFT) calculations when the underlying lattices can be represented as small and/or repeatable unit cells (i.e., molecules and periodic crystals). Realistic systems are, however, non-ideal and generally characterized by higher structural com…
▽ More
Graph neural networks (GNNs) have shown promise in learning the ground-state electronic properties of materials, subverting ab initio density functional theory (DFT) calculations when the underlying lattices can be represented as small and/or repeatable unit cells (i.e., molecules and periodic crystals). Realistic systems are, however, non-ideal and generally characterized by higher structural complexity. As such, they require large (10+ Angstroms) unit cells and thousands of atoms to be accurately described. At these scales, DFT becomes computationally prohibitive, making GNNs especially attractive. In this work, we present a strictly local equivariant GNN capable of learning the electronic Hamiltonian (H) of realistically extended materials. It incorporates an augmented partitioning approach that enables training on arbitrarily large structures while preserving local atomic environments beyond boundaries. We demonstrate its capabilities by predicting the electronic Hamiltonian of various systems with up to 3,000 nodes (atoms), 500,000+ edges, ~28 million orbital interactions (nonzero entries of H), and $\leq$0.53% error in the eigenvalue spectra. Our work expands the applicability of current electronic property prediction methods to some of the most challenging cases encountered in computational materials science, namely systems with disorder, interfaces, and defects.
△ Less
Submitted 6 June, 2025; v1 submitted 31 January, 2025;
originally announced January 2025.
-
Electron-Electron Interactions in Device Simulation via Non-equilibrium Green's Functions and the GW Approximation
Authors:
Leonard Deuschle,
Jiang Cao,
Alexandros Nikolaos Ziogas,
Anders Winka,
Alexander Maeder,
Nicolas Vetsch,
Mathieu Luisier
Abstract:
The continuous scaling of metal-oxide-semiconductor field-effect transistors (MOSFETs) has led to device geometries where charged carriers are increasingly confined to ever smaller channel cross sections. This development is associated with reduced screening of long-range Coulomb interactions. To accurately predict the behavior of such ultra-scaled devices, electron-electron (e-e) interactions mus…
▽ More
The continuous scaling of metal-oxide-semiconductor field-effect transistors (MOSFETs) has led to device geometries where charged carriers are increasingly confined to ever smaller channel cross sections. This development is associated with reduced screening of long-range Coulomb interactions. To accurately predict the behavior of such ultra-scaled devices, electron-electron (e-e) interactions must be explicitly incorporated in their quantum transport simulation. In this paper, we present an \textit{ab initio} atomistic simulation framework based on density functional theory, the non-equilibrium Green's function formalism, and the self-consistent GW approximation to perform this task. The implemented method is first validated with a carbon nanotube test structure before being applied to calculate the transfer characteristics of a silicon nanowire MOSFET in a gate-all-around configuration. As a consequence of e-e scattering, the energy and spatial distribution of the carrier and current densities both significantly change, while the on-current of the transistor decreases owing to the Coulomb repulsion between the electrons. Furthermore, we demonstrate how the resulting bandgap modulation of the nanowire channel as a function of the gate-to-source voltage could potentially improve the device performance. To the best of our knowledge, this study is the first one reporting large-scale atomistic quantum transport simulations of nano-devices under non-equilibrium conditions and in the presence of e-e interactions within the GW approximation.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Single Neuromorphic Memristor closely Emulates Multiple Synaptic Mechanisms for Energy Efficient Neural Networks
Authors:
Christoph Weilenmann,
Alexandros Ziogas,
Till Zellweger,
Kevin Portner,
Marko Mladenović,
Manasa Kaniselvan,
Timoleon Moraitis,
Mathieu Luisier,
Alexandros Emboras
Abstract:
Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial neural networks, but also more complex functions such as short-term memory, short-term plasticity, and meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate all these synaptic f…
▽ More
Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial neural networks, but also more complex functions such as short-term memory, short-term plasticity, and meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate all these synaptic functions. These memristors operate in a non-filamentary, low conductance regime, which enables stable and energy efficient operation. They can act as multi-functional hardware synapses in a class of bio-inspired deep neural networks (DNN) that make use of both long- and short-term synaptic dynamics and are capable of meta-learning or "learning-to-learn". The resulting bio-inspired DNN is then trained to play the video game Atari Pong, a complex reinforcement learning task in a dynamic environment. Our analysis shows that the energy consumption of the DNN with multi-functional memristive synapses decreases by about two orders of magnitude as compared to a pure GPU implementation. Based on this finding, we infer that memristive devices with a better emulation of the synaptic functionalities do not only broaden the applicability of neuromorphic computing, but could also improve the performance and energy costs of certain artificial intelligence applications.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.