-
Machine Learning Interatomic Potentials: library for efficient training, model development and simulation of molecular systems
Authors:
Christoph Brunken,
Olivier Peltre,
Heloise Chomet,
Lucien Walewski,
Manus McAuliffe,
Valentin Heyraud,
Solal Attias,
Martin Maarand,
Yessine Khanfir,
Edan Toledo,
Fabio Falcioni,
Marie Bluntzer,
Silvia Acosta-Gutiérrez,
Jules Tilly
Abstract:
Machine Learning Interatomic Potentials (MLIP) are a novel in silico approach for molecular property prediction, creating an alternative to disrupt the accuracy/speed trade-off of empirical force fields and density functional theory (DFT). In this white paper, we present our MLIP library which was created with two core aims: (1) provide to industry experts without machine learning background a use…
▽ More
Machine Learning Interatomic Potentials (MLIP) are a novel in silico approach for molecular property prediction, creating an alternative to disrupt the accuracy/speed trade-off of empirical force fields and density functional theory (DFT). In this white paper, we present our MLIP library which was created with two core aims: (1) provide to industry experts without machine learning background a user-friendly and computationally efficient set of tools to experiment with MLIP models, (2) provide machine learning developers a framework to develop novel approaches fully integrated with molecular dynamics tools. The library includes in this release three model architectures (MACE, NequIP, and ViSNet), and two molecular dynamics (MD) wrappers (ASE, and JAX-MD), along with a set of pre-trained organics models. The seamless integration with JAX-MD, in particular, facilitates highly efficient MD simulations, bringing MLIP models significantly closer to industrial application. The library is available on GitHub and on PyPI under the Apache license 2.0.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Universally applicable and tunable graph-based coarse-graining for Machine learning force fields
Authors:
Christoph Brunken,
Sebastien Boyer,
Mustafa Omar,
Martin Maarand,
Olivier Peltre,
Solal Attias,
Bakary N'tji Diallo,
Anastasia Markina,
Olaf Othersen,
Oliver Bent
Abstract:
Coarse-grained (CG) force field methods for molecular systems are a crucial tool to simulate large biological macromolecules and are therefore essential for characterisations of biomolecular systems. While state-of-the-art deep learning (DL)-based models for all-atom force fields have improved immensely over recent years, we observe and analyse significant limitations of the currently available ap…
▽ More
Coarse-grained (CG) force field methods for molecular systems are a crucial tool to simulate large biological macromolecules and are therefore essential for characterisations of biomolecular systems. While state-of-the-art deep learning (DL)-based models for all-atom force fields have improved immensely over recent years, we observe and analyse significant limitations of the currently available approaches for DL-based CG simulations. In this work, we present the first transferable DL-based CG force field approach (i.e., not specific to only one narrowly defined system type) applicable to a wide range of biosystems. To achieve this, our CG algorithm does not rely on hard-coded rules and is tuned to output coarse-grained systems optimised for minimal statistical noise in the ground truth CG forces, which results in significant improvement of model training. Our force field model is also the first CG variant that is based on the MACE architecture and is trained on a custom dataset created by a new approach based on the fragmentation of large biosystems covering protein, RNA and lipid chemistry. We demonstrate that our model can be applied in molecular dynamics simulations to obtain stable and qualitatively accurate trajectories for a variety of systems, while also discussing cases for which we observe limited reliability.
△ Less
Submitted 24 March, 2025;
originally announced April 2025.
-
Local Max-Entropy and Free Energy Principles, Belief Diffusions and their Singularities
Authors:
Olivier Peltre
Abstract:
A comprehensive picture of three Bethe-Kikuchi variational principles including their relationship to belief propagation (BP) algorithms on hypergraphs is given. The structure of BP equations is generalized to define continuous-time diffusions, solving localized versions of the max-entropy principle (A), the variational free energy principle (B), and a less usual equilibrium free energy principle…
▽ More
A comprehensive picture of three Bethe-Kikuchi variational principles including their relationship to belief propagation (BP) algorithms on hypergraphs is given. The structure of BP equations is generalized to define continuous-time diffusions, solving localized versions of the max-entropy principle (A), the variational free energy principle (B), and a less usual equilibrium free energy principle (C), Legendre dual to A. Both critical points of Bethe-Kikuchi functionals and stationary beliefs are shown to lie at the non-linear intersection of two constraint surfaces, enforcing energy conservation and marginal consistency respectively. The hypersurface of singular beliefs, accross which equilibria become unstable as the constraint surfaces meet tangentially, is described by polynomial equations in the convex polytope of consistent beliefs. This polynomial is expressed by a loop series expansion for graphs of binary variables.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Equivariant Message Passing Neural Network for Crystal Material Discovery
Authors:
Astrid Klipfel,
Olivier Peltre,
Najwa Harrati,
Yaël Fregier,
Adlane Sayede,
Zied Bouraoui
Abstract:
Automatic material discovery with desired properties is a fundamental challenge for material sciences. Considerable attention has recently been devoted to generating stable crystal structures. While existing work has shown impressive success on supervised tasks such as property prediction, the progress on unsupervised tasks such as material generation is still hampered by the limited extent to whi…
▽ More
Automatic material discovery with desired properties is a fundamental challenge for material sciences. Considerable attention has recently been devoted to generating stable crystal structures. While existing work has shown impressive success on supervised tasks such as property prediction, the progress on unsupervised tasks such as material generation is still hampered by the limited extent to which the equivalent geometric representations of the same crystal are considered. To address this challenge, we propose EMPNN a periodic equivariant message-passing neural network that learns crystal lattice deformation in an unsupervised fashion. Our model equivalently acts on lattice according to the deformation action that must be performed, making it suitable for crystal generation, relaxation and optimisation. We present experimental evaluations that demonstrate the effectiveness of our approach.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Local Max-Entropy and Free Energy Principles Solved by Belief Propagation
Authors:
Olivier Peltre
Abstract:
A statistical system is classically defined on a set of microstates $E$ by a global energy function $H : E \to \mathbb{R}$, yielding Gibbs probability measures (softmins) $ρ^β(H)$ for every inverse temperature $β= T^{-1}$. Gibbs states are simultaneously characterized by free energy principles and the max-entropy principle, with dual constraints on inverse temperature $β$ and mean energy…
▽ More
A statistical system is classically defined on a set of microstates $E$ by a global energy function $H : E \to \mathbb{R}$, yielding Gibbs probability measures (softmins) $ρ^β(H)$ for every inverse temperature $β= T^{-1}$. Gibbs states are simultaneously characterized by free energy principles and the max-entropy principle, with dual constraints on inverse temperature $β$ and mean energy ${\cal U}(β) = \mathbb{E}_{ρ^β}[H]$ respectively. The Legendre transform relates these diverse variational principles which are unfortunately not tractable in high dimension.
The global energy is generally given as a sum $H(x) = \sum_{\rm a \subset Ω} h_{\rm a}(x_{|\rm a})$ of local short-range interactions $h_{\rm a} : E_{\rm a} \to \mathbb{R}$ indexed by bounded subregions ${\rm a} \subset Ω$, and this local structure can be used to design good approximation schemes on thermodynamic functionals. We show that the generalized belief propagation (GBP) algorithm solves a collection of local variational principles, by converging to critical points of Bethe-Kikuchi approximations of the free energy $F(β)$, the Shannon entropy $S(\cal U)$, and the variational free energy ${\cal F}(β) = {\cal U} - β^{-1} S(\cal U)$, extending an initial correspondence by Yedidia et al. This local form of Legendre duality yields a possible degenerate relationship between mean energy ${\cal U}$ and $β$.
△ Less
Submitted 2 July, 2022;
originally announced July 2022.
-
Belief Propagation as Diffusion
Authors:
Olivier Peltre
Abstract:
We introduce novel belief propagation algorithms to estimate the marginals of a high dimensional probability distribution. They involve natural (co)homological constructions relevant for a localised description of statistical systems.
We introduce novel belief propagation algorithms to estimate the marginals of a high dimensional probability distribution. They involve natural (co)homological constructions relevant for a localised description of statistical systems.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Extra-fine sheaves and interaction decompositions
Authors:
Daniel Bennequin,
Olivier Peltre,
Grégoire Sergeant-Perthuis,
Juan Pablo Vigneaux
Abstract:
We introduce an original notion of extra-fine sheaf on a topological space, and a variant (hyper-extra-fine) for which Čech cohomology in strictly positive degree vanishes. We provide a characterization of such sheaves when the topological space is a partially ordered set (poset) equipped with the Alexandrov topology. Then we further specialize our results to some sheaves of vector spaces and inje…
▽ More
We introduce an original notion of extra-fine sheaf on a topological space, and a variant (hyper-extra-fine) for which Čech cohomology in strictly positive degree vanishes. We provide a characterization of such sheaves when the topological space is a partially ordered set (poset) equipped with the Alexandrov topology. Then we further specialize our results to some sheaves of vector spaces and injective maps, where extra-fineness is (essentially) equivalent to the decomposition of the sheaf into a direct sum of subfunctors, known as interaction decomposition, and can be expressed by a sum-intersection condition. We use these results to compute the dimension of the space of global sections when the presheaves are freely generated over a functor of sets, generalizing classical counting formulae for the number of solutions of the linearized marginal problem (Kellerer and Matúš). We finish with a comparison theorem between the Čech cohomology associated to a covering and the topos cohomology of the poset with coefficients in the presheaf, which is also the cohomology of a cosimplicial local system over the nerve of the poset. For that, we give a detailed treatment of cosimplicial local systems on simplicial sets. The appendixes present presheaves, sheaves and Čech cohomology, and their application to the marginal problem.
△ Less
Submitted 18 December, 2020; v1 submitted 26 September, 2020;
originally announced September 2020.
-
Message-Passing Algorithms and Homology
Authors:
Olivier Peltre
Abstract:
This PhD thesis lays out algebraic and topological structures relevant for the study of probabilistic graphical models.
Marginal estimation algorithms are introduced as diffusion equations of the form $\dot u = δ\varphi$. They generalise the traditional belief propagation (BP) algorithm, and provide an alternative for contrastive divergence (CD) or Markov chain Monte Carlo (MCMC) algorithms, typ…
▽ More
This PhD thesis lays out algebraic and topological structures relevant for the study of probabilistic graphical models.
Marginal estimation algorithms are introduced as diffusion equations of the form $\dot u = δ\varphi$. They generalise the traditional belief propagation (BP) algorithm, and provide an alternative for contrastive divergence (CD) or Markov chain Monte Carlo (MCMC) algorithms, typically involved in estimating a free energy functional and its gradient w.r.t. model parameters.
We propose a new homological picture where parameters are a collections of local interaction potentials $(u_α) \in A_0$, for $α$ running over the factor nodes of a given region graph. The boundary operator $δ$ mapping heat fluxes $(\varphi_{αβ}) \in A_1$ to a subspace $δA_1 \subseteq A_0$ is the discrete analog of a divergence. The total energy $H = \sum_αu_α$ defining the global probability $p = e^{-H} / Z$ is in one-to-one correspondence with a homology class $[u] = u + δA_1$ of interaction potentials, so that total energy remains constant when $u$ evolves up to a boundary term $δ\varphi$.
Stationary states of diffusion are shown to lie at the intersection of a homology class of potentials with a non-linear constraint surface enforcing consistency of the local marginals estimates. This picture allows us to precise and complete a proof on the correspondence between stationary states of BP and critical points of a local free energy functional (obtained by Bethe-Kikuchi approximations) and to extend the uniqueness result for acyclic graphs (i.e. trees) to a wider class of hypergraphs. In general, bifurcations of equilibria are related to the spectral singularities of a local diffusion operator, yielding new explicit examples of the degeneracy phenomenon.
Work supervised by Pr. Daniel Bennequin
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Geomstats: A Python Package for Riemannian Geometry in Machine Learning
Authors:
Nina Miolane,
Alice Le Brigant,
Johan Mathe,
Benjamin Hou,
Nicolas Guigui,
Yann Thanwerdas,
Stefan Heyder,
Olivier Peltre,
Niklas Koep,
Hadi Zaatiti,
Hatem Hajri,
Yann Cabanes,
Thomas Gerald,
Paul Chauchat,
Christian Shewmake,
Bernhard Kainz,
Claire Donnat,
Susan Holmes,
Xavier Pennec
Abstract:
We introduce Geomstats, an open-source Python toolbox for computations and statistics on nonlinear manifolds, such as hyperbolic spaces, spaces of symmetric positive definite matrices, Lie groups of transformations, and many more. We provide object-oriented and extensively unit-tested implementations. Among others, manifolds come equipped with families of Riemannian metrics, with associated expone…
▽ More
We introduce Geomstats, an open-source Python toolbox for computations and statistics on nonlinear manifolds, such as hyperbolic spaces, spaces of symmetric positive definite matrices, Lie groups of transformations, and many more. We provide object-oriented and extensively unit-tested implementations. Among others, manifolds come equipped with families of Riemannian metrics, with associated exponential and logarithmic maps, geodesics and parallel transport. Statistics and learning algorithms provide methods for estimation, clustering and dimension reduction on manifolds. All associated operations are vectorized for batch computation and provide support for different execution backends, namely NumPy, PyTorch and TensorFlow, enabling GPU acceleration. This paper presents the package, compares it with related libraries and provides relevant code examples. We show that Geomstats provides reliable building blocks to foster research in differential geometry and statistics, and to democratize the use of Riemannian geometry in machine learning applications. The source code is freely available under the MIT license at \url{geomstats.ai}.
△ Less
Submitted 7 April, 2020;
originally announced April 2020.
-
A Homological Approach to Belief Propagation and Bethe Approximations
Authors:
Olivier Peltre
Abstract:
We introduce a differential complex of local observables given a decomposition of a global set of random variables into subsets. Its boundary operator allows us to define a transport equation equivalent to Belief Propagation. This definition reveals a set of conserved quantities under Belief Propagation and gives new insight on the relationship of its equilibria with the critical points of Bethe f…
▽ More
We introduce a differential complex of local observables given a decomposition of a global set of random variables into subsets. Its boundary operator allows us to define a transport equation equivalent to Belief Propagation. This definition reveals a set of conserved quantities under Belief Propagation and gives new insight on the relationship of its equilibria with the critical points of Bethe free energy.
△ Less
Submitted 14 March, 2019;
originally announced March 2019.