Skip to main content

Showing 1–12 of 12 results for author: Kaba, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14665  [pdf, ps, other

    physics.chem-ph cs.AI cs.CE cs.LG physics.comp-ph

    Accurate and scalable exchange-correlation with deep learning

    Authors: Giulia Luise, Chin-Wei Huang, Thijs Vogels, Derk P. Kooi, Sebastian Ehlert, Stephanie Lanius, Klaas J. H. Giesbertz, Amir Karton, Deniz Gunceler, Megan Stanley, Wessel P. Bruinsma, Lin Huang, Xinran Wei, José Garrido Torres, Abylay Katbashev, Rodrigo Chavez Zavaleta, Bálint Máté, Sékou-Oumar Kaba, Roberto Sordillo, Yingrong Chen, David B. Williams-Young, Christopher M. Bishop, Jan Hermann, Rianne van den Berg, Paola Gori-Giorgi

    Abstract: Density Functional Theory (DFT) is the most widely used electronic structure method for predicting the properties of molecules and materials. Although DFT is, in principle, an exact reformulation of the Schrödinger equation, practical applications rely on approximations to the unknown exchange-correlation (XC) functional. Most existing XC functionals are constructed using a limited set of increasi… ▽ More

    Submitted 23 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: Main: 13 pages plus references, 11 figures and tables. Supplementary information: 19 pages, 12 figures and tables. v2 update: fix rendering of figure 1 and part of figure 5 in Safari PDF viewer. v3 update: update author information and fix typo

  2. arXiv:2503.21985  [pdf, other

    cs.LG stat.ML

    Improving Equivariant Networks with Probabilistic Symmetry Breaking

    Authors: Hannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba

    Abstract: Equivariance encodes known symmetries into neural networks, often enhancing generalization. However, equivariant networks cannot break symmetries: the output of an equivariant network must, by definition, have at least the same self-symmetries as the input. This poses an important problem, both (1) for prediction tasks on domains where self-symmetries are common, and (2) for generative models, whi… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: 28 pages, 7 figures

  3. arXiv:2503.10834  [pdf, other

    stat.ML cs.LG

    On the Identifiability of Causal Abstractions

    Authors: Xiusi Li, Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Causal representation learning (CRL) enhances machine learning models' robustness and generalizability by learning structural causal models associated with data-generating processes. We focus on a family of CRL methods that uses contrastive data pairs in the observable space, generated before and after a random, unknown intervention, to identify the latent causal model. (Brehmer et al., 2022) show… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 15 pages, 4 figures, published in AISTATS 2025

  4. arXiv:2502.03638  [pdf, other

    cond-mat.mtrl-sci cs.LG

    SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models

    Authors: Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhsh

    Abstract: Generating novel crystalline materials has the potential to lead to advancements in fields such as electronics, energy storage, and catalysis. The defining characteristic of crystals is their symmetry, which plays a central role in determining their physical properties. However, existing crystal generation methods either fail to generate materials that display the symmetries of real-world crystals… ▽ More

    Submitted 23 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 24 pages, 10 figures, International Conference on Learning Representations (ICLR) 2025

  5. arXiv:2501.07773  [pdf, other

    cs.LG

    Symmetry-Aware Generative Modeling through Learned Canonicalization

    Authors: Kusha Sareen, Daniel Levy, Arnab Kumar Mondal, Sékou-Oumar Kaba, Tara Akhound-Sadegh, Siamak Ravanbakhsh

    Abstract: Generative modeling of symmetric densities has a range of applications in AI for science, from drug discovery to physics simulations. The existing generative modeling paradigm for invariant densities combines an invariant prior with an equivariant generative process. However, we observe that this technique is not necessary and has several drawbacks resulting from the limitations of equivariant net… ▽ More

    Submitted 3 February, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: NeurReps 2024 Workshop Version

  6. arXiv:2312.09016  [pdf, other

    cs.LG stat.ML

    Symmetry Breaking and Equivariant Neural Networks

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Using symmetry as an inductive bias in deep learning has been proven to be a principled approach for sample-efficient model design. However, the relationship between symmetry and the imperative for equivariance in neural networks is not always obvious. Here, we analyze a key limitation that arises in equivariant functions: their incapacity to break symmetry at the level of individual data samples.… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages, 2 figures, Symmetry and Geometry in Neural Representations

  7. arXiv:2310.01647  [pdf, other

    cs.LG

    Equivariant Adaptation of Large Pretrained Models

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh

    Abstract: Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to higher sample efficiency and more accurate and robust predictions. However, redesigning each component of prevalent deep neural network architectures to achieve chosen equivariance is a difficult problem and can result in a computationally expensive network during… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures. Accepted to NeurIPS 2023

  8. arXiv:2309.03139  [pdf, other

    cs.LG

    Using Multiple Vector Channels Improves E(n)-Equivariant Graph Neural Networks

    Authors: Daniel Levy, Sékou-Oumar Kaba, Carmelo Gonzales, Santiago Miret, Siamak Ravanbakhsh

    Abstract: We present a natural extension to E(n)-equivariant graph neural networks that uses multiple equivariant vectors per node. We formulate the extension and show that it improves performance across different physical systems benchmark tasks, with minimal differences in runtime or number of parameters. The proposed multichannel EGNN outperforms the standard singlechannel EGNN on N-body charged particle… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  9. arXiv:2211.15420  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Equivariant Networks for Crystal Structures

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Supervised learning with deep models has tremendous potential for applications in materials science. Recently, graph neural networks have been used in this context, drawing direct inspiration from models for molecules. However, materials are typically much more structured than molecules, which is a feature that these models do not leverage. In this work, we introduce a class of models that are equ… ▽ More

    Submitted 15 January, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures + appendix

  10. arXiv:2211.06489  [pdf, other

    cs.LG cs.AI

    Equivariance with Learned Canonicalization Functions

    Authors: Sékou-Oumar Kaba, Arnab Kumar Mondal, Yan Zhang, Yoshua Bengio, Siamak Ravanbakhsh

    Abstract: Symmetry-based neural networks often constrain the architecture in order to achieve invariance or equivariance to a group of transformations. In this paper, we propose an alternative that avoids this architectural constraint by learning to produce canonical representations of the data. These canonicalization functions can readily be plugged into non-equivariant backbone architectures. We offer exp… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 21 pages, 5 figures

  11. arXiv:2111.14712  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Prediction of Large Magnetic Moment Materials With Graph Neural Networks and Random Forests

    Authors: Sékou-Oumar Kaba, Benjamin Groleau-Paré, Marc-Antoine Gauthier, André-Marie Tremblay, Simon Verret, Chloé Gauvin-Ndiaye

    Abstract: Magnetic materials are crucial components of many technologies that could drive the ecological transition, including electric motors, wind turbine generators and magnetic refrigeration systems. Discovering materials with large magnetic moments is therefore an increasing priority. Here, using state-of-the-art machine learning methods, we scan the Inorganic Crystal Structure Database (ICSD) of hundr… ▽ More

    Submitted 17 April, 2023; v1 submitted 29 November, 2021; originally announced November 2021.

    ACM Class: J.2

    Journal ref: Phys. Rev. Mater., 7:044407, Apr 2023

  12. arXiv:2011.09468  [pdf, other

    cs.LG math.DS stat.ML

    Gradient Starvation: A Learning Proclivity in Neural Networks

    Authors: Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron Courville, Doina Precup, Guillaume Lajoie

    Abstract: We identify and formalize a fundamental gradient descent phenomenon resulting in a learning proclivity in over-parameterized neural networks. Gradient Starvation arises when cross-entropy loss is minimized by capturing only a subset of features relevant for the task, despite the presence of other predictive features that fail to be discovered. This work provides a theoretical explanation for the e… ▽ More

    Submitted 24 November, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: Proceeding of NeurIPS 2021