Skip to main content

Showing 1–50 of 51 results for author: Gómez-Bombarelli, R

.
  1. arXiv:2504.15370  [pdf, other

    physics.chem-ph cs.LG

    Transferable Learning of Reaction Pathways from Geometric Priors

    Authors: Juno Nam, Miguel Steiner, Max Misterka, Soojung Yang, Avni Singhal, Rafael Gómez-Bombarelli

    Abstract: Identifying minimum-energy paths (MEPs) is crucial for understanding chemical reaction mechanisms but remains computationally demanding. We introduce MEPIN, a scalable machine-learning method for efficiently predicting MEPs from reactant and product configurations, without relying on transition-state geometries or pre-optimized reaction paths during training. The task is defined as predicting devi… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 14 pages, 6 figures; Supporting Information in ancillary files

  2. arXiv:2504.08986  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    High-Throughput Transition-State Searches in Zeolite Nanopores

    Authors: Pau Ferri-Vicedo, Alexander J. Hoffman, Avni Singhal, Rafael Gómez-Bombarelli

    Abstract: Zeolites are important for industrial catalytic processes involving organic molecules. Understanding molecular reaction mechanisms within the confined nanoporous environment can guide the selection of pore topologies, material compositions, and process conditions to maximize activity and selectivity. However, experimental mechanistic studies are time- and resource-intensive, and traditional molecu… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: Main Paper; 12 Pages, 7 Figures. Method; 4 Pages, 1 Figure. Supplementary Information; 123 Pages, 59 Figures

  3. arXiv:2503.17870  [pdf, other

    cond-mat.mtrl-sci cond-mat.stat-mech cs.CE cs.LG

    Accelerating and enhancing thermodynamic simulations of electrochemical interfaces

    Authors: Xiaochen Du, Mengren Liu, Jiayu Peng, Hoje Chun, Alexander Hoffman, Bilge Yildiz, Lin Li, Martin Z. Bazant, Rafael Gómez-Bombarelli

    Abstract: Electrochemical interfaces are crucial in catalysis, energy storage, and corrosion, where their stability and reactivity depend on complex interactions between the electrode, adsorbates, and electrolyte. Predicting stable surface structures remains challenging, as traditional surface Pourbaix diagrams tend to either rely on expert knowledge or costly $\textit{ab initio}$ sampling, and neglect ther… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 19 pages main text, 5 figures, supplementary information (SI) in ancillary files

  4. arXiv:2502.05970  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.CE physics.chem-ph

    Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

    Authors: Nofit Segal, Aviv Netanyahu, Kevin P. Greenman, Pulkit Agrawal, Rafael Gomez-Bombarelli

    Abstract: Discovery of high-performance materials and molecules requires identifying extremes with property values that fall outside the known distribution. Therefore, the ability to extrapolate to out-of-distribution (OOD) property values is critical for both solid-state materials and molecular design. Our objective is to train predictor models that extrapolate zero-shot to higher ranges than in the traini… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: 10 Pages, 5 figures, supporting information

  5. arXiv:2411.17839  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Learning Mean First Passage Time: Chemical Short-Range Order and Kinetics of Diffusive Relaxation

    Authors: Hoje Chun, Hao Tang, Rafael Gomez-Bombarelli, Ju Li

    Abstract: Long-timescale processes pose significant challenges in atomistic simulations, particularly for phenomena such as diffusion and phase transitions. We present a deep reinforcement learning (DRL)-based computational framework, combined with a temporal difference (TD) learning method, to simulate long-timescale atomic processes of diffusive relaxation. We apply it to study the emergence of chemical s… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  6. arXiv:2410.17518  [pdf, other

    physics.comp-ph cs.LG

    Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing

    Authors: Qibang Liu, Pengfei Cai, Diab Abueidda, Sagar Vyas, Seid Koric, Rafael Gomez-Bombarelli, Philippe Geubelle

    Abstract: Under some initial and boundary conditions, the rapid reaction-thermal diffusion process taking place during frontal polymerization (FP) destabilizes the planar mode of front propagation, leading to spatially varying, complex hierarchical patterns in thermoset polymeric materials. Although modern reaction-diffusion models can predict the patterns resulting from unstable FP, the inverse design of p… ▽ More

    Submitted 31 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  7. arXiv:2410.08833  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.AI

    Symmetry-Constrained Generation of Diverse Low-Bandgap Molecules with Monte Carlo Tree Search

    Authors: Akshay Subramanian, James Damewood, Juno Nam, Kevin P. Greenman, Avni P. Singhal, Rafael Gómez-Bombarelli

    Abstract: Organic optoelectronic materials are a promising avenue for next-generation electronic devices due to their solution processability, mechanical flexibility, and tunable electronic properties. In particular, near-infrared (NIR) sensitive molecules have unique applications in night-vision equipment and biomedical imaging. Molecular engineering has played a crucial role in developing non-fullerene ac… ▽ More

    Submitted 12 December, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

  8. arXiv:2410.07539  [pdf, other

    cond-mat.mtrl-sci cs.AI

    Efficient Generation of Molecular Clusters with Dual-Scale Equivariant Flow Matching

    Authors: Akshay Subramanian, Shuhui Qu, Cheol Woo Park, Sulin Liu, Janghwan Lee, Rafael Gómez-Bombarelli

    Abstract: Amorphous molecular solids offer a promising alternative to inorganic semiconductors, owing to their mechanical flexibility and solution processability. The packing structure of these materials plays a crucial role in determining their electronic and transport properties, which are key to enhancing the efficiency of devices like organic solar cells (OSCs). However, obtaining these optoelectronic p… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  9. arXiv:2410.06264  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Think While You Generate: Discrete Diffusion with Planned Denoising

    Authors: Sulin Liu, Juno Nam, Andrew Campbell, Hannes Stärk, Yilun Xu, Tommi Jaakkola, Rafael Gómez-Bombarelli

    Abstract: Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that separates the generation process into two models: a planner and a denoiser. At inference time, the planner selects which positions to denoise next by identifying t… ▽ More

    Submitted 9 April, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: ICLR 2025

  10. arXiv:2410.01464  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Flow Matching for Accelerated Simulation of Atomic Transport in Materials

    Authors: Juno Nam, Sulin Liu, Gavin Winter, KyuJung Jun, Soojung Yang, Rafael Gómez-Bombarelli

    Abstract: We introduce LiFlow, a generative framework to accelerate molecular dynamics (MD) simulations for crystalline materials that formulates the task as conditional generation of atomic displacements. The model uses flow matching, with a Propagator submodel to generate atomic displacements and a Corrector to locally correct unphysical geometries, and incorporates an adaptive prior based on the Maxwell-… ▽ More

    Submitted 24 February, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  11. arXiv:2409.13851  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Learning Ordering in Crystalline Materials with Symmetry-Aware Graph Neural Networks

    Authors: Jiayu Peng, James Damewood, Jessica Karaguesian, Jaclyn R. Lunger, Rafael Gómez-Bombarelli

    Abstract: Graph convolutional neural networks (GCNNs) have become a machine learning workhorse for screening the chemical space of crystalline materials in fields such as catalysis and energy storage, by predicting properties from structures. Multicomponent materials, however, present a unique challenge since they can exhibit chemical (dis)order, where a given lattice structure can encompass a variety of el… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  12. arXiv:2404.10746  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Interpolation and differentiation of alchemical degrees of freedom in machine learning interatomic potentials

    Authors: Juno Nam, Jiayu Peng, Rafael Gómez-Bombarelli

    Abstract: Machine learning interatomic potentials (MLIPs) have become a workhorse of modern atomistic simulations, and recently published universal MLIPs, pre-trained on large datasets, have demonstrated remarkable accuracy and generalizability. However, the computational cost of MLIPs limits their applicability to chemically disordered systems requiring large simulation cells or to sample-intensive statist… ▽ More

    Submitted 3 December, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  13. arXiv:2402.03753  [pdf, other

    cs.LG physics.comp-ph

    Enhanced sampling of robust molecular datasets with uncertainty-based collective variables

    Authors: Aik Rui Tan, Johannes C. B. Dietschreit, Rafael Gomez-Bombarelli

    Abstract: Generating a data set that is representative of the accessible configuration space of a molecular system is crucial for the robustness of machine learned interatomic potentials (MLIP). However, the complexity of molecular systems, characterized by intricate potential energy surfaces (PESs) with numerous local minima and energy barriers, presents a significant challenge. Traditional methods of data… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 13 pages, 4 figures, 10 pages of Supplementary Information

  14. arXiv:2402.01542  [pdf, other

    physics.chem-ph cs.LG q-bio.BM

    Learning Collective Variables with Synthetic Data Augmentation through Physics-Inspired Geodesic Interpolation

    Authors: Soojung Yang, Juno Nam, Johannes C. B. Dietschreit, Rafael Gómez-Bombarelli

    Abstract: In molecular dynamics simulations, rare events, such as protein folding, are typically studied using enhanced sampling techniques, most of which are based on the definition of a collective variable (CV) along which acceleration occurs. Obtaining an expressive CV is crucial, but often hindered by the lack of information about the particular event, e.g., the transition from unfolded to folded confor… ▽ More

    Submitted 19 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  15. arXiv:2307.01705  [pdf, other

    cond-mat.mtrl-sci

    Learning a reactive potential for silica-water through uncertainty attribution

    Authors: Swagata Roy, Johannes P. Dürholt, Thomas S. Asche, Federico Zipoli, Rafael Gómez-Bombarelli

    Abstract: The reactivity of silicates in an aqueous solution is relevant to various chemistries ranging from silicate minerals in geology, to the C-S-H phase in cement, nanoporous zeolite catalysts, or highly porous precipitated silica. While simulations of chemical reactions can provide insight at the molecular level, balancing accuracy and scale in reactive simulations in the condensed phase is a challeng… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 26 pages, 4 figures, 1 supplementary figure

  16. arXiv:2305.19930  [pdf, other

    cond-mat.mtrl-sci

    Atom-by-atom design of metal oxide catalysts for the oxygen evolution reaction with machine learning

    Authors: Jaclyn R. Lunger, Jessica Karaguesian, Hoje Chun, Jiayu Peng, Yitong Tseo, Chung Hsuan Shan, Byungchan Han, Yang Shao-Horn, Rafael Gomez-Bombarelli

    Abstract: Green hydrogen production is crucial for a sustainable future, but current catalysts for the oxygen evolution reaction (OER) suffer from slow kinetics, despite many efforts to produce optimal designs, particularly through the calculation of descriptors for activity. In this study, we develop a dataset of density functional theory calculations of bulk and surface perovskite oxides, and adsorption e… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  17. arXiv:2305.12896  [pdf, other

    physics.chem-ph

    Effect of framework composition and NH3 on the diffusion of Cu+ in Cu-CHA catalysts predicted by machine-learning accelerated molecular dynamics

    Authors: Reisel Millan, Estefania Bello-Jurado, Manual Moliner, Mercedes Boronat, Rafael Gomez-Bombarelli

    Abstract: Cu-exchanged zeolites rely on mobile solvated Cu+ cations for their catalytic activity, but the role of framework composition on transport is not fully understood. Ab initio molecular dynamics simulations can provide quantitative atomistic insight but are too computationally expensive to explore large length- and time-scales or diverse compositions. We report a machine-learning interatomic potenti… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  18. arXiv:2305.07251  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Machine-learning-accelerated simulations to enable automatic surface reconstruction

    Authors: Xiaochen Du, James K. Damewood, Jaclyn R. Lunger, Reisel Millan, Bilge Yildiz, Lin Li, Rafael Gómez-Bombarelli

    Abstract: Understanding material surfaces and interfaces is vital in applications like catalysis or electronics. By combining energies from electronic structure with statistical mechanics, ab initio simulations can in principle predict the structure of material surfaces as a function of thermodynamic variables. However, accurate energy simulations are prohibitive when coupled to the vast phase space that mu… ▽ More

    Submitted 21 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: 30 pages main, 15 figures/tables, 5 pages supplementary

    Journal ref: Nat Comput Sci 2023, 3, 1034

  19. arXiv:2305.01806  [pdf, other

    cond-mat.mtrl-sci physics.chem-ph

    Data-Driven, Physics-Informed Descriptors of Cation Ordering in Multicomponent Oxides

    Authors: Jiayu Peng, James Damewood, Rafael Gómez-Bombarelli

    Abstract: The structural tunability and compositional diversity of multicomponent perovskite oxides have enabled their various applications, including catalysis and electronics. The cation ordering in these oxides, ranging from disordered (i.e., high-entropy) to ordered (e.g., rocksalt), profoundly influences their properties. While computational design tools can typically predict properties associated with… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  20. arXiv:2305.01754  [pdf, other

    cs.LG physics.chem-ph

    Single-model uncertainty quantification in neural network potentials does not consistently outperform model ensembles

    Authors: Aik Rui Tan, Shingo Urata, Samuel Goldman, Johannes C. B. Dietschreit, Rafael Gómez-Bombarelli

    Abstract: Neural networks (NNs) often assign high confidence to their predictions, even for points far out-of-distribution, making uncertainty quantification (UQ) a challenge. When they are employed to model interatomic potentials in materials systems, this problem leads to unphysical structures that disrupt simulations, or to biased statistics and dynamics that do not reflect the true physics. Differentiab… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 27 pages, 4 figures, Supporting Information (22 pages)

  21. arXiv:2304.10676  [pdf, other

    physics.chem-ph

    Entropy and Energy Profiles of Chemical Reactions

    Authors: Johannes C. B. Dietschreit, Dennis J. Diestler, Rafael Gómez-Bombarelli

    Abstract: The description of chemical processes at the molecular level is often facilitated by use of reaction coordinates, or collective variables (CVs). The CV measures the progress of the reaction and allows the construction of profiles that track the evolution of a specific property as the reaction progresses. Whereas CVs are routinely used, especially alongside enhanced sampling techniques, links betwe… ▽ More

    Submitted 25 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 24 pages, 5 figures, 3 tables

  22. arXiv:2303.08272  [pdf, other

    physics.chem-ph cs.LG

    Automated patent extraction powers generative modeling in focused chemical spaces

    Authors: Akshay Subramanian, Kevin P. Greenman, Alexis Gervaix, Tzuhsiung Yang, Rafael Gómez-Bombarelli

    Abstract: Deep generative models have emerged as an exciting avenue for inverse molecular design, with progress coming from the interplay between training algorithms and molecular representations. One of the key challenges in their applicability to materials science and chemistry has been the lack of access to sizeable training datasets with property labels. Published patents contain the first disclosure of… ▽ More

    Submitted 24 July, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Digital Discovery (2023)

  23. arXiv:2303.01569  [pdf, other

    cs.LG q-bio.BM

    Chemically Transferable Generative Backmapping of Coarse-Grained Proteins

    Authors: Soojung Yang, Rafael Gómez-Bombarelli

    Abstract: Coarse-graining (CG) accelerates molecular simulations of protein dynamics by simulating sets of atoms as singular beads. Backmapping is the opposite operation of bringing lost atomistic details back from the CG representation. While machine learning (ML) has produced accurate and efficient CG simulations of proteins, fast and reliable backmapping remains a challenge. Rule-based methods produce po… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 18 pages

  24. arXiv:2302.11490  [pdf, other

    physics.chem-ph

    Mapping the space of photoswitchable ligands and photodruggable proteins with computational modeling

    Authors: Simon Axelrod, Eugene Shakhnovich, Rafael Gómez-Bombarelli

    Abstract: Light-activated drugs are a promising way to localize biological activity and minimize side effects. However, their development is complicated by the numerous photophysical and biological properties that must be simultaneously optimized. To accelerate the design of photoactive drugs, we describe a procedure that combines ligand-protein docking with chemical property prediction based on machine lea… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  25. arXiv:2301.08813  [pdf, other

    cond-mat.mtrl-sci

    Representations of Materials for Machine Learning

    Authors: James Damewood, Jessica Karaguesian, Jaclyn R. Lunger, Aik Rui Tan, Mingrou Xie, Jiayu Peng, Rafael Gómez-Bombarelli

    Abstract: High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning relationships among composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: 20 pages, 5 figures, To Appear in Annual Review of Materials Research 53

  26. arXiv:2301.03480  [pdf, other

    physics.chem-ph cs.LG

    Differentiable Simulations for Enhanced Sampling of Rare Events

    Authors: Martin Šípka, Johannes C. B. Dietschreit, Lukáš Grajciar, Rafael Gómez-Bombarelli

    Abstract: Simulating rare events, such as the transformation of a reactant into a product in a chemical reaction typically requires enhanced sampling techniques that rely on heuristically chosen collective variables (CVs). We propose using differentiable simulations (DiffSim) for the discovery and enhanced sampling of chemical transformations without a need to resort to preselected CVs, using only a distanc… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  27. arXiv:2211.05713  [pdf, other

    cond-mat.mtrl-sci

    Simulations with machine learning potentials identify the ion conduction mechanism mediating non-Arrhenius behavior in LGPS

    Authors: Gavin Winter, Rafael Gómez-Bombarelli

    Abstract: Li$_{10}$Ge(PS$_6$)$_2$ (LGPS) is a highly concentrated solid electrolyte, in which Coulombic repulsion between neighboring cations is hypothesized as the underlying reason for concerted ion hopping, a mechanism common among superionic conductors such as Li$_7$La$_3$Zr$_2$O$_{12}$ (LLZO) and Li$_{1.3}$Al$_{0.3}$Ti$_{1.7}$(PO$_4$)$_3$ (LATP). While first principles simulations using molecular dynam… ▽ More

    Submitted 27 November, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

  28. arXiv:2210.07237  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph

    Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations

    Authors: Xiang Fu, Zhenghao Wu, Wujie Wang, Tian Xie, Sinan Keten, Rafael Gomez-Bombarelli, Tommi Jaakkola

    Abstract: Molecular dynamics (MD) simulation techniques are widely used for various natural science applications. Increasingly, machine learning (ML) force field (FF) models begin to replace ab-initio simulations by predicting forces directly from atomic structures. Despite significant progress in this area, such techniques are primarily benchmarked by their force/energy prediction errors, even though the p… ▽ More

    Submitted 26 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 31 pages, 18 figures

    Journal ref: Transactions on Machine Learning Research, 2023

  29. arXiv:2209.07679  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph

    Learning Pair Potentials using Differentiable Simulations

    Authors: Wujie Wang, Zhenghao Wu, Rafael Gómez-Bombarelli

    Abstract: Learning pair interactions from experimental or simulation data is of great interest for molecular simulations. We propose a general stochastic method for learning pair interactions from data using differentiable simulations (DiffSim). DiffSim defines a loss function based on structural observables, such as the radial distribution function, through molecular dynamics (MD) simulations. The interact… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 12 pages, 10 figures

  30. arXiv:2208.05039  [pdf

    cond-mat.mtrl-sci

    Examining graph neural networks for crystal structures: limitations and opportunities for capturing periodicity

    Authors: Sheng Gong, Tian Xie, Yang Shao-Horn, Rafael Gomez-Bombarelli, Jeffrey C. Grossman

    Abstract: Historically, materials informatics has relied on human-designed descriptors of materials structures. In recent years, graph neural networks (GNNs) have been proposed for learning representations of crystal structures from data end-to-end producing vectorial embeddings that are optimized for downstream prediction tasks. However, a systematic scheme is lacking to analyze and understand the limits o… ▽ More

    Submitted 27 March, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  31. arXiv:2207.11592  [pdf, other

    physics.chem-ph cs.LG

    Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential

    Authors: Simon Axelrod, Eugene Shakhnovich, Rafael Gomez-Bombarelli

    Abstract: Molecular photoswitches are the foundation of light-activated drugs. A key photoswitch is azobenzene, which exhibits trans-cis isomerism in response to light. The thermal half-life of the cis isomer is of crucial importance, since it controls the duration of the light-induced biological effect. Here we introduce a computational tool for predicting the thermal half-lives of azobenzene derivatives.… ▽ More

    Submitted 12 January, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

  32. arXiv:2206.02893  [pdf, other

    physics.chem-ph physics.comp-ph

    From Free-Energy Profiles to Activation Free Energies

    Authors: Johannes C. B. Dietschreit, Dennis J. Diestler, Andreas Hulm, Christian Ochsenfeld, Rafael Gómez-Bombarelli

    Abstract: Given a chemical reaction going from reactant (R) to the product (P) on a potential energy surface (PES) and a collective variable (CV) that discriminates between R and P, one can define a free-energy profile (FEP) as the logarithm of the marginal Boltzmann distribution of the CV. The FEP is not a true free energy, however, it is common to treat the FEP as the free-energy analog of the minimum ene… ▽ More

    Submitted 20 April, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 19 pages, 11 figures

    Journal ref: J. Chem. Phys. 157, 084113 (2022)

  33. arXiv:2201.12176  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Generative Coarse-Graining of Molecular Conformations

    Authors: Wujie Wang, Minkai Xu, Chen Cai, Benjamin Kurt Miller, Tess Smidt, Yusu Wang, Jian Tang, Rafael Gómez-Bombarelli

    Abstract: Coarse-graining (CG) of molecular simulations simplifies the particle representation by grouping selected atoms into pseudo-beads and drastically accelerates simulation. However, such CG procedure induces information losses, which makes accurate backmapping, i.e., restoring fine-grained (FG) coordinates from CG coordinates, a long-standing challenge. Inspired by the recent progress in generative m… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 23 pages, 11 figures

    Journal ref: International Conference on Machine Learning (ICML), 2022

  34. arXiv:2111.07452  [pdf, other

    cond-mat.mtrl-sci

    Graph theory-based structural analysis on density anomaly of silica glass

    Authors: Aik Rui Tan, Shingo Urata, Masatsugu Yamada, Rafael Gómez-Bombarelli

    Abstract: Analyzing the atomic structure of glassy materials is a tremendous challenge both experimentally and computationally, and the lack of direct, detailed insights into glass structure hinders our ability to navigate structure-property relationships. For instance, the structural origin of the density anomaly in silica glasses - the negative thermal expansion coefficient - is still poorly understood. S… ▽ More

    Submitted 23 August, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: 12 pages, 7 figures, supporting information

  35. arXiv:2108.04879  [pdf, other

    physics.chem-ph cs.LG

    Excited state, non-adiabatic dynamics of large photoswitchable molecules using a chemically transferable machine learning potential

    Authors: Simon Axelrod, Eugene Shakhnovich, Rafael Gómez-Bombarelli

    Abstract: Light-induced chemical processes are ubiquitous in nature and have widespread technological applications. For example, photoisomerization can allow a drug with a photo-switchable scaffold such as azobenzene to be activated with light. In principle, photoswitches with desired photophysical properties like high isomerization quantum yields can be identified through virtual screening with reactive si… ▽ More

    Submitted 16 March, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

  36. arXiv:2107.05109  [pdf, other

    cond-mat.stat-mech

    Sampling Lattices in Semi-Grand Canonical Ensemble with Autoregressive Machine Learning

    Authors: James Damewood, Daniel Schwalbe-Koda, Rafael Gomez-Bombarelli

    Abstract: Calculating thermodynamic potentials and observables efficiently and accurately is key for the application of statistical mechanics simulations to materials science. However, naive Monte Carlo approaches, on which such calculations are often dependent, struggle to scale to complex materials in many state-of-the-art disciplines such as the design of high entropy alloys or multicomponent catalysts.… ▽ More

    Submitted 13 July, 2021; v1 submitted 11 July, 2021; originally announced July 2021.

    Comments: 29 pages, 28 figures

  37. arXiv:2105.07246  [pdf, other

    cs.LG q-bio.BM

    An End-to-End Framework for Molecular Conformation Generation via Bilevel Programming

    Authors: Minkai Xu, Wujie Wang, Shitong Luo, Chence Shi, Yoshua Bengio, Rafael Gomez-Bombarelli, Jian Tang

    Abstract: Predicting molecular conformations (or 3D structures) from molecular graphs is a fundamental problem in many applications. Most existing approaches are usually divided into two steps by first predicting the distances between atoms and then generating a 3D structure through optimizing a distance geometry problem. However, the distances predicted with such two-stage approaches may not be able to con… ▽ More

    Submitted 2 June, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: Accepted by ICML 2021

  38. arXiv:2103.02565  [pdf, other

    cs.LG cs.CY q-bio.BM q-bio.QM stat.ML

    GLAMOUR: Graph Learning over Macromolecule Representations

    Authors: Somesh Mohapatra, Joyce An, Rafael Gómez-Bombarelli

    Abstract: The near-infinite chemical diversity of natural and artificial macromolecules arises from the vast range of possible component monomers, linkages, and polymers topologies. This enormous variety contributes to the ubiquity and indispensability of macromolecules but hinders the development of general machine learning methods with macromolecules as input. To address this, we developed GLAMOUR, a fram… ▽ More

    Submitted 23 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Main text: 4 pages, 2 figures; Appendix: 33 pages, 46 figures, 7 in-text tables, 4 supplementary tables

    ACM Class: J.2.4; J.3.1

  39. arXiv:2101.11588  [pdf, other

    cs.LG cond-mat.stat-mech physics.chem-ph

    Differentiable sampling of molecular geometries with uncertainty-based adversarial attacks

    Authors: Daniel Schwalbe-Koda, Aik Rui Tan, Rafael Gómez-Bombarelli

    Abstract: Neural network (NN) interatomic potentials provide fast prediction of potential energy surfaces, closely matching the accuracy of the electronic structure methods used to produce the training data. However, NN predictions are only reliable within well-learned training domains, and show volatile behavior when extrapolating. Uncertainty quantification approaches can flag atomic configurations for wh… ▽ More

    Submitted 28 March, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: 12 pages, 4 figures, supporting information

    Journal ref: Nat. Commun. 12, 5104 (2021)

  40. arXiv:2101.05339  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Accelerating amorphous polymer electrolyte screening by learning to reduce errors in molecular dynamics simulated properties

    Authors: Tian Xie, Arthur France-Lanord, Yanming Wang, Jeffrey Lopez, Michael Austin Stolberg, Megan Hill, Graham Michael Leverick, Rafael Gomez-Bombarelli, Jeremiah A. Johnson, Yang Shao-Horn, Jeffrey C. Grossman

    Abstract: Polymer electrolytes are promising candidates for the next generation lithium-ion battery technology. Large scale screening of polymer electrolytes is hindered by the significant cost of molecular dynamics (MD) simulation in amorphous systems: the amorphous structure of polymers requires multiple, repeated sampling to reduce noise and the slow relaxation requires long simulation time for convergen… ▽ More

    Submitted 15 March, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 29 pages, 6 figures + supplementary information

    Journal ref: Nature communications 13.1 (2022): 1-10

  41. arXiv:2012.08452  [pdf, other

    cs.LG physics.chem-ph

    Molecular machine learning with conformer ensembles

    Authors: Simon Axelrod, Rafael Gomez-Bombarelli

    Abstract: Virtual screening can accelerate drug discovery by identifying promising candidates for experimental evaluation. Machine learning is a powerful method for screening, as it can learn complex structure-property relationships from experimental data and make rapid predictions over virtual libraries. Molecules inherently exist as a three-dimensional ensemble and their biological action typically occurs… ▽ More

    Submitted 18 February, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  42. arXiv:2007.14144  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Temperature-transferable coarse-graining of ionic liquids with dual graph convolutional neural networks

    Authors: Jurgis Ruza, Wujie Wang, Daniel Schwalbe-Koda, Simon Axelrod, William H. Harris, Rafael Gomez-Bombarelli

    Abstract: Computer simulations can provide mechanistic insight into ionic liquids (ILs) and predict the properties of experimentally unrealized ion combinations. However, ILs suffer from a particularly large disparity in the time scales of atomistic and ensemble motion. Coarse-grained models are therefore used in place of costly atomistic simulations, allowing simulation of longer time scales and larger sys… ▽ More

    Submitted 8 November, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: 9 pages, 6 figures, 2 supplementary material pages

    Journal ref: The Journal of Chemical Physics 153.16 (2020): 164501

  43. arXiv:2006.05531  [pdf, other

    physics.comp-ph cs.LG

    GEOM: Energy-annotated molecular conformations for property prediction and molecular generation

    Authors: Simon Axelrod, Rafael Gomez-Bombarelli

    Abstract: Machine learning (ML) outperforms traditional approaches in many molecular design tasks. ML models usually predict molecular properties from a 2D chemical graph or a single 3D structure, but neither of these representations accounts for the ensemble of 3D conformers that are accessible to a molecule. Property prediction could be improved by using conformer ensembles as input, but there is no large… ▽ More

    Submitted 9 February, 2022; v1 submitted 9 June, 2020; originally announced June 2020.

  44. arXiv:2003.00868  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph physics.data-an stat.ML

    Differentiable Molecular Simulations for Control and Learning

    Authors: Wujie Wang, Simon Axelrod, Rafael Gómez-Bombarelli

    Abstract: Molecular dynamics simulations use statistical mechanics at the atomistic scale to enable both the elucidation of fundamental mechanisms and the engineering of matter for desired tasks. The behavior of molecular systems at the microscale is typically simulated with differential equations parameterized by a Hamiltonian, or energy function. The Hamiltonian describes the state of the system and its i… ▽ More

    Submitted 23 December, 2020; v1 submitted 26 February, 2020; originally announced March 2020.

    Comments: 14 pages, 6 figures

  45. arXiv:1907.01632  [pdf, other

    cs.LG physics.chem-ph stat.ML

    Generative Models for Automatic Chemical Design

    Authors: Daniel Schwalbe-Koda, Rafael Gómez-Bombarelli

    Abstract: Materials discovery is decisive for tackling urgent challenges related to energy, the environment, health care and many others. In chemistry, conventional methodologies for innovation usually rely on expensive and incremental strategies to optimize properties from molecular structures. On the other hand, inverse approaches map properties to structures, thus expediting the design of novel useful co… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

  46. arXiv:1812.02706  [pdf, other

    physics.chem-ph cs.LG stat.ML

    Coarse-Graining Auto-Encoders for Molecular Dynamics

    Authors: Wujie Wang, Rafael Gómez-Bombarelli

    Abstract: Molecular dynamics simulations provide theoretical insight into the microscopic behavior of materials in condensed phase and, as a predictive tool, enable computational design of new compounds. However, because of the large temporal and spatial scales involved in thermodynamic and kinetic phenomena in materials, atomistic simulations are often computationally unfeasible. Coarse-graining methods al… ▽ More

    Submitted 27 March, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: 8 pages, 6 figures

    Journal ref: npj Comput Mater 5, 125 (2019)

  47. Graph similarity drives zeolite diffusionless transformations and intergrowth

    Authors: Daniel Schwalbe-Koda, Zach Jensen, Elsa Olivetti, Rafael Gomez-Bombarelli

    Abstract: Predicting and directing polymorphic transformations is a critical challenge in zeolite synthesis. Although interzeolite transformations enable selective crystallization, their design lacks predictions to connect framework similarity and experimental observations. Here, computational and theoretical tools are combined to data-mine, analyze and explain interzeolite relations. It is observed that bu… ▽ More

    Submitted 10 March, 2021; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: Pre-peer review manuscript; 8 pages, 3 figures

  48. arXiv:1610.02415  [pdf, other

    cs.LG physics.chem-ph

    Automatic chemical design using a data-driven continuous representation of molecules

    Authors: Rafael Gómez-Bombarelli, Jennifer N. Wei, David Duvenaud, José Miguel Hernández-Lobato, Benjamín Sánchez-Lengeling, Dennis Sheberla, Jorge Aguilera-Iparraguirre, Timothy D. Hirzel, Ryan P. Adams, Alán Aspuru-Guzik

    Abstract: We report a method to convert discrete representations of molecules to and from a multidimensional continuous representation. This model allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds. A deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an enc… ▽ More

    Submitted 5 December, 2017; v1 submitted 7 October, 2016; originally announced October 2016.

    Comments: 26 pages, 8 figures

  49. arXiv:1511.06302  [pdf, other

    quant-ph physics.chem-ph

    Photocell Optimisation Using Dark State Protection

    Authors: Amir Fruchtman, Rafael Gómez-Bombarelli, Brendon W. Lovett, Erik M. Gauger

    Abstract: Conventional photocells suffer a fundamental efficiency threshold imposed by the principle of detailed balance, reflecting the fact that good absorbers must necessarily also be fast emitters. This limitation can be overcome by `parking' the energy of an absorbed photon in a dark state which neither absorbs nor emits light. Here we argue that suitable dark states occur naturally as a consequence of… ▽ More

    Submitted 5 August, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 14 pages, 16 figures, comments are very welcome!

    Journal ref: Phys. Rev. Lett. 117, 203603 (2016)

  50. arXiv:1509.09292  [pdf, other

    cs.LG cs.NE stat.ML

    Convolutional Networks on Graphs for Learning Molecular Fingerprints

    Authors: David Duvenaud, Dougal Maclaurin, Jorge Aguilera-Iparraguirre, Rafael Gómez-Bombarelli, Timothy Hirzel, Alán Aspuru-Guzik, Ryan P. Adams

    Abstract: We introduce a convolutional neural network that operates directly on graphs. These networks allow end-to-end learning of prediction pipelines whose inputs are graphs of arbitrary size and shape. The architecture we present generalizes standard molecular feature extraction methods based on circular fingerprints. We show that these data-driven features are more interpretable, and have better predic… ▽ More

    Submitted 3 November, 2015; v1 submitted 30 September, 2015; originally announced September 2015.

    Comments: 9 pages, 5 figures. To appear in Neural Information Processing Systems (NIPS)