-
Optimal message passing for molecular prediction is simple, attentive and spatial
Authors:
Alma C. Castaneda-Leautaud,
Rommie E. Amaro
Abstract:
Strategies to improve the predicting performance of Message-Passing Neural-Networks for molecular property predictions can be achieved by simplifying how the message is passed and by using descriptors that capture multiple aspects of molecular graphs. In this work, we designed model architectures that achieved state-of-the-art performance, surpassing more complex models such as those pre-trained o…
▽ More
Strategies to improve the predicting performance of Message-Passing Neural-Networks for molecular property predictions can be achieved by simplifying how the message is passed and by using descriptors that capture multiple aspects of molecular graphs. In this work, we designed model architectures that achieved state-of-the-art performance, surpassing more complex models such as those pre-trained on external databases. We assessed dataset diversity to complement our performance results, finding that structural diversity influences the need for additional components in our MPNNs and feature sets.
In most datasets, our best architecture employs bidirectional message-passing with an attention mechanism, applied to a minimalist message formulation that excludes self-perception, highlighting that relatively simpler models, compared to classical MPNNs, yield higher class separability. In contrast, we found that convolution normalization factors do not benefit the predictive power in all the datasets tested. This was corroborated in both global and node-level outputs. Additionally, we analyzed the influence of both adding spatial features and working with 3D graphs, finding that 2D molecular graphs are sufficient when complemented with appropriately chosen 3D descriptors. This approach not only preserves predictive performance but also reduces computational cost by over 50%, making it particularly advantageous for high-throughput screening campaigns.
△ Less
Submitted 13 September, 2025;
originally announced September 2025.
-
Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design
Authors:
Jie Li,
Oufan Zhang,
Yingze Wang,
Kunyang Sun,
Xingyi Guan,
Dorian Bagni,
Mojtaba Haghighatlari,
Fiona L. Kearns,
Conor Parks,
Rommie E. Amaro,
Teresa Head-Gordon
Abstract:
The viability of a new drug molecule is a time and resource intensive task that makes computer-aided assessments a vital approach to rapid drug discovery. Here we develop a machine learning algorithm, iMiner, that generates novel inhibitor molecules for target proteins by combining deep reinforcement learning with real-time 3D molecular docking using AutoDock Vina, thereby simultaneously creating…
▽ More
The viability of a new drug molecule is a time and resource intensive task that makes computer-aided assessments a vital approach to rapid drug discovery. Here we develop a machine learning algorithm, iMiner, that generates novel inhibitor molecules for target proteins by combining deep reinforcement learning with real-time 3D molecular docking using AutoDock Vina, thereby simultaneously creating chemical novelty while constraining molecules for shape and molecular compatibility with target active sites. Moreover, through the use of various types of reward functions, we can generate new molecules that are chemically similar to a target ligand, which can be grown from known protein bound fragments, as well as to create molecules that enforce interactions with target residues in the protein active site. The iMiner algorithm is embedded in a composite workflow that filters out Pan-assay interference compounds, Lipinski rule violations, and poor synthetic accessibility, with options for cross-validation against other docking scoring functions and automation of a molecular dynamics simulation to measure pose stability. Because our approach only relies on the structure of the target protein, iMiner can be easily adapted for future development of other inhibitors or small molecule therapeutics of any target protein.
△ Less
Submitted 10 January, 2024; v1 submitted 4 October, 2021;
originally announced October 2021.
-
An Open Source Mesh Generation Platform for Biophysical Modeling Using Realistic Cellular Geometries
Authors:
Christopher T. Lee,
Justin G. Laughlin,
John B. Moody,
Rommie E. Amaro,
J. Andrew McCammon,
Michael J. Holst,
Padmini Rangamani
Abstract:
Advances in imaging methods such as electron microscopy, tomography and other modalities are enabling high-resolution reconstructions of cellular and organelle geometries. Such advances pave the way for using these geometries for biophysical and mathematical modeling once these data can be represented as a geometric mesh, which, when carefully conditioned, enables the discretization and solution o…
▽ More
Advances in imaging methods such as electron microscopy, tomography and other modalities are enabling high-resolution reconstructions of cellular and organelle geometries. Such advances pave the way for using these geometries for biophysical and mathematical modeling once these data can be represented as a geometric mesh, which, when carefully conditioned, enables the discretization and solution of partial differential equations. In this study, we outline the steps for a naïve user to approach GAMer 2, a mesh generation code written in C++ designed to convert structural datasets to realistic geometric meshes, while preserving the underlying shapes. We present two example cases, 1) mesh generation at the subcellular scale as informed by electron tomography, and 2) meshing a protein with structure from x-ray crystallography. We further demonstrate that the meshes generated by GAMer are suitable for use with numerical methods. Together, this collection of libraries and tools simplifies the process of constructing realistic geometric meshes from structural biology data.
△ Less
Submitted 16 December, 2019; v1 submitted 10 September, 2019;
originally announced September 2019.
-
3D mesh processing using GAMer 2 to enable reaction-diffusion simulations in realistic cellular geometries
Authors:
Christopher T. Lee,
Justin G. Laughlin,
Nils Angliviel de La Beaumelle,
Rommie E. Amaro,
J. Andrew McCammon,
Ravi Ramamoorthi,
Michael J. Holst,
Padmini Rangamani
Abstract:
Recent advances in electron microscopy have enabled the imaging of single cells in 3D at nanometer length scale resolutions. An uncharted frontier for in silico biology is the ability to simulate cellular processes using these observed geometries. Enabling such simulations requires watertight meshing of electron micrograph images into 3D volume meshes, which can then form the basis of computer sim…
▽ More
Recent advances in electron microscopy have enabled the imaging of single cells in 3D at nanometer length scale resolutions. An uncharted frontier for in silico biology is the ability to simulate cellular processes using these observed geometries. Enabling such simulations requires watertight meshing of electron micrograph images into 3D volume meshes, which can then form the basis of computer simulations of such processes using numerical techniques such as the Finite Element Method. In this paper, we describe the use of our recently rewritten mesh processing software, GAMer 2, to bridge the gap between poorly conditioned meshes generated from segmented micrographs and boundary marked tetrahedral meshes which are compatible with simulation. We demonstrate the application of a workflow using GAMer 2 to a series of electron micrographs of neuronal dendrite morphology explored at three different length scales and show that the resulting meshes are suitable for finite element simulations. This work is an important step towards making physical simulations of biological processes in realistic geometries routine. Innovations in algorithms to reconstruct and simulate cellular length scale phenomena based on emerging structural data will enable realistic physical models and advance discovery at the interface of geometry and cellular processes. We posit that a new frontier at the intersection of computational technologies and single cell biology is now open.
△ Less
Submitted 17 December, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.