Search | arXiv e-print repository

GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule Generation

Authors: Filipp Nikitin, Ian Dunn, David Ryan Koes, Olexandr Isayev

Abstract: Deep generative models have shown significant promise in generating valid 3D molecular structures, with the GEOM-Drugs dataset serving as a key benchmark. However, current evaluation protocols suffer from critical flaws, including incorrect valency definitions, bugs in bond order calculations, and reliance on force fields inconsistent with the reference data. In this work, we revisit GEOM-Drugs an… ▽ More Deep generative models have shown significant promise in generating valid 3D molecular structures, with the GEOM-Drugs dataset serving as a key benchmark. However, current evaluation protocols suffer from critical flaws, including incorrect valency definitions, bugs in bond order calculations, and reliance on force fields inconsistent with the reference data. In this work, we revisit GEOM-Drugs and propose a corrected evaluation framework: we identify and fix issues in data preprocessing, construct chemically accurate valency tables, and introduce a GFN2-xTB-based geometry and energy benchmark. We retrain and re-evaluate several leading models under this framework, providing updated performance metrics and practical recommendations for future benchmarking. Our results underscore the need for chemically rigorous evaluation practices in 3D molecular generation. Our recommended evaluation methods and GEOM-Drugs processing scripts are available at https://github.com/isayevlab/geom-drugs-3dgen-evaluation. △ Less

Submitted 15 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

arXiv:2504.13384 [pdf, other]

Formation of Magnetic Switchbacks via expanding Alfvén Waves

Authors: Trevor A. Bowen, Alfred Mallet, Corina I. Dunn, Jonathan Squire, Benjamin D. G. Chandran, Romain Meyrand, Nooshin Davis, Thierry Dudok de Wit, Stuart D. Bale, Samuel T. Badman, Nikos Sioulas

Abstract: Context. Large-amplitude inversions of the solar wind's interplanetary magnetic field have long been documented; however, observations from the Parker Solar Probe (PSP) mission have renewed interest in this phenomenon as such features, often termed switchbacks, may constrain both the sources of the solar wind as well as in-situ nonlinear dynamics and turbulent heating. Aims. We aim to show that ma… ▽ More Context. Large-amplitude inversions of the solar wind's interplanetary magnetic field have long been documented; however, observations from the Parker Solar Probe (PSP) mission have renewed interest in this phenomenon as such features, often termed switchbacks, may constrain both the sources of the solar wind as well as in-situ nonlinear dynamics and turbulent heating. Aims. We aim to show that magnetic field fluctuations in the solar wind are consistent with Alfvénic fluctuations that naturally form switchback inversions in the magnetic field through expansion effects. Methods. We examine PSP observations of the evolution of a single stream of solar wind in a radial scan from PSP's tenth perihelion encounter from approximately 15-50 solar radii. We study the growth and radial scaling of normalized fluctuation amplitudes in the magnetic field, $δB/B$, within the framework of spherical polarization. We compare heating rates computed via outer-scale decay from consideration of wave-action to proton heating rates empirically observed through considering adiabatic expansion. Results. We find that the magnetic field fluctuations are largely spherically polarized and that the normalized amplitudes of the magnetic field, $δB/B$, increases with amplitude. The growth of the magnetic field amplitude leads to switchback inversions in the magnetic field. While the amplitudes do not grow as fast as predicted by the conservation of wave action, the deviation from the expected scaling yields an effective heating rate, which is close to the empirically observed proton heating rate. Conclusions. The observed scaling of fluctuation amplitudes is largely consistent with a picture of expanding Alfvén waves that seed turbulence leading to dissipation. The expansion of the waves leads to the growth of wave-amplitudes, resulting in the formation of switchbacks. △ Less

Submitted 17 April, 2025; originally announced April 2025.

arXiv:2411.16644 [pdf, other]

Exploring Discrete Flow Matching for 3D De Novo Molecule Generation

Authors: Ian Dunn, David R. Koes

Abstract: Deep generative models that produce novel molecular structures have the potential to facilitate chemical discovery. Flow matching is a recently proposed generative modeling framework that has achieved impressive performance on a variety of tasks including those on biomolecular structures. The seminal flow matching framework was developed only for continuous data. However, de novo molecular design… ▽ More Deep generative models that produce novel molecular structures have the potential to facilitate chemical discovery. Flow matching is a recently proposed generative modeling framework that has achieved impressive performance on a variety of tasks including those on biomolecular structures. The seminal flow matching framework was developed only for continuous data. However, de novo molecular design tasks require generating discrete data such as atomic elements or sequences of amino acid residues. Several discrete flow matching methods have been proposed recently to address this gap. In this work we benchmark the performance of existing discrete flow matching methods for 3D de novo small molecule generation and provide explanations of their differing behavior. As a result we present FlowMol-CTMC, an open-source model that achieves state of the art performance for 3D de novo design with fewer learnable parameters than existing methods. Additionally, we propose the use of metrics that capture molecule quality beyond local chemical valency constraints and towards higher-order structural motifs. These metrics show that even though basic constraints are satisfied, the models tend to produce unusual and potentially problematic functional groups outside of the training data distribution. Code and trained models for reproducing this work are available at \url{https://github.com/dunni3/FlowMol}. △ Less

Submitted 25 November, 2024; originally announced November 2024.

Comments: Presented at the NeurIPS 2024 Machine Learning for Structural Biology Workshop

arXiv:2404.19739 [pdf, other]

Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation

Authors: Ian Dunn, David Ryan Koes

Abstract: Deep generative models that produce novel molecular structures have the potential to facilitate chemical discovery. Diffusion models currently achieve state of the art performance for 3D molecule generation. In this work, we explore the use of flow matching, a recently proposed generative modeling framework that generalizes diffusion models, for the task of de novo molecule generation. Flow matchi… ▽ More Deep generative models that produce novel molecular structures have the potential to facilitate chemical discovery. Diffusion models currently achieve state of the art performance for 3D molecule generation. In this work, we explore the use of flow matching, a recently proposed generative modeling framework that generalizes diffusion models, for the task of de novo molecule generation. Flow matching provides flexibility in model design; however, the framework is predicated on the assumption of continuously-valued data. 3D de novo molecule generation requires jointly sampling continuous and categorical variables such as atom position and atom type. We extend the flow matching framework to categorical data by constructing flows that are constrained to exist on a continuous representation of categorical data known as the probability simplex. We call this extension SimplexFlow. We explore the use of SimplexFlow for de novo molecule generation. However, we find that, in practice, a simpler approach that makes no accommodations for the categorical nature of the data yields equivalent or superior performance. As a result of these experiments, we present FlowMol, a flow matching model for 3D de novo generative model that achieves improved performance over prior flow matching methods, and we raise important questions about the design of prior distributions for achieving strong performance in flow matching models. Code and trained models for reproducing this work are available at https://github.com/dunni3/FlowMol △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2311.13466 [pdf, other]

Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure

Authors: Ian Dunn, David Ryan Koes

Abstract: Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models… ▽ More Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models rely on coarse-grained representations of protein structure to make training and inference feasible. However, such coarse-grained representations discard essential information for modeling molecular interactions and impair the quality of generated structures. In this work, we present a novel GNN-based architecture for learning latent representations of molecular structure. When trained end-to-end with a diffusion model for de novo ligand design, our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time. △ Less

Submitted 8 May, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

Comments: This paper appeared as a spotlight paper at the NeurIPS 2023 Generative AI and Biology Workshop

arXiv:2203.02450 [pdf, other]

doi 10.1103/PhysRevB.105.224305

Cumulant methods for electron-phonon problems. II. The self-consistent cumulant expansion

Authors: Paul J. Robinson, Ian S. Dunn, David R. Reichman

Abstract: In this work we present a self-consistent cumulant expansion (SC-CE) and investigate its accuracy for the one-dimensional Holstein model with and without phonon dispersion. We show that for finite lattices sizes, the numerical integration of the SC-CE equations becomes unstable at long times. This defect is partially ameliorated when studying systems in the thermodynamic limit, enabling the demons… ▽ More In this work we present a self-consistent cumulant expansion (SC-CE) and investigate its accuracy for the one-dimensional Holstein model with and without phonon dispersion. We show that for finite lattices sizes, the numerical integration of the SC-CE equations becomes unstable at long times. This defect is partially ameliorated when studying systems in the thermodynamic limit, enabling the demonstration that the SC-CE corrects many deficits of the standard perturbative CE in the (non-dispersive) Holstein model. The natural phonon damping that arises in the more realistic dispersed Holstein model renders the SC-CE stable, allowing for a complete assessment of the method. Here we find that self-consistency dramatically corrects many of the failures found in the perturbative CE, but also introduces some unphysical features. Finally, we comment on the potential use of SC-CE as a tool for calculating Green's functions in generic many-body problems. △ Less

Submitted 8 June, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

Comments: 11 pages, 4 figures

Journal ref: Phys. Rev. B 105, 224305 (2022)

arXiv:2203.02449 [pdf, other]

doi 10.1103/PhysRevB.105.224304

Cumulant methods for electron-phonon problems. I. Perturbative expansions

Authors: Paul J. Robinson, Ian S. Dunn, David R. Reichman

Abstract: In this work we investigate the ability of the cumulant expansion (CE) to capture one-particle spectral information in electron-phonon coupled systems at both zero and finite temperatures. In particular, we present a comprehensive study of the second- and fourth-order CE for the one-dimensional Holstein model as compared with numerically exact methods. We investigate both finite sized systems as w… ▽ More In this work we investigate the ability of the cumulant expansion (CE) to capture one-particle spectral information in electron-phonon coupled systems at both zero and finite temperatures. In particular, we present a comprehensive study of the second- and fourth-order CE for the one-dimensional Holstein model as compared with numerically exact methods. We investigate both finite sized systems as well as the approach to the thermodynamic limit, drawing distinctions and connections between the behavior of systems in and away from the thermodynamic limit that enable a greater understanding of the ability of the CE to capture real-frequency information across the full range of wave vectors. We find that for zero electronic momentum, the spectral function is well described by the second-order CE at low and high temperatures. However, for non-zero electronic momenta, the CE is only accurate at high temperature. We analyze the fourth-order cumulant, and find that while it improves the description of the short-time dynamics encoded in the one-particle Green's function, it can introduce divergences in the time domain as well as unphysical negative spectral weight in the spectral function. When well-behaved, the fourth-order CE does provide notable accurate corrections to the second-order CE. Finally, we use our results to comment on the use of the CE as a tool for calculating transport behavior in the realistic ab initio modeling of materials. △ Less

Submitted 8 June, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

Comments: 16 pages, 7 figures

Journal ref: Phys. Rev. B 105, 224304 (2022)

arXiv:2106.00576 [pdf, other]

doi 10.1145/3460319.3464801

Exposing Previously Undetectable Faults in Deep Neural Networks

Authors: Isaac Dunn, Hadrien Pouget, Daniel Kroening, Tom Melham

Abstract: Existing methods for testing DNNs solve the oracle problem by constraining the raw features (e.g. image pixel values) to be within a small distance of a dataset example for which the desired DNN output is known. But this limits the kinds of faults these approaches are able to detect. In this paper, we introduce a novel DNN testing method that is able to find faults in DNNs that other methods canno… ▽ More Existing methods for testing DNNs solve the oracle problem by constraining the raw features (e.g. image pixel values) to be within a small distance of a dataset example for which the desired DNN output is known. But this limits the kinds of faults these approaches are able to detect. In this paper, we introduce a novel DNN testing method that is able to find faults in DNNs that other methods cannot. The crux is that, by leveraging generative machine learning, we can generate fresh test inputs that vary in their high-level features (for images, these include object shape, location, texture, and colour). We demonstrate that our approach is capable of detecting deliberately injected faults as well as new faults in state-of-the-art DNNs, and that in both cases, existing methods are unable to find these faults. △ Less

Submitted 1 June, 2021; originally announced June 2021.

Comments: Accepted to the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2021)

ACM Class: I.2.6; D.2.5

arXiv:2001.11055 [pdf, other]

Evaluating Robustness to Context-Sensitive Feature Perturbations of Different Granularities

Authors: Isaac Dunn, Laura Hanu, Hadrien Pouget, Daniel Kroening, Tom Melham

Abstract: We cannot guarantee that training datasets are representative of the distribution of inputs that will be encountered during deployment. So we must have confidence that our models do not over-rely on this assumption. To this end, we introduce a new method that identifies context-sensitive feature perturbations (e.g. shape, location, texture, colour) to the inputs of image classifiers. We produce th… ▽ More We cannot guarantee that training datasets are representative of the distribution of inputs that will be encountered during deployment. So we must have confidence that our models do not over-rely on this assumption. To this end, we introduce a new method that identifies context-sensitive feature perturbations (e.g. shape, location, texture, colour) to the inputs of image classifiers. We produce these changes by performing small adjustments to the activation values of different layers of a trained generative neural network. Perturbing at layers earlier in the generator causes changes to coarser-grained features; perturbations further on cause finer-grained changes. Unsurprisingly, we find that state-of-the-art classifiers are not robust to any such changes. More surprisingly, when it comes to coarse-grained feature changes, we find that adversarial training against pixel-space perturbations is not just unhelpful: it is counterproductive. △ Less

Submitted 23 October, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

arXiv:1905.02463 [pdf, other]

Adaptive Generation of Unrestricted Adversarial Inputs

Authors: Isaac Dunn, Hadrien Pouget, Tom Melham, Daniel Kroening

Abstract: Neural networks are vulnerable to adversarially-constructed perturbations of their inputs. Most research so far has considered perturbations of a fixed magnitude under some $l_p$ norm. Although studying these attacks is valuable, there has been increasing interest in the construction of (and robustness to) unrestricted attacks, which are not constrained to a small and rather artificial subset of a… ▽ More Neural networks are vulnerable to adversarially-constructed perturbations of their inputs. Most research so far has considered perturbations of a fixed magnitude under some $l_p$ norm. Although studying these attacks is valuable, there has been increasing interest in the construction of (and robustness to) unrestricted attacks, which are not constrained to a small and rather artificial subset of all possible adversarial inputs. We introduce a novel algorithm for generating such unrestricted adversarial inputs which, unlike prior work, is adaptive: it is able to tune its attacks to the classifier being targeted. It also offers a 400-2,000x speedup over the existing state of the art. We demonstrate our approach by generating unrestricted adversarial inputs that fool classifiers robust to perturbation-based attacks. We also show that, by virtue of being adaptive and unrestricted, our attack is able to defeat adversarial training against it. △ Less

Submitted 1 October, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

Comments: Updated to include new results

arXiv:1902.04594 [pdf, other]

doi 10.1063/1.5092616

Removing instabilities in the hierarchical equations of motion: exact and approximate projection approaches

Authors: Ian S. Dunn, Roel Tempelaar, David R. Reichman

Abstract: The hierarchical equations of motion (HEOM) provide a numerically exact approach for computing the reduced dynamics of a quantum system linearly coupled to a bath. We have found that HEOM contains temperature-dependent instabilities that grow exponentially in time. In the case of continuous-bath models, these instabilities may be delayed to later times by increasing the hierarchy dimension; howeve… ▽ More The hierarchical equations of motion (HEOM) provide a numerically exact approach for computing the reduced dynamics of a quantum system linearly coupled to a bath. We have found that HEOM contains temperature-dependent instabilities that grow exponentially in time. In the case of continuous-bath models, these instabilities may be delayed to later times by increasing the hierarchy dimension; however, for systems coupled to discrete, non-dispersive modes, increasing the hierarchy dimension does little to alleviate the problem. We show that these instabilities can also be removed completely at a potentially much lower cost via projection onto the space of stable eigenmodes; furthermore, we find that for discrete-bath models at zero temperature, the remaining projected dynamics computed with few hierarchy levels are essentially identical to the exact dynamics that otherwise might require an intractably large number of hierarchy levels for convergence. Recognizing that computation of the eigenmodes might be prohibitive, e.g. for large or strongly-coupled models, we present a Prony filtration algorithm that may be useful as an alternative for accomplishing this projection when diagonalization is too costly. We present results demonstrating the efficacy of HEOM projected via diagonalization and Prony filtration. We also discuss issues associated with the nonnormality of HEOM. △ Less

Submitted 10 May, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

Comments: This article may be downloaded for personal use only. Any other use requires prior permission of the author and AIP Publishing. This article appeared in J. Chem. Phys. 150, 184109 (2019) and may be found at https://doi.org/10.1063/1.5092616

Journal ref: J. Chem. Phys. 150, 184109 (2019)

arXiv:1809.08040 [pdf, other]

doi 10.1063/1.5055826

A straightforward 2$ω$ technique for the measurement of the Thomson effect

Authors: Isaac Haïk Dunn, Ramzy Daou, Colin Atkinson

Abstract: We present a simplified, rapid, and accurate method for the measurement of the thermoelectric Thomson coefficient by the dynamical heating of a suspended wire by an alternating current. By applying a temperature gradient across the wire, we find that the response at the second harmonic of the excitation frequency is directly proportional to the Thomson coefficient. The absolute thermoelectric coef… ▽ More We present a simplified, rapid, and accurate method for the measurement of the thermoelectric Thomson coefficient by the dynamical heating of a suspended wire by an alternating current. By applying a temperature gradient across the wire, we find that the response at the second harmonic of the excitation frequency is directly proportional to the Thomson coefficient. The absolute thermoelectric coefficient of a single material can therefore be extracted with high precision by a phase sensitive detector. We test our method on platinum and nickel wires and develop both analytical and numerical models to determine the leading sources of error. △ Less

Submitted 24 February, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

Comments: 10 pages, 7 figures and 1 table

Journal ref: Review of Scientific Instruments 90, 024902 (2019)

Showing 1–12 of 12 results for author: Dunn, I