Skip to main content

Showing 1–10 of 10 results for author: Segler, M

Searching in archive physics. Search in all archives.
.
  1. arXiv:2506.14492  [pdf, ps, other

    physics.chem-ph

    Accurate Chemistry Collection: Coupled cluster atomization energies for broad chemical space

    Authors: Sebastian Ehlert, Jan Hermann, Thijs Vogels, Victor Garcia Satorras, Stephanie Lanius, Marwin Segler, Derk P. Kooi, Kenji Takeda, Chin-Wei Huang, Giulia Luise, Rianne van den Berg, Paola Gori-Giorgi, Amir Karton

    Abstract: Accurate thermochemical data with sub-chemical accuracy (i.e., within $\pm$1 kcal mol$^{-1}$ from sufficiently accurate experimental or theoretical reference data) is essential for the development and improvement of computational chemistry methods. Challenging thermochemical properties such as heats of formation and total atomization energies (TAEs) are of particular interest because they rigorous… ▽ More

    Submitted 1 July, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: 9 pages plus references, 6 figures, dataset on Zenodo; CHANGES: extended method specification

  2. arXiv:2503.06687  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.bio-ph physics.chem-ph

    UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion

    Authors: Gongbo Zhang, Yanting Li, Renqian Luo, Pipi Hu, Zeru Zhao, Lingbo Li, Guoqing Liu, Zun Wang, Ran Bi, Kaiyuan Gao, Liya Guo, Yu Xie, Chang Liu, Jia Zhang, Tian Xie, Robert Pinsler, Claudio Zeni, Ziheng Lu, Yingce Xia, Marwin Segler, Maik Riechert, Li Yuan, Lei Chen, Haiguang Liu, Tao Qin

    Abstract: Unified generation of sequence and structure for scientific data (e.g., materials, molecules, proteins) is a critical task. Existing approaches primarily rely on either autoregressive sequence models or diffusion models, each offering distinct advantages and facing notable limitations. Autoregressive models, such as GPT, Llama, and Phi-4, have demonstrated remarkable success in natural language ge… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  3. arXiv:2501.06669  [pdf, other

    cs.LG physics.chem-ph

    Challenging reaction prediction models to generalize to novel chemistry

    Authors: John Bradshaw, Anji Zhang, Babak Mahjour, David E. Graff, Marwin H. S. Segler, Connor W. Coley

    Abstract: Deep learning models for anticipating the products of organic reactions have found many use cases, including validating retrosynthetic pathways and constraining synthesis-based molecular design tools. Despite compelling performance on popular benchmark tasks, strange and erroneous predictions sometimes ensue when using these models in practice. The core issue is that common benchmarks test models… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

  4. arXiv:1906.05221  [pdf, other

    cs.LG physics.comp-ph stat.ML

    A Model to Search for Synthesizable Molecules

    Authors: John Bradshaw, Brooks Paige, Matt J. Kusner, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) re… ▽ More

    Submitted 4 December, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: To appear in Advances in Neural Information Processing Systems 2019

  5. arXiv:1811.09621  [pdf, ps, other

    q-bio.QM cs.LG physics.chem-ph q-bio.BM

    GuacaMol: Benchmarking Models for De Novo Molecular Design

    Authors: Nathan Brown, Marco Fiscato, Marwin H. S. Segler, Alain C. Vaucher

    Abstract: De novo design seeks to generate molecules with required property profiles by virtual design-make-test cycles. With the emergence of deep learning and neural generative models in many application areas, models for molecular design based on neural networks appeared recently and show promising results. However, the new models have not been profiled on consistent tasks, and comparative studies to wel… ▽ More

    Submitted 26 February, 2019; v1 submitted 22 November, 2018; originally announced November 2018.

  6. arXiv:1805.10970  [pdf, other

    physics.chem-ph cs.LG stat.ML

    A Generative Model For Electron Paths

    Authors: John Bradshaw, Matt J. Kusner, Brooks Paige, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: Chemical reactions can be described as the stepwise redistribution of electrons in molecules. As such, reactions are often depicted using `arrow-pushing' diagrams which show this movement as a sequence of arrows. We propose an electron path prediction model (ELECTRO) to learn these sequences directly from raw reaction data. Instead of predicting product molecules directly from reactant molecules i… ▽ More

    Submitted 20 March, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

  7. arXiv:1708.04202  [pdf, ps, other

    cs.AI cs.LG physics.chem-ph

    Learning to Plan Chemical Syntheses

    Authors: Marwin H. S. Segler, Mike Preuss, Mark P. Waller

    Abstract: From medicines to materials, small organic molecules are indispensable for human well-being. To plan their syntheses, chemists employ a problem solving technique called retrosynthesis. In retrosynthesis, target molecules are recursively transformed into increasingly simpler precursor compounds until a set of readily available starting materials is obtained. Computer-aided retrosynthesis would be a… ▽ More

    Submitted 14 August, 2017; originally announced August 2017.

    Journal ref: Nature 555 (2018), 604-610

  8. arXiv:1702.00020  [pdf, ps, other

    cs.AI cs.LG physics.chem-ph

    Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Policies

    Authors: Marwin Segler, Mike Preuß, Mark P. Waller

    Abstract: Retrosynthesis is a technique to plan the chemical synthesis of organic molecules, for example drugs, agro- and fine chemicals. In retrosynthesis, a search tree is built by analysing molecules recursively and dissecting them into simpler molecular building blocks until one obtains a set of known building blocks. The search space is intractably large, and it is difficult to determine the value of r… ▽ More

    Submitted 31 January, 2017; originally announced February 2017.

    Comments: 4 pages, 1 figure

  9. arXiv:1701.01329  [pdf, ps, other

    cs.NE cs.AI cs.LG physics.chem-ph stat.ML

    Generating Focussed Molecule Libraries for Drug Discovery with Recurrent Neural Networks

    Authors: Marwin H. S. Segler, Thierry Kogej, Christian Tyrchan, Mark P. Waller

    Abstract: In de novo drug design, computational strategies are used to generate novel molecules with good affinity to the desired biological target. In this work, we show that recurrent neural networks can be trained as generative models for molecular structures, similar to statistical language models in natural language processing. We demonstrate that the properties of the generated molecules correlate ver… ▽ More

    Submitted 5 January, 2017; originally announced January 2017.

    Comments: 17 pages, 17 figures

  10. arXiv:1608.07117  [pdf, ps, other

    cs.AI physics.chem-ph q-bio.MN

    Modelling Chemical Reasoning to Predict Reactions

    Authors: Marwin H. S. Segler, Mark P. Waller

    Abstract: The ability to reason beyond established knowledge allows Organic Chemists to solve synthetic problems and to invent novel transformations. Here, we propose a model which mimics chemical reasoning and formalises reaction prediction as finding missing links in a knowledge graph. We have constructed a knowledge graph containing 14.4 million molecules and 8.2 million binary reactions, which represent… ▽ More

    Submitted 25 August, 2016; originally announced August 2016.

    Comments: 17 pages, 8 figures

    Journal ref: Chem. Eur. J. 2017, 23, 6118-6128