Search | arXiv e-print repository

Language Models Enable Data-Augmented Synthesis Planning for Inorganic Materials

Authors: Thorben Prein, Elton Pan, Janik Jehkul, Steffen Weinmann, Elsa A. Olivetti, Jennifer L. M. Rupp

Abstract: Inorganic synthesis planning currently relies primarily on heuristic approaches or machine-learning models trained on limited datasets, which constrains its generality. We demonstrate that language models, without task-specific fine-tuning, can recall synthesis conditions. Off-the-shelf models, such as GPT-4.1, Gemini 2.0 Flash and Llama 4 Maverick, achieve a Top-1 precursor-prediction accuracy of… ▽ More Inorganic synthesis planning currently relies primarily on heuristic approaches or machine-learning models trained on limited datasets, which constrains its generality. We demonstrate that language models, without task-specific fine-tuning, can recall synthesis conditions. Off-the-shelf models, such as GPT-4.1, Gemini 2.0 Flash and Llama 4 Maverick, achieve a Top-1 precursor-prediction accuracy of up to 53.8 % and a Top-5 performance of 66.1 % on a held-out set of 1,000 reactions. They also predict calcination and sintering temperatures with mean absolute errors below 126 °C, matching specialized regression methods. Ensembling these language models further enhances predictive accuracy and reduces inference cost per prediction by up to 70 %. We subsequently employ language models to generate 28,548 synthetic reaction recipes, which we combine with literature-mined examples to pretrain a transformer-based model, SyntMTE. After fine-tuning on the combined dataset, SyntMTE reduces mean-absolute error in sintering temperature prediction to 73 °C and in calcination temperature to 98 °C. This strategy improves models by up to 8.7 % compared with baselines trained exclusively on experimental data. Finally, in a case study on Li7La3Zr2O12 solid-state electrolytes, we demonstrate that SyntMTE reproduces the experimentally observed dopant-dependent sintering trends. Our hybrid workflow enables scalable, data-efficient inorganic synthesis planning. △ Less

Submitted 14 June, 2025; originally announced June 2025.

arXiv:2502.04289 [pdf, other]

Retro-Rank-In: A Ranking-Based Approach for Inorganic Materials Synthesis Planning

Authors: Thorben Prein, Elton Pan, Sami Haddouti, Marco Lorenz, Janik Jehkul, Tymoteusz Wilk, Cansu Moran, Menelaos Panagiotis Fotiadis, Artur P. Toshev, Elsa Olivetti, Jennifer L. M. Rupp

Abstract: Retrosynthesis strategically plans the synthesis of a chemical target compound from simpler, readily available precursor compounds. This process is critical for synthesizing novel inorganic materials, yet traditional methods in inorganic chemistry continue to rely on trial-and-error experimentation. Emerging machine-learning approaches struggle to generalize to entirely new reactions due to their… ▽ More Retrosynthesis strategically plans the synthesis of a chemical target compound from simpler, readily available precursor compounds. This process is critical for synthesizing novel inorganic materials, yet traditional methods in inorganic chemistry continue to rely on trial-and-error experimentation. Emerging machine-learning approaches struggle to generalize to entirely new reactions due to their reliance on known precursors, as they frame retrosynthesis as a multi-label classification task. To address these limitations, we propose Retro-Rank-In, a novel framework that reformulates the retrosynthesis problem by embedding target and precursor materials into a shared latent space and learning a pairwise ranker on a bipartite graph of inorganic compounds. We evaluate Retro-Rank-In's generalizability on challenging retrosynthesis dataset splits designed to mitigate data duplicates and overlaps. For instance, for Cr2AlB2, it correctly predicts the verified precursor pair CrB + Al despite never seeing them in training, a capability absent in prior work. Extensive experiments show that Retro-Rank-In sets a new state-of-the-art, particularly in out-of-distribution generalization and candidate set ranking, offering a powerful tool for accelerating inorganic material synthesis. △ Less

Submitted 7 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

arXiv:2410.15012 [pdf]

Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer

Authors: Gesa Mittmann, Sara Laiouar-Pedari, Hendrik A. Mehrtens, Sarah Haggenmüller, Tabea-Clara Bucher, Tirtha Chanda, Nadine T. Gaisa, Mathias Wagner, Gilbert Georg Klamminger, Tilman T. Rau, Christina Neppl, Eva Maria Compérat, Andreas Gocht, Monika Hämmerle, Niels J. Rupp, Jula Westhoff, Irene Krücken, Maximillian Seidl, Christian M. Schürch, Marcus Bauer, Wiebke Solass, Yu Chun Tam, Florian Weber, Rainer Grobholz, Jaroslaw Augustyniak , et al. (41 additional authors not shown)

Abstract: The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue… ▽ More The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue, we introduce a novel dataset of 1,015 tissue microarray core images, annotated by an international group of 54 pathologists. The annotations provide detailed localized pattern descriptions for Gleason grading in line with international guidelines. Utilizing this dataset, we develop an inherently explainable AI system based on a U-Net architecture that provides predictions leveraging pathologists' terminology. This approach circumvents post-hoc explainability methods while maintaining or exceeding the performance of methods trained directly for Gleason pattern segmentation (Dice score: 0.713 $\pm$ 0.003 trained on explanations vs. 0.691 $\pm$ 0.010 trained on Gleason patterns). By employing soft labels during training, we capture the intrinsic uncertainty in the data, yielding strong results in Gleason pattern segmentation even in the context of high interobserver variability. With the release of this dataset, we aim to encourage further research into segmentation in medical tasks with high levels of subjectivity and to advance the understanding of pathologists' reasoning processes. △ Less

Submitted 19 October, 2024; originally announced October 2024.

Comments: 58 pages, 15 figures (incl. supplementary)

arXiv:2309.12483 [pdf]

doi 10.1007/s13218-023-00823-8

PrivAgE: A Toolchain for Privacy-Preserving Distributed Aggregation on Edge-Devices

Authors: Johannes Liebenow, Timothy Imort, Yannick Fuchs, Marcel Heisel, Nadja Käding, Jan Rupp, Esfandiar Mohammadi

Abstract: Valuable insights, such as frequently visited environments in the wake of the COVID-19 pandemic, can oftentimes only be gained by analyzing sensitive data spread across edge-devices like smartphones. To facilitate such an analysis, we present a toolchain called PrivAgE for a distributed, privacy-preserving aggregation of local data by taking the limited resources of edge-devices into account. The… ▽ More Valuable insights, such as frequently visited environments in the wake of the COVID-19 pandemic, can oftentimes only be gained by analyzing sensitive data spread across edge-devices like smartphones. To facilitate such an analysis, we present a toolchain called PrivAgE for a distributed, privacy-preserving aggregation of local data by taking the limited resources of edge-devices into account. The distributed aggregation is based on secure summation and simultaneously satisfies the notion of differential privacy. In this way, other parties can neither learn the sensitive data of single clients nor a single client's influence on the final result. We perform an evaluation of the power consumption, the running time and the bandwidth overhead on real as well as simulated devices and demonstrate the flexibility of our toolchain by presenting an extension of the summation of histograms to distributed clustering. △ Less

Submitted 12 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2105.05093 [pdf]

Lattice contraction induced by resistive switching in chromium-doped V2O3: a hallmark of Mott physics

Authors: D. Babich, J. Tranchant, C. Adda, B. Corraze, M. -P. Besland, P. Warnicke, D. Bedau, P. Bertoncini, J. -Y. Mevellec, B. Humbert, J. Rupp, T. Hennen, D. Wouters, R. Llopis, L. Cario, E. Janod

Abstract: Since the beginnings of the electronic age, a quest for ever faster and smaller switches has been initiated, since this element is ubiquitous and foundational in any electronic circuit to regulate the flow of current. Mott insulators are promising candidates to meet this need as they undergo extremely fast resistive switching under electric field. However the mechanism of this transition is still… ▽ More Since the beginnings of the electronic age, a quest for ever faster and smaller switches has been initiated, since this element is ubiquitous and foundational in any electronic circuit to regulate the flow of current. Mott insulators are promising candidates to meet this need as they undergo extremely fast resistive switching under electric field. However the mechanism of this transition is still under debate. Our spatially-resolved μ-XRD imaging experiments carried out on the prototypal Mott insulator (V0.95Cr0.05)2O3 show that the resistive switching is associated with the creation of a conducting filamentary path consisting in an isostructural compressed phase without any chemical nor symmetry change. This clearly evidences that the resistive switching mechanism is inherited from the bandwidth-controlled Mott transition. This discovery might hence ease the development of a new branch of electronics dubbed Mottronics. △ Less

Submitted 20 October, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

Comments: 14 pages, 5 figures + Supplementary Materials (14 pages, 7 figures)

arXiv:2008.10105 [pdf, other]

Deep Bayes Factor Scoring for Authorship Verification

Authors: Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa

Abstract: The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts. Fanfiction is a fan-written extension of a storyline in which a so-called fandom topic describes the principal subject of the document. The data provided in the PAN 2020 AV task is quite challenging because authors of texts across multiple/different fandom topics a… ▽ More The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts. Fanfiction is a fan-written extension of a storyline in which a so-called fandom topic describes the principal subject of the document. The data provided in the PAN 2020 AV task is quite challenging because authors of texts across multiple/different fandom topics are included. In this work, we present a hierarchical fusion of two well-known approaches into a single end-to-end learning procedure: A deep metric learning framework at the bottom aims to learn a pseudo-metric that maps a document of variable length onto a fixed-sized feature vector. At the top, we incorporate a probabilistic layer to perform Bayes factor scoring in the learned metric space. We also provide text preprocessing strategies to deal with the cross-topic issue. △ Less

Submitted 23 August, 2020; originally announced August 2020.

Comments: CLEF 2020 Labs and Workshops, Notebook Papers, September 2020. CEUR-WS.org

Showing 1–6 of 6 results for author: Rupp, J