Skip to main content

Showing 1–6 of 6 results for author: Schilling-Wilhelmi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.07456  [pdf, ps, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    General purpose models for the chemical sciences

    Authors: Nawaf Alampara, Anagha Aneesh, Martiño Ríos-García, Adrian Mirza, Mara Schilling-Wilhelmi, Ali Asghar Aghajani, Meiling Sun, Gordan Prastalo, Kevin Maik Jablonka

    Abstract: Data-driven techniques have a large potential to transform and accelerate the chemical sciences. However, chemical sciences also pose the unique challenge of very diverse, small, fuzzy datasets that are difficult to leverage in conventional machine learning approaches completely. A new class of models, general-purpose models (GPMs) such as large language models, have shown the ability to solve tas… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

  2. arXiv:2503.10837  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Lessons from the trenches on evaluating machine-learning systems in materials science

    Authors: Nawaf Alampara, Mara Schilling-Wilhelmi, Kevin Maik Jablonka

    Abstract: Measurements are fundamental to knowledge creation in science, enabling consistent sharing of findings and serving as the foundation for scientific discovery. As machine learning systems increasingly transform scientific fields, the question of how to effectively evaluate these systems becomes crucial for ensuring reliable progress. In this review, we examine the current state and future directi… ▽ More

    Submitted 6 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  3. arXiv:2411.16955  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Probing the limitations of multimodal language models for chemistry and materials research

    Authors: Nawaf Alampara, Mara Schilling-Wilhelmi, Martiño Ríos-García, Indrajeet Mandal, Pranav Khetarpal, Hargun Singh Grover, N. M. Anoop Krishnan, Kevin Maik Jablonka

    Abstract: Recent advancements in artificial intelligence have sparked interest in scientific assistants that could support researchers across the full spectrum of scientific workflows, from literature review to experimental design and data analysis. A key capability for such systems is the ability to process and reason about scientific information in both visual and textual forms - from interpreting spectro… ▽ More

    Submitted 28 February, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

  4. arXiv:2411.15221  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

    Authors: Yoel Zimmermann, Adib Bazgir, Zartashia Afzal, Fariha Agbere, Qianxiang Ai, Nawaf Alampara, Alexander Al-Feghali, Mehrad Ansari, Dmytro Antypov, Amro Aswad, Jiaru Bai, Viktoriia Baibakova, Devi Dutta Biswajeet, Erik Bitzek, Joshua D. Bocarsly, Anna Borisova, Andres M Bran, L. Catherine Brinson, Marcel Moran Calderon, Alessandro Canalicchio, Victor Chen, Yuan Chiang, Defne Circi, Benjamin Charmes, Vikrant Chaudhary , et al. (119 additional authors not shown)

    Abstract: Here, we present the outcomes from the second Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry, which engaged participants across global hybrid locations, resulting in 34 team submissions. The submissions spanned seven key application areas and demonstrated the diverse utility of LLMs for applications in (1) molecular and material property prediction; (2) mo… ▽ More

    Submitted 2 January, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: Updating author information, the submission remains largely unchanged. 98 pages total

  5. arXiv:2407.16867  [pdf, other

    cond-mat.mtrl-sci cs.LG

    From Text to Insight: Large Language Models for Materials Science Data Extraction

    Authors: Mara Schilling-Wilhelmi, Martiño Ríos-García, Sherjeel Shabih, María Victoria Gil, Santiago Miret, Christoph T. Koch, José A. Márquez, Kevin Maik Jablonka

    Abstract: The vast majority of materials science knowledge exists in unstructured natural language, yet structured data is crucial for innovative and systematic materials design. Traditionally, the field has relied on manual curation and partial automation for data extraction for specific use cases. The advent of large language models (LLMs) represents a significant shift, potentially enabling efficient ext… ▽ More

    Submitted 2 December, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  6. arXiv:2404.01475  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.chem-ph

    Are large language models superhuman chemists?

    Authors: Adrian Mirza, Nawaf Alampara, Sreekanth Kunchapu, Martiño Ríos-García, Benedict Emoekabu, Aswanth Krishnan, Tanya Gupta, Mara Schilling-Wilhelmi, Macjonathan Okereke, Anagha Aneesh, Amir Mohammad Elahi, Mehrdad Asgari, Juliane Eberhardt, Hani M. Elbeheiry, María Victoria Gil, Maximilian Greiner, Caroline T. Holick, Christina Glaubitz, Tim Hoffmann, Abdelrahman Ibrahim, Lea C. Klepsch, Yannik Köster, Fabian Alexander Kreth, Jakob Meyer, Santiago Miret , et al. (10 additional authors not shown)

    Abstract: Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. However, we possess only a limited systematic understanding of the chemical capabilities of LLMs, which would be required to improve models and mitigate potential harm. Here, we introduce "ChemBench," an automated framework… ▽ More

    Submitted 1 November, 2024; v1 submitted 1 April, 2024; originally announced April 2024.