-
eLog analysis for accelerators: status and future outlook
Authors:
Antonin Sulc,
Thorsten Hellert,
Aaron Reed,
Adam Carpenter,
Alex Bien,
Chris Tennant,
Claudio Bisegni,
Daniel Lersch,
Daniel Ratner,
David Lawrence,
Diana McSpadden,
Hayden Hoschouer,
Jason St. John,
Thomas Britton
Abstract:
This work demonstrates electronic logbook (eLog) systems leveraging modern AI-driven information retrieval capabilities at the accelerator facilities of Fermilab, Jefferson Lab, Lawrence Berkeley National Laboratory (LBNL), SLAC National Accelerator Laboratory. We evaluate contemporary tools and methodologies for information retrieval with Retrieval Augmented Generation (RAGs), focusing on operati…
▽ More
This work demonstrates electronic logbook (eLog) systems leveraging modern AI-driven information retrieval capabilities at the accelerator facilities of Fermilab, Jefferson Lab, Lawrence Berkeley National Laboratory (LBNL), SLAC National Accelerator Laboratory. We evaluate contemporary tools and methodologies for information retrieval with Retrieval Augmented Generation (RAGs), focusing on operational insights and integration with existing accelerator control systems.
The study addresses challenges and proposes solutions for state-of-the-art eLog analysis through practical implementations, demonstrating applications and limitations. We present a framework for enhancing accelerator facility operations through improved information accessibility and knowledge management, which could potentially lead to more efficient operations.
△ Less
Submitted 15 June, 2025;
originally announced June 2025.
-
Towards Agentic AI on Particle Accelerators
Authors:
Antonin Sulc,
Thorsten Hellert,
Raimund Kammering,
Hayden Hoschouer,
Jason St. John
Abstract:
As particle accelerators grow in complexity, traditional control methods face increasing challenges in achieving optimal performance. This paper envisions a paradigm shift: a decentralized multi-agent framework for accelerator control, powered by Large Language Models (LLMs) and distributed among autonomous agents. We present a proposition of a self-improving decentralized system where intelligent…
▽ More
As particle accelerators grow in complexity, traditional control methods face increasing challenges in achieving optimal performance. This paper envisions a paradigm shift: a decentralized multi-agent framework for accelerator control, powered by Large Language Models (LLMs) and distributed among autonomous agents. We present a proposition of a self-improving decentralized system where intelligent agents handle high-level tasks and communication and each agent is specialized to control individual accelerator components.
This approach raises some questions: What are the future applications of AI in particle accelerators? How can we implement an autonomous complex system such as a particle accelerator where agents gradually improve through experience and human feedback? What are the implications of integrating a human-in-the-loop component for labeling operational data and providing expert guidance? We show three examples, where we demonstrate the viability of such architecture.
△ Less
Submitted 22 December, 2024; v1 submitted 10 September, 2024;
originally announced September 2024.
-
PhysBERT: A Text Embedding Model for Physics Scientific Literature
Authors:
Thorsten Hellert,
João Montenegro,
Andrea Pollastro
Abstract:
The specialized language and complex concepts in physics pose significant challenges for information extraction through Natural Language Processing (NLP). Central to effective NLP applications is the text embedding model, which converts text into dense vector representations for efficient information retrieval and semantic analysis. In this work, we introduce PhysBERT, the first physics-specific t…
▽ More
The specialized language and complex concepts in physics pose significant challenges for information extraction through Natural Language Processing (NLP). Central to effective NLP applications is the text embedding model, which converts text into dense vector representations for efficient information retrieval and semantic analysis. In this work, we introduce PhysBERT, the first physics-specific text embedding model. Pre-trained on a curated corpus of 1.2 million arXiv physics papers and fine-tuned with supervised data, PhysBERT outperforms leading general-purpose models on physics-specific tasks including the effectiveness in fine-tuning for specific physics subdomains.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Towards Unlocking Insights from Logbooks Using AI
Authors:
Antonin Sulc,
Alex Bien,
Annika Eichler,
Daniel Ratner,
Florian Rehm,
Frank Mayet,
Gregor Hartmann,
Hayden Hoschouer,
Henrik Tuennermann,
Jan Kaiser,
Jason St. John,
Jennefer Maldonado,
Kyle Hazelwood,
Raimund Kammering,
Thorsten Hellert,
Tim Wilksen,
Verena Kain,
Wan-Lin Hu
Abstract:
Electronic logbooks contain valuable information about activities and events concerning their associated particle accelerator facilities. However, the highly technical nature of logbook entries can hinder their usability and automation. As natural language processing (NLP) continues advancing, it offers opportunities to address various challenges that logbooks present. This work explores jointly t…
▽ More
Electronic logbooks contain valuable information about activities and events concerning their associated particle accelerator facilities. However, the highly technical nature of logbook entries can hinder their usability and automation. As natural language processing (NLP) continues advancing, it offers opportunities to address various challenges that logbooks present. This work explores jointly testing a tailored Retrieval Augmented Generation (RAG) model for enhancing the usability of particle accelerator logbooks at institutes like DESY, BESSY, Fermilab, BNL, SLAC, LBNL, and CERN. The RAG model uses a corpus built on logbook contributions and aims to unlock insights from these logbooks by leveraging retrieval over facility datasets, including discussion about potential multimodal sources. Our goals are to increase the FAIR-ness (findability, accessibility, interoperability, and reusability) of logbooks by exploiting their information content to streamline everyday use, enable macro-analysis for root cause analysis, and facilitate problem-solving automation.
△ Less
Submitted 25 May, 2024;
originally announced June 2024.