-
Plasma State Monitoring and Disruption Characterization using Multimodal VAEs
Authors:
Yoeri Poels,
Alessandro Pau,
Christian Donner,
Giulio Romanelli,
Olivier Sauter,
Cristina Venturini,
Vlado Menkovski,
the TCV team,
the WPTE team
Abstract:
When a plasma disrupts in a tokamak, significant heat and electromagnetic loads are deposited onto the surrounding device components. These forces scale with plasma current and magnetic field strength, making disruptions one of the key challenges for future devices. Unfortunately, disruptions are not fully understood, with many different underlying causes that are difficult to anticipate. Data-dri…
▽ More
When a plasma disrupts in a tokamak, significant heat and electromagnetic loads are deposited onto the surrounding device components. These forces scale with plasma current and magnetic field strength, making disruptions one of the key challenges for future devices. Unfortunately, disruptions are not fully understood, with many different underlying causes that are difficult to anticipate. Data-driven models have shown success in predicting them, but they only provide limited interpretability. On the other hand, large-scale statistical analyses have been a great asset to understanding disruptive patterns. In this paper, we leverage data-driven methods to find an interpretable representation of the plasma state for disruption characterization. Specifically, we use a latent variable model to represent diagnostic measurements as a low-dimensional, latent representation. We build upon the Variational Autoencoder (VAE) framework, and extend it for (1) continuous projections of plasma trajectories; (2) a multimodal structure to separate operating regimes; and (3) separation with respect to disruptive regimes. Subsequently, we can identify continuous indicators for the disruption rate and the disruptivity based on statistical properties of measurement data. The proposed method is demonstrated using a dataset of approximately 1600 TCV discharges, selecting for flat-top disruptions or regular terminations. We evaluate the method with respect to (1) the identified disruption risk and its correlation with other plasma properties; (2) the ability to distinguish different types of disruptions; and (3) downstream analyses. For the latter, we conduct a demonstrative study on identifying parameters connected to disruptions using counterfactual-like analysis. Overall, the method can adequately identify distinct operating regimes characterized by varying proximity to disruptions in an interpretable manner.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Robust Confinement State Classification with Uncertainty Quantification through Ensembled Data-Driven Methods
Authors:
Yoeri Poels,
Cristina Venturini,
Alessandro Pau,
Olivier Sauter,
Vlado Menkovski,
the TCV team,
the WPTE team
Abstract:
Maximizing fusion performance in tokamaks relies on high energy confinement, often achieved through distinct operating regimes. The automated labeling of these confinement states is crucial to enable large-scale analyses or for real-time control applications. While this task becomes difficult to automate near state transitions or in marginal scenarios, much success has been achieved with data-driv…
▽ More
Maximizing fusion performance in tokamaks relies on high energy confinement, often achieved through distinct operating regimes. The automated labeling of these confinement states is crucial to enable large-scale analyses or for real-time control applications. While this task becomes difficult to automate near state transitions or in marginal scenarios, much success has been achieved with data-driven models. However, these methods generally provide predictions as point estimates, and cannot adequately deal with missing and/or broken input signals. To enable wide-range applicability, we develop methods for confinement state classification with uncertainty quantification and model robustness. We focus on off-line analysis for TCV discharges, distinguishing L-mode, H-mode, and an in-between dithering phase (D). We propose ensembling data-driven methods on two axes: model formulations and feature sets. The former considers a dynamic formulation based on a recurrent Fourier Neural Operator-architecture and a static formulation based on gradient-boosted decision trees. These models are trained using multiple feature groupings categorized by diagnostic system or physical quantity. A dataset of 302 TCV discharges is fully labeled, and will be publicly released. We evaluate our method quantitatively using Cohen's kappa coefficient for predictive performance and the Expected Calibration Error for the uncertainty calibration. Furthermore, we discuss performance using a variety of common and alternative scenarios, the performance of individual components, out-of-distribution performance, cases of broken or missing signals, and evaluate conditionally-averaged behavior around different state transitions. Overall, the proposed method can distinguish L, D and H-mode with high performance, can cope with missing or broken signals, and provides meaningful uncertainty estimates.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Learning Plasma Dynamics and Robust Rampdown Trajectories with Predict-First Experiments at TCV
Authors:
Allen M. Wang,
Alessandro Pau,
Cristina Rea,
Oswin So,
Charles Dawson,
Olivier Sauter,
Mark D. Boyer,
Anna Vu,
Cristian Galperti,
Chuchu Fan,
Antoine Merle,
Yoeri Poels,
Cristina Venturini,
Stefano Marchioni,
the TCV Team
Abstract:
The rampdown in tokamak operations is a difficult to simulate phase during which the plasma is often pushed towards multiple instability limits. To address this challenge, and reduce the risk of disrupting operations, we leverage recent advances in Scientific Machine Learning (SciML) to develop a neural state-space model (NSSM) that predicts plasma dynamics during Tokamak à Configuration Variable…
▽ More
The rampdown in tokamak operations is a difficult to simulate phase during which the plasma is often pushed towards multiple instability limits. To address this challenge, and reduce the risk of disrupting operations, we leverage recent advances in Scientific Machine Learning (SciML) to develop a neural state-space model (NSSM) that predicts plasma dynamics during Tokamak à Configuration Variable (TCV) rampdowns. By integrating simple physics structure and data-driven models, the NSSM efficiently learns plasma dynamics during the rampdown from a modest dataset of 311 pulses with only five pulses in the reactor relevant high performance regime. The NSSM is parallelized across uncertainties, and reinforcement learning (RL) is applied to design trajectories that avoid multiple instability limits with high probability. Experiments at TCV ramping down high performance plasmas show statistically significant improvements in current and energy at plasma termination, with improvements in speed through continuous re-training. A predict-first experiment, increasing plasma current by 20\% from baseline, demonstrates the NSSM's ability to make small extrapolations with sufficient accuracy to design trajectories that successfully terminate the pulse. The developed approach paves the way for designing tokamak controls with robustness to considerable uncertainty, and demonstrates the relevance of the SciML approach to learning plasma dynamics for rapidly developing robust trajectories and controls during the incremental campaigns of upcoming burning plasma tokamaks.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Towards Transparent and Accurate Plasma State Monitoring at JET
Authors:
Andrin Bürli,
Alessandro Pau,
Thomas Koller,
Olivier Sauter,
JET Contributors
Abstract:
Controlling and monitoring plasma within a tokamak device is complex and challenging. Plasma off-normal events, such as disruptions, are hindering steady-state operation. For large devices, they can even endanger the machine's integrity and it represents in general one of the most serious concerns for the exploitation of the tokamak concept for future power plants. Effective plasma state monitorin…
▽ More
Controlling and monitoring plasma within a tokamak device is complex and challenging. Plasma off-normal events, such as disruptions, are hindering steady-state operation. For large devices, they can even endanger the machine's integrity and it represents in general one of the most serious concerns for the exploitation of the tokamak concept for future power plants. Effective plasma state monitoring carries the potential to enable an understanding of such phenomena and their evolution which is crucial for the successful operation of tokamaks. This paper presents the application of a transparent and data-driven methodology to monitor the plasma state in a tokamak. Compared to previous studies in the field, supervised and unsupervised learning techniques are combined. The dataset consisted of 520 expert-validated discharges from JET. The goal was to provide an interpretable plasma state representation for the JET operational space by leveraging multi-task learning for the first time in the context of plasma state monitoring. When evaluated as disruption predictors, a sequence-based approach showed significant improvements compared to the state-based models. The best resulting network achieved a promising cross-validated success rate when combined with a physical indicator and accounting for nearby instabilities. Qualitative evaluations of the learned latent space uncovered operational and disruptive regions as well as patterns related to learned dynamics and global feature importance. The applied methodology provides novel possibilities for the definition of triggers to switch between different control scenarios, data analysis, and learning as well as exploring latent dynamics for plasma state monitoring. It also showed promising quantitative and qualitative results with warning times suitable for avoidance purposes and distributions that are consistent with known physical mechanisms.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Correlation of the L-mode density limit with edge collisionality
Authors:
Andrew Maris,
Cristina Rea,
Alessandro Pau,
Wenhui Hu,
Bingjia Xiao,
Robert Granetz,
Earl Marmar,
the EUROfusion Tokamak Exploitation team,
the Alcator C-Mod team,
the ASDEX Upgrade team,
the DIII-D team,
the EAST team,
the TCV team
Abstract:
The "density limit" is one of the fundamental bounds on tokamak operating space, and is commonly estimated via the empirical Greenwald scaling. This limit has garnered renewed interest in recent years as it has become clear that ITER and many tokamak pilot plant concepts must operate near or above the Greenwald limit to achieve their objectives. Evidence has also grown that the Greenwald scaling -…
▽ More
The "density limit" is one of the fundamental bounds on tokamak operating space, and is commonly estimated via the empirical Greenwald scaling. This limit has garnered renewed interest in recent years as it has become clear that ITER and many tokamak pilot plant concepts must operate near or above the Greenwald limit to achieve their objectives. Evidence has also grown that the Greenwald scaling - in its remarkable simplicity - may not capture the full complexity of the density limit. In this study, we assemble a multi-machine database to quantify the effectiveness of the Greenwald limit as a predictor of the L-mode density limit and compare it with data-driven approaches. We find that a boundary in the plasma edge involving dimensionless collisionality and pressure, $ν_{*\rm, edge}^{\rm limit} = 3.5 β_{T,{\rm edge}}^{-0.40}$, achieves significantly higher accuracy (false positive rate of 2.3% at a true positive rate of 95%) of predicting density limit disruptions than the Greenwald limit (false positive rate of 13.4% at a true positive rate of 95%) across a multi-machine dataset including metal- and carbon-wall tokamaks (AUG, C-Mod, DIII-D, and TCV). This two-parameter boundary succeeds at predicting L-mode density limits by robustly identifying the radiative state preceding the terminal MHD instability. This boundary can be applied for density limit avoidance in current devices and in ITER, where it can be measured and responded to in real time.
△ Less
Submitted 21 May, 2025; v1 submitted 26 June, 2024;
originally announced June 2024.
-
A machine-learning-based tool for last closed-flux surface reconstruction on tokamaks
Authors:
Chenguang Wan,
Zhi Yu,
Alessandro Pau,
Xiaojuan Liu,
Jiangang Li
Abstract:
Nuclear fusion represents one of the best alternatives for a sustainable source of clean energy. Tokamaks allow to confine fusion plasma with magnetic fields and one of the main challenges in the control of the magnetic configuration is the prediction/reconstruction of the Last Closed-Flux Surface (LCFS). The evolution in time of the LCFS is determined by the interaction of the actuator coils and…
▽ More
Nuclear fusion represents one of the best alternatives for a sustainable source of clean energy. Tokamaks allow to confine fusion plasma with magnetic fields and one of the main challenges in the control of the magnetic configuration is the prediction/reconstruction of the Last Closed-Flux Surface (LCFS). The evolution in time of the LCFS is determined by the interaction of the actuator coils and the internal tokamak plasma. This task requires real-time capable tools able to deal with high-dimensional data as well as with high resolution in time, where the interaction between a wide range of input actuator coils with internal plasma state responses add additional layer of complexity. In this work, we present the application of a novel state of the art machine learning model to the LCFS reconstruction in the Experimental Advanced Superconducting Tokamak (EAST) that learns automatically from the experimental data of EAST. This architecture allows not only offline simulation and testing of a particular control strategy, but can also be embedded in the real-time control system for online magnetic equilibrium reconstruction and prediction. In the real-time modeling test, our approach achieves very high accuracies, with over 99% average similarity in LCFS reconstruction of the entire discharge process.
△ Less
Submitted 20 October, 2022; v1 submitted 12 July, 2022;
originally announced July 2022.
-
First-principles density limit scaling in tokamaks based on edge turbulent transport and implications for ITER
Authors:
M. Giacomin,
A. Pau,
P. Ricci,
O. Sauter,
T. Eich
Abstract:
A first-principles scaling law, based on turbulent transport considerations, and a multi-machine database of density limit discharges from the ASDEX Upgrade, JET and TCV tokamaks, show that the increase of the boundary turbulent transport with the plasma collisionality sets the maximum density achievable in tokamaks. This scaling law shows a strong dependence on the heating power, therefore predic…
▽ More
A first-principles scaling law, based on turbulent transport considerations, and a multi-machine database of density limit discharges from the ASDEX Upgrade, JET and TCV tokamaks, show that the increase of the boundary turbulent transport with the plasma collisionality sets the maximum density achievable in tokamaks. This scaling law shows a strong dependence on the heating power, therefore predicting for ITER a significantly larger safety margin than the Greenwald empirical scaling (Greenwald et al, Nucl. Fusion, 28(12), 1988) in case of unintentional H-L transition.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
EAST discharge prediction without integrating simulation results
Authors:
Chenguang Wan,
Zhi Yu,
Alessandro Pau,
Xiaojuan Liu,
Jiangang Li
Abstract:
In this work, a purely data-driven discharge prediction model was developed and tested without integrating any data or results from simulations. The model was developed based on the experimental data from the Experimental Advanced Superconducting Tokamak (EAST) campaign 2010-2020 discharges and can predict the actual plasma current $I_{p}$, normalized beta $β_{n}$, toroidal beta $β_{t}$, beta polo…
▽ More
In this work, a purely data-driven discharge prediction model was developed and tested without integrating any data or results from simulations. The model was developed based on the experimental data from the Experimental Advanced Superconducting Tokamak (EAST) campaign 2010-2020 discharges and can predict the actual plasma current $I_{p}$, normalized beta $β_{n}$, toroidal beta $β_{t}$, beta poloidal $β_{p}$, electron density $n_{e}$, store energy $W_{mhd}$, loop voltage $V_{loop}$, elongation at plasma boundary $κ$, internal inductance $l_{i}$, q at magnetic axis $q_{0}$, and q at 95% flux surface $q_{95}$. The average similarities of all the selected key diagnostic signals between prediction results and the experimental data are greater than 90%, except for the $V_{loop}$ and $q_{0}$. Before a tokamak experiment, the values of actuator signals are set in the discharge proposal stage, with the model allowing to check the consistency of expected diagnostic signals. The model can give the estimated values of the diagnostic signals to check the reasonableness of the tokamak experimental proposal.
△ Less
Submitted 20 October, 2022; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Plasma Confinement Mode Classification Using a Sequence-to-Sequence Neural Network With Attention
Authors:
Francisco Matos,
Vlado Menkovski,
Alessandro Pau,
Gino Marceca,
Frank Jenko
Abstract:
In a typical fusion experiment, the plasma can have several possible confinement modes. At the TCV tokamak, aside from the Low (L) and High (H) confinement modes, an additional mode, dithering (D), is frequently observed. Developing methods that automatically detect these modes is considered to be important for future tokamak operation. Previous work with deep learning methods, particularly convol…
▽ More
In a typical fusion experiment, the plasma can have several possible confinement modes. At the TCV tokamak, aside from the Low (L) and High (H) confinement modes, an additional mode, dithering (D), is frequently observed. Developing methods that automatically detect these modes is considered to be important for future tokamak operation. Previous work with deep learning methods, particularly convolutional recurrent neural networks (Conv-RNNs), indicates that they are a suitable approach. Nevertheless, those models are sensitive to noise in the temporal alignment of labels, and that model in particular is limited to making individual decisions taking into account only its own hidden state and its input at each time step. In this work, we propose an architecture for a sequence-to-sequence neural network model with attention which solves both of those issues. Using a carefully calibrated dataset, we compare the performance of a Conv-RNN with that of our proposed sequence-to-sequence model, and show two results: one, that the Conv-RNN can be improved upon with new data; two, that the sequence-to-sequence model can improve the results even further, achieving excellent scores on both train and test data.
△ Less
Submitted 2 November, 2020;
originally announced December 2020.
-
Classification of tokamak plasma confinement states with convolutional recurrent neural networks
Authors:
F. Matos,
V. Menkovski,
F. Felici,
A. Pau,
F. Jenko
Abstract:
During a tokamak discharge, the plasma can vary between different confinement regimes: Low (L), High (H) and, in some cases, a temporary (intermediate state), called Dithering (D). In addition, while the plasma is in H mode, Edge Localized Modes (ELMs) can occur. The automatic detection of changes between these states, and of ELMs, is important for tokamak operation. Motivated by this, and by rece…
▽ More
During a tokamak discharge, the plasma can vary between different confinement regimes: Low (L), High (H) and, in some cases, a temporary (intermediate state), called Dithering (D). In addition, while the plasma is in H mode, Edge Localized Modes (ELMs) can occur. The automatic detection of changes between these states, and of ELMs, is important for tokamak operation. Motivated by this, and by recent developments in Deep Learning (DL), we developed and compared two methods for automatic detection of the occurrence of L-D-H transitions and ELMs, applied on data from the TCV tokamak. These methods consist in a Convolutional Neural Network (CNN) and a Convolutional Long Short Term Memory Neural Network (Conv-LSTM). We measured our results with regards to ELMs using ROC curves and Youden's score index, and regarding state detection using Cohen's Kappa Index.
△ Less
Submitted 11 November, 2019;
originally announced November 2019.