Tackling the Accuracy-Interpretability Trade-off in a Hierarchy of Machine Learning Models for the Prediction of Extreme Heatwaves
Authors:
Alessandro Lovo,
Amaury Lancelin,
Corentin Herbert,
Freddy Bouchet
Abstract:
When performing predictions that use Machine Learning (ML), we are mainly interested in performance and interpretability. This generates a natural trade-off, where complex models generally have higher skills but are harder to explain and thus trust. Interpretability is particularly important in the climate community, where we aim at gaining a physical understanding of the underlying phenomena. Eve…
▽ More
When performing predictions that use Machine Learning (ML), we are mainly interested in performance and interpretability. This generates a natural trade-off, where complex models generally have higher skills but are harder to explain and thus trust. Interpretability is particularly important in the climate community, where we aim at gaining a physical understanding of the underlying phenomena. Even more so when the prediction concerns extreme weather events with high impact on society. In this paper, we perform probabilistic forecasts of extreme heatwaves over France, using a hierarchy of increasingly complex ML models, which allows us to find the best compromise between accuracy and interpretability. More precisely, we use models that range from a global Gaussian Approximation (GA) to deep Convolutional Neural Networks (CNNs), with the intermediate steps of a simple Intrinsically Interpretable Neural Network (IINN) and a model using the Scattering Transform (ScatNet). Our findings reveal that CNNs provide higher accuracy, but their black-box nature severely limits interpretability, even when using state-of-the-art Explainable Artificial Intelligence (XAI) tools. In contrast, ScatNet achieves similar performance to CNNs while providing greater transparency, identifying key scales and patterns in the data that drive predictions. This study underscores the potential of interpretability in ML models for climate science, demonstrating that simpler models can rival the performance of their more complex counterparts, all the while being much easier to understand. This gained interpretability is crucial for building trust in model predictions and uncovering new scientific insights, ultimately advancing our understanding and management of extreme weather events.
△ Less
Submitted 28 March, 2025; v1 submitted 1 October, 2024;
originally announced October 2024.
Gaussian Framework and Optimal Projection of Weather Fields for Prediction of Extreme Events
Authors:
Valeria Mascolo,
Alessandro Lovo,
Corentin Herbert,
Freddy Bouchet
Abstract:
Extreme events are the major weather-related hazard for humanity. It is then of crucial importance to have a good understanding of their statistics and to be able to forecast them. However, lack of sufficient data makes their study particularly challenging. In this work, we provide a simple framework for studying extreme events that tackles the lack of data issue by using the entire available data…
▽ More
Extreme events are the major weather-related hazard for humanity. It is then of crucial importance to have a good understanding of their statistics and to be able to forecast them. However, lack of sufficient data makes their study particularly challenging. In this work, we provide a simple framework for studying extreme events that tackles the lack of data issue by using the entire available dataset, rather than focusing on the extremes of the dataset. To do so, we make the assumption that the set of predictors and the observable used to define the extreme event follow a jointly Gaussian distribution. This naturally gives the notion of an optimal projection of the predictors for forecasting the event. We take as a case study extreme heatwaves over France, and we test our method on an 8000-year-long intermediate complexity climate model time series and on the ERA5 reanalysis dataset. For a-posteriori statistics, we observe and motivate the fact that composite maps of very extreme events look similar to less extreme ones. For prediction, we show that our method is competitive with off-the-shelf neural networks on the long dataset and outperforms them on reanalysis. The optimal projection pattern, which makes our forecast intrinsically interpretable, highlights the importance of soil moisture deficit and quasi-stationary Rossby waves as precursors to extreme heatwaves.
△ Less
Submitted 11 March, 2025; v1 submitted 31 May, 2024;
originally announced May 2024.