-
Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data
Authors:
Huan Zhang,
Justin Finkel,
Dorian S. Abbot,
Edwin P. Gerber,
Jonathan Weare
Abstract:
Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstra…
▽ More
Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstrate this approach on an idealized quasigeostrophic model developed by Marshall and Molteni (1993). We train a convolutional neural network (CNN), and subsequently, build a sparse predictive model for the persistence of Atlantic blocking, conditioned on an initial high-pressure anomaly. Shapley Additive ExPlanation (SHAP) analysis reveals that high-pressure anomalies in the American Southeast and North Atlantic, separated by a trough over Atlantic Canada, contribute significantly to prediction of sustained blocking events in the Atlantic region. This agrees with previous work that identified precursors in the same regions via wave train analysis. When we apply the same CNN to blockings in the ERA5 atmospheric reanalysis, there is insufficient data to accurately predict persistent blocks. We partially overcome this limitation by pre-training the CNN on the plentiful data of the Marshall-Molteni model, and then using Transfer Learning to achieve better predictions than direct training. SHAP analysis before and after transfer learning allows a comparison between the predictive features in the reanalysis and the quasigeostrophic model, quantifying dynamical biases in the idealized model. This work demonstrates the potential for machine learning methods to extract meaningful precursors of extreme weather events and achieve better prediction using limited observational data.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Bringing statistics to storylines: rare event sampling for sudden, transient extreme events
Authors:
Justin Finkel,
Paul A. O'Gorman
Abstract:
A leading goal for climate science and weather risk management is to accurately model both the physics and statistics of extreme events. These two goals are fundamentally at odds: the higher a computational model's resolution, the more expensive are the ensembles needed to capture accurate statistics in the tail of the distribution. Here, we focus on events that are localized in space and time, su…
▽ More
A leading goal for climate science and weather risk management is to accurately model both the physics and statistics of extreme events. These two goals are fundamentally at odds: the higher a computational model's resolution, the more expensive are the ensembles needed to capture accurate statistics in the tail of the distribution. Here, we focus on events that are localized in space and time, such as heavy precipitation events, which can start suddenly and decay rapidly. We advance a method for sampling such events more efficiently than straightforward climate model simulation. Our method combines elements of two recent approaches: adaptive multilevel splitting (AMS), a rare event algorithm that generates rigorous statistics at reduced cost, but that does not work well for sudden, transient extreme events; and "ensemble boosting" which generates physically plausible storylines of these events but not their statistics. We modify AMS by splitting trajectories well in advance of the event's onset following the approach of ensemble boosting, and this is shown to be critical for amplifying and diversifying simulated events in tests with the Lorenz-96 model. Early splitting requires a rejection step that reduces efficiency, but nevertheless we demonstrate improved sampling of extreme local events by a factor of order 10 relative to direct sampling in Lorenz-96. Our work makes progress on the challenge posed by fast dynamical timescales for rare event sampling, and it draws connections with existing methods in reliability engineering which, we believe, can be further exploited for weather risk assessment.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Predicting rare events using neural networks and short-trajectory data
Authors:
John Strahan,
Justin Finkel,
Aaron R. Dinner,
Jonathan Weare
Abstract:
Estimating the likelihood, timing, and nature of events is a major goal of modeling stochastic dynamical systems. When the event is rare in comparison with the timescales of simulation and/or measurement needed to resolve the elemental dynamics, accurate prediction from direct observations becomes challenging. In such cases a more effective approach is to cast statistics of interest as solutions t…
▽ More
Estimating the likelihood, timing, and nature of events is a major goal of modeling stochastic dynamical systems. When the event is rare in comparison with the timescales of simulation and/or measurement needed to resolve the elemental dynamics, accurate prediction from direct observations becomes challenging. In such cases a more effective approach is to cast statistics of interest as solutions to Feynman-Kac equations (partial differential equations). Here, we develop an approach to solve Feynman-Kac equations by training neural networks on short-trajectory data. Our approach is based on a Markov approximation but otherwise avoids assumptions about the underlying model and dynamics. This makes it applicable to treating complex computational models and observational data. We illustrate the advantages of our method using a low-dimensional model that facilitates visualization, and this analysis motivates an adaptive sampling strategy that allows on-the-fly identification of and addition of data to regions important for predicting the statistics of interest. Finally, we demonstrate that we can compute accurate statistics for a 75-dimensional model of sudden stratospheric warming. This system provides a stringent test bed for our method.
△ Less
Submitted 2 March, 2023; v1 submitted 2 August, 2022;
originally announced August 2022.
-
Revealing the statistics of extreme events hidden in short weather forecast data
Authors:
Justin Finkel,
Edwin P. Gerber,
Dorian S. Abbot,
Jonathan Weare
Abstract:
Extreme weather events have significant consequences, dominating the impact of climate on society. While high-resolution weather models can forecast many types of extreme events on synoptic timescales, long-term climatological risk assessment is an altogether different problem. A once-in-a-century event takes, on average, 100 years of simulation time to appear just once, far beyond the typical int…
▽ More
Extreme weather events have significant consequences, dominating the impact of climate on society. While high-resolution weather models can forecast many types of extreme events on synoptic timescales, long-term climatological risk assessment is an altogether different problem. A once-in-a-century event takes, on average, 100 years of simulation time to appear just once, far beyond the typical integration length of a weather forecast model. Therefore, this task is left to cheaper, but less accurate, low-resolution or statistical models. But there is untapped potential in weather model output: despite being short in duration, weather forecast ensembles are produced multiple times a week. Integrations are launched with independent perturbations, causing them to spread apart over time and broadly sample phase space. Collectively, these integrations add up to thousands of years of data. We establish methods to extract climatological information from these short weather simulations. Using ensemble hindcasts by the European Center for Medium-range Weather Forecasting (ECMWF) archived in the subseasonal-to-seasonal (S2S) database, we characterize sudden stratospheric warming (SSW) events with multi-centennial return times. Consistent results are found between alternative methods, including basic counting strategies and Markov state modeling. By carefully combining trajectories together, we obtain estimates of SSW frequencies and their seasonal distributions that are consistent with reanalysis-derived estimates for moderately rare events, but with much tighter uncertainty bounds, and which can be extended to events of unprecedented severity that have not yet been observed historically. These methods hold potential for assessing extreme events throughout the climate system, beyond this example of stratospheric extremes.
△ Less
Submitted 23 January, 2023; v1 submitted 10 June, 2022;
originally announced June 2022.
-
Snow topography on undeformed Arctic sea ice captured by an idealized "snow dune" model
Authors:
Predrag Popović,
Justin Finkel,
Mary C. Silber,
Dorian S. Abbot
Abstract:
Our ability to predict the future of Arctic sea ice is limited by ice's sensitivity to detailed surface conditions such as the distribution of snow and melt ponds. Snow on top of the ice decreases ice's thermal conductivity, increases its reflectivity (albedo), and provides a source of meltwater for melt ponds during summer that decrease the ice's albedo. In this paper, we develop a simple model o…
▽ More
Our ability to predict the future of Arctic sea ice is limited by ice's sensitivity to detailed surface conditions such as the distribution of snow and melt ponds. Snow on top of the ice decreases ice's thermal conductivity, increases its reflectivity (albedo), and provides a source of meltwater for melt ponds during summer that decrease the ice's albedo. In this paper, we develop a simple model of pre-melt snow topography that accurately describes snow cover of flat, undeformed Arctic sea ice on several study sites for which data was available. The model considers a surface that is a sum of randomly sized and placed "snow dunes" represented as Gaussian mounds. This model generalizes the "void model" of Popović et al. (2018) and, as such, accurately describes the statistics of melt pond geometry. We test this model against detailed LiDAR measurements of the pre-melt snow topography. We show that the model snow-depth distribution is statistically indistinguishable from the measurements on flat ice, while small disagreement exists if the ice is deformed. We then use this model to determine analytic expressions for the conductive heat flux through the ice and for melt pond coverage evolution during an early stage of pond formation. We also formulate a criterion for ice to remain pond-free throughout the summer. Results from our model could be directly included in large-scale models, thereby improving our understanding of energy balance on sea ice and allowing for more reliable predictions of Arctic sea ice in a future climate.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
Data-driven transition path analysis yields a statistical understanding of sudden stratospheric warming events in an idealized model
Authors:
Justin Finkel,
Robert J. Webber,
Edwin P. Gerber,
Dorian S. Abbot,
Jonathan Weare
Abstract:
Atmospheric regime transitions are highly impactful as drivers of extreme weather events, but pose two formidable modeling challenges: predicting the next event (weather forecasting), and characterizing the statistics of events of a given severity (the risk climatology). Each event has a different duration and spatial structure, making it hard to define an objective "average event." We argue here…
▽ More
Atmospheric regime transitions are highly impactful as drivers of extreme weather events, but pose two formidable modeling challenges: predicting the next event (weather forecasting), and characterizing the statistics of events of a given severity (the risk climatology). Each event has a different duration and spatial structure, making it hard to define an objective "average event." We argue here that transition path theory (TPT), a stochastic process framework, is an appropriate tool for the task. We demonstrate TPT's capacities on a wave-mean flow model of sudden stratospheric warmings (SSWs) developed by Holton and Mass (1976), which is idealized enough for transparent TPT analysis but complex enough to demonstrate computational scalability. Whereas a recent article (Finkel et al. 2021) studied near-term SSW predictability, the present article uses TPT to link predictability to long-term SSW frequency. This requires not only forecasting forward in time from an initial condition, but also \emph{backward in time} to assess the probability of the initial conditions themselves. TPT enables one to condition the dynamics on the regime transition occurring, and thus visualize its physical drivers with a vector field called the \emph{reactive current}. The reactive current shows that before an SSW, dissipation and stochastic forcing drive a slow decay of vortex strength at lower altitudes. The response of upper-level winds is late and sudden, occurring only after the transition is almost complete from a probabilistic point of view. This case study demonstrates that TPT quantities, visualized in a space of physically meaningful variables, can help one understand the dynamics of regime transitions.
△ Less
Submitted 19 October, 2022; v1 submitted 28 August, 2021;
originally announced August 2021.
-
Learning forecasts of rare stratospheric transitions from short simulations
Authors:
Justin Finkel,
Robert J. Webber,
Dorian S. Abbot,
Edwin P. Gerber,
Jonathan Weare
Abstract:
Rare events arising in nonlinear atmospheric dynamics remain hard to predict and attribute. We address the problem of forecasting rare events in a prototypical example, Sudden Stratospheric Warmings (SSWs). Approximately once every other winter, the boreal stratospheric polar vortex rapidly breaks down, shifting midlatitude surface weather patterns for months. We focus on two key quantities of int…
▽ More
Rare events arising in nonlinear atmospheric dynamics remain hard to predict and attribute. We address the problem of forecasting rare events in a prototypical example, Sudden Stratospheric Warmings (SSWs). Approximately once every other winter, the boreal stratospheric polar vortex rapidly breaks down, shifting midlatitude surface weather patterns for months. We focus on two key quantities of interest: the probability of an SSW occurring, and the expected lead time if it does occur, as functions of initial condition. These \emph{optimal forecasts} concretely measure the event's progress. Direct numerical simulation can estimate them in principle, but is prohibitively expensive in practice: each rare event requires a long integration to observe, and the cost of each integration grows with model complexity. We describe an alternative approach using integrations that are \emph{short} compared to the timescale of the warming event. We compute the probability and lead time efficiently by solving equations involving the transition operator, which encodes all information about the dynamics. We relate these optimal forecasts to a small number of interpretable physical variables, suggesting optimal measurements for forecasting. We illustrate the methodology on a prototype SSW model developed by Holton and Mass (1976) and modified by stochastic forcing. While highly idealized, this model captures the essential nonlinear dynamics of SSWs and exhibits the key forecasting challenge: the dramatic separation in timescales between a single event and the return time between successive events. Our methodology is designed to fully exploit high-dimensional data from models and observations, and has the potential to identify detailed predictors of many complex rare events in meteorology.
△ Less
Submitted 28 August, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Path properties of atmospheric transitions: illustration with a low-order sudden stratospheric warming model
Authors:
Justin Finkel,
Dorian Abbot,
Jonathan Weare
Abstract:
Many rare weather events, including hurricanes, droughts, and floods, dramatically impact human life. To accurately forecast these events and characterize their climatology requires specialized mathematical techniques to fully leverage the limited data that are available. Here we describe \emph{transition path theory} (TPT), a framework originally developed for molecular simulation, and argue that…
▽ More
Many rare weather events, including hurricanes, droughts, and floods, dramatically impact human life. To accurately forecast these events and characterize their climatology requires specialized mathematical techniques to fully leverage the limited data that are available. Here we describe \emph{transition path theory} (TPT), a framework originally developed for molecular simulation, and argue that it is a useful paradigm for developing mechanistic understanding of rare climate events. TPT provides a method to calculate statistical properties of the paths into the event. As an initial demonstration of the utility of TPT, we analyze a low-order model of sudden stratospheric warming (SSW), a dramatic disturbance to the polar vortex which can induce extreme cold spells at the surface in the midlatitudes. SSW events pose a major challenge for seasonal weather prediction because of their rapid, complex onset and development. Climate models struggle to capture the long-term statistics of SSW, owing to their diversity and intermittent nature. We use a stochastically forced Holton-Mass-type model with two stable states, corresponding to radiative equilibrium and a vacillating SSW-like regime. In this stochastic bistable setting, from certain probabilistic forecasts TPT facilitates estimation of dominant transition pathways and return times of transitions. These "dynamical statistics" are obtained by solving partial differential equations in the model's phase space. With future application to more complex models, TPT and its constituent quantities promise to improve the predictability of extreme weather events, through both generation and principled evaluation of forecasts.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
Changing World Extreme Temperature Statistics
Authors:
J. M. Finkel,
J. I. Katz
Abstract:
We use the Global Historical Climatology Network--daily database to calculate a nonparametric statistic that describes the rate at which all-time daily high and low temperature records have been set in nine geographic regions (continents or major portions of continents) during periods mostly from the mid-20th Century to the present. This statistic was defined in our earlier work on temperature rec…
▽ More
We use the Global Historical Climatology Network--daily database to calculate a nonparametric statistic that describes the rate at which all-time daily high and low temperature records have been set in nine geographic regions (continents or major portions of continents) during periods mostly from the mid-20th Century to the present. This statistic was defined in our earlier work on temperature records in the 48 contiguous United States. In contrast to this earlier work, we find that in every region except North America all-time high records were set at a rate significantly (at least $3σ$) higher than in the null hypothesis of a stationary climate. Except in Antarctica, all-time low records were set at a rate significantly lower than in the null hypothesis. In Europe, North Africa and North Asia the rate of setting new all-time highs increased suddenly in the 1990's, suggesting a change in regional climate regime; in most other regions there was a steadier increase.
△ Less
Submitted 15 August, 2017;
originally announced August 2017.
-
Changing U.S. Extreme Temperature Statistics
Authors:
J. M. Finkel,
J. I. Katz
Abstract:
The rise in global mean temperature is an incomplete description of warming. For many purposes, including agriculture and human life, temperature extremes may be more important than temperature means and changes in local extremes may be more important than mean global changes. We define a nonparametric statistic to describe extreme temperature behavior by quantifying the frequency of local daily a…
▽ More
The rise in global mean temperature is an incomplete description of warming. For many purposes, including agriculture and human life, temperature extremes may be more important than temperature means and changes in local extremes may be more important than mean global changes. We define a nonparametric statistic to describe extreme temperature behavior by quantifying the frequency of local daily all-time highs and lows, normalized by their frequency in the null hypothesis of no climate change. We average this metric over 1218 weather stations in the 48 contiguous United States, and find significantly fewer all-time lows than for the null hypothesis of unchanging climate. Record highs, by contrast, exhibit no significant trend. The metric is evaluated by Monte Carlo simulation for stationary and warming temperature distributions, permitting comparison of the statistics of historic temperature records with those of modeled behavior.
△ Less
Submitted 18 January, 2017;
originally announced January 2017.