-
Reduced Cloud Cover Errors in a Hybrid AI-Climate Model Through Equation Discovery And Automatic Tuning
Authors:
Arthur Grundner,
Tom Beucler,
Julien Savre,
Axel Lauer,
Manuel Schlund,
Veronika Eyring
Abstract:
Climate models rely on parameterizations that account for the effects of small-scale processes on large-scale dynamics. Particularly cloud-related parameterizations remain a major source of uncertainty in climate projections. While hybrid Earth system models (ESMs) with machine learning-based parameterizations could improve current ESMs, deep learning approaches often lack interpretability, physic…
▽ More
Climate models rely on parameterizations that account for the effects of small-scale processes on large-scale dynamics. Particularly cloud-related parameterizations remain a major source of uncertainty in climate projections. While hybrid Earth system models (ESMs) with machine learning-based parameterizations could improve current ESMs, deep learning approaches often lack interpretability, physical consistency, and computational efficiency. Furthermore, most data-driven parameterizations are trained in a stand-alone fashion and fail within ESMs, partly due to the difficulty of tuning the ESM to accommodate new, non-traditional schemes. In this study, we introduce a novel two-step pipeline for improving a climate model with data-driven parameterizations. First, we incorporate a physically consistent, data-driven cloud cover parameterization into the ICON global atmospheric model. The parameterization, a diagnostic equation derived from storm-resolving simulations via symbolic regression, retains the interpretability and efficiency of traditional parameterizations while improving fidelity. Second, we introduce an automated, gradient-free tuning procedure to recalibrate the new climate model with Earth observations. We employ the Nelder-Mead algorithm and progressively increase simulation length, making our approach simple, computationally efficient, and easily extendable to other ESMs. The tuned hybrid model significantly reduces some long-standing biases in cloud cover and radiative budgets, particularly over regions such as the Southern Ocean and the subtropical stratocumulus regions. Moreover, it remains robust under +4K surface warming. Our results highlight the potential of data-driven parameterizations when combined with model tuning. This framework offers an automatic, efficient and practical approach to enhancing climate projections without losing performance or interpretability.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
From Winter Storm Thermodynamics to Wind Gust Extremes: Discovering Interpretable Equations from Data
Authors:
Frederick Iat-Hin Tam,
Fabien Augsburger,
Tom Beucler
Abstract:
Reliably identifying and understanding temporal precursors to extreme wind gusts is crucial for early warning and mitigation. This study proposes a simple data-driven approach to extract key predictors from a dataset of historical extreme European winter windstorms and derive simple equations linking these precursors to extreme gusts over land. A major challenge is the limited training data for ex…
▽ More
Reliably identifying and understanding temporal precursors to extreme wind gusts is crucial for early warning and mitigation. This study proposes a simple data-driven approach to extract key predictors from a dataset of historical extreme European winter windstorms and derive simple equations linking these precursors to extreme gusts over land. A major challenge is the limited training data for extreme events, increasing the risk of model overfitting. Testing various mitigation strategies, we find that combining dimensionality reduction, careful cross-validation, feature selection, and a nonlinear transformation of maximum wind gusts informed by Generalized Extreme Value distributions successfully reduces overfitting. These measures yield interpretable equations that generalize across regions while maintaining satisfactory predictive skill. The discovered equations reveal the association between a steady drying low-troposphere before landfall and wind gust intensity in Northwestern Europe.
△ Less
Submitted 11 April, 2025; v1 submitted 10 April, 2025;
originally announced April 2025.
-
Improving Predictions of Convective Storm Wind Gusts through Statistical Post-Processing of Neural Weather Models
Authors:
Antoine Leclerc,
Erwan Koch,
Monika Feldmann,
Daniele Nerini,
Tom Beucler
Abstract:
Issuing timely severe weather warnings helps mitigate potentially disastrous consequences. Recent advancements in Neural Weather Models (NWMs) offer a computationally inexpensive and fast approach for forecasting atmospheric environments on a 0.25° global grid. For thunderstorms, these environments can be empirically post-processed to predict wind gust distributions at specific locations. With the…
▽ More
Issuing timely severe weather warnings helps mitigate potentially disastrous consequences. Recent advancements in Neural Weather Models (NWMs) offer a computationally inexpensive and fast approach for forecasting atmospheric environments on a 0.25° global grid. For thunderstorms, these environments can be empirically post-processed to predict wind gust distributions at specific locations. With the Pangu-Weather NWM, we apply a hierarchy of statistical and deep learning post-processing methods to forecast hourly wind gusts up to three days ahead. To ensure statistical robustness, we constrain our probabilistic forecasts using generalised extreme-value distributions across five regions in Switzerland. Using a convolutional neural network to post-process the predicted atmospheric environment's spatial patterns yields the best results, outperforming direct forecasting approaches across lead times and wind gust speeds. Our results confirm the added value of NWMs for extreme wind forecasting, especially for designing more responsive early-warning systems.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Distilling Machine Learning's Added Value: Pareto Fronts in Atmospheric Applications
Authors:
Tom Beucler,
Arthur Grundner,
Sara Shamekh,
Peter Ukkonen,
Matthew Chantry,
Ryan Lagerquist
Abstract:
The added value of machine learning for weather and climate applications is measurable through performance metrics, but explaining it remains challenging, particularly for large deep learning models. Inspired by climate model hierarchies, we propose that a full hierarchy of Pareto-optimal models, defined within an appropriately determined error-complexity plane, can guide model development and hel…
▽ More
The added value of machine learning for weather and climate applications is measurable through performance metrics, but explaining it remains challenging, particularly for large deep learning models. Inspired by climate model hierarchies, we propose that a full hierarchy of Pareto-optimal models, defined within an appropriately determined error-complexity plane, can guide model development and help understand the models' added value. We demonstrate the use of Pareto fronts in atmospheric physics through three sample applications, with hierarchies ranging from semi-empirical models with minimal parameters to deep learning algorithms. First, in cloud cover parameterization, we find that neural networks identify nonlinear relationships between cloud cover and its thermodynamic environment, and assimilate previously neglected features such as vertical gradients in relative humidity that improve the representation of low cloud cover. This added value is condensed into a ten-parameter equation that rivals deep learning models. Second, we establish a machine learning model hierarchy for emulating shortwave radiative transfer, distilling the importance of bidirectional vertical connectivity for accurately representing absorption and scattering, especially for multiple cloud layers. Third, we emphasize the importance of convective organization information when modeling the relationship between tropical precipitation and its surrounding environment. We discuss the added value of temporal memory when high-resolution spatial information is unavailable, with implications for precipitation parameterization. Therefore, by comparing data-driven models directly with existing schemes using Pareto optimality, we promote process understanding by hierarchically unveiling system complexity, with the hope of improving the trustworthiness of machine learning models in atmospheric applications.
△ Less
Submitted 18 January, 2025; v1 submitted 4 August, 2024;
originally announced August 2024.
-
Lightning-Fast Convective Outlooks: Predicting Severe Convective Environments with Global AI-based Weather Models
Authors:
Monika Feldmann,
Tom Beucler,
Milton Gomez,
Olivia Martius
Abstract:
Severe convective storms are among the most dangerous weather phenomena and accurate forecasts mitigate their impacts. The recently released suite of AI-based weather models produces medium-range forecasts within seconds, with a skill similar to state-of-the-art operational forecasts for variables on single levels. However, predicting severe thunderstorm environments requires accurate combinations…
▽ More
Severe convective storms are among the most dangerous weather phenomena and accurate forecasts mitigate their impacts. The recently released suite of AI-based weather models produces medium-range forecasts within seconds, with a skill similar to state-of-the-art operational forecasts for variables on single levels. However, predicting severe thunderstorm environments requires accurate combinations of dynamic and thermodynamic variables and the vertical structure of the atmosphere. Advancing the assessment of AI-models towards process-based evaluations lays the foundation for hazard-driven applications. We assess the forecast skill of three top-performing AI-models for convective parameters at lead-times of up to 10 days against reanalysis and ECMWF's operational numerical weather prediction model IFS. In a case study and seasonal analyses, we see the best performance by GraphCast and Pangu-Weather: these models match or even exceed the performance of IFS for instability and shear. This opens opportunities for fast and inexpensive predictions of severe weather environments.
△ Less
Submitted 9 September, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Simulating Atmospheric Processes in Earth System Models and Quantifying Uncertainties with Deep Learning Multi-Member and Stochastic Parameterizations
Authors:
Gunnar Behrens,
Tom Beucler,
Fernando Iglesias-Suarez,
Sungduk Yu,
Pierre Gentine,
Michael Pritchard,
Mierk Schwabe,
Veronika Eyring
Abstract:
Deep learning is a powerful tool to represent subgrid processes in climate models, but many application cases have so far used idealized settings and deterministic approaches. Here, we develop stochastic parameterizations with calibrated uncertainty quantification to learn subgrid convective and turbulent processes and surface radiative fluxes of a superparameterization (SP) embedded in an Earth S…
▽ More
Deep learning is a powerful tool to represent subgrid processes in climate models, but many application cases have so far used idealized settings and deterministic approaches. Here, we develop stochastic parameterizations with calibrated uncertainty quantification to learn subgrid convective and turbulent processes and surface radiative fluxes of a superparameterization (SP) embedded in an Earth System Model (ESM). We explore three methods to construct stochastic parameterizations: 1) a single Deep Neural Network (DNN) with Monte Carlo Dropout; 2) a multi-member parameterization; and 3) a Variational Encoder Decoder with latent space perturbation. We show that the multi-member (MM) parameterization improves the representation of convective processes, especially in the planetary boundary layer, compared to individual DNNs. The respective uncertainty quantification illustrates that methods 2) and 3) are advantageous compared to a dropout-based DNN parameterization regarding the spread of convective processes. Hybrid simulations with our best-performing MM parameterizations remained challenging and crash within the first days. Therefore, we develop a pragmatic partial coupling strategy relying on the SP for condensate emulation. Partial coupling reduces the computational efficiency of hybrid Earth-like simulations but enables model stability over 5 months with our MM parameterizations. However, our hybrid simulations exhibit biases in thermodynamic fields and differences in precipitation patterns. Despite this, the MM parameterizations enable improvements in reproducing tropical extreme precipitation compared to a traditional convection parameterization. Despite these challenges, our results indicate the potential of a new generation of MM machine learning parameterizations leveraging uncertainty quantification to improve the representation of stochasticity of subgrid effects.
△ Less
Submitted 18 February, 2025; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Identifying Three-Dimensional Radiative Patterns Associated with Early Tropical Cyclone Intensification
Authors:
Frederick Iat-Hin Tam,
Tom Beucler,
James H. Ruppert Jr
Abstract:
Cloud radiative feedback impacts early tropical cyclone (TC) intensification, but limitations in existing diagnostic frameworks make them unsuitable for studying asymmetric or transient radiative heating. We propose a linear Variational Encoder-Decoder (VED) to learn the hidden relationship between radiation and the surface intensification of realistic simulated TCs. Limiting VED model inputs enab…
▽ More
Cloud radiative feedback impacts early tropical cyclone (TC) intensification, but limitations in existing diagnostic frameworks make them unsuitable for studying asymmetric or transient radiative heating. We propose a linear Variational Encoder-Decoder (VED) to learn the hidden relationship between radiation and the surface intensification of realistic simulated TCs. Limiting VED model inputs enables using its uncertainty to identify periods when radiation has more importance for intensification. A close examination of the extracted 3D radiative structures suggests that longwave radiative forcing from inner core deep convection and shallow clouds both contribute to intensification, with the deep convection having the most impact overall. We find that deep convection downwind of the shallow clouds is critical to the intensification of Haiyan. Our work demonstrates that machine learning can discover thermodynamic-kinematic relationships without relying on axisymmetric or deterministic assumptions, paving the way towards the objective discovery of processes leading to TC intensification in realistic conditions.
△ Less
Submitted 4 October, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Lessons Learned: Reproducibility, Replicability, and When to Stop
Authors:
Milton S. Gomez,
Tom Beucler
Abstract:
While extensive guidance exists for ensuring the reproducibility of one's own study, there is little discussion regarding the reproduction and replication of external studies within one's own research. To initiate this discussion, drawing lessons from our experience reproducing an operational product for predicting tropical cyclogenesis, we present a two-dimensional framework to offer guidance on…
▽ More
While extensive guidance exists for ensuring the reproducibility of one's own study, there is little discussion regarding the reproduction and replication of external studies within one's own research. To initiate this discussion, drawing lessons from our experience reproducing an operational product for predicting tropical cyclogenesis, we present a two-dimensional framework to offer guidance on reproduction and replication. Our framework, representing model fitting on one axis and its use in inference on the other, builds upon three key aspects: the dataset, the metrics, and the model itself. By assessing the trajectories of our studies on this 2D plane, we can better inform the claims made using our research. Additionally, we use this framework to contextualize the utility of benchmark datasets in the atmospheric sciences. Our two-dimensional framework provides a tool for researchers, especially early career researchers, to incorporate prior work in their own research and to inform the claims they can make in this context.
△ Less
Submitted 9 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Stress-testing the coupled behavior of hybrid physics-machine learning climate simulations on an unseen, warmer climate
Authors:
Jerry Lin,
Mohamed Aziz Bhouri,
Tom Beucler,
Sungduk Yu,
Michael Pritchard
Abstract:
Accurate and computationally-viable representations of clouds and turbulence are a long-standing challenge for climate model development. Traditional parameterizations that crudely but efficiently approximate these processes are a leading source of uncertainty in long-term projected warming and precipitation patterns. Machine Learning (ML)-based parameterizations have long been hailed as a promisi…
▽ More
Accurate and computationally-viable representations of clouds and turbulence are a long-standing challenge for climate model development. Traditional parameterizations that crudely but efficiently approximate these processes are a leading source of uncertainty in long-term projected warming and precipitation patterns. Machine Learning (ML)-based parameterizations have long been hailed as a promising alternative with the potential to yield higher accuracy at a fraction of the cost of more explicit simulations. However, these ML variants are often unpredictably unstable and inaccurate in \textit{coupled} testing (i.e. in a downstream hybrid simulation task where they are dynamically interacting with the large-scale climate model). These issues are exacerbated in out-of-distribution climates. Certain design decisions such as ``climate-invariant" feature transformation for moisture inputs, input vector expansion, and temporal history incorporation have been shown to improve coupled performance, but they may be insufficient for coupled out-of-distribution generalization. If feature selection and transformations can inoculate hybrid physics-ML climate models from non-physical, out-of-distribution extrapolation in a changing climate, there is far greater potential in extrapolating from observational data. Otherwise, training on multiple simulated climates becomes an inevitable necessity. While our results show generalization benefits from these design decisions, the obtained improvment does not sufficiently preclude the necessity of using multi-climate simulated training data.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Next-Generation Earth System Models: Towards Reliable Hybrid Models for Weather and Climate Applications
Authors:
Tom Beucler,
Erwan Koch,
Sven Kotlarski,
David Leutwyler,
Adrien Michel,
Jonathan Koh
Abstract:
We review how machine learning has transformed our ability to model the Earth system, and how we expect recent breakthroughs to benefit end-users in Switzerland in the near future. Drawing from our review, we identify three recommendations.
Recommendation 1: Develop Hybrid AI-Physical Models: Emphasize the integration of AI and physical modeling for improved reliability, especially for longer pr…
▽ More
We review how machine learning has transformed our ability to model the Earth system, and how we expect recent breakthroughs to benefit end-users in Switzerland in the near future. Drawing from our review, we identify three recommendations.
Recommendation 1: Develop Hybrid AI-Physical Models: Emphasize the integration of AI and physical modeling for improved reliability, especially for longer prediction horizons, acknowledging the delicate balance between knowledge-based and data-driven components required for optimal performance. Recommendation 2: Emphasize Robustness in AI Downscaling Approaches, favoring techniques that respect physical laws, preserve inter-variable dependencies and spatial structures, and accurately represent extremes at the local scale. Recommendation 3: Promote Inclusive Model Development: Ensure Earth System Model development is open and accessible to diverse stakeholders, enabling forecasters, the public, and AI/statistics experts to use, develop, and engage with the model and its predictions/projections.
△ Less
Submitted 26 January, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
Navigating the Noise: Bringing Clarity to ML Parameterization Design with O(100) Ensembles
Authors:
Jerry Lin,
Sungduk Yu,
Liran Peng,
Tom Beucler,
Eliot Wong-Toi,
Zeyuan Hu,
Pierre Gentine,
Margarita Geleta,
Mike Pritchard
Abstract:
Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, uncertainty about the relationship between offline and online performance (i.e., when integrated with a large-scale general circulation model (GCM)) hind…
▽ More
Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, uncertainty about the relationship between offline and online performance (i.e., when integrated with a large-scale general circulation model (GCM)) hinders their development. Much of this uncertainty stems from limited sampling of the noisy, emergent effects of upstream ML design decisions on downstream online hybrid simulation. Our work rectifies the sampling issue via the construction of a semi-automated, end-to-end pipeline for $\mathcal{O}(100)$ size ensembles of hybrid simulations, revealing important nuances in how systematic reductions in offline error manifest in changes to online error and online stability. For example, removing dropout and switching from a Mean Squared Error (MSE) to a Mean Absolute Error (MAE) loss both reduce offline error, but they have opposite effects on online error and online stability. Other design decisions, like incorporating memory, converting moisture input from specific humidity to relative humidity, using batch normalization, and training on multiple climates do not come with any such compromises. Finally, we show that ensemble sizes of $\mathcal{O}(100)$ may be necessary to reliably detect causally relevant differences online. By enabling rapid online experimentation at scale, we can empirically settle debates regarding subgrid ML parameterization design that would have otherwise remained unresolved in the noise.
△ Less
Submitted 17 December, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
ClimSim-Online: A Large Multi-scale Dataset and Framework for Hybrid ML-physics Climate Emulation
Authors:
Sungduk Yu,
Zeyuan Hu,
Akshay Subramaniam,
Walter Hannah,
Liran Peng,
Jerry Lin,
Mohamed Aziz Bhouri,
Ritwik Gupta,
Björn Lütjens,
Justus C. Will,
Gunnar Behrens,
Julius J. M. Busecke,
Nora Loose,
Charles I. Stern,
Tom Beucler,
Bryce Harrop,
Helge Heuer,
Benjamin R. Hillman,
Andrea Jenney,
Nana Liu,
Alistair White,
Tian Zheng,
Zhiming Kuang,
Fiaz Ahmed,
Elizabeth Barnes
, et al. (22 additional authors not shown)
Abstract:
Modern climate projections lack adequate spatial and temporal resolution due to computational constraints, leading to inaccuracies in representing critical processes like thunderstorms that occur on the sub-resolution scale. Hybrid methods combining physics with machine learning (ML) offer faster, higher fidelity climate simulations by outsourcing compute-hungry, high-resolution simulations to ML…
▽ More
Modern climate projections lack adequate spatial and temporal resolution due to computational constraints, leading to inaccuracies in representing critical processes like thunderstorms that occur on the sub-resolution scale. Hybrid methods combining physics with machine learning (ML) offer faster, higher fidelity climate simulations by outsourcing compute-hungry, high-resolution simulations to ML emulators. However, these hybrid ML-physics simulations require domain-specific data and workflows that have been inaccessible to many ML experts. As an extension of the ClimSim dataset (Yu et al., 2024), we present ClimSim-Online, which also includes an end-to-end workflow for developing hybrid ML-physics simulators. The ClimSim dataset includes 5.7 billion pairs of multivariate input/output vectors, capturing the influence of high-resolution, high-fidelity physics on a host climate simulator's macro-scale state. The dataset is global and spans ten years at a high sampling frequency. We provide a cross-platform, containerized pipeline to integrate ML models into operational climate simulators for hybrid testing. We also implement various ML baselines, alongside a hybrid baseline simulator, to highlight the ML challenges of building stable, skillful emulators. The data (https://huggingface.co/datasets/LEAP/ClimSim_high-res) and code (https://leap-stc.github.io/ClimSim and https://github.com/leap-stc/climsim-online) are publicly released to support the development of hybrid ML-physics and high-fidelity climate simulations.
△ Less
Submitted 8 July, 2024; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Causally-informed deep learning to improve climate models and projections
Authors:
Fernando Iglesias-Suarez,
Pierre Gentine,
Breixo Solino-Fernandez,
Tom Beucler,
Michael Pritchard,
Jakob Runge,
Veronika Eyring
Abstract:
Climate models are essential to understand and project climate change, yet long-standing biases and uncertainties in their projections remain. This is largely associated with the representation of subgrid-scale processes, particularly clouds and convection. Deep learning can learn these subgrid-scale processes from computationally expensive storm-resolving models while retaining many features at a…
▽ More
Climate models are essential to understand and project climate change, yet long-standing biases and uncertainties in their projections remain. This is largely associated with the representation of subgrid-scale processes, particularly clouds and convection. Deep learning can learn these subgrid-scale processes from computationally expensive storm-resolving models while retaining many features at a fraction of computational cost. Yet, climate simulations with embedded neural network parameterizations are still challenging and highly depend on the deep learning solution. This is likely associated with spurious non-physical correlations learned by the neural networks due to the complexity of the physical dynamical system. Here, we show that the combination of causality with deep learning helps removing spurious correlations and optimizing the neural network algorithm. To resolve this, we apply a causal discovery method to unveil causal drivers in the set of input predictors of atmospheric subgrid-scale processes of a superparameterized climate model in which deep convection is explicitly resolved. The resulting causally-informed neural networks are coupled to the climate model, hence, replacing the superparameterization and radiation scheme. We show that the climate simulations with causally-informed neural network parameterizations retain many convection-related properties and accurately generate the climate of the original high-resolution climate model, while retaining similar generalization capabilities to unseen climates compared to the non-causal approach. The combination of causal discovery and deep learning is a new and promising approach that leads to stable and more trustworthy climate simulations and paves the way towards more physically-based causal deep learning approaches also in other scientific disciplines.
△ Less
Submitted 20 March, 2024; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Data-Driven Equation Discovery of a Cloud Cover Parameterization
Authors:
Arthur Grundner,
Tom Beucler,
Pierre Gentine,
Veronika Eyring
Abstract:
A promising method for improving the representation of clouds in climate models, and hence climate projections, is to develop machine learning-based parameterizations using output from global storm-resolving models. While neural networks can achieve state-of-the-art performance within their training distribution, they can make unreliable predictions outside of it. Additionally, they often require…
▽ More
A promising method for improving the representation of clouds in climate models, and hence climate projections, is to develop machine learning-based parameterizations using output from global storm-resolving models. While neural networks can achieve state-of-the-art performance within their training distribution, they can make unreliable predictions outside of it. Additionally, they often require post-hoc tools for interpretation. To avoid these limitations, we combine symbolic regression, sequential feature selection, and physical constraints in a hierarchical modeling framework. This framework allows us to discover new equations diagnosing cloud cover from coarse-grained variables of global storm-resolving model simulations. These analytical equations are interpretable by construction and easily transferable to other grids or climate models. Our best equation balances performance and complexity, achieving a performance comparable to that of neural networks ($R^2=0.94$) while remaining simple (with only 11 trainable parameters). It reproduces cloud cover distributions more accurately than the Xu-Randall scheme across all cloud regimes (Hellinger distances $<0.09$), and matches neural networks in condensate-rich regimes. When applied and fine-tuned to the ERA5 reanalysis, the equation exhibits superior transferability to new data compared to all other optimal cloud cover schemes. Our findings demonstrate the effectiveness of symbolic regression in discovering interpretable, physically-consistent, and nonlinear equations to parameterize cloud cover.
△ Less
Submitted 19 February, 2024; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Selecting Robust Features for Machine Learning Applications using Multidata Causal Discovery
Authors:
Saranya Ganesh S.,
Tom Beucler,
Frederick Iat-Hin Tam,
Milton S. Gomez,
Jakob Runge,
Andreas Gerhardus
Abstract:
Robust feature selection is vital for creating reliable and interpretable Machine Learning (ML) models. When designing statistical prediction models in cases where domain knowledge is limited and underlying interactions are unknown, choosing the optimal set of features is often difficult. To mitigate this issue, we introduce a Multidata (M) causal feature selection approach that simultaneously pro…
▽ More
Robust feature selection is vital for creating reliable and interpretable Machine Learning (ML) models. When designing statistical prediction models in cases where domain knowledge is limited and underlying interactions are unknown, choosing the optimal set of features is often difficult. To mitigate this issue, we introduce a Multidata (M) causal feature selection approach that simultaneously processes an ensemble of time series datasets and produces a single set of causal drivers. This approach uses the causal discovery algorithms PC1 or PCMCI that are implemented in the Tigramite Python package. These algorithms utilize conditional independence tests to infer parts of the causal graph. Our causal feature selection approach filters out causally-spurious links before passing the remaining causal features as inputs to ML models (Multiple linear regression, Random Forest) that predict the targets. We apply our framework to the statistical intensity prediction of Western Pacific Tropical Cyclones (TC), for which it is often difficult to accurately choose drivers and their dimensionality reduction (time lags, vertical levels, and area-averaging). Using more stringent significance thresholds in the conditional independence tests helps eliminate spurious causal relationships, thus helping the ML model generalize better to unseen TC cases. M-PC1 with a reduced number of features outperforms M-PCMCI, non-causal ML, and other feature selection methods (lagged correlation, random), even slightly outperforming feature selection based on eXplainable Artificial Intelligence. The optimal causal drivers obtained from our causal feature selection help improve our understanding of underlying relationships and suggest new potential drivers of TC intensification.
△ Less
Submitted 30 June, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
-
Physics-constrained deep learning postprocessing of temperature and humidity
Authors:
Francesco Zanetta,
Daniele Nerini,
Tom Beucler,
Mark A. Liniger
Abstract:
Weather forecasting centers currently rely on statistical postprocessing methods to minimize forecast error. This improves skill but can lead to predictions that violate physical principles or disregard dependencies between variables, which can be problematic for downstream applications and for the trustworthiness of postprocessing models, especially when they are based on new machine learning app…
▽ More
Weather forecasting centers currently rely on statistical postprocessing methods to minimize forecast error. This improves skill but can lead to predictions that violate physical principles or disregard dependencies between variables, which can be problematic for downstream applications and for the trustworthiness of postprocessing models, especially when they are based on new machine learning approaches. Building on recent advances in physics-informed machine learning, we propose to achieve physical consistency in deep learning-based postprocessing models by integrating meteorological expertise in the form of analytic equations. Applied to the post-processing of surface weather in Switzerland, we find that constraining a neural network to enforce thermodynamic state equations yields physically-consistent predictions of temperature and humidity without compromising performance. Our approach is especially advantageous when data is scarce, and our findings suggest that incorporating domain expertise into postprocessing models allows to optimize weather forecast information while satisfying application-specific requirements.
△ Less
Submitted 19 May, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Understanding Extreme Precipitation Changes through Unsupervised Machine Learning
Authors:
Griffin Mooers,
Tom Beucler,
Mike Pritchard,
Stephan Mandt
Abstract:
Despite the importance of quantifying how the spatial patterns of extreme precipitation will change with warming, we lack tools to objectively analyze the storm-scale outputs of modern climate models. To address this gap, we develop an unsupervised machine learning framework to quantify how storm dynamics affect changes in precipitation extremes, without sacrificing spatial information. For the up…
▽ More
Despite the importance of quantifying how the spatial patterns of extreme precipitation will change with warming, we lack tools to objectively analyze the storm-scale outputs of modern climate models. To address this gap, we develop an unsupervised machine learning framework to quantify how storm dynamics affect changes in precipitation extremes, without sacrificing spatial information. For the upper precipitation quantiles (above the 80th percentile), we find that the spatial patterns of extreme precipitation changes are dominated by spatial shifts in storm dynamical regimes rather than changes in how these storm regimes produce precipitation. Our study shows how unsupervised machine learning, paired with domain knowledge, may allow us to better understand the physics of the atmosphere and anticipate the changes associated with a warming world.
△ Less
Submitted 1 December, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
Comparing Storm Resolving Models and Climates via Unsupervised Machine Learning
Authors:
Griffin Mooers,
Mike Pritchard,
Tom Beucler,
Prakhar Srivastava,
Harshini Mangipudi,
Liran Peng,
Pierre Gentine,
Stephan Mandt
Abstract:
Global Storm-Resolving Models (GSRMs) have gained widespread interest because of the unprecedented detail with which they resolve the global climate. However, it remains difficult to quantify objective differences in how GSRMs resolve complex atmospheric formations. This lack of comprehensive tools for comparing model similarities is a problem in many disparate fields that involve simulation tools…
▽ More
Global Storm-Resolving Models (GSRMs) have gained widespread interest because of the unprecedented detail with which they resolve the global climate. However, it remains difficult to quantify objective differences in how GSRMs resolve complex atmospheric formations. This lack of comprehensive tools for comparing model similarities is a problem in many disparate fields that involve simulation tools for complex data. To address this challenge we develop methods to estimate distributional distances based on both nonlinear dimensionality reduction and vector quantization. Our approach automatically learns physically meaningful notions of similarity from low-dimensional latent data representations that the different models produce. This enables an intercomparison of nine GSRMs based on their high-dimensional simulation data (2D vertical velocity snapshots) and reveals that only six are similar in their representation of atmospheric dynamics. Furthermore, we uncover signatures of the convective response to global warming in a fully unsupervised way. Our study provides a path toward evaluating future high-resolution simulation data more objectively.
△ Less
Submitted 2 December, 2023; v1 submitted 24 August, 2022;
originally announced August 2022.
-
Non-Linear Dimensionality Reduction with a Variational Encoder Decoder to Understand Convective Processes in Climate Models
Authors:
Gunnar Behrens,
Tom Beucler,
Pierre Gentine,
Fernando Iglesias-Suarez,
Michael Pritchard,
Veronika Eyring
Abstract:
Deep learning can accurately represent sub-grid-scale convective processes in climate models, learning from high resolution simulations. However, deep learning methods usually lack interpretability due to large internal dimensionality, resulting in reduced trustworthiness in these methods. Here, we use Variational Encoder Decoder structures (VED), a non-linear dimensionality reduction technique, t…
▽ More
Deep learning can accurately represent sub-grid-scale convective processes in climate models, learning from high resolution simulations. However, deep learning methods usually lack interpretability due to large internal dimensionality, resulting in reduced trustworthiness in these methods. Here, we use Variational Encoder Decoder structures (VED), a non-linear dimensionality reduction technique, to learn and understand convective processes in an aquaplanet superparameterized climate model simulation, where deep convective processes are simulated explicitly. We show that similar to previous deep learning studies based on feed-forward neural nets, the VED is capable of learning and accurately reproducing convective processes. In contrast to past work, we show this can be achieved by compressing the original information into only five latent nodes. As a result, the VED can be used to understand convective processes and delineate modes of convection through the exploration of its latent dimensions. A close investigation of the latent space enables the identification of different convective regimes: a) stable conditions are clearly distinguished from deep convection with low outgoing longwave radiation and strong precipitation; b) high optically thin cirrus-like clouds are separated from low optically thick cumulus clouds; and c) shallow convective processes are associated with large-scale moisture content and surface diabatic heating. Our results demonstrate that VEDs can accurately represent convective processes in climate models, while enabling interpretability and better understanding of sub-grid-scale physical processes, paving the way to increasingly interpretable machine learning parameterizations with promising generative properties
△ Less
Submitted 26 July, 2022; v1 submitted 19 April, 2022;
originally announced April 2022.
-
Deep Learning Based Cloud Cover Parameterization for ICON
Authors:
Arthur Grundner,
Tom Beucler,
Pierre Gentine,
Fernando Iglesias-Suarez,
Marco A. Giorgetta,
Veronika Eyring
Abstract:
A promising approach to improve cloud parameterizations within climate models and thus climate projections is to use deep learning in combination with training data from storm-resolving model (SRM) simulations. The ICOsahedral Non-hydrostatic (ICON) modeling framework permits simulations ranging from numerical weather prediction to climate projections, making it an ideal target to develop neural n…
▽ More
A promising approach to improve cloud parameterizations within climate models and thus climate projections is to use deep learning in combination with training data from storm-resolving model (SRM) simulations. The ICOsahedral Non-hydrostatic (ICON) modeling framework permits simulations ranging from numerical weather prediction to climate projections, making it an ideal target to develop neural network (NN) based parameterizations for sub-grid scale processes. Within the ICON framework, we train NN based cloud cover parameterizations with coarse-grained data based on realistic regional and global ICON SRM simulations. We set up three different types of NNs that differ in the degree of vertical locality they assume for diagnosing cloud cover from coarse-grained atmospheric state variables. The NNs accurately estimate sub-grid scale cloud cover from coarse-grained data that has similar geographical characteristics as their training data. Additionally, globally trained NNs can reproduce sub-grid scale cloud cover of the regional SRM simulation. Using the game-theory based interpretability library SHapley Additive exPlanations, we identify an overemphasis on specific humidity and cloud ice as the reason why our column-based NN cannot perfectly generalize from the global to the regional coarse-grained SRM data. The interpretability tool also helps visualize similarities and differences in feature importance between regionally and globally trained column-based NNs, and reveals a local relationship between their cloud cover predictions and the thermodynamic environment. Our results show the potential of deep learning to derive accurate yet interpretable cloud cover parameterizations from global SRMs, and suggest that neighborhood-based models may be a good compromise between accuracy and generalizability.
△ Less
Submitted 6 December, 2022; v1 submitted 21 December, 2021;
originally announced December 2021.
-
Climate-Invariant Machine Learning
Authors:
Tom Beucler,
Pierre Gentine,
Janni Yuval,
Ankitesh Gupta,
Liran Peng,
Jerry Lin,
Sungduk Yu,
Stephan Rasp,
Fiaz Ahmed,
Paul A. O'Gorman,
J. David Neelin,
Nicholas J. Lutsko,
Michael Pritchard
Abstract:
Projecting climate change is a generalization problem: we extrapolate the recent past using physical models across past, present, and future climates. Current climate models require representations of processes that occur at scales smaller than model grid size, which have been the main source of model projection uncertainty. Recent machine learning (ML) algorithms hold promise to improve such proc…
▽ More
Projecting climate change is a generalization problem: we extrapolate the recent past using physical models across past, present, and future climates. Current climate models require representations of processes that occur at scales smaller than model grid size, which have been the main source of model projection uncertainty. Recent machine learning (ML) algorithms hold promise to improve such process representations, but tend to extrapolate poorly to climate regimes they were not trained on. To get the best of the physical and statistical worlds, we propose a new framework - termed "climate-invariant" ML - incorporating knowledge of climate processes into ML algorithms, and show that it can maintain high offline accuracy across a wide range of climate conditions and configurations in three distinct atmospheric models. Our results suggest that explicitly incorporating physical knowledge into data-driven models of Earth system processes can improve their consistency, data efficiency, and generalizability across climate regimes.
△ Less
Submitted 17 January, 2024; v1 submitted 14 December, 2021;
originally announced December 2021.
-
Analyzing High-Resolution Clouds and Convection using Multi-Channel VAEs
Authors:
Harshini Mangipudi,
Griffin Mooers,
Mike Pritchard,
Tom Beucler,
Stephan Mandt
Abstract:
Understanding the details of small-scale convection and storm formation is crucial to accurately represent the larger-scale planetary dynamics. Presently, atmospheric scientists run high-resolution, storm-resolving simulations to capture these kilometer-scale weather details. However, because they contain abundant information, these simulations can be overwhelming to analyze using conventional app…
▽ More
Understanding the details of small-scale convection and storm formation is crucial to accurately represent the larger-scale planetary dynamics. Presently, atmospheric scientists run high-resolution, storm-resolving simulations to capture these kilometer-scale weather details. However, because they contain abundant information, these simulations can be overwhelming to analyze using conventional approaches. This paper takes a data-driven approach and jointly embeds spatial arrays of vertical wind velocities, temperatures, and water vapor information as three "channels" of a VAE architecture. Our "multi-channel VAE" results in more interpretable and robust latent structures than earlier work analyzing vertical velocities in isolation. Analyzing and clustering the VAE's latent space identifies weather patterns and their geographical manifestations in a fully unsupervised fashion. Our approach shows that VAEs can play essential roles in analyzing high-dimensional simulation data and extracting critical weather and climate characteristics.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
Assessing the Potential of Deep Learning for Emulating Cloud Superparameterization in Climate Models with Real-Geography Boundary Conditions
Authors:
Griffin Mooers,
Mike Pritchard,
Tom Beucler,
Jordan Ott,
Galen Yacalis,
Pierre Baldi,
Pierre Gentine
Abstract:
We explore the potential of feed-forward deep neural networks (DNNs) for emulating cloud superparameterization in realistic geography, using offline fits to data from the Super Parameterized Community Atmospheric Model. To identify the network architecture of greatest skill, we formally optimize hyperparameters using ~250 trials. Our DNN explains over 70 percent of the temporal variance at the 15-…
▽ More
We explore the potential of feed-forward deep neural networks (DNNs) for emulating cloud superparameterization in realistic geography, using offline fits to data from the Super Parameterized Community Atmospheric Model. To identify the network architecture of greatest skill, we formally optimize hyperparameters using ~250 trials. Our DNN explains over 70 percent of the temporal variance at the 15-minute sampling scale throughout the mid-to-upper troposphere. Autocorrelation timescale analysis compared against DNN skill suggests the less good fit in the tropical, marine boundary layer is driven by neural network difficulty emulating fast, stochastic signals in convection. However, spectral analysis in the temporal domain indicates skillful emulation of signals on diurnal to synoptic scales. A close look at the diurnal cycle reveals correct emulation of land-sea contrasts and vertical structure in the heating and moistening fields, but some distortion of precipitation. Sensitivity tests targeting precipitation skill reveal complementary effects of adding positive constraints vs. hyperparameter tuning, motivating the use of both in the future. A first attempt to force an offline land model with DNN emulated atmospheric fields produces reassuring results further supporting neural network emulation viability in real-geography settings. Overall, the fit skill is competitive with recent attempts by sophisticated Residual and Convolutional Neural Network architectures trained on added information, including memory of past states. Our results confirm the parameterizability of superparameterized convection with continents through machine learning and we highlight advantages of casting this problem locally in space and time for accurate emulation and hopefully quick implementation of hybrid climate models.
△ Less
Submitted 20 April, 2021; v1 submitted 24 October, 2020;
originally announced October 2020.
-
Generative Modeling for Atmospheric Convection
Authors:
Griffin Mooers,
Jens Tuyls,
Stephan Mandt,
Michael Pritchard,
Tom Beucler
Abstract:
While cloud-resolving models can explicitly simulate the details of small-scale storm formation and morphology, these details are often ignored by climate models for lack of computational resources. Here, we explore the potential of generative modeling to cheaply recreate small-scale storms by designing and implementing a Variational Autoencoder (VAE) that performs structural replication, dimensio…
▽ More
While cloud-resolving models can explicitly simulate the details of small-scale storm formation and morphology, these details are often ignored by climate models for lack of computational resources. Here, we explore the potential of generative modeling to cheaply recreate small-scale storms by designing and implementing a Variational Autoencoder (VAE) that performs structural replication, dimensionality reduction, and clustering of high-resolution vertical velocity fields. Trained on ~6*10^6 samples spanning the globe, the VAE successfully reconstructs the spatial structure of convection, performs unsupervised clustering of convective organization regimes, and identifies anomalous storm activity, confirming the potential of generative modeling to power stochastic parameterizations of convection in climate models.
△ Less
Submitted 24 October, 2020; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Interpreting and Stabilizing Machine-learning Parametrizations of Convection
Authors:
Noah D. Brenowitz,
Tom Beucler,
Michael Pritchard,
Christopher S. Bretherton
Abstract:
Neural networks are a promising technique for parameterizing sub-grid-scale physics (e.g. moist atmospheric convection) in coarse-resolution climate models, but their lack of interpretability and reliability prevents widespread adoption. For instance, it is not fully understood why neural network parameterizations often cause dramatic instability when coupled to atmospheric fluid dynamics. This pa…
▽ More
Neural networks are a promising technique for parameterizing sub-grid-scale physics (e.g. moist atmospheric convection) in coarse-resolution climate models, but their lack of interpretability and reliability prevents widespread adoption. For instance, it is not fully understood why neural network parameterizations often cause dramatic instability when coupled to atmospheric fluid dynamics. This paper introduces tools for interpreting their behavior that are customized to the parameterization task. First, we assess the nonlinear sensitivity of a neural network to lower-tropospheric stability and the mid-tropospheric moisture, two widely-studied controls of moist convection. Second, we couple the linearized response functions of these neural networks to simplified gravity-wave dynamics, and analytically diagnose the corresponding phase speeds, growth rates, wavelengths, and spatial structures. To demonstrate their versatility, these techniques are tested on two sets of neural networks, one trained with a super-parametrized version of the Community Atmosphere Model (SPCAM) and the second with a near-global cloud-resolving model (GCRM). Even though the SPCAM simulation has a warmer climate than the cloud-resolving model, both neural networks predict stronger heating/drying in moist and unstable environments, which is consistent with observations. Moreover, the spectral analysis can predict that instability occurs when GCMs are coupled to networks that support gravity waves that are unstable and have phase speeds larger than 5 m/s. In contrast, standing unstable modes do not cause catastrophic instability. Using these tools, differences between the SPCAM- vs. GCRM- trained neural networks are analyzed, and strategies to incrementally improve both of their coupled online performance unveiled.
△ Less
Submitted 14 March, 2020;
originally announced March 2020.
-
Quantifying Convective Aggregation using the Tropical Moist Margin's Length
Authors:
Tom Beucler,
David Leutwyler,
Julia Windmiller
Abstract:
On small scales, the tropical atmosphere tends to be either moist or very dry. This defines two states that, on large scales, are separated by a sharp margin, well-identified by the anti-mode of the bimodal tropical column water vapor distribution. Despite recent progress in understanding physical processes governing the spatio-temporal variability of tropical water vapor, the behavior of this mar…
▽ More
On small scales, the tropical atmosphere tends to be either moist or very dry. This defines two states that, on large scales, are separated by a sharp margin, well-identified by the anti-mode of the bimodal tropical column water vapor distribution. Despite recent progress in understanding physical processes governing the spatio-temporal variability of tropical water vapor, the behavior of this margin remains elusive, and we lack a simple framework to understand the bimodality of tropical water vapor in observations. Motivated by the success of coarsening theory in explaining bimodal distributions, we leverage its methodology to relate the moisture field's spatial organization to its time-evolution. This results in a new diagnostic framework for the bimodality of tropical water vapor, from which we argue that the length of the margin separating moist from dry regions should evolve towards a minimum in equilibrium. As the spatial organization of moisture is closely related to the organization of tropical convection, we hereby introduce a new organization index (BLW) measuring the ratio of the margin's length to the circumference of a well-defined equilibrium shape. Using BLW, we assess the evolution of self-aggregation in idealized cloud-resolving simulations of radiative-convective equilibrium and contrast it to the time-evolution of the Atlantic Inter-Tropical Convergence Zone (ITCZ) in the ERA5 meteorological re-analysis product. We find that BLW successfully captures aspects of convective organization ignored by more traditional metrics, while offering a new perpective on the seasonal cycle of convective organization in the Atlantic ITCZ.
△ Less
Submitted 5 August, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Towards Physically-consistent, Data-driven Models of Convection
Authors:
Tom Beucler,
Michael Pritchard,
Pierre Gentine,
Stephan Rasp
Abstract:
Data-driven algorithms, in particular neural networks, can emulate the effect of sub-grid scale processes in coarse-resolution climate models if trained on high-resolution climate simulations. However, they may violate key physical constraints and lack the ability to generalize outside of their training set. Here, we show that physical constraints can be enforced in neural networks, either approxi…
▽ More
Data-driven algorithms, in particular neural networks, can emulate the effect of sub-grid scale processes in coarse-resolution climate models if trained on high-resolution climate simulations. However, they may violate key physical constraints and lack the ability to generalize outside of their training set. Here, we show that physical constraints can be enforced in neural networks, either approximately by adapting the loss function or to within machine precision by adapting the architecture. As these physical constraints are insufficient to guarantee generalizability, we additionally propose to physically rescale the training and validation data to improve the ability of neural networks to generalize to unseen climates.
△ Less
Submitted 17 April, 2020; v1 submitted 19 February, 2020;
originally announced February 2020.
-
Convective dynamics and the response of precipitation extremes to warming in radiative-convective equilibrium
Authors:
Tristan H. Abbott,
Timothy W. Cronin,
Tom Beucler
Abstract:
Tropical precipitation extremes are expected to strengthen with warming, but quantitative estimates remain uncertain because of a poor understanding of changes in convective dynamics. This uncertainty is addressed here by analyzing idealized convection-permitting simulations of radiative-convective equilibrium in long-channel geometry. Across a wide range of climates, the thermodynamic contributio…
▽ More
Tropical precipitation extremes are expected to strengthen with warming, but quantitative estimates remain uncertain because of a poor understanding of changes in convective dynamics. This uncertainty is addressed here by analyzing idealized convection-permitting simulations of radiative-convective equilibrium in long-channel geometry. Across a wide range of climates, the thermodynamic contribution to changes in instantaneous precipitation extremes follows near-surface moisture, and the dynamic contribution is positive and small, but sensitive to domain size. The shapes of mass flux profiles associated with precipitation extremes are determined by conditional sampling that favors strong vertical motion at levels where the vertical saturation specific humidity gradient is large, and mass flux profiles collapse to a common shape across climates when plotted in a moisture-based vertical coordinate. The collapse, robust to changes in microphysics and turbulence schemes, implies a thermodynamic contribution that scales with near-surface moisture despite substantial convergence aloft and allows the dynamic contribution to be defined by the pressure velocity at a single level. Linking the simplified dynamic mode to vertical velocities from entraining plume models reveals that the small dynamic mode in channel simulations (<~2 %/K) is caused by opposing height-dependences of vertical velocity and density, together with the buffering influence of cloud-base buoyancies that vary little with surface temperature. These results reinforce an emerging picture of the response of extreme tropical precipitation rates to warming: a thermodynamic mode of about 7 %/K dominates, with a minor contribution from changes in dynamics.
△ Less
Submitted 21 April, 2020; v1 submitted 4 September, 2019;
originally announced September 2019.
-
Enforcing Analytic Constraints in Neural-Networks Emulating Physical Systems
Authors:
Tom Beucler,
Michael Pritchard,
Stephan Rasp,
Jordan Ott,
Pierre Baldi,
Pierre Gentine
Abstract:
Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function. Applied to convective processes for climate modeling, architectural constra…
▽ More
Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function. Applied to convective processes for climate modeling, architectural constraints enforce conservation laws to within machine precision without degrading performance. Enforcing constraints also reduces errors in the subsets of the outputs most impacted by the constraints.
△ Less
Submitted 27 January, 2021; v1 submitted 2 September, 2019;
originally announced September 2019.
-
Comparing Convective Self-Aggregation in Idealized Models to Observed Moist Static Energy Variability near the Equator
Authors:
Tom Beucler,
Tristan Abbott,
Timothy Cronin,
Michael Pritchard
Abstract:
Idealized convection-permitting simulations of radiative-convective equilibrium (RCE) have become a popular tool for understanding the physical processes leading to horizontal variability of tropical water vapor and rainfall. However, the applicability of idealized simulations to nature is still unclear given that important processes are typically neglected, such as lateral vapor advection by extr…
▽ More
Idealized convection-permitting simulations of radiative-convective equilibrium (RCE) have become a popular tool for understanding the physical processes leading to horizontal variability of tropical water vapor and rainfall. However, the applicability of idealized simulations to nature is still unclear given that important processes are typically neglected, such as lateral vapor advection by extratropical intrusions, or interactive ocean coupling. Here, we exploit spectral analysis to compactly summarize the multi-scale processes supporting convective aggregation. By applying this framework to high-resolution reanalysis data and satellite observations in addition to idealized simulations, we compare convective-aggregation processes across horizontal scales and data sets. The results affirm the validity of the RCE simulations as an analogy to the real world. Column moist static energy tendencies share similar signs and scale-selectivity in convection-permitting models and observations: Radiation increases variance at wavelengths above 1,000km, while advection damps variance across wavelengths, and surface fluxes mostly reduce variance between 1,000km and 10,000km.
△ Less
Submitted 10 August, 2019;
originally announced August 2019.
-
Achieving Conservation of Energy in Neural Network Emulators for Climate Modeling
Authors:
Tom Beucler,
Stephan Rasp,
Michael Pritchard,
Pierre Gentine
Abstract:
Artificial neural-networks have the potential to emulate cloud processes with higher accuracy than the semi-empirical emulators currently used in climate models. However, neural-network models do not intrinsically conserve energy and mass, which is an obstacle to using them for long-term climate predictions. Here, we propose two methods to enforce linear conservation laws in neural-network emulato…
▽ More
Artificial neural-networks have the potential to emulate cloud processes with higher accuracy than the semi-empirical emulators currently used in climate models. However, neural-network models do not intrinsically conserve energy and mass, which is an obstacle to using them for long-term climate predictions. Here, we propose two methods to enforce linear conservation laws in neural-network emulators of physical models: Constraining (1) the loss function or (2) the architecture of the network itself. Applied to the emulation of explicitly-resolved cloud processes in a prototype multi-scale climate model, we show that architecture constraints can enforce conservation laws to satisfactory numerical precision, while all constraints help the neural-network better generalize to conditions outside of its training set, such as global warming.
△ Less
Submitted 15 June, 2019;
originally announced June 2019.