-
Fourier analysis of the physics of transfer learning for data-driven subgrid-scale models of ocean turbulence
Authors:
Moein Darman,
Pedram Hassanzadeh,
Laure Zanna,
Ashesh Chattopadhyay
Abstract:
Transfer learning (TL) is a powerful tool for enhancing the performance of neural networks (NNs) in applications such as weather and climate prediction and turbulence modeling. TL enables models to generalize to out-of-distribution data with minimal training data from the new system. In this study, we employ a 9-layer convolutional NN to predict the subgrid forcing in a two-layer ocean quasi-geost…
▽ More
Transfer learning (TL) is a powerful tool for enhancing the performance of neural networks (NNs) in applications such as weather and climate prediction and turbulence modeling. TL enables models to generalize to out-of-distribution data with minimal training data from the new system. In this study, we employ a 9-layer convolutional NN to predict the subgrid forcing in a two-layer ocean quasi-geostrophic system and examine which metrics best describe its performance and generalizability to unseen dynamical regimes. Fourier analysis of the NN kernels reveals that they learn low-pass, Gabor, and high-pass filters, regardless of whether the training data are isotropic or anisotropic. By analyzing the activation spectra, we identify why NNs fail to generalize without TL and how TL can overcome these limitations: the learned weights and biases from one dataset underestimate the out-of-distribution sample spectra as they pass through the network, leading to an underestimation of output spectra. By re-training only one layer with data from the target system, this underestimation is corrected, enabling the NN to produce predictions that match the target spectra. These findings are broadly applicable to data-driven parameterization of dynamical systems.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
A fluctuation-dissipation theorem perspective on radiative responses to temperature perturbations
Authors:
Fabrizio Falasca,
Aurora Basinski-Ferris,
Laure Zanna,
Ming Zhao
Abstract:
Radiative forcing drives warming in the Earth system, leading to changes in sea surface temperatures (SSTs) and associated radiative feedbacks. The link between changes in the top-of-the-atmosphere (TOA) net radiative flux and SST patterns, known as the "pattern effect", is typically diagnosed by studying the response of atmosphere-only models to SST perturbations. In this work, we diagnose the pa…
▽ More
Radiative forcing drives warming in the Earth system, leading to changes in sea surface temperatures (SSTs) and associated radiative feedbacks. The link between changes in the top-of-the-atmosphere (TOA) net radiative flux and SST patterns, known as the "pattern effect", is typically diagnosed by studying the response of atmosphere-only models to SST perturbations. In this work, we diagnose the pattern effect through response theory, by performing idealized warming perturbation experiments from unperturbed data alone. First, by studying the response at short time scales, where the response is dominated by atmospheric variability, we recover results that agree with the literature. Second, by extending the framework to longer time scales, we capture coupled interactions between the slow ocean component and the atmosphere, yielding a novel "sensitivity map" quantifying the response of the net radiative flux to SST perturbations in the coupled system. Here, feedbacks are captured by a spatiotemporal response operator, rather than time-independent maps as in traditional studies. Both formulations skillfully reconstruct changes in externally forced simulations and provide practical strategies for climate studies. The key distinction lies in their perspectives on climate feedbacks. The first formulation, closely aligned with prediction tasks, follows the traditional view in which slow variables, such as SSTs, exert a one-way influence on fast variables. The second formulation broadens this perspective by incorporating spatiotemporal interactions across state variables. This alternative approach explores how localized SST perturbations can alter the coupled dynamics, leading to temperature changes in remote areas and further impacting the radiative fluxes at later times.
△ Less
Submitted 13 February, 2025; v1 submitted 22 August, 2024;
originally announced August 2024.
-
A data-driven framework for dimensionality reduction and causal inference in climate fields
Authors:
Fabrizio Falasca,
Pavel Perezhogin,
Laure Zanna
Abstract:
We propose a data-driven framework to simplify the description of spatiotemporal climate variability into few entities and their causal linkages. Given a high-dimensional climate field, the methodology first reduces its dimensionality into a set of regionally constrained patterns. Time-dependent causal links are then inferred in the interventional sense through the fluctuation-response formalism,…
▽ More
We propose a data-driven framework to simplify the description of spatiotemporal climate variability into few entities and their causal linkages. Given a high-dimensional climate field, the methodology first reduces its dimensionality into a set of regionally constrained patterns. Time-dependent causal links are then inferred in the interventional sense through the fluctuation-response formalism, as shown in Baldovin et al. (2020). These two steps allow to explore how regional climate variability can influence remote locations. To distinguish between true and spurious responses, we propose a novel analytical null model for the fluctuation-dissipation relation, therefore allowing for uncertainty estimation at a given confidence level. Finally, we select a set of metrics to summarize the results, offering a useful and simplified approach to explore climate dynamics. We showcase the methodology on the monthly sea surface temperature field at global scale. We demonstrate the usefulness of the proposed framework by studying few individual links as well as "link maps", visualizing the cumulative degree of causation between a given region and the whole system. Finally, each pattern is ranked in terms of its "causal strength", quantifying its relative ability to influence the system's dynamics. We argue that the methodology allows to explore and characterize causal relationships in high-dimensional spatiotemporal fields in a rigorous and interpretable way.
△ Less
Submitted 5 April, 2024; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Optimisation of an idealised ocean model, stochastic parameterisation of sub-grid eddies
Authors:
Fenwick C. Cooper,
Laure Zanna
Abstract:
An optimisation scheme is developed to accurately represent the sub-grid scale forcing of a high dimensional chaotic ocean system. Using a simple parameterisation scheme, the velocity components of a 30km resolution shallow water ocean model are optimised to have the same climatological mean and variance as that of a less viscous 7.5km resolution model. The 5 day lag-covariance is also optimised,…
▽ More
An optimisation scheme is developed to accurately represent the sub-grid scale forcing of a high dimensional chaotic ocean system. Using a simple parameterisation scheme, the velocity components of a 30km resolution shallow water ocean model are optimised to have the same climatological mean and variance as that of a less viscous 7.5km resolution model. The 5 day lag-covariance is also optimised, leading to a more accurate estimate of the high resolution response to forcing using the low resolution model.
The system considered is an idealised barotropic double gyre that is chaotic at both resolutions. Using the optimisation scheme, we find and apply the constant in time, but spatially varying, forcing term that is equal to the time integrated forcing of the sub-mesoscale eddies. A linear stochastic term, independent of the large-scale flow, with no spatial correlation but a spatially varying amplitude and time scale is used to represent the transient eddies. The climatological mean, variance and 5 day lag-covariance of the velocity from a single high resolution integration is used to provide an optimisation target. No other high resolution statistics are required. Additional programming effort, for example to build a tangent linear or adjoint model, is not required either.
The focus of this paper is on the optimisation scheme and the accuracy of the optimised flow. The method can be applied in future investigations into the physical processes that govern barotropic turbulence and it can perhaps be applied to help understand and correct biases in the mean and variance of a more realistic coarse or eddy-permitting ocean model. The method is complementary to current parameterisations and can be applied at the same time without modification.
△ Less
Submitted 21 October, 2014;
originally announced October 2014.