-
Online stochastic generators using Slepian bases for regional bivariate wind speed ensembles from ERA5
Authors:
Yan Song,
Zubair Khalid,
Marc G. Genton
Abstract:
Reanalysis data, such as ERA5, provide a comprehensive and detailed representation of the Earth's system by assimilating observations into climate models. While crucial for climate research, they pose significant challenges in terms of generation, storage, and management. For 3-hourly bivariate wind speed ensembles from ERA5, which face these challenges, this paper proposes an online stochastic ge…
▽ More
Reanalysis data, such as ERA5, provide a comprehensive and detailed representation of the Earth's system by assimilating observations into climate models. While crucial for climate research, they pose significant challenges in terms of generation, storage, and management. For 3-hourly bivariate wind speed ensembles from ERA5, which face these challenges, this paper proposes an online stochastic generator (OSG) applicable to any global region, offering fast stochastic approximations while storing only model parameters. A key innovation is the incorporation of the online updating, which allows data to sequentially enter the model in blocks of time and contribute to parameter updates. This approach reduces storage demands during modeling by eliminating the need to store and analyze the entire dataset, and enables near real-time emulations that complement the generation of reanalysis data. The Slepian concentration technique supports the efficiency of the proposed OSG by representing the data in a lower-dimensional space spanned by data-independent Slepian bases optimally concentrated within the specified region. We demonstrate the flexibility and efficiency of the OSG through two case studies requiring long and short blocks, specified for the Arabian-Peninsula region (ARP). For both cases, the OSG performs well across several statistical metrics and is comparable to the SG trained on the full dataset.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Boosting Earth System Model Outputs And Saving PetaBytes in their Storage Using Exascale Climate Emulators
Authors:
Sameh Abdulah,
Allison H. Baker,
George Bosilca,
Qinglei Cao,
Stefano Castruccio,
Marc G. Genton,
David E. Keyes,
Zubair Khalid,
Hatem Ltaief,
Yan Song,
Georgiy L. Stenchikov,
Ying Sun
Abstract:
We present the design and scalable implementation of an exascale climate emulator for addressing the escalating computational and storage requirements of high-resolution Earth System Model simulations. We utilize the spherical harmonic transform to stochastically model spatio-temporal variations in climate data. This provides tunable spatio-temporal resolution and significantly improves the fideli…
▽ More
We present the design and scalable implementation of an exascale climate emulator for addressing the escalating computational and storage requirements of high-resolution Earth System Model simulations. We utilize the spherical harmonic transform to stochastically model spatio-temporal variations in climate data. This provides tunable spatio-temporal resolution and significantly improves the fidelity and granularity of climate emulation, achieving an ultra-high spatial resolution of 0.034 (approximately 3.5 km) in space. Our emulator, trained on 318 billion hourly temperature data points from a 35-year and 31 billion daily data points from an 83-year global simulation ensemble, generates statistically consistent climate emulations. We extend linear solver software to mixed-precision arithmetic GPUs, applying different precisions within a single solver to adapt to different correlation strengths. The PaRSEC runtime system supports efficient parallel matrix operations by optimizing the dynamic balance between computation, communication, and memory requirements. Our BLAS3-rich code is optimized for systems equipped with four different families and generations of GPUs, scaling well to achieve 0.976 EFlop/s on 9,025 nodes (36,100 AMD MI250X multichip module (MCM) GPUs) of Frontier (nearly full system), 0.739 EFlop/s on 1,936 nodes (7,744 Grace-Hopper Superchips (GH200)) of Alps, 0.243 EFlop/s on 1,024 nodes (4,096 A100 GPUs) of Leonardo, and 0.375 EFlop/s on 3,072 nodes (18,432 V100 GPUs) of Summit.
△ Less
Submitted 11 August, 2024; v1 submitted 8 August, 2024;
originally announced August 2024.
-
Efficient stochastic generators with spherical harmonic transformation for high-resolution global climate simulations from CESM2-LENS2
Authors:
Yan Song,
Zubair Khalid,
Marc G. Genton
Abstract:
Earth system models (ESMs) are fundamental for understanding Earth's complex climate system. However, the computational demands and storage requirements of ESM simulations limit their utility. For the newly published CESM2-LENS2 data, which suffer from this issue, we propose a novel stochastic generator (SG) as a practical complement to the CESM2, capable of rapidly producing emulations closely mi…
▽ More
Earth system models (ESMs) are fundamental for understanding Earth's complex climate system. However, the computational demands and storage requirements of ESM simulations limit their utility. For the newly published CESM2-LENS2 data, which suffer from this issue, we propose a novel stochastic generator (SG) as a practical complement to the CESM2, capable of rapidly producing emulations closely mirroring training simulations. Our SG leverages the spherical harmonic transformation (SHT) to shift from spatial to spectral domains, enabling efficient low-rank approximations that significantly reduce computational and storage costs. By accounting for axial symmetry and retaining distinct ranks for land and ocean regions, our SG captures intricate non-stationary spatial dependencies. Additionally, a modified Tukey g-and-h (TGH) transformation accommodates non-Gaussianity in high-temporal-resolution data. We apply the proposed SG to generate emulations for surface temperature simulations from the CESM2-LENS2 data across various scales, marking the first attempt of reproducing daily data. These emulations are then meticulously validated against training simulations. This work offers a promising complementary pathway for efficient climate modeling and analysis while overcoming computational and storage limitations.
△ Less
Submitted 24 May, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.