Search | arXiv e-print repository

ACE2-SOM: Coupling an ML atmospheric emulator to a slab ocean and learning the sensitivity of climate to changed CO$_2$

Authors: Spencer K. Clark, Oliver Watt-Meyer, Anna Kwa, Jeremy McGibbon, Brian Henn, W. Andre Perkins, Elynn Wu, Lucas M. Harris, Christopher S. Bretherton

Abstract: While autoregressive machine-learning-based emulators have been trained to produce stable and accurate rollouts in the climate of the present-day and recent past, none so far have been trained to emulate the sensitivity of climate to substantial changes in CO$_2$ or other greenhouse gases. As an initial step we couple the Ai2 Climate Emulator version 2 to a slab ocean model (hereafter ACE2-SOM) an… ▽ More While autoregressive machine-learning-based emulators have been trained to produce stable and accurate rollouts in the climate of the present-day and recent past, none so far have been trained to emulate the sensitivity of climate to substantial changes in CO$_2$ or other greenhouse gases. As an initial step we couple the Ai2 Climate Emulator version 2 to a slab ocean model (hereafter ACE2-SOM) and train it on output from a collection of equilibrium-climate physics-based reference simulations with varying levels of CO$_2$. We test it in equilibrium and non-equilibrium climate scenarios with CO$_2$ concentrations seen and unseen in training. ACE2-SOM performs well in equilibrium-climate inference with both in-sample and out-of-sample CO$_2$ concentrations, accurately reproducing the emergent time-mean spatial patterns of surface temperature and precipitation change with CO$_2$ doubling, tripling, or quadrupling. In addition, the vertical profile of atmospheric warming and change in extreme precipitation rates up to the 99.9999th percentile closely agree with the reference model. Non-equilibrium-climate inference is more challenging. With CO$_2$ increasing gradually at a rate of 2% year$^{-1}$, ACE2-SOM can accurately emulate the global annual mean trends of surface and lower-to-middle atmosphere fields but produces unphysical jumps in stratospheric fields. With an abrupt quadrupling of CO$_2$, ML-controlled fields transition unrealistically quickly to the 4xCO$_2$ regime. In doing so they violate global energy conservation and exhibit unphysical sensitivities of and surface and top of atmosphere radiative fluxes to instantaneous changes in CO$_2$. Future emulator development needed to address these issues should improve its generalizability to diverse climate change scenarios. △ Less

Submitted 30 December, 2024; v1 submitted 5 December, 2024; originally announced December 2024.

Comments: 31 pages, 13 figures

arXiv:2411.11268 [pdf, other]

ACE2: Accurately learning subseasonal to decadal atmospheric variability and forced responses

Authors: Oliver Watt-Meyer, Brian Henn, Jeremy McGibbon, Spencer K. Clark, Anna Kwa, W. Andre Perkins, Elynn Wu, Lucas Harris, Christopher S. Bretherton

Abstract: Existing machine learning models of weather variability are not formulated to enable assessment of their response to varying external boundary conditions such as sea surface temperature and greenhouse gases. Here we present ACE2 (Ai2 Climate Emulator version 2) and its application to reproducing atmospheric variability over the past 80 years on timescales from days to decades. ACE2 is a 450M-param… ▽ More Existing machine learning models of weather variability are not formulated to enable assessment of their response to varying external boundary conditions such as sea surface temperature and greenhouse gases. Here we present ACE2 (Ai2 Climate Emulator version 2) and its application to reproducing atmospheric variability over the past 80 years on timescales from days to decades. ACE2 is a 450M-parameter autoregressive machine learning emulator, operating with 6-hour temporal resolution, 1° horizontal resolution and eight vertical layers. It exactly conserves global dry air mass and moisture and can be stepped forward stably for arbitrarily many steps with a throughput of about 1500 simulated years per wall clock day. ACE2 generates emergent phenomena such as tropical cyclones, the Madden Julian Oscillation, and sudden stratospheric warmings. Furthermore, it accurately reproduces the atmospheric response to El Niño variability and global trends of temperature over the past 80 years. However, its sensitivities to separately changing sea surface temperature and carbon dioxide are not entirely realistic. △ Less

Submitted 17 November, 2024; originally announced November 2024.

Comments: 31 pages, 23 figures

arXiv:2211.13354 [pdf, other]

Improving the predictions of ML-corrected climate models with novelty detection

Authors: Clayton Sanford, Anna Kwa, Oliver Watt-Meyer, Spencer Clark, Noah Brenowitz, Jeremy McGibbon, Christopher Bretherton

Abstract: While previous works have shown that machine learning (ML) can improve the prediction accuracy of coarse-grid climate models, these ML-augmented methods are more vulnerable to irregular inputs than the traditional physics-based models they rely on. Because ML-predicted corrections feed back into the climate model's base physics, the ML-corrected model regularly produces out of sample data, which c… ▽ More While previous works have shown that machine learning (ML) can improve the prediction accuracy of coarse-grid climate models, these ML-augmented methods are more vulnerable to irregular inputs than the traditional physics-based models they rely on. Because ML-predicted corrections feed back into the climate model's base physics, the ML-corrected model regularly produces out of sample data, which can cause model instability and frequent crashes. This work shows that adding semi-supervised novelty detection to identify out-of-sample data and disable the ML-correction accordingly stabilizes simulations and sharply improves the quality of predictions. We design an augmented climate model with a one-class support vector machine (OCSVM) novelty detector that provides better temperature and precipitation forecasts in a year-long simulation than either a baseline (no-ML) or a standard ML-corrected run. By improving the accuracy of coarse-grid climate models, this work helps make accurate climate models accessible to researchers without massive computational resources. △ Less

Submitted 23 November, 2022; originally announced November 2022.

Comments: Appearing at Tackling Climate Change with Machine Learning Workshop at NeurIPS 2022

arXiv:2211.11820 [pdf, other]

Machine-learned climate model corrections from a global storm-resolving model

Authors: Anna Kwa, Spencer K. Clark, Brian Henn, Noah D. Brenowitz, Jeremy McGibbon, W. Andre Perkins, Oliver Watt-Meyer, Lucas Harris, Christopher S. Bretherton

Abstract: Due to computational constraints, running global climate models (GCMs) for many years requires a lower spatial grid resolution (${\gtrsim}50$ km) than is optimal for accurately resolving important physical processes. Such processes are approximated in GCMs via subgrid parameterizations, which contribute significantly to the uncertainty in GCM predictions. One approach to improving the accuracy of… ▽ More Due to computational constraints, running global climate models (GCMs) for many years requires a lower spatial grid resolution (${\gtrsim}50$ km) than is optimal for accurately resolving important physical processes. Such processes are approximated in GCMs via subgrid parameterizations, which contribute significantly to the uncertainty in GCM predictions. One approach to improving the accuracy of a coarse-grid global climate model is to add machine-learned state-dependent corrections at each simulation timestep, such that the climate model evolves more like a high-resolution global storm-resolving model (GSRM). We train neural networks to learn the state-dependent temperature, humidity, and radiative flux corrections needed to nudge a 200 km coarse-grid climate model to the evolution of a 3~km fine-grid GSRM. When these corrective ML models are coupled to a year-long coarse-grid climate simulation, the time-mean spatial pattern errors are reduced by 6-25% for land surface temperature and 9-25% for land surface precipitation with respect to a no-ML baseline simulation. The ML-corrected simulations develop other biases in climate and circulation that differ from, but have comparable amplitude to, the baseline simulation. △ Less

Submitted 21 November, 2022; originally announced November 2022.

arXiv:2211.10774 [pdf, other]

Emulating Fast Processes in Climate Models

Authors: Noah D. Brenowitz, W. Andre Perkins, Jacqueline M. Nugent, Oliver Watt-Meyer, Spencer K. Clark, Anna Kwa, Brian Henn, Jeremy McGibbon, Christopher S. Bretherton

Abstract: Cloud microphysical parameterizations in atmospheric models describe the formation and evolution of clouds and precipitation, a central weather and climate process. Cloud-associated latent heating is a primary driver of large and small-scale circulations throughout the global atmosphere, and clouds have important interactions with atmospheric radiation. Clouds are ubiquitous, diverse, and can chan… ▽ More Cloud microphysical parameterizations in atmospheric models describe the formation and evolution of clouds and precipitation, a central weather and climate process. Cloud-associated latent heating is a primary driver of large and small-scale circulations throughout the global atmosphere, and clouds have important interactions with atmospheric radiation. Clouds are ubiquitous, diverse, and can change rapidly. In this work, we build the first emulator of an entire cloud microphysical parameterization, including fast phase changes. The emulator performs well in offline and online (i.e. when coupled to the rest of the atmospheric model) tests, but shows some developing biases in Antarctica. Sensitivity tests demonstrate that these successes require careful modeling of the mixed discrete-continuous output as well as the input-output structure of the underlying code and physical process. △ Less

Submitted 19 November, 2022; originally announced November 2022.

Comments: Accepted at the Machine Learning and the Physical Sciences Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS) December 3, 2022

arXiv:2011.03081 [pdf, other]

Machine Learning Climate Model Dynamics: Offline versus Online Performance

Authors: Noah D. Brenowitz, Brian Henn, Jeremy McGibbon, Spencer K. Clark, Anna Kwa, W. Andre Perkins, Oliver Watt-Meyer, Christopher S. Bretherton

Abstract: Climate models are complicated software systems that approximate atmospheric and oceanic fluid mechanics at a coarse spatial resolution. Typical climate forecasts only explicitly resolve processes larger than 100 km and approximate any process occurring below this scale (e.g. thunderstorms) using so-called parametrizations. Machine learning could improve upon the accuracy of some traditional physi… ▽ More Climate models are complicated software systems that approximate atmospheric and oceanic fluid mechanics at a coarse spatial resolution. Typical climate forecasts only explicitly resolve processes larger than 100 km and approximate any process occurring below this scale (e.g. thunderstorms) using so-called parametrizations. Machine learning could improve upon the accuracy of some traditional physical parametrizations by learning from so-called global cloud-resolving models. We compare the performance of two machine learning models, random forests (RF) and neural networks (NNs), at parametrizing the aggregate effect of moist physics in a 3 km resolution global simulation with an atmospheric model. The NN outperforms the RF when evaluated offline on a testing dataset. However, when the ML models are coupled to an atmospheric model run at 200 km resolution, the NN-assisted simulation crashes with 7 days, while the RF-assisted simulations remain stable. Both runs produce more accurate weather forecasts than a baseline configuration, but globally averaged climate variables drift over longer timescales. △ Less

Submitted 5 November, 2020; originally announced November 2020.

arXiv:1808.05695 [pdf, other]

doi 10.1103/PhysRevX.9.031020

Reconciling the Diversity and Uniformity of Galactic Rotation Curves with Self-Interacting Dark Matter

Authors: Tao Ren, Anna Kwa, Manoj Kaplinghat, Hai-Bo Yu

Abstract: Galactic rotation curves exhibit diverse behavior in the inner regions, while obeying an organizing principle, i.e., they can be approximately described by a radial acceleration relation or the Modified Newtonian Dynamics phenomenology. We analyze the rotation curve data from the SPARC sample, and explicitly demonstrate that both the diversity and uniformity are naturally reproduced in a hierarchi… ▽ More Galactic rotation curves exhibit diverse behavior in the inner regions, while obeying an organizing principle, i.e., they can be approximately described by a radial acceleration relation or the Modified Newtonian Dynamics phenomenology. We analyze the rotation curve data from the SPARC sample, and explicitly demonstrate that both the diversity and uniformity are naturally reproduced in a hierarchical structure formation model with the addition of dark matter self-interactions. The required concentrations of the dark matter halos are fully consistent with the concentration-mass relation predicted by the Planck cosmological model. The inferred stellar mass-to-light ($3.6 μm$) ratios scatter around $0.5 M_\odot/L_\odot$, as expected from population synthesis models, leading to a tight radial acceleration relation and baryonic Tully-Fisher relation. The inferred stellar-halo mass relation is consistent with the expectations from abundance matching. These results indicate that the inner dark matter halos of galaxies are thermalized due to the self-interactions of dark matter particles. △ Less

Submitted 16 August, 2018; originally announced August 2018.

Comments: Main text: 10 pages and 3 figures. Supplementary Materials: 47 pages, 2 tables and 5 figures including detailed fits to 135 galaxies

Journal ref: Phys. Rev. X 9, 031020 (2019)

arXiv:1710.03215 [pdf, other]

doi 10.1103/PhysRevD.97.103007

What the Milky Way's Dwarfs tell us about the Galactic Center extended excess

Authors: Ryan E. Keeley, Kevork N. Abazajian, Anna Kwa, Nicholas L. Rodd, Benjamin R. Safdi

Abstract: The Milky Way's Galactic Center harbors a gamma-ray excess that is a candidate signal of annihilating dark matter. Dwarf galaxies remain predominantly dark in their expected commensurate emission. In this work we quantify the degree of consistency between these two observations through a joint likelihood analysis. In doing so we incorporate Milky Way dark matter halo profile uncertainties, as well… ▽ More The Milky Way's Galactic Center harbors a gamma-ray excess that is a candidate signal of annihilating dark matter. Dwarf galaxies remain predominantly dark in their expected commensurate emission. In this work we quantify the degree of consistency between these two observations through a joint likelihood analysis. In doing so we incorporate Milky Way dark matter halo profile uncertainties, as well as an accounting of diffuse gamma-ray emission uncertainties in dark matter annihilation models for the Galactic Center Extended gamma-ray excess (GCE) detected by the Fermi Gamma-Ray Space Telescope. The preferred range of annihilation rates and masses expands when including these unknowns. Even so, using two recent determinations of the Milky Way halo's local density leave the GCE preferred region of single-channel dark matter annihilation models to be in strong tension with annihilation searches in combined dwarf galaxy analyses. A third, higher Milky Way density determination, alleviates this tension. Our joint likelihood analysis allows us to quantify this inconsistency. We provide a set of tools for testing dark matter annihilation models' consistency within this combined dataset. As an example, we test a representative inverse Compton sourced self-interacting dark matter model, which is consistent with both the GCE and dwarfs. △ Less

Submitted 13 October, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

Comments: v2, 12 pages, 4 figures, tools online at: https://github.com/rekeeley/GCE_errors

Journal ref: Phys. Rev. D 97, 103007 (2018)

arXiv:1709.04014 [pdf, other]

doi 10.1093/mnras/sty1483

The Galactic Isotropic $γ$-ray Background and Implications for Dark Matter

Authors: Sheldon S. Campbell, Anna Kwa, Manoj Kaplinghat

Abstract: We present an analysis of the radial angular profile of the galacto-isotropic (GI) $γ$-ray flux--the statistically uniform flux in circular annuli about the Galactic center. Two different approaches are used to measure the GI flux profile in 85 months of Fermi-LAT data: the BDS statistic method which identifies spatial correlations, and a new Poisson ordered-pixel method which identifies non-Poiss… ▽ More We present an analysis of the radial angular profile of the galacto-isotropic (GI) $γ$-ray flux--the statistically uniform flux in circular annuli about the Galactic center. Two different approaches are used to measure the GI flux profile in 85 months of Fermi-LAT data: the BDS statistic method which identifies spatial correlations, and a new Poisson ordered-pixel method which identifies non-Poisson contributions. Both methods produce similar GI flux profiles. The GI flux profile is well-described by an existing model of bremsstrahlung, $π^0$ production, inverse Compton scattering, and the isotropic background. Discrepancies with data in our full-sky model are not present in the GI component, and are therefore due to mis-modeling of the non-GI emission. Dark matter annihilation constraints based solely on the observed GI profile are close to the thermal WIMP cross section below 100 GeV, for fixed models of the dark matter density profile and astrophysical $γ$-ray foregrounds. Refined measurements of the GI profile are expected to improve these constraints by a factor of a few. △ Less

Submitted 25 September, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

Comments: 20 pages, 15 figures, references added

arXiv:1610.08060 [pdf, other]

doi 10.1007/JHEP03(2017)064

Lepton-Flavor Violating Mediators

Authors: Iftah Galon, Anna Kwa, Philip Tanedo

Abstract: We present a framework where dark matter interacts with the Standard Model through a light, spin-0 mediator that couples chirally to pairs of different-flavor leptons. This flavor violating final state weakens bounds on new physics coupled to leptons from terrestrial experiments and cosmic-ray measurements. As an example, we apply this framework to construct a model for the Fermi-LAT excess of GeV… ▽ More We present a framework where dark matter interacts with the Standard Model through a light, spin-0 mediator that couples chirally to pairs of different-flavor leptons. This flavor violating final state weakens bounds on new physics coupled to leptons from terrestrial experiments and cosmic-ray measurements. As an example, we apply this framework to construct a model for the Fermi-LAT excess of GeV $γ$-rays from the galactic center. We comment on the viability of this portal for self-interacting dark matter explanations of small scale structure anomalies and embeddings in flavor models. Models of this type are shown to be compatible with the muon anomalous magnetic moment anomaly. We review current experimental constraints and identify possible future theoretical and experimental directions. △ Less

Submitted 25 October, 2016; originally announced October 2016.

Comments: 27 pages, 7 figures, 1 table

Report number: UCI-TR-2016-21

arXiv:1609.03592 [pdf, other]

doi 10.1103/PhysRevD.94.123017

Hidden Sector Hydrogen as Dark Matter: Small-scale Structure Formation Predictions and the Importance of Hyperfine Interactions

Authors: Kimberly K. Boddy, Manoj Kaplinghat, Anna Kwa, Annika H. G. Peter

Abstract: We study the atomic physics and the astrophysical implications of a model in which the dark matter is the analog of hydrogen in a secluded sector. The self interactions between dark matter particles include both elastic scatterings as well as inelastic processes due to a hyperfine transition. The self-interaction cross sections are computed by numerically solving the coupled Schrödinger equations… ▽ More We study the atomic physics and the astrophysical implications of a model in which the dark matter is the analog of hydrogen in a secluded sector. The self interactions between dark matter particles include both elastic scatterings as well as inelastic processes due to a hyperfine transition. The self-interaction cross sections are computed by numerically solving the coupled Schrödinger equations for this system. We show that these self interactions exhibit the right velocity dependence to explain the low dark matter density cores seen in small galaxies while being consistent with all constraints from observations of clusters of galaxies. For a viable solution, the dark hydrogen mass has to be in 10--100 GeV range and the dark fine-structure constant has to be larger than 0.02. Precisely for this range of parameters, we show that significant cooling losses may occur due to inelastic excitations to the hyperfine state and subsequent decays, with implications for the evolution of low-mass halos and the early growth of supermassive black holes. Cooling from excitations to higher $n$ levels of dark hydrogen and subsequent decays is possible at the cluster scale, with a strong dependence on halo mass. Finally, we show that the minimum halo mass is in the range of $10^{3.5}$ to $10^7 M_\odot$ for the viable regions of parameter space, significantly larger than the typical predictions for weakly-interacting dark matter models. This pattern of observables in cosmological structure formation is unique to this model, making it possible to rule in or rule out hidden sector hydrogen as a viable dark matter model. △ Less

Submitted 3 January, 2017; v1 submitted 12 September, 2016; originally announced September 2016.

Comments: 22 pages, 6 figures, 2 tables; v2: published version

Journal ref: Phys. Rev. D 94, 123017 (2016)

arXiv:1604.01402 [pdf, other]

doi 10.1088/1475-7516/2016/11/053

Investigating the Uniformity of the Excess Gamma rays towards the Galactic Center Region

Authors: Shunsaku Horiuchi, Manoj Kaplinghat, Anna Kwa

Abstract: We perform a composite likelihood analysis of subdivided regions within the central $26^\circ\times20^\circ$ of the Milky Way, with the aim of characterizing the spectrum of the gamma-ray galactic center excess in regions of varying galactocentric distance. Outside of the innermost few degrees, we find that the radial profile of the excess is background-model dependent and poorly constrained. The… ▽ More We perform a composite likelihood analysis of subdivided regions within the central $26^\circ\times20^\circ$ of the Milky Way, with the aim of characterizing the spectrum of the gamma-ray galactic center excess in regions of varying galactocentric distance. Outside of the innermost few degrees, we find that the radial profile of the excess is background-model dependent and poorly constrained. The spectrum of the excess emission is observed to extend upwards of 10 GeV outside $\sim5^\circ$ in radius, but cuts off steeply between 10--20 GeV only in the innermost few degrees. If interpreted as a real feature of the excess, this radial variation in the spectrum has important implications for both astrophysical and dark matter interpretations of the galactic center excess. Single-component dark matter annihilation models face challenges in reproducing this variation; on the other hand, a population of unresolved millisecond pulsars contributing both prompt and secondary inverse Compton emission may be able to explain the spectrum as well as its spatial dependency. We show that the expected differences in the photon-count distributions of a smooth dark matter annihilation signal and an unresolved point source population are an order of magnitude smaller than the fluctuations in residuals after fitting the data, which implies that mismodeling is an important systematic effect in point source analyses aimed at resolving the gamma-ray excess. △ Less

Submitted 16 November, 2016; v1 submitted 5 April, 2016; originally announced April 2016.

Comments: 27 pages, 9 figures. Matches accepted version: references added, typo corrected in Sec. 4.2, some additional discussion added (results unchanged)

Journal ref: JCAP 1611 (2016) no.11, 053

arXiv:1410.6168 [pdf, other]

doi 10.1088/1475-7516/2015/07/013

Discovery of a New Galactic Center Excess Consistent with Upscattered Starlight

Authors: Kevork N. Abazajian, Nicolas Canac, Shunsaku Horiuchi, Manoj Kaplinghat, Anna Kwa

Abstract: We present a new extended gamma ray excess detected with the Fermi Satellite Large Area Telescope toward the Galactic Center that traces the morphology of infrared starlight emission. Combined with its measured spectrum, this new extended source is approximately consistent with inverse Compton emission from a high-energy electron-positron population with energies up to about 10 GeV. Previously det… ▽ More We present a new extended gamma ray excess detected with the Fermi Satellite Large Area Telescope toward the Galactic Center that traces the morphology of infrared starlight emission. Combined with its measured spectrum, this new extended source is approximately consistent with inverse Compton emission from a high-energy electron-positron population with energies up to about 10 GeV. Previously detected emissions tracing the 20 cm radio, interpreted as bremsstrahlung radiation, and the Galactic Center Extended emission tracing a spherical distribution and peaking at 2 GeV, are also detected. We show that the inverse Compton and bremsstrahlung emissions are likely due to the same source of electrons and positrons. All three extended emissions may be explained within the framework of a model where the dark matter annihilates to leptons or a model with unresolved millisecond pulsars in the Galactic Center. △ Less

Submitted 10 July, 2015; v1 submitted 22 October, 2014; originally announced October 2014.

Comments: 13 pages and 5 figures. Version 2 was expanded to include tests to demonstrate the robustness of results against background systematics. Conclusions unchanged. Version 3 includes more checks and matches the published version

Journal ref: JCAP 07 (2015) 013

Showing 1–13 of 13 results for author: Kwa, A