Skip to main content

Showing 1–17 of 17 results for author: Hoyer, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.11973  [pdf, other

    physics.ao-ph cs.LG

    Neural general circulation models optimized to predict satellite-based precipitation observations

    Authors: Janni Yuval, Ian Langmore, Dmitrii Kochkov, Stephan Hoyer

    Abstract: Climate models struggle to accurately simulate precipitation, particularly extremes and the diurnal cycle. Here, we present a hybrid model that is trained directly on satellite-based precipitation observations. Our model runs at 2.8$^\circ$ resolution and is built on the differentiable NeuralGCM framework. The model demonstrates significant improvements over existing general circulation models, th… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 20 pages, 6 figures in Main. 29 pages, 30 figures in SI

  2. arXiv:2408.02767  [pdf, other

    cs.LG math.DS physics.geo-ph

    4D-Var using Hessian approximation and backpropagation applied to automatically-differentiable numerical and machine learning models

    Authors: Kylen Solvik, Stephen G. Penny, Stephan Hoyer

    Abstract: Constraining a numerical weather prediction (NWP) model with observations via 4D variational (4D-Var) data assimilation is often difficult to implement in practice due to the need to develop and maintain a software-based tangent linear model and adjoint model. One of the most common 4D-Var algorithms uses an incremental update procedure, which has been shown to be an approximation of the Gauss-New… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 24 pages, 7 figures

    ACM Class: J.2; I.6.5; G.1.6

  3. arXiv:2402.04467  [pdf, other

    cs.LG math.DS

    DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems

    Authors: Yair Schiff, Zhong Yi Wan, Jeffrey B. Parker, Stephan Hoyer, Volodymyr Kuleshov, Fei Sha, Leonardo Zepeda-Núñez

    Abstract: Learning dynamics from dissipative chaotic systems is notoriously difficult due to their inherent instability, as formalized by their positive Lyapunov exponents, which exponentially amplify errors in the learned dynamics. However, many of these systems exhibit ergodicity and an attractor: a compact and highly complex manifold, to which trajectories converge in finite-time, that supports an invari… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024; Code to reproduce our experiments is available at https://github.com/google-research/swirl-dynamics/tree/main/swirl_dynamics/projects/ergodic

  4. arXiv:2311.07222  [pdf, other

    physics.ao-ph cs.LG physics.comp-ph

    Neural General Circulation Models for Weather and Climate

    Authors: Dmitrii Kochkov, Janni Yuval, Ian Langmore, Peter Norgaard, Jamie Smith, Griffin Mooers, Milan Klöwer, James Lottes, Stephan Rasp, Peter Düben, Sam Hatfield, Peter Battaglia, Alvaro Sanchez-Gonzalez, Matthew Willson, Michael P. Brenner, Stephan Hoyer

    Abstract: General circulation models (GCMs) are the foundation of weather and climate prediction. GCMs are physics-based simulators which combine a numerical solver for large-scale dynamics with tuned representations for small-scale processes such as cloud formation. Recently, machine learning (ML) models trained on reanalysis data achieved comparable or better skill than GCMs for deterministic weather fore… ▽ More

    Submitted 7 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 92 pages, 54 figures. Nature (2024)

  5. arXiv:2308.15560  [pdf, other

    physics.ao-ph cs.AI

    WeatherBench 2: A benchmark for the next generation of data-driven global weather models

    Authors: Stephan Rasp, Stephan Hoyer, Alexander Merose, Ian Langmore, Peter Battaglia, Tyler Russel, Alvaro Sanchez-Gonzalez, Vivian Yang, Rob Carver, Shreya Agrawal, Matthew Chantry, Zied Ben Bouallegue, Peter Dueben, Carla Bromberg, Jared Sisk, Luke Barrington, Aaron Bell, Fei Sha

    Abstract: WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and… ▽ More

    Submitted 26 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  6. arXiv:2212.12794  [pdf, other

    cs.LG physics.ao-ph

    GraphCast: Learning skillful medium-range global weather forecasting

    Authors: Remi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, Peter Wirnsberger, Meire Fortunato, Ferran Alet, Suman Ravuri, Timo Ewalds, Zach Eaton-Rosen, Weihua Hu, Alexander Merose, Stephan Hoyer, George Holland, Oriol Vinyals, Jacklynn Stott, Alexander Pritzel, Shakir Mohamed, Peter Battaglia

    Abstract: Global medium-range weather forecasting is critical to decision-making across many social and economic domains. Traditional numerical weather prediction uses increased compute resources to improve forecast accuracy, but cannot directly use historical weather data to improve the underlying model. We introduce a machine learning-based method called "GraphCast", which can be trained directly from rea… ▽ More

    Submitted 4 August, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: GraphCast code and trained weights are available at: https://github.com/deepmind/graphcast

  7. arXiv:2207.00556  [pdf, other

    cs.LG physics.flu-dyn

    Learning to correct spectral methods for simulating turbulent flows

    Authors: Gideon Dresdner, Dmitrii Kochkov, Peter Norgaard, Leonardo Zepeda-Núñez, Jamie A. Smith, Michael P. Brenner, Stephan Hoyer

    Abstract: Despite their ubiquity throughout science and engineering, only a handful of partial differential equations (PDEs) have analytical, or closed-form solutions. This motivates a vast amount of classical work on numerical simulation of PDEs and more recently, a whirlwind of research into data-driven techniques leveraging machine learning (ML). A recent line of work indicates that a hybrid of classical… ▽ More

    Submitted 25 June, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

  8. arXiv:2105.15183  [pdf, other

    cs.LG math.NA stat.ML

    Efficient and Modular Implicit Differentiation

    Authors: Mathieu Blondel, Quentin Berthet, Marco Cuturi, Roy Frostig, Stephan Hoyer, Felipe Llinares-López, Fabian Pedregosa, Jean-Philippe Vert

    Abstract: Automatic differentiation (autodiff) has revolutionized machine learning. It allows to express complex computations by composing elementary ones in creative ways and removes the burden of computing their derivatives by hand. More recently, differentiation of optimization problem solutions has attracted widespread attention with applications such as optimization layers, and in bi-level problems suc… ▽ More

    Submitted 12 October, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: V3: added more related work and Jacobian precision figure

  9. arXiv:2102.11192  [pdf, other

    cs.LG physics.ao-ph

    Variational Data Assimilation with a Learned Inverse Observation Operator

    Authors: Thomas Frerix, Dmitrii Kochkov, Jamie A. Smith, Daniel Cremers, Michael P. Brenner, Stephan Hoyer

    Abstract: Variational data assimilation optimizes for an initial state of a dynamical system such that its evolution fits observational data. The physical model can subsequently be evolved into the future to make predictions. This principle is a cornerstone of large scale forecasting applications such as numerical weather prediction. As such, it is implemented in current operational systems of weather forec… ▽ More

    Submitted 20 May, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Published at the International Conference on Machine Learning (ICML) 2021

  10. arXiv:2102.01010  [pdf, other

    physics.flu-dyn cs.LG

    Machine learning accelerated computational fluid dynamics

    Authors: Dmitrii Kochkov, Jamie A. Smith, Ayya Alieva, Qing Wang, Michael P. Brenner, Stephan Hoyer

    Abstract: Numerical simulation of fluids plays an essential role in modeling many physical phenomena, such as weather, climate, aerodynamics and plasma physics. Fluids are well described by the Navier-Stokes equations, but solving these equations at scale remains daunting, limited by the computational cost of resolving the smallest spatiotemporal features. This leads to unfavorable trade-offs between accura… ▽ More

    Submitted 28 January, 2021; originally announced February 2021.

    Comments: 13 pages, 9 figures

  11. arXiv:2009.08551  [pdf, other

    physics.comp-ph cs.LG

    Kohn-Sham equations as regularizer: building prior knowledge into machine-learned physics

    Authors: Li Li, Stephan Hoyer, Ryan Pederson, Ruoxi Sun, Ekin D. Cubuk, Patrick Riley, Kieron Burke

    Abstract: Including prior knowledge is important for effective machine learning models in physics, and is usually achieved by explicitly adding loss terms or constraints on model architectures. Prior knowledge embedded in the physics computation itself rarely draws attention. We show that solving the Kohn-Sham equations when training neural networks for the exchange-correlation functional provides an implic… ▽ More

    Submitted 17 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Journal ref: Phys. Rev. Lett. 126, 036401 (2021)

  12. Array Programming with NumPy

    Authors: Charles R. Harris, K. Jarrod Millman, Stéfan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fernández del Río, Mark Wiebe, Pearu Peterson, Pierre Gérard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke , et al. (1 additional authors not shown)

    Abstract: Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays. NumPy is the primary array programming library for the Python language. It plays an essential role in research analysis pipelines in fields as diverse as physics, chemistry, astronomy, geoscience, biology, psychology, material sci… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Journal ref: Nature 585, 357 (2020)

  13. arXiv:2003.04630  [pdf, other

    cs.LG math.DS physics.comp-ph physics.data-an stat.ML

    Lagrangian Neural Networks

    Authors: Miles Cranmer, Sam Greydanus, Stephan Hoyer, Peter Battaglia, David Spergel, Shirley Ho

    Abstract: Accurate models of the world are built upon notions of its underlying symmetries. In physics, these symmetries correspond to conservation laws, such as for energy and momentum. Yet even though neural network models see increasing use in the physical sciences, they struggle to learn these symmetries. In this paper, we propose Lagrangian Neural Networks (LNNs), which can parameterize arbitrary Lagra… ▽ More

    Submitted 30 July, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 7 pages (+2 appendix). Published in ICLR 2020 Deep Differential Equations Workshop. Code at github.com/MilesCranmer/lagrangian_nns

  14. arXiv:1910.05006  [pdf, other

    cs.LG stat.ML

    Inundation Modeling in Data Scarce Regions

    Authors: Zvika Ben-Haim, Vladimir Anisimov, Aaron Yonas, Varun Gulshan, Yusef Shafi, Stephan Hoyer, Sella Nevo

    Abstract: Flood forecasts are crucial for effective individual and governmental protective action. The vast majority of flood-related casualties occur in developing countries, where providing spatially accurate forecasts is a challenge due to scarcity of data and lack of funding. This paper describes an operational system providing flood extent forecast maps covering several flood-prone regions in India, wi… ▽ More

    Submitted 30 October, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: To appear in the Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop (AI+HADR) @ NeurIPS 2019

  15. arXiv:1909.04240  [pdf, other

    cs.LG cs.NE stat.ML

    Neural reparameterization improves structural optimization

    Authors: Stephan Hoyer, Jascha Sohl-Dickstein, Sam Greydanus

    Abstract: Structural optimization is a popular method for designing objects such as bridge trusses, airplane wings, and optical devices. Unfortunately, the quality of solutions depends heavily on how the problem is parameterized. In this paper, we propose using the implicit bias over functions induced by neural networks to improve the parameterization of structural optimization. Rather than directly optimiz… ▽ More

    Submitted 13 September, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

  16. arXiv:1811.12436  [pdf

    physics.optics cs.LG

    Freeform Diffractive Metagrating Design Based on Generative Adversarial Networks

    Authors: Jiaqi Jiang, David Sell, Stephan Hoyer, Jason Hickey, Jianji Yang, Jonathan A. Fan

    Abstract: A key challenge in metasurface design is the development of algorithms that can effectively and efficiently produce high performance devices. Design methods based on iterative optimization can push the performance limits of metasurfaces, but they require extensive computational resources that limit their implementation to small numbers of microscale devices. We show that generative neural networks… ▽ More

    Submitted 18 June, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 15 pages, 5 figures

  17. arXiv:1705.10743  [pdf, other

    cs.LG stat.ML

    The Cramer Distance as a Solution to Biased Wasserstein Gradients

    Authors: Marc G. Bellemare, Ivo Danihelka, Will Dabney, Shakir Mohamed, Balaji Lakshminarayanan, Stephan Hoyer, Rémi Munos

    Abstract: The Wasserstein probability metric has received much attention from the machine learning community. Unlike the Kullback-Leibler divergence, which strictly measures change in probability, the Wasserstein metric reflects the underlying geometry between outcomes. The value of being sensitive to this geometry has been demonstrated, among others, in ordinal regression and generative modelling. In this… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.