-
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning
Authors:
Ruben Ohana,
Michael McCabe,
Lucas Meyer,
Rudy Morel,
Fruzsina J. Agocs,
Miguel Beneitez,
Marsha Berger,
Blakesley Burkhart,
Keaton Burns,
Stuart B. Dalziel,
Drummond B. Fielding,
Daniel Fortunato,
Jared A. Goldberg,
Keiya Hirashima,
Yan-Fei Jiang,
Rich R. Kerswell,
Suryanarayana Maddu,
Jonah Miller,
Payel Mukhopadhyay,
Stefan S. Nixon,
Jeff Shen,
Romain Watteaux,
Bruno Régaldo-Saint Blancard,
François Rozet,
Liam H. Parker
, et al. (2 additional authors not shown)
Abstract:
Machine learning based surrogate models offer researchers powerful tools for accelerating simulation-based workflows. However, as standard datasets in this space often cover small classes of physical behavior, it can be difficult to evaluate the efficacy of new approaches. To address this gap, we introduce the Well: a large-scale collection of datasets containing numerical simulations of a wide va…
▽ More
Machine learning based surrogate models offer researchers powerful tools for accelerating simulation-based workflows. However, as standard datasets in this space often cover small classes of physical behavior, it can be difficult to evaluate the efficacy of new approaches. To address this gap, we introduce the Well: a large-scale collection of datasets containing numerical simulations of a wide variety of spatiotemporal physical systems. The Well draws from domain experts and numerical software developers to provide 15TB of data across 16 datasets covering diverse domains such as biological systems, fluid dynamics, acoustic scattering, as well as magneto-hydrodynamic simulations of extra-galactic fluids or supernova explosions. These datasets can be used individually or as part of a broader benchmark suite. To facilitate usage of the Well, we provide a unified PyTorch interface for training and evaluating models. We demonstrate the function of this library by introducing example baselines that highlight the new challenges posed by the complex dynamics of the Well. The code and data is available at https://github.com/PolymathicAI/the_well.
△ Less
Submitted 21 February, 2025; v1 submitted 30 November, 2024;
originally announced December 2024.
-
Extending a Physics-Informed Machine Learning Network for Superresolution Studies of Rayleigh-Bénard Convection
Authors:
Diane M. Salim,
Blakesley Burkhart,
David Sondak
Abstract:
Advancing our understanding of astrophysical turbulence is bottlenecked by the limited resolution of numerical simulations that may not fully sample scales in the inertial range. Machine learning (ML) techniques have demonstrated promise in up-scaling resolution in both image analysis and numerical simulations (i.e., superresolution). Here we employ and further develop a physics-constrained convol…
▽ More
Advancing our understanding of astrophysical turbulence is bottlenecked by the limited resolution of numerical simulations that may not fully sample scales in the inertial range. Machine learning (ML) techniques have demonstrated promise in up-scaling resolution in both image analysis and numerical simulations (i.e., superresolution). Here we employ and further develop a physics-constrained convolutional neural network (CNN) ML model called "MeshFreeFlowNet'' (MFFN) for superresolution studies of turbulent systems. The model is trained both on the simulation images as well as the evaluated PDEs, making it sensitive to the underlying physics of a particular fluid system. We develop a framework for 2D turbulent Rayleigh-Bénard convection (RBC) generated with the \textsc{Dedalus} code by modifying the MFFN architecture to include the full set of simulation PDEs and the boundary conditions. Our training set includes fully developed turbulence sampling Rayleigh numbers ($Ra$) of $Ra=10^6-10^{10}$. We evaluate the success of the learned simulations by comparing the power spectra of the direct \textsc{Dedalus} simulation to the predicted model output, and compare both ground truth and predicted power spectral inertial range scalings to theoretical predictions. We find that the updated network performs well at all $Ra$ studied here in recovering large-scale information, including the inertial range slopes. The superresolution prediction is overly dissipative at smaller scales than that of the inertial range in all cases, but the smaller-scales are better recovered in more turbulent, than laminar, regimes. This is likely because more turbulent systems have a rich variety of structures at many length scales compared to laminar flows.
△ Less
Submitted 31 January, 2024; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Diagnosing Turbulence in the Neutral and Molecular Interstellar Medium of Galaxies
Authors:
Blakesley Burkhart
Abstract:
Magnetohydrodynamic (MHD) turbulence is a crucial component of the current paradigms of star formation, dynamo theory, particle transport, magnetic reconnection and evolution of structure in the interstellar medium (ISM) of galaxies. Despite the importance of turbulence to astrophysical fluids, a full theoretical framework based on solutions to the Navier-Stokes equations remains intractable. Obse…
▽ More
Magnetohydrodynamic (MHD) turbulence is a crucial component of the current paradigms of star formation, dynamo theory, particle transport, magnetic reconnection and evolution of structure in the interstellar medium (ISM) of galaxies. Despite the importance of turbulence to astrophysical fluids, a full theoretical framework based on solutions to the Navier-Stokes equations remains intractable. Observations provide only limited line-of-sight information on densities, temperatures, velocities and magnetic field strengths and therefore directly measuring turbulence in the ISM is challenging. A statistical approach has been of great utility in allowing comparisons of observations, simulations and analytic predictions. In this review article we address the growing importance of MHD turbulence in many fields of astrophysics and review statistical diagnostics for studying interstellar and interplanetary turbulence. In particular, we will review statistical diagnostics and machine learning algorithms that have been developed for observational data sets in order to obtain information about the turbulence cascade, fluid compressibility (sonic Mach number), and magnetization of fluid (Alfvénic Mach number). These techniques have often been tested on numerical simulations of MHD turbulence, which may include the creation of synthetic observations, and are often formulated on theoretical expectations for compressible magnetized turbulence. We stress the use of multiple techniques, as this can provide a more accurate indication of the turbulence parameters of interest. We conclude by describing several open-source tools for the astrophysical community to use when dealing with turbulence.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Principal Component Analysis studies of turbulence in optically thick gas
Authors:
Caio Correia,
Alex Lazarian,
Blakesley Burkhart,
Dmitri Pogosyan,
José Renan De Medeiros
Abstract:
In this work we investigate the Principal Component Analysis (PCA) sensitivity to the velocity power spectrum in high opacity regimes of the interstellar medium (ISM). For our analysis we use synthetic Position-Position-Velocity (PPV) cubes of fractional Brownian motion (fBm) and magnetohydrodynamics (MHD) simulations, post processed to include radiative transfer effects from CO. We find that PCA…
▽ More
In this work we investigate the Principal Component Analysis (PCA) sensitivity to the velocity power spectrum in high opacity regimes of the interstellar medium (ISM). For our analysis we use synthetic Position-Position-Velocity (PPV) cubes of fractional Brownian motion (fBm) and magnetohydrodynamics (MHD) simulations, post processed to include radiative transfer effects from CO. We find that PCA analysis is very different from the tools based on the traditional power spectrum of PPV data cubes. Our major finding is that PCA is also sensitive to the phase information of PPV cubes and this allows PCA to detect the changes of the underlying velocity and density spectra at high opacities, where the spectral analysis of the maps provides the universal -3 spectrum in accordance with the predictions of Lazarian \& Pogosyan (2004) theory. This makes PCA potentially a valuable tool for studies of turbulence at high opacities provided that the proper gauging of the PCA index is made. The later, however, we found to be not easy, as the PCA results change in an irregular way for data with high sonic Mach numbers. This is in contrast to synthetic Brownian noise data used for velocity and density fields that show monotonic PCA behavior. We attribute this difference to the PCA's sensitivity to Fourier phase information.
△ Less
Submitted 11 November, 2015;
originally announced November 2015.
-
Low-Mach-number turbulence in interstellar gas revealed by radio polarization gradients
Authors:
Bryan M. Gaensler,
Marijke Haverkorn,
Blakesley Burkhart,
Katherine J. Newton-McGee,
Ronald D. Ekers,
Alex Lazarian,
Naomi M. McClure-Griffiths,
Timothy Robishaw,
John M. Dickey,
Anne J. Green
Abstract:
The interstellar medium of the Milky Way is multi-phase, magnetized and turbulent. Turbulence in the interstellar medium produces a global cascade of random gas motions, spanning scales ranging from 100 parsecs to 1000 kilometres. Fundamental parameters of interstellar turbulence such as the sonic Mach number (the speed of sound) have been difficult to determine because observations have lacked th…
▽ More
The interstellar medium of the Milky Way is multi-phase, magnetized and turbulent. Turbulence in the interstellar medium produces a global cascade of random gas motions, spanning scales ranging from 100 parsecs to 1000 kilometres. Fundamental parameters of interstellar turbulence such as the sonic Mach number (the speed of sound) have been difficult to determine because observations have lacked the sensitivity and resolution to directly image the small-scale structure associated with turbulent motion. Observations of linear polarization and Faraday rotation in radio emission from the Milky Way have identified unusual polarized structures that often have no counterparts in the total radiation intensity or at other wavelengths, and whose physical significance has been unclear. Here we report that the gradient of the Stokes vector (Q,U), where Q and U are parameters describing the polarization state of radiation, provides an image of magnetized turbulence in diffuse ionized gas, manifested as a complex filamentary web of discontinuities in gas density and magnetic field. Through comparison with simulations, we demonstrate that turbulence in the warm ionized medium has a relatively low sonic Mach number, M_s <~ 2. The development of statistical tools for the analysis of polarization gradients will allow accurate determinations of the Mach number, Reynolds number and magnetic field strength in interstellar turbulence over a wide range of conditions.
△ Less
Submitted 13 October, 2011;
originally announced October 2011.