-
Cosmic ray and plasma coupling for isothermal supersonic turbulence in the magnetized interstellar medium
Authors:
Matt L. Sampson,
James R. Beattie,
Romain Teyssier,
Philipp Kempski,
Eric R. Moseley,
Benoît Commerçon,
Yohan Dubois,
Joakim Rosdahl
Abstract:
Cosmic rays (CRs) are an integral part of the non-thermal pressure budget in the interstellar medium (ISM) and are the leading-order ionization mechanism in cold molecular clouds. We study the impacts that different microphysical CR diffusion coefficients and streaming speeds have on the evolution of isothermal, magnetized, turbulent plasmas, relevant to the cold ISM. We utilized a two-moment CR m…
▽ More
Cosmic rays (CRs) are an integral part of the non-thermal pressure budget in the interstellar medium (ISM) and are the leading-order ionization mechanism in cold molecular clouds. We study the impacts that different microphysical CR diffusion coefficients and streaming speeds have on the evolution of isothermal, magnetized, turbulent plasmas, relevant to the cold ISM. We utilized a two-moment CR magnetohydrodynamic (CRMHD) model, allowing us to dynamically evolve both CR energy and flux densities with contributions from Alfvénic streaming and anisotropic diffusion. We identify $\textit{coupled}$ and $\textit{decoupled}$ regimes, and define dimensionless Prandtl numbers $\rm{Pm_c}$ and $\rm{Pm_s}$, which quantify whether the plasma falls within these two regimes. In the coupled regime -- characteristic of slow streaming ($\rm{Pm_s} < 1$) and low diffusion ($\rm{Pm_c} < 1$) -- the CR fluid imprints upon the plasma a mixed equation of state between $P_{\rm{c}} \propto ρ^{4/3}$ (relativistic fluid) and $P_{\rm{c}} \propto ρ^{2/3}$ (streaming), where $P_{\rm{c}}$ is the CR pressure, and $ρ$ is the plasma density. By modifying the sound speed, the coupling reduces the turbulent Mach number, and hence the amplitude of the density fluctuations, whilst supporting secular heating of the CR fluid. In contrast, in the decoupled regime ($\rm{Pm_s} > 1$ or $\rm{Pm_c} > 1$) the CR fluid and the plasma have negligible interactions. We further show that CR heating is enabled by coherent structures within the compressible velocity field, with no impact on the turbulence spectrum of incompressible modes.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Path-minimizing Latent ODEs for improved extrapolation and inference
Authors:
Matt L. Sampson,
Peter Melchior
Abstract:
Latent ODE models provide flexible descriptions of dynamic systems, but they can struggle with extrapolation and predicting complicated non-linear dynamics. The latent ODE approach implicitly relies on encoders to identify unknown system parameters and initial conditions, whereas the evaluation times are known and directly provided to the ODE solver. This dichotomy can be exploited by encouraging…
▽ More
Latent ODE models provide flexible descriptions of dynamic systems, but they can struggle with extrapolation and predicting complicated non-linear dynamics. The latent ODE approach implicitly relies on encoders to identify unknown system parameters and initial conditions, whereas the evaluation times are known and directly provided to the ODE solver. This dichotomy can be exploited by encouraging time-independent latent representations. By replacing the common variational penalty in latent space with an $\ell_2$ penalty on the path length of each system, the models learn data representations that can easily be distinguished from those of systems with different configurations. This results in faster training, smaller models, more accurate interpolation and long-time extrapolation compared to the baseline ODE models with GRU, RNN, and LSTM encoder/decoders on tests with damped harmonic oscillator, self-gravitating fluid, and predator-prey systems. We also demonstrate superior results for simulation-based inference of the Lotka-Volterra parameters and initial conditions by using the latents as data summaries for a conditional normalizing flow. Our change to the training loss is agnostic to the specific recognition network used by the decoder and can therefore easily be adopted by other latent ODE models.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Disentangling transients and their host galaxies with Scarlet2: A framework to forward model multi-epoch imaging
Authors:
Charlotte Ward,
Peter Melchior,
Matt L. Sampson,
Colin J. Burke,
Jared Siegel,
Benjamin Remy,
Sufia Birmingham,
Emily Ramey,
Sjoert van Velzen
Abstract:
Many science cases for wide-field time-domain surveys rely on accurate identification and characterization of the galaxies hosting transient and variable objects. In the era of the Legacy Survey of Space and Time (LSST) at the Vera C. Rubin Observatory the number of known transient and variable sources will grow by orders of magnitude, and many of these sources will be blended with their host gala…
▽ More
Many science cases for wide-field time-domain surveys rely on accurate identification and characterization of the galaxies hosting transient and variable objects. In the era of the Legacy Survey of Space and Time (LSST) at the Vera C. Rubin Observatory the number of known transient and variable sources will grow by orders of magnitude, and many of these sources will be blended with their host galaxies and neighboring galaxies. A diverse range of applications - including the classification of nuclear and non-nuclear sources, identification of potential host galaxies, extraction of host galaxy SEDs without requiring a transient-free reference image, and combined analysis of photometry from multiple surveys - will benefit from a flexible framework to model time-domain imaging of transients. We describe a time-domain extension of the Scarlet2 scene modeling code for multi-epoch, multi-band, and multi-resolution imaging data to extract simultaneous transient and host galaxy models. Scarlet2 leverages the benefits of data-driven priors on galaxy morphology, is fully GPU compatible, and can jointly model multi-resolution data from ground and space-based surveys. We demonstrate the method on simulated LSST-like supernova imaging, low-resolution Zwicky Transient Facility imaging of tidal disruption events, and Hyper Suprime Cam imaging of variable AGN out to z = 4 in the COSMOS fields. We show that Scarlet2 models provide accurate transient and host galaxy models as well as accurate measurement of host-transient spatial offsets, and demonstrate future applications to the search for 'wandering' massive black holes.
△ Less
Submitted 26 January, 2025; v1 submitted 23 September, 2024;
originally announced September 2024.
-
Score-matching neural networks for improved multi-band source separation
Authors:
Matt L. Sampson,
Peter Melchior,
Charlotte Ward,
Sufia Birmingham
Abstract:
We present the implementation of a score-matching neural network that represents a data-driven prior for non-parametric galaxy morphologies. The gradients of this prior can be incorporated in the optimization of galaxy models to aid with tasks like deconvolution, inpainting or source separation. We demonstrate this approach with modification of the multi-band modeling framework Scarlet that is cur…
▽ More
We present the implementation of a score-matching neural network that represents a data-driven prior for non-parametric galaxy morphologies. The gradients of this prior can be incorporated in the optimization of galaxy models to aid with tasks like deconvolution, inpainting or source separation. We demonstrate this approach with modification of the multi-band modeling framework Scarlet that is currently employed as deblending method in the pipelines of the HyperSuprimeCam survey and the Rubin Observatory. The addition of the prior avoids the requirement of non-differentiable constraints, which can lead to convergence failures we discovered in Scarlet. We present the architecture and training details of our score-matching neural network and show with simulated Rubin-like observations that using a data-driven prior outperforms the baseline \scarlet method in accuracy of total flux and morphology estimates, while maintaining excellent performance for colors. We also demonstrate significant improvements in the robustness to inaccurate initializations. The trained score models used for this analysis are publicly available at https://github.com/SampsonML/galaxygrad.
△ Less
Submitted 5 September, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Spotting Hallucinations in Inverse Problems with Data-Driven Priors
Authors:
Matt L. Sampson,
Peter Melchior
Abstract:
Hallucinations are an inescapable consequence of solving inverse problems with deep neural networks. The expressiveness of recent generative models is the reason why they can yield results far superior to conventional regularizers; it can also lead to realistic-looking but incorrect features, potentially undermining the trust in important aspects of the reconstruction. We present a practical and c…
▽ More
Hallucinations are an inescapable consequence of solving inverse problems with deep neural networks. The expressiveness of recent generative models is the reason why they can yield results far superior to conventional regularizers; it can also lead to realistic-looking but incorrect features, potentially undermining the trust in important aspects of the reconstruction. We present a practical and computationally efficient method to determine, which regions in the solutions of inverse problems with data-driven priors are prone to hallucinations. By computing the diagonal elements of the Fisher information matrix of the likelihood and the data-driven prior separately, we can flag regions where the information is prior-dominated. Our diagnostic can directly be compared to the reconstructed solutions and enables users to decide if measurements in such regions are robust for their application. Our method scales linearly with the number of parameters and is thus applicable in high-dimensional settings, allowing it to be rolled out broadly for the large-volume data products of future wide-field surveys.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Cosmic Ray Interstellar Propagation Tool using Itô Calculus (criptic): software for simultaneous calculation of cosmic ray transport and observational signatures
Authors:
Mark R. Krumholz,
Roland M. Crocker,
Matt L. Sampson
Abstract:
We present criptic, the Cosmic Ray Interstellar Propagation Tool using Itô Calculus, a new open-source software package to simulate the propagation of cosmic rays through the interstellar medium and to calculate the resulting observable non-thermal emission. Criptic solves the Fokker-Planck equation describing transport of cosmic rays on scales larger than that on which their pitch angles become a…
▽ More
We present criptic, the Cosmic Ray Interstellar Propagation Tool using Itô Calculus, a new open-source software package to simulate the propagation of cosmic rays through the interstellar medium and to calculate the resulting observable non-thermal emission. Criptic solves the Fokker-Planck equation describing transport of cosmic rays on scales larger than that on which their pitch angles become approximately isotropic, and couples this to a rich and accurate treatment of the microphysical processes by which cosmic rays in the energy range $\sim$MeV to $\sim$PeV lose energy and produce emission. Criptic is deliberately agnostic as to both the cosmic ray transport model and the state of the background plasma through which cosmic rays travel. It can solve problems where cosmic rays stream, diffuse, or perform arbitrary combinations of both, and the coefficients describing these transport processes can be arbitrary functions of the background plasma state, the properties of the cosmic rays themselves, and local integrals of the cosmic ray field itself (e.g., the local cosmic ray pressure or pressure gradient). The code is parallelised using a hybrid OpenMP-MPI paradigm, allowing rapid calculations exploiting multiple cores and nodes on modern supercomputers. Here we describe the numerical methods used in the code, our treatment of the microphysical processes, and the set of code tests and validations we have performed.
△ Less
Submitted 19 September, 2022; v1 submitted 27 July, 2022;
originally announced July 2022.
-
Turbulent diffusion of streaming cosmic rays in compressible, partially ionised plasma
Authors:
Matt L. Sampson,
James R. Beattie,
Mark R. Krumholz,
Roland M. Crocker,
Christoph Federrath,
Amit Seta
Abstract:
Cosmic rays (CRs) are a dynamically important component of the interstellar medium (ISM) of galaxies. The $\sim$GeV CRs that carry most CR energy and pressure are likely confined by self-generated turbulence, leading them to stream along magnetic field lines at the ion Alfvén speed. However, the consequences of self-confinement for CR propagation on galaxy scales remain highly uncertain. In this p…
▽ More
Cosmic rays (CRs) are a dynamically important component of the interstellar medium (ISM) of galaxies. The $\sim$GeV CRs that carry most CR energy and pressure are likely confined by self-generated turbulence, leading them to stream along magnetic field lines at the ion Alfvén speed. However, the consequences of self-confinement for CR propagation on galaxy scales remain highly uncertain. In this paper, we use a large ensemble of magnetohydrodynamical turbulence simulations to quantify how the basic parameters describing ISM turbulence -- the sonic Mach number, $\mathcal{M}$ (plasma compressibility), Alfvén Mach number, $\mathcal{M}_{A0}$ (strength of the large-scale field with respect to the turbulence), and ionisation fraction by mass, $χ$ -- affect the transport of streaming CRs. We show that the large-scale transport of CRs whose small-scale motion consists of streaming along field lines is well described as a combination of streaming along the mean field and superdiffusion both along (parallel to) and across (perpendicular to) it; $\mathcal{M}_{A0}$ drives the level of anisotropy between parallel and perpendicular diffusion and $χ$ modulates the magnitude of the diffusion coefficients, while in our choice of units, $\mathcal{M}$ is unimportant except in the sub-Alfvénic ($\mathcal{M}_{A0} \lesssim 0.5$) regime. Our finding that superdiffusion is ubiquitous potentially explains the apparent discrepancy between CR diffusion coefficients inferred from measurements close to individual sources compared to those measured on larger, Galactic scales. Finally, we present empirical fits for the diffusion coefficients as a function of plasma parameters that may be used as sub-grid recipes for global interstellar medium, galaxy or cosmological simulations.
△ Less
Submitted 3 November, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.