-
Scalable stellar evolution forecasting: Deep learning emulation vs. hierarchical nearest neighbor interpolation
Authors:
K. Maltsev,
F. R. N. Schneider,
F. K. Roepke,
A. I. Jordan,
G. A. Qadir,
W. E. Kerzendorf,
K. Riedmiller,
P. van der Smagt
Abstract:
Many astrophysical applications require efficient yet reliable forecasts of stellar evolution tracks. One example is population synthesis, which generates forward predictions of models for comparison with observations. The majority of state-of-the-art rapid population synthesis methods are based on analytic fitting formulae to stellar evolution tracks that are computationally cheap to sample stati…
▽ More
Many astrophysical applications require efficient yet reliable forecasts of stellar evolution tracks. One example is population synthesis, which generates forward predictions of models for comparison with observations. The majority of state-of-the-art rapid population synthesis methods are based on analytic fitting formulae to stellar evolution tracks that are computationally cheap to sample statistically over a continuous parameter range. The computational costs of running detailed stellar evolution codes, such as MESA, over wide and densely sampled parameter grids are prohibitive, while stellar-age based interpolation in-between sparsely sampled grid points leads to intolerably large systematic prediction errors. In this work, we provide two solutions for automated interpolation methods that offer satisfactory trade-off points between cost-efficiency and accuracy. We construct a timescale-adapted evolutionary coordinate and use it in a two-step interpolation scheme that traces the evolution of stars from ZAMS all the way to the end of core helium burning while covering a mass range from ${0.65}$ to $300 \, \mathrm{M_\odot}$. The feedforward neural network regression model (first solution) that we train to predict stellar surface variables can make millions of predictions, sufficiently accurate over the entire parameter space, within tens of seconds on a 4-core CPU. The hierarchical nearest-neighbor interpolation algorithm (second solution) that we hard-code to the same end achieves even higher predictive accuracy, the same algorithm remains applicable to all stellar variables evolved over time, but it is two orders of magnitude slower. Our methodological framework is demonstrated to work on the MIST (Choi et al. 2016) data set. Finally, we discuss the prospective applications of these methods and provide guidelines for generalizing them to higher dimensional parameter spaces.
△ Less
Submitted 27 October, 2023; v1 submitted 22 September, 2023;
originally announced September 2023.
-
1991T-Like Type Ia Supernovae as an Extension of the Normal Population
Authors:
John T. O'Brien,
Wolfgang E. Kerzendorf,
Andrew Fullard,
Reudiger Pakmor,
Johannes Buchner,
Christian Vogl,
Nutan Chen,
Patrick van der Smagt,
Marc Williamson,
Jaladh Singhal
Abstract:
Type Ia supernovae remain poorly understood despite decades of investigation. Massive computationally intensive hydrodynamic simulations have been developed and run to model an ever-growing number of proposed progenitor channels. Further complicating the matter, a large number of sub-types of Type Ia supernovae have been identified in recent decades. Due to the massive computational load required,…
▽ More
Type Ia supernovae remain poorly understood despite decades of investigation. Massive computationally intensive hydrodynamic simulations have been developed and run to model an ever-growing number of proposed progenitor channels. Further complicating the matter, a large number of sub-types of Type Ia supernovae have been identified in recent decades. Due to the massive computational load required, inference of the internal structure of Type Ia supernovae ejecta directly from observations using simulations has previously been computationally intractable. However, deep-learning emulators for radiation transport simulations have alleviated such barriers. We perform abundance tomography on 40 Type Ia supernovae from optical spectra using the radiative transfer code TARDIS accelerated by the probabilistic DALEK deep-learning emulator. We apply a parametric model of potential ejecta structures to comparatively investigate abundance distributions and internal ionization fractions of intermediate-mass elements between normal and 1991T-like Type Ia supernovae. Our inference shows that 1991T-like Type Ia supernovae are under-abundant in the typical intermediate mass elements that heavily contribute to the spectral line formation seen in normal Type Ia supernovae at early times. Additionally, we find that the intermediate-mass elements present in 1991T-like Type Ia supernovae are highly ionized compared to those in the normal Type Ia population. Finally, we conclude that the transition between normal and 1991T-like Type Ia supernovae appears to be continuous observationally and that the observed differences come out of a combination of both abundance and ionization fractions in these supernovae populations.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Probabilistic Dalek -- Emulator framework with probabilistic prediction for supernova tomography
Authors:
Wolfgang Kerzendorf,
Nutan Chen,
Jack O'Brien,
Johannes Buchner,
Patrick van der Smagt
Abstract:
Supernova spectral time series can be used to reconstruct a spatially resolved explosion model known as supernova tomography. In addition to an observed spectral time series, a supernova tomography requires a radiative transfer model to perform the inverse problem with uncertainty quantification for a reconstruction. The smallest parametrizations of supernova tomography models are roughly a dozen…
▽ More
Supernova spectral time series can be used to reconstruct a spatially resolved explosion model known as supernova tomography. In addition to an observed spectral time series, a supernova tomography requires a radiative transfer model to perform the inverse problem with uncertainty quantification for a reconstruction. The smallest parametrizations of supernova tomography models are roughly a dozen parameters with a realistic one requiring more than 100. Realistic radiative transfer models require tens of CPU minutes for a single evaluation making the problem computationally intractable with traditional means requiring millions of MCMC samples for such a problem. A new method for accelerating simulations known as surrogate models or emulators using machine learning techniques offers a solution for such problems and a way to understand progenitors/explosions from spectral time series. There exist emulators for the TARDIS supernova radiative transfer code but they only perform well on simplistic low-dimensional models (roughly a dozen parameters) with a small number of applications for knowledge gain in the supernova field. In this work, we present a new emulator for the radiative transfer code TARDIS that not only outperforms existing emulators but also provides uncertainties in its prediction. It offers the foundation for a future active-learning-based machinery that will be able to emulate very high dimensional spaces of hundreds of parameters crucial for unraveling urgent questions in supernovae and related fields.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
New mass estimates for massive binary systems: a probabilistic approach using polarimetric radiative transfer
Authors:
Andrew G. Fullard,
John T. O'Brien,
Wolfgang E. Kerzendorf,
Manisha Shrestha,
Jennifer L. Hoffman,
Richard Ignace,
Patrick van der Smagt
Abstract:
Understanding the evolution of massive binary stars requires accurate estimates of their masses. This understanding is critically important because massive star evolution can potentially lead to gravitational wave sources such as binary black holes or neutron stars. For Wolf-Rayet stars with optically thick stellar winds, their masses can only be determined with accurate inclination angle estimate…
▽ More
Understanding the evolution of massive binary stars requires accurate estimates of their masses. This understanding is critically important because massive star evolution can potentially lead to gravitational wave sources such as binary black holes or neutron stars. For Wolf-Rayet stars with optically thick stellar winds, their masses can only be determined with accurate inclination angle estimates from binary systems which have spectroscopic $M \sin i$ measurements. Orbitally-phased polarization signals can encode the inclination angle of binary systems, where the Wolf-Rayet winds act as scattering regions.
We investigated four Wolf-Rayet + O star binary systems, WR 42, WR 79, WR 127, and WR 153, with publicly available phased polarization data to estimate their masses. To avoid the biases present in analytic models of polarization while retaining computational expediency, we used a Monte Carlo radiative transfer model accurately emulated by a neural network. We used the emulated model to investigate the posterior distribution of parameters of our four systems. Our mass estimates calculated from the estimated inclination angles put strong constraints on existing mass estimates for three of the systems, and disagrees with the existing mass estimates for WR 153. We recommend a concerted effort to obtain polarization observations that can be used to estimate the masses of Wolf-Rayet binary systems and increase our understanding of their evolutionary paths.
△ Less
Submitted 21 February, 2022;
originally announced February 2022.
-
Probabilistic Reconstruction of Type Ia Supernova SN 2002bo
Authors:
John T. O'Brien,
Wolfgang E. Kerzendorf,
Andrew Fullard,
Marc Williamson,
Ruediger Pakmor,
Johannes Buchner,
Stephan Hachinger,
Christian Vogl,
James H. Gillanders,
Andreas Floers,
Patrick van der Smagt
Abstract:
Manual fits to spectral times series of Type Ia supernovae have provided a method of reconstructing the explosion from a parametric model but due to lack of information about model uncertainties or parameter degeneracies direct comparison between theory and observation is difficult. In order to mitigate this important problem we present a new way to probabilistically reconstruct the outer ejecta o…
▽ More
Manual fits to spectral times series of Type Ia supernovae have provided a method of reconstructing the explosion from a parametric model but due to lack of information about model uncertainties or parameter degeneracies direct comparison between theory and observation is difficult. In order to mitigate this important problem we present a new way to probabilistically reconstruct the outer ejecta of the normal Type Ia supernova SN 2002bo. A single epoch spectrum, taken 10 days before maximum light, is fit by a 13-parameter model describing the elemental composition of the ejecta and the explosion physics (density, temperature, velocity, and explosion epoch). Model evaluation is performed through the application of a novel rapid spectral synthesis technique in which the radiative transfer code, TARDIS, is accelerated by a machine-learning framework. Analysis of the posterior distribution reveals a complex and degenerate parameter space and allows direct comparison to various hydrodynamic models. Our analysis favors detonation over deflagration scenarios and we find that our technique offers a novel way to compare simulation to observation.
△ Less
Submitted 12 October, 2021; v1 submitted 17 May, 2021;
originally announced May 2021.
-
Dalek -- a deep-learning emulator for TARDIS
Authors:
Wolfgang E. Kerzendorf,
Christian Vogl,
Johannes Buchner,
Gabriella Contardo,
Marc Williamson,
Patrick van der Smagt
Abstract:
Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least sever…
▽ More
Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least several million function evaluations. Physically realistic models require at least tens of CPU minutes per evaluation putting a detailed reconstruction of the explosion out of reach of traditional methodology. The advent of widely available libraries for the training of neural networks combined with their ability to approximate almost arbitrary functions with high precision allows for a new approach to this problem. Instead of evaluating the radiative transfer model itself, one can build a neural network proxy trained on the simulations but evaluating orders of magnitude faster. Such a framework is called an emulator or surrogate model. In this work, we present an emulator for the TARDIS supernova radiative transfer code applied to Type Ia supernova spectra. We show that we can train an emulator for this problem given a modest training set of a hundred thousand spectra (easily calculable on modern supercomputers). The results show an accuracy on the percent level (that are dominated by the Monte Carlo nature of TARDIS and not the emulator) with a speedup of several orders of magnitude. This method has a much broader set of applications and is not limited to the presented problem.
△ Less
Submitted 3 July, 2020;
originally announced July 2020.