Search | arXiv e-print repository

doi 10.1051/0004-6361/202347118

Scalable stellar evolution forecasting: Deep learning emulation vs. hierarchical nearest neighbor interpolation

Authors: K. Maltsev, F. R. N. Schneider, F. K. Roepke, A. I. Jordan, G. A. Qadir, W. E. Kerzendorf, K. Riedmiller, P. van der Smagt

Abstract: Many astrophysical applications require efficient yet reliable forecasts of stellar evolution tracks. One example is population synthesis, which generates forward predictions of models for comparison with observations. The majority of state-of-the-art rapid population synthesis methods are based on analytic fitting formulae to stellar evolution tracks that are computationally cheap to sample stati… ▽ More Many astrophysical applications require efficient yet reliable forecasts of stellar evolution tracks. One example is population synthesis, which generates forward predictions of models for comparison with observations. The majority of state-of-the-art rapid population synthesis methods are based on analytic fitting formulae to stellar evolution tracks that are computationally cheap to sample statistically over a continuous parameter range. The computational costs of running detailed stellar evolution codes, such as MESA, over wide and densely sampled parameter grids are prohibitive, while stellar-age based interpolation in-between sparsely sampled grid points leads to intolerably large systematic prediction errors. In this work, we provide two solutions for automated interpolation methods that offer satisfactory trade-off points between cost-efficiency and accuracy. We construct a timescale-adapted evolutionary coordinate and use it in a two-step interpolation scheme that traces the evolution of stars from ZAMS all the way to the end of core helium burning while covering a mass range from ${0.65}$ to $300 \, \mathrm{M_\odot}$. The feedforward neural network regression model (first solution) that we train to predict stellar surface variables can make millions of predictions, sufficiently accurate over the entire parameter space, within tens of seconds on a 4-core CPU. The hierarchical nearest-neighbor interpolation algorithm (second solution) that we hard-code to the same end achieves even higher predictive accuracy, the same algorithm remains applicable to all stellar variables evolved over time, but it is two orders of magnitude slower. Our methodological framework is demonstrated to work on the MIST (Choi et al. 2016) data set. Finally, we discuss the prospective applications of these methods and provide guidelines for generalizing them to higher dimensional parameter spaces. △ Less

Submitted 27 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

Comments: Accepted at A&A

Journal ref: A&A 681, A86 (2024)

arXiv:2306.08137 [pdf, other]

1991T-Like Type Ia Supernovae as an Extension of the Normal Population

Authors: John T. O'Brien, Wolfgang E. Kerzendorf, Andrew Fullard, Reudiger Pakmor, Johannes Buchner, Christian Vogl, Nutan Chen, Patrick van der Smagt, Marc Williamson, Jaladh Singhal

Abstract: Type Ia supernovae remain poorly understood despite decades of investigation. Massive computationally intensive hydrodynamic simulations have been developed and run to model an ever-growing number of proposed progenitor channels. Further complicating the matter, a large number of sub-types of Type Ia supernovae have been identified in recent decades. Due to the massive computational load required,… ▽ More Type Ia supernovae remain poorly understood despite decades of investigation. Massive computationally intensive hydrodynamic simulations have been developed and run to model an ever-growing number of proposed progenitor channels. Further complicating the matter, a large number of sub-types of Type Ia supernovae have been identified in recent decades. Due to the massive computational load required, inference of the internal structure of Type Ia supernovae ejecta directly from observations using simulations has previously been computationally intractable. However, deep-learning emulators for radiation transport simulations have alleviated such barriers. We perform abundance tomography on 40 Type Ia supernovae from optical spectra using the radiative transfer code TARDIS accelerated by the probabilistic DALEK deep-learning emulator. We apply a parametric model of potential ejecta structures to comparatively investigate abundance distributions and internal ionization fractions of intermediate-mass elements between normal and 1991T-like Type Ia supernovae. Our inference shows that 1991T-like Type Ia supernovae are under-abundant in the typical intermediate mass elements that heavily contribute to the spectral line formation seen in normal Type Ia supernovae at early times. Additionally, we find that the intermediate-mass elements present in 1991T-like Type Ia supernovae are highly ionized compared to those in the normal Type Ia population. Finally, we conclude that the transition between normal and 1991T-like Type Ia supernovae appears to be continuous observationally and that the observed differences come out of a combination of both abundance and ionization fractions in these supernovae populations. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: 15 Pages, 6 Figures, Article, Submitted to ApJ

arXiv:2209.09453 [pdf, other]

Probabilistic Dalek -- Emulator framework with probabilistic prediction for supernova tomography

Authors: Wolfgang Kerzendorf, Nutan Chen, Jack O'Brien, Johannes Buchner, Patrick van der Smagt

Abstract: Supernova spectral time series can be used to reconstruct a spatially resolved explosion model known as supernova tomography. In addition to an observed spectral time series, a supernova tomography requires a radiative transfer model to perform the inverse problem with uncertainty quantification for a reconstruction. The smallest parametrizations of supernova tomography models are roughly a dozen… ▽ More Supernova spectral time series can be used to reconstruct a spatially resolved explosion model known as supernova tomography. In addition to an observed spectral time series, a supernova tomography requires a radiative transfer model to perform the inverse problem with uncertainty quantification for a reconstruction. The smallest parametrizations of supernova tomography models are roughly a dozen parameters with a realistic one requiring more than 100. Realistic radiative transfer models require tens of CPU minutes for a single evaluation making the problem computationally intractable with traditional means requiring millions of MCMC samples for such a problem. A new method for accelerating simulations known as surrogate models or emulators using machine learning techniques offers a solution for such problems and a way to understand progenitors/explosions from spectral time series. There exist emulators for the TARDIS supernova radiative transfer code but they only perform well on simplistic low-dimensional models (roughly a dozen parameters) with a small number of applications for knowledge gain in the supernova field. In this work, we present a new emulator for the radiative transfer code TARDIS that not only outperforms existing emulators but also provides uncertainties in its prediction. It offers the foundation for a future active-learning-based machinery that will be able to emulate very high dimensional spaces of hundreds of parameters crucial for unraveling urgent questions in supernovae and related fields. △ Less

Submitted 20 September, 2022; originally announced September 2022.

Comments: 7 pages, accepted at ICML 2022 Workshop on Machine Learning for Astrophysics

arXiv:2202.10262 [pdf, other]

doi 10.3847/1538-4357/ac589e

New mass estimates for massive binary systems: a probabilistic approach using polarimetric radiative transfer

Authors: Andrew G. Fullard, John T. O'Brien, Wolfgang E. Kerzendorf, Manisha Shrestha, Jennifer L. Hoffman, Richard Ignace, Patrick van der Smagt

Abstract: Understanding the evolution of massive binary stars requires accurate estimates of their masses. This understanding is critically important because massive star evolution can potentially lead to gravitational wave sources such as binary black holes or neutron stars. For Wolf-Rayet stars with optically thick stellar winds, their masses can only be determined with accurate inclination angle estimate… ▽ More Understanding the evolution of massive binary stars requires accurate estimates of their masses. This understanding is critically important because massive star evolution can potentially lead to gravitational wave sources such as binary black holes or neutron stars. For Wolf-Rayet stars with optically thick stellar winds, their masses can only be determined with accurate inclination angle estimates from binary systems which have spectroscopic $M \sin i$ measurements. Orbitally-phased polarization signals can encode the inclination angle of binary systems, where the Wolf-Rayet winds act as scattering regions. We investigated four Wolf-Rayet + O star binary systems, WR 42, WR 79, WR 127, and WR 153, with publicly available phased polarization data to estimate their masses. To avoid the biases present in analytic models of polarization while retaining computational expediency, we used a Monte Carlo radiative transfer model accurately emulated by a neural network. We used the emulated model to investigate the posterior distribution of parameters of our four systems. Our mass estimates calculated from the estimated inclination angles put strong constraints on existing mass estimates for three of the systems, and disagrees with the existing mass estimates for WR 153. We recommend a concerted effort to obtain polarization observations that can be used to estimate the masses of Wolf-Rayet binary systems and increase our understanding of their evolutionary paths. △ Less

Submitted 21 February, 2022; originally announced February 2022.

Comments: 24 pages, 13 figures, accepted to ApJ

arXiv:2105.07910 [pdf, other]

doi 10.3847/2041-8213/ac1173

Probabilistic Reconstruction of Type Ia Supernova SN 2002bo

Authors: John T. O'Brien, Wolfgang E. Kerzendorf, Andrew Fullard, Marc Williamson, Ruediger Pakmor, Johannes Buchner, Stephan Hachinger, Christian Vogl, James H. Gillanders, Andreas Floers, Patrick van der Smagt

Abstract: Manual fits to spectral times series of Type Ia supernovae have provided a method of reconstructing the explosion from a parametric model but due to lack of information about model uncertainties or parameter degeneracies direct comparison between theory and observation is difficult. In order to mitigate this important problem we present a new way to probabilistically reconstruct the outer ejecta o… ▽ More Manual fits to spectral times series of Type Ia supernovae have provided a method of reconstructing the explosion from a parametric model but due to lack of information about model uncertainties or parameter degeneracies direct comparison between theory and observation is difficult. In order to mitigate this important problem we present a new way to probabilistically reconstruct the outer ejecta of the normal Type Ia supernova SN 2002bo. A single epoch spectrum, taken 10 days before maximum light, is fit by a 13-parameter model describing the elemental composition of the ejecta and the explosion physics (density, temperature, velocity, and explosion epoch). Model evaluation is performed through the application of a novel rapid spectral synthesis technique in which the radiative transfer code, TARDIS, is accelerated by a machine-learning framework. Analysis of the posterior distribution reveals a complex and degenerate parameter space and allows direct comparison to various hydrodynamic models. Our analysis favors detonation over deflagration scenarios and we find that our technique offers a novel way to compare simulation to observation. △ Less

Submitted 12 October, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

Comments: 12 Pages, 4 Figures, submitted to AAS Journals, comments and constructive criticism welcome

arXiv:2007.01868 [pdf, other]

doi 10.3847/2041-8213/abeb1b

Dalek -- a deep-learning emulator for TARDIS

Authors: Wolfgang E. Kerzendorf, Christian Vogl, Johannes Buchner, Gabriella Contardo, Marc Williamson, Patrick van der Smagt

Abstract: Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least sever… ▽ More Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least several million function evaluations. Physically realistic models require at least tens of CPU minutes per evaluation putting a detailed reconstruction of the explosion out of reach of traditional methodology. The advent of widely available libraries for the training of neural networks combined with their ability to approximate almost arbitrary functions with high precision allows for a new approach to this problem. Instead of evaluating the radiative transfer model itself, one can build a neural network proxy trained on the simulations but evaluating orders of magnitude faster. Such a framework is called an emulator or surrogate model. In this work, we present an emulator for the TARDIS supernova radiative transfer code applied to Type Ia supernova spectra. We show that we can train an emulator for this problem given a modest training set of a hundred thousand spectra (easily calculable on modern supercomputers). The results show an accuracy on the percent level (that are dominated by the Monte Carlo nature of TARDIS and not the emulator) with a speedup of several orders of magnitude. This method has a much broader set of applications and is not limited to the presented problem. △ Less

Submitted 3 July, 2020; originally announced July 2020.

Comments: 6 pages;5 figures submitted to AAS Journals. Constructive Criticism invited

Showing 1–6 of 6 results for author: van der Smagt, P