-
Accelerating LISA inference with Gaussian processes
Authors:
Jonas El Gammal,
Riccardo Buscicchio,
Germano Nardini,
Jesús Torrado
Abstract:
Source inference for deterministic gravitational waves is a computationally demanding task in LISA. In a novel approach, we investigate the capability of Gaussian Processes to learn the posterior surface in order to reconstruct individual signal posteriors. We use GPry, which automates this reconstruction through active learning, using a very small number of likelihood evaluations, without the nee…
▽ More
Source inference for deterministic gravitational waves is a computationally demanding task in LISA. In a novel approach, we investigate the capability of Gaussian Processes to learn the posterior surface in order to reconstruct individual signal posteriors. We use GPry, which automates this reconstruction through active learning, using a very small number of likelihood evaluations, without the need for pretraining. We benchmark GPry against the cutting-edge nested sampler nessai, by injecting individually three signals on LISA noisy data simulated with Balrog: a white dwarf binary (DWD), a stellar-mass black hole binary (stBHB), and a super-massive black hole binary (SMBHB). We find that GPry needs $\mathcal O(10^{-2})$ fewer likelihood evaluations to achieve an inference accuracy comparable to nessai, with Jensen-Shannon divergence $D_{\scriptscriptstyle \mathrm{JS}} \lesssim 0.01$ for the DWD, and $D_{\scriptscriptstyle \mathrm{JS}} \lesssim 0.05$ for the SMBHB. Lower accuracy is found for the less Gaussian posterior of the stBHB: $D_{\scriptscriptstyle \mathrm{JS}} \lesssim 0.2$. Despite the overhead costs of GPry, we obtain a speed-up of $\mathcal O(10^2)$ for the slowest cases of stBHB and SMBHB. In conclusion, active-learning Gaussian process frameworks show great potential for rapid LISA parameter inference, especially for costly likelihoods, enabling suppression of computational costs without the trade-off of approximations in the calculations.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Reconstructing Primordial Curvature Perturbations via Scalar-Induced Gravitational Waves with LISA
Authors:
Jonas El Gammal,
Aya Ghaleb,
Gabriele Franciolini,
Theodoros Papanikolaou,
Marco Peloso,
Gabriele Perna,
Mauro Pieroni,
Angelo Ricciardone,
Robert Rosati,
Gianmassimo Tasinato,
Matteo Braglia,
Jacopo Fumagalli,
Jun'ya Kume,
Enrico Morgante,
Germano Nardini,
Davide Racco,
Sébastien Renaux-Petel,
Hardi Veermäe,
Denis Werth,
Ivonne Zavala
Abstract:
Many early universe scenarios predict an enhancement of scalar perturbations at scales currently unconstrained by cosmological probes. These perturbations source gravitational waves (GWs) at second order in perturbation theory, leading to a scalar-induced gravitational wave (SIGW) background. The LISA detector, sensitive to mHz GWs, will be able to constrain curvature perturbations in a new window…
▽ More
Many early universe scenarios predict an enhancement of scalar perturbations at scales currently unconstrained by cosmological probes. These perturbations source gravitational waves (GWs) at second order in perturbation theory, leading to a scalar-induced gravitational wave (SIGW) background. The LISA detector, sensitive to mHz GWs, will be able to constrain curvature perturbations in a new window corresponding to scales $k \in [10^{10}, 10^{14}] \,{\rm Mpc}^{-1}$, difficult to probe otherwise. In this work, we forecast the capabilities of LISA to constrain the source of SIGWs using different approaches: i) agnostic, where the spectrum of curvature perturbations is binned in frequency space; ii) template-based, modeling the curvature power spectrum based on motivated classes of models; iii) ab initio, starting from first-principles model of inflation featuring an ultra-slow roll phase. We compare the strengths and weaknesses of each approach. We also discuss the impact on the SIGW spectrum of non-standard thermal histories affecting the kernels of SIGW emission and non-Gaussianity in the statistics of the curvature perturbations. Finally, we propose simple tests to assess whether the signal is compatible with the SIGW hypothesis. The pipeline used is built into the SIGWAY code.
△ Less
Submitted 21 May, 2025; v1 submitted 20 January, 2025;
originally announced January 2025.
-
Circular reasoning: Solving the Hubble tension with a non-$π$ value of $π$
Authors:
Jonas El Gammal,
Sven Günther,
Emil Brinch Holm,
Andreas Nygaard
Abstract:
Recently, cosmology has seen a surge in alternative models that purport to solve the discrepancy between the values of the Hubble constant $H_0$ as measured by cosmological microwave background anisotropies and local supernovae, respectively. In particular, many of the most successful approaches have involved varying fundamental constants, such as an alternative value of the fine structure constan…
▽ More
Recently, cosmology has seen a surge in alternative models that purport to solve the discrepancy between the values of the Hubble constant $H_0$ as measured by cosmological microwave background anisotropies and local supernovae, respectively. In particular, many of the most successful approaches have involved varying fundamental constants, such as an alternative value of the fine structure constant and time-varying values of the electron mass, the latter of which showed particular promise as the strongest candidate in several earlier studies. Inspired by these approaches, in this paper, we investigate a cosmological model where the value of the geometric constant $π$ is taken to be a free model parameter. Using the latest CMB data from Planck as well as baryon-acoustic oscillation data, we constrain the parameters of the model and find a strong correlation between $π$ and $H_0$, with the final constraint $H_0 = 71.3 \pm 1.1 \ \mathrm{ km/s/Mpc}$, equivalent to a mere $1.5σ$ discrepancy with the value measured by the SH0ES collaboration. Furthermore, our results show that $π= 3.206 \pm 0.038$ at $95 \%$ C.L., which is in good agreement with several external measurements discussed in the paper. Hence, we conclude that the $πΛ$CDM model presented in this paper, which has only a single extra parameter, currently stands as the perhaps strongest solution to the Hubble tension.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Parallelized Acquisition for Active Learning using Monte Carlo Sampling
Authors:
Jesús Torrado,
Nils Schöneberg,
Jonas El Gammal
Abstract:
Bayesian inference remains one of the most important tool-kits for any scientist, but increasingly expensive likelihood functions are required for ever-more complex experiments, raising the cost of generating a Monte Carlo sample of the posterior. Recent attention has been directed towards the use of emulators of the posterior based on Gaussian Process (GP) regression combined with active sampling…
▽ More
Bayesian inference remains one of the most important tool-kits for any scientist, but increasingly expensive likelihood functions are required for ever-more complex experiments, raising the cost of generating a Monte Carlo sample of the posterior. Recent attention has been directed towards the use of emulators of the posterior based on Gaussian Process (GP) regression combined with active sampling to achieve comparable precision with far fewer costly likelihood evaluations. Key to this approach is the batched acquisition of proposals, so that the true posterior can be evaluated in parallel. This is usually achieved via sequential maximization of the highly multimodal acquisition function. Unfortunately, this approach parallelizes poorly and is prone to getting stuck in local maxima. Our approach addresses this issue by generating nearly-optimal batches of candidates using an almost-embarrassingly parallel Nested Sampler on the mean prediction of the GP. The resulting nearly-sorted Monte Carlo sample is used to generate a batch of candidates ranked according to their sequentially conditioned acquisition function values at little cost. The final sample can also be used for inferring marginal quantities. Our proposed implementation (NORA) demonstrates comparable accuracy to sequential conditioned acquisition optimization and efficient parallelization in various synthetic and cosmological inference problems.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Fast and robust Bayesian Inference using Gaussian Processes with GPry
Authors:
Jonas El Gammal,
Nils Schöneberg,
Jesús Torrado,
Christian Fidler
Abstract:
We present the GPry algorithm for fast Bayesian inference of general (non-Gaussian) posteriors with a moderate number of parameters. GPry does not need any pre-training, special hardware such as GPUs, and is intended as a drop-in replacement for traditional Monte Carlo methods for Bayesian inference. Our algorithm is based on generating a Gaussian Process surrogate model of the log-posterior, aide…
▽ More
We present the GPry algorithm for fast Bayesian inference of general (non-Gaussian) posteriors with a moderate number of parameters. GPry does not need any pre-training, special hardware such as GPUs, and is intended as a drop-in replacement for traditional Monte Carlo methods for Bayesian inference. Our algorithm is based on generating a Gaussian Process surrogate model of the log-posterior, aided by a Support Vector Machine classifier that excludes extreme or non-finite values. An active learning scheme allows us to reduce the number of required posterior evaluations by two orders of magnitude compared to traditional Monte Carlo inference. Our algorithm allows for parallel evaluations of the posterior at optimal locations, further reducing wall-clock times. We significantly improve performance using properties of the posterior in our active learning scheme and for the definition of the GP prior. In particular we account for the expected dynamical range of the posterior in different dimensionalities. We test our model against a number of synthetic and cosmological examples. GPry outperforms traditional Monte Carlo methods when the evaluation time of the likelihood (or the calculation of theoretical observables) is of the order of seconds; for evaluation times of over a minute it can perform inference in days that would take months using traditional methods. GPry is distributed as an open source Python package (pip install gpry) and can also be found at https://github.com/jonaselgammal/GPry.
△ Less
Submitted 17 December, 2022; v1 submitted 3 November, 2022;
originally announced November 2022.