Astrophysics > Cosmology and Nongalactic Astrophysics
[Submitted on 6 Apr 2022 (v1), last revised 12 Apr 2022 (this version, v2)]
Title:Bayesian Control Variates for optimal covariance estimation with pairs of simulations and surrogates
View PDFAbstract:Predictions of the mean and covariance matrix of summary statistics are critical for confronting cosmological theories with observations, not least for likelihood approximations and parameter inference. The price to pay for accurate estimates is the extreme cost of running $N$-body and hydrodynamics simulations. Approximate solvers, or surrogates, greatly reduce the computational cost but can introduce significant biases, for example in the non-linear regime of cosmic structure growth. We propose "CARPool Bayes", an approach to solve the inference problem for both the means and covariances using a combination of simulations and surrogates. Our framework allows incorporating prior information for the mean and covariance. We derive closed-form solutions for Maximum A Posteriori covariance estimates that are efficient Bayesian shrinkage estimators, guarantee positive semi-definiteness, and can optionally leverage analytical covariance approximations. We discuss choices of the prior and propose a simple procedure for obtaining optimal prior hyperparameter values with a small set of test simulations. We test our method by estimating the covariances of clustering statistics of GADGET-III $N$-body simulations at redshift $z=0.5$ using surrogates from a 100-1000$\times$ faster particle-mesh code. Taking the sample covariance from 15,000 simulations as the truth, and using an empirical Bayes prior with diagonal blocks, our estimator produces nearly identical Fisher matrix contours for $\Lambda$CDM parameters using only $15$ simulations of the non-linear dark matter power spectrum. In this case the number of simulations is so small that the sample covariance would be degenerate. We show cases where even with a naïve prior our method still improves the estimate. Our framework is applicable to a wide range of cosmological and astrophysical problems where fast surrogates are available.
Submission history
From: Nicolas Chartier [view email][v1] Wed, 6 Apr 2022 20:13:58 UTC (4,169 KB)
[v2] Tue, 12 Apr 2022 15:51:38 UTC (4,169 KB)
Current browse context:
astro-ph.CO
Change to browse by:
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender
(What is IArxiv?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.