Sampling Bayesian probabilities given only sampled priors
Authors:
Gary Bernstein,
William Assignies Doumerg,
Michael A. Troxel,
Alex Alarcon,
Alexandra Amon,
Giulia Giannini,
Boyan Yin,
Sahar Allam,
Felipe Andrade-Oliveira,
David Brooks,
Aurelio Carnero Rosell,
Jorge Carretero,
Luiz da Costa,
Maria Elidaiana da Silva Pereira,
Juan De Vicente,
Spencer Everett,
Josh Frieman,
Juan Garcia-Bellido,
Daniel Gruen,
Samuel Hinton,
Devon L. Hollowood,
Klaus Honscheid,
David James,
Sujeong Lee,
Jennifer Marshall
, et al. (13 additional authors not shown)
Abstract:
A typical Bayesian inference on the values of some parameters of interest $q$ from some data $D$ involves running a Markov Chain (MC) to sample from the posterior $p(q,n | D) \propto \mathcal{L}(D | q,n) p(q) p(n),$ where $n$ are some nuisance parameters. In many cases, the nuisance parameters are high-dimensional, and their prior $p(n)$ is itself defined only by a set of samples that have been dr…
▽ More
A typical Bayesian inference on the values of some parameters of interest $q$ from some data $D$ involves running a Markov Chain (MC) to sample from the posterior $p(q,n | D) \propto \mathcal{L}(D | q,n) p(q) p(n),$ where $n$ are some nuisance parameters. In many cases, the nuisance parameters are high-dimensional, and their prior $p(n)$ is itself defined only by a set of samples that have been drawn from some other MC. Two problems arise: first, the MC for the posterior will typically require evaluation of $p(n)$ at arbitrary values of $n,$ \ie\ one needs to provide a density estimator over the full $n$ space from the provided samples. Second, the high dimensionality of $n$ hinders both the density estimation and the efficiency of the MC for the posterior. We describe a solution to this problem: a linear compression of the $n$ space into a much lower-dimensional space $u$ which projects away directions in $n$ space that cannot appreciably alter $\mathcal{L}.$ The algorithm for doing so is a slight modification to principal components analysis, and is less restrictive on $p(n)$ than other proposed solutions to this issue. We demonstrate this ``mode projection'' technique using the analysis of 2-point correlation functions of weak lensing fields and galaxy density in the \textit{Dark Energy Survey}, where $n$ is a binned representation of the redshift distribution $n(z)$ of the galaxies.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.