Search | arXiv e-print repository

Normalizing flow regularization for photoacoustic tomography

Abstract: Proper regularization is crucial in inverse problems to achieve high-quality reconstruction, even with an ill-conditioned measurement system. This is particularly true for three-dimensional photoacoustic tomography, which is computationally demanding and requires rapid scanning, often leading to incomplete measurements. Deep neural networks, known for their efficiency in handling big data, are ant… ▽ More Proper regularization is crucial in inverse problems to achieve high-quality reconstruction, even with an ill-conditioned measurement system. This is particularly true for three-dimensional photoacoustic tomography, which is computationally demanding and requires rapid scanning, often leading to incomplete measurements. Deep neural networks, known for their efficiency in handling big data, are anticipated to be adept at extracting underlying information from images sharing certain characteristics, such as specific types of natural or medical images. We introduce a Normalizing Flow Regularization (NFR) method designed to reconstruct images from incomplete and noisy measurements. The method involves training a normalizing flow network to understand the statistical distribution of sample images by mapping them to Gaussian distributions. This well-trained network then acts as a regularization tool within a Bayesian inversion framework. Additionally, we explore the concept of adaptive regularization selection, providing theoretical proof of its admissibility. A significant challenge in three-dimensional image training is the extensive memory and computation requirements. We address this by training the normalizing flow model using only small-size images and applying a patch-based model for reconstructing larger images. Our approach is model-independent, allowing the reuse of a well-trained network as regularization for various imaging systems. Moreover, as a data-driven prior, NFR effectively leverages the available dataset information, outperforming artificial priors. This advantage is demonstrated through numerical simulations of three-dimensional photoacoustic tomography under various conditions of sparsity, noise levels, and limited-view scenarios. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Journal ref: Inverse Problems, 2024

arXiv:1610.09788 [pdf, other]

Pseudo-marginal Metropolis--Hastings using averages of unbiased estimators

Authors: Chris Sherlock, Alexandre Thiery, Anthony Lee

Abstract: We consider a pseudo-marginal Metropolis--Hastings kernel $P_m$ that is constructed using an average of $m$ exchangeable random variables, as well as an analogous kernel $P_s$ that averages $s<m$ of these same random variables. Using an embedding technique to facilitate comparisons, we show that the asymptotic variances of ergodic averages associated with $P_m$ are lower bounded in terms of those… ▽ More We consider a pseudo-marginal Metropolis--Hastings kernel $P_m$ that is constructed using an average of $m$ exchangeable random variables, as well as an analogous kernel $P_s$ that averages $s<m$ of these same random variables. Using an embedding technique to facilitate comparisons, we show that the asymptotic variances of ergodic averages associated with $P_m$ are lower bounded in terms of those associated with $P_s$. We show that the bound provided is tight and disprove a conjecture that when the random variables to be averaged are independent, the asymptotic variance under $P_m$ is never less than $s/m$ times the variance under $P_s$. The conjecture does, however, hold when considering continuous-time Markov chains. These results imply that if the computational cost of the algorithm is proportional to $m$, it is often better to set $m=1$. We provide intuition as to why these findings differ so markedly from recent results for pseudo-marginal kernels employing particle filter approximations. Our results are exemplified through two simulation studies; in the first the computational cost is effectively proportional to $m$ and in the second there is a considerable start-up cost at each iteration. △ Less

Submitted 31 October, 2016; originally announced October 2016.

arXiv:1606.01016 [pdf, other]

On Coupling Particle Filter Trajectories

Authors: Deborshee Sen, Alexandre Thiery, Ajay Jasra

Abstract: Particle filters are a powerful and flexible tool for performing inference on state-space models. They involve a collection of samples evolving over time through a combination of sampling and re-sampling steps. The re-sampling step is necessary to ensure that weight degeneracy is avoided. In several situations of statistical interest, it is important to be able to compare the estimates produced by… ▽ More Particle filters are a powerful and flexible tool for performing inference on state-space models. They involve a collection of samples evolving over time through a combination of sampling and re-sampling steps. The re-sampling step is necessary to ensure that weight degeneracy is avoided. In several situations of statistical interest, it is important to be able to compare the estimates produced by two different particle filters; consequently, being able to efficiently couple two particle filter trajectories is often of paramount importance. In this text, we propose several ways to do so. In particular, we leverage ideas from the optimal transportation literature. In general, though, computing the optimal transport map is extremely computationally expensive; to deal with this, we introduce computationally tractable approximations to optimal transport couplings. We demonstrate that our resulting algorithms for coupling two particle filter trajectories often perform orders of magnitude more efficiently than more standard approaches. △ Less

Submitted 16 March, 2017; v1 submitted 3 June, 2016; originally announced June 2016.

arXiv:1510.02577 [pdf, other]

Asymptotic Analysis of the Random-Walk Metropolis Algorithm on Ridged Densities

Authors: Alexandros Beskos, Gareth Roberts, Alexandre Thiery, Natesh Pillai

Abstract: In this paper we study the asymptotic behavior of the Random-Walk Metropolis algorithm on probability densities with two different `scales', where most of the probability mass is distributed along certain key directions with the `orthogonal' directions containing relatively less mass. Such class of probability measures arise in various applied contexts including Bayesian inverse problems where the… ▽ More In this paper we study the asymptotic behavior of the Random-Walk Metropolis algorithm on probability densities with two different `scales', where most of the probability mass is distributed along certain key directions with the `orthogonal' directions containing relatively less mass. Such class of probability measures arise in various applied contexts including Bayesian inverse problems where the posterior measure concentrates on a sub-manifold when the noise variance goes to zero. When the target measure concentrates on a linear sub-manifold, we derive analytically a diffusion limit for the Random-Walk Metropolis Markov chain as the scale parameter goes to zero. In contrast to the existing works on scaling limits, our limiting Stochastic Differential Equation does not in general have a constant diffusion coefficient. Our results show that in some cases, the usual practice of adapting the step-size to control the acceptance probability might be sub-optimal as the optimal acceptance probability is zero (in the limit). △ Less

Submitted 9 October, 2015; originally announced October 2015.

arXiv:1509.08775 [pdf, other]

Error Bounds for Sequential Monte Carlo Samplers for Multimodal Distributions

Authors: Daniel Paulin, Ajay Jasra, Alexandre Thiery

Abstract: In this paper, we provide bounds on the asymptotic variance for a class of sequential Monte Carlo (SMC) samplers designed for approximating multimodal distributions. Such methods combine standard SMC methods and Markov chain Monte Carlo (MCMC) kernels. Our bounds improve upon previous results, and unlike some earlier work, they also apply in the case when the MCMC kernels can move between the mode… ▽ More In this paper, we provide bounds on the asymptotic variance for a class of sequential Monte Carlo (SMC) samplers designed for approximating multimodal distributions. Such methods combine standard SMC methods and Markov chain Monte Carlo (MCMC) kernels. Our bounds improve upon previous results, and unlike some earlier work, they also apply in the case when the MCMC kernels can move between the modes. We apply our results to the Potts model from statistical physics. In this case, the problem of sharp peaks is encountered. Earlier methods, such as parallel tempering, are only able to sample from it at an exponential (in an important parameter of the model) cost. We propose a sequence of interpolating distributions called interpolation to independence, and show that the SMC sampler based on it is able to sample from this target distribution at a polynomial cost. We believe that our method is generally applicable to many other distributions as well. △ Less

Submitted 24 January, 2018; v1 submitted 29 September, 2015; originally announced September 2015.

Comments: 42 pages, 6 figures. Some minor corrections in this version

MSC Class: 60J22; 65C05; 65Z05

arXiv:1506.08155 [pdf, other]

Efficiency of delayed-acceptance random walk Metropolis algorithms

Authors: Chris Sherlock, Alexandre Thiery, Andrew Golightly

Abstract: Delayed-acceptance Metropolis-Hastings and delayed-acceptance pseudo-marginal Metropolis-Hastings algorithms can be applied when it is computationally expensive to calculate the true posterior or an unbiased stochastic approximation thereof, but a computationally cheap deterministic approximation is available. An initial accept-reject stage uses the cheap approximation for computing the Metropolis… ▽ More Delayed-acceptance Metropolis-Hastings and delayed-acceptance pseudo-marginal Metropolis-Hastings algorithms can be applied when it is computationally expensive to calculate the true posterior or an unbiased stochastic approximation thereof, but a computationally cheap deterministic approximation is available. An initial accept-reject stage uses the cheap approximation for computing the Metropolis-Hastings ratio; proposals which are accepted at this stage are then subjected to a further accept-reject step which corrects for the error in the approximation. Since the expensive posterior, or the approximation thereof, is only evaluated for proposals which are accepted at the first stage, the cost of the algorithm is reduced and larger scalings may be used. We focus on the random walk Metropolis (RWM) and consider the delayed-acceptance RWM and the delayed-acceptance pseudo-marginal RWM. We provide a framework for incorporating relatively general deterministic approximations into the theoretical analysis of high-dimensional targets. Justified by diffusion approximation arguments, we derive expressions for the limiting efficiency and acceptance rates in high-dimensional settings. These theoretical insights are finally leveraged to formulate practical guidelines for the efficient tuning of the algorithms. The robustness of these guidelines and predicted properties are verified against simulation studies, all of which are strictly outside of the domain of validity of our limit results. △ Less

Submitted 23 February, 2021; v1 submitted 26 June, 2015; originally announced June 2015.

MSC Class: 65C05; 65C40; 60F05

arXiv:1409.5281 [pdf, ps, other]

$q$-Varieties and Drinfeld Modules

Authors: Alain Thiéry

Abstract: Let $\mathbb{F}_q$ be the finite field with $q$ elements, $K$ be an algebraically closed field containing $\mathbb{F}_q$, $K\{τ\}$ be the Ore ring of $\mathbb{F}_q$-linear polynomials and $Λ_n$ be a free $K\{τ\}$-module of rank $n$. In a first part, we prove that there is a bijection between the set of Zariski closed subsets of $K^n$ which are also $\mathbb{F}_q$-vector spaces, the so-called… ▽ More Let $\mathbb{F}_q$ be the finite field with $q$ elements, $K$ be an algebraically closed field containing $\mathbb{F}_q$, $K\{τ\}$ be the Ore ring of $\mathbb{F}_q$-linear polynomials and $Λ_n$ be a free $K\{τ\}$-module of rank $n$. In a first part, we prove that there is a bijection between the set of Zariski closed subsets of $K^n$ which are also $\mathbb{F}_q$-vector spaces, the so-called $q$-varities, and the set of radical $K\{τ\}$-submodules of $Λ_n$. We also study the dimension of $q$-varieties and their tangent spaces. Let $F$ be a $q$-variety, $K\{F\} := Mor(F,K)$ be the set of $\mathbb{F}_q$-linear polynomial maps from $F$ to $K$. Let $A=\mathbb{F}_q[T]$ and choose $δ: A \longrightarrow K$ a ring morphism. By definition, an $A$-module structure on $F$ is a ring morphism $Φ: A \longrightarrow End(F)$ such that, for all $a\in A$, $$d(Φ_a) = δ(a) Id_{T(F)}$$ where $T(F)$ is the tangent space of $F$ and $d(Φ_a)$ the differential map. We prove that $K(F) := K(T)\otimes_{K[T]}K\{F\}$ has finite dimension over $K(T)$. This dimension is called the rank of the $A$-module and is denoted by $r(F)$. We then prove that there exists $c \in A\setminus \{0\}$ such that for all $a\in A$, prime to $c$, $$Tor(a,F) := \{x\in F \mid Φ_a(x) = 0\} = (A/aA)^{r(F)}.$$ △ Less

Submitted 18 September, 2014; originally announced September 2014.

arXiv:1401.6140 [pdf, other]

The density of sets avoiding distance 1 in Euclidean space

Authors: Christine Bachoc, Alberto Passuello, Alain Thiery

Abstract: We improve by an exponential factor the best known asymptotic upper bound for the density of sets avoiding 1 in Euclidean space. This result is obtained by a combination of an analytic bound that is an analogue of Lovasz theta number and of a combinatorial argument involving finite subgraphs of the unit distance graph. In turn, we straightforwardly obtain an asymptotic improvement for the measurab… ▽ More We improve by an exponential factor the best known asymptotic upper bound for the density of sets avoiding 1 in Euclidean space. This result is obtained by a combination of an analytic bound that is an analogue of Lovasz theta number and of a combinatorial argument involving finite subgraphs of the unit distance graph. In turn, we straightforwardly obtain an asymptotic improvement for the measurable chromatic number of Euclidean space. We also tighten previous results for the dimensions between 4 and 24. △ Less

Submitted 29 January, 2015; v1 submitted 23 January, 2014; originally announced January 2014.

Comments: Revised version, to appear in Discrete and Computational Geometry

arXiv:1309.7209 [pdf, ps, other]

doi 10.1214/14-AOS1278

On the efficiency of pseudo-marginal random walk Metropolis algorithms

Authors: Chris Sherlock, Alexandre H. Thiery, Gareth O. Roberts, Jeffrey S. Rosenthal

Abstract: We examine the behaviour of the pseudo-marginal random walk Metropolis algorithm, where evaluations of the target density for the accept/reject probability are estimated rather than computed precisely. Under relatively general conditions on the target distribution, we obtain limiting formulae for the acceptance rate and for the expected squared jump distance, as the dimension of the target approac… ▽ More We examine the behaviour of the pseudo-marginal random walk Metropolis algorithm, where evaluations of the target density for the accept/reject probability are estimated rather than computed precisely. Under relatively general conditions on the target distribution, we obtain limiting formulae for the acceptance rate and for the expected squared jump distance, as the dimension of the target approaches infinity, under the assumption that the noise in the estimate of the log-target is additive and is independent of the position. For targets with independent and identically distributed components, we also obtain a limiting diffusion for the first component. We then consider the overall efficiency of the algorithm, in terms of both speed of mixing and computational time. Assuming the additive noise is Gaussian and is inversely proportional to the number of unbiased estimates that are used, we prove that the algorithm is optimally efficient when the variance of the noise is approximately 3.283 and the acceptance rate is approximately 7.001%. We also find that the optimal scaling is insensitive to the noise and that the optimal variance of the noise is insensitive to the scaling. The theory is illustrated with a simulation study using the particle marginal random walk Metropolis. △ Less

Submitted 30 December, 2014; v1 submitted 27 September, 2013; originally announced September 2013.

Comments: Published in at http://dx.doi.org/10.1214/14-AOS1278 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1278

Journal ref: Annals of Statistics 2015, Vol. 43, No. 1, 238-275

arXiv:1309.6473 [pdf, ps, other]

doi 10.1214/15-AOS1311

On nonnegative unbiased estimators

Authors: Pierre E. Jacob, Alexandre H. Thiery

Abstract: We study the existence of algorithms generating almost surely nonnegative unbiased estimators. We show that given a nonconstant real-valued function $f$ and a sequence of unbiased estimators of $λ\in\mathbb{R}$, there is no algorithm yielding almost surely nonnegative unbiased estimators of $f(λ)\in\mathbb{R}^+$. The study is motivated by pseudo-marginal Monte Carlo algorithms that rely on such no… ▽ More We study the existence of algorithms generating almost surely nonnegative unbiased estimators. We show that given a nonconstant real-valued function $f$ and a sequence of unbiased estimators of $λ\in\mathbb{R}$, there is no algorithm yielding almost surely nonnegative unbiased estimators of $f(λ)\in\mathbb{R}^+$. The study is motivated by pseudo-marginal Monte Carlo algorithms that rely on such nonnegative unbiased estimators. These methods allow "exact inference" in intractable models, in the sense that integrals with respect to a target distribution can be estimated without any systematic error, even though the associated probability density function cannot be evaluated pointwise. We discuss the consequences of our results on the applicability of pseudo-marginal algorithms and thus on the possibility of exact inference in intractable models. We illustrate our study with particular choices of functions $f$ corresponding to known challenges in statistics, such as exact simulation of diffusions, inference in large datasets and doubly intractable distributions. △ Less

Submitted 1 April, 2015; v1 submitted 25 September, 2013; originally announced September 2013.

Comments: Published at http://dx.doi.org/10.1214/15-AOS1311 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1311

Journal ref: Annals of Statistics 2015, Vol. 43, No. 2, 769-784

arXiv:1108.1494 [pdf, other]

Gradient Flow from a Random Walk in Hilbert Space

Authors: Natesh S. Pillai, Andrew M. Stuart, Alexandre H. Thiery

Abstract: Consider a probability measure on a Hilbert space defined via its density with respect to a Gaussian. The purpose of this paper is to demonstrate that an appropriately defined Markov chain, which is reversible with respect to the measure in question, exhibits a diffusion limit to a noisy gradient flow, also reversible with respect to the same measure. The Markov chain is defined by applying a Metr… ▽ More Consider a probability measure on a Hilbert space defined via its density with respect to a Gaussian. The purpose of this paper is to demonstrate that an appropriately defined Markov chain, which is reversible with respect to the measure in question, exhibits a diffusion limit to a noisy gradient flow, also reversible with respect to the same measure. The Markov chain is defined by applying a Metropolis-Hastings accept-reject mechanism to an Ornstein-Uhlenbeck proposal which is itself reversible with respect to the underlying Gaussian measure. The resulting noisy gradient flow is a stochastic partial differential equation driven by a Wiener process with spatial correlation given by the underlying Gaussian structure. △ Less

Submitted 18 April, 2014; v1 submitted 6 August, 2011; originally announced August 2011.

Comments: Major revision of the original version

arXiv:1103.0542 [pdf, ps, other]

doi 10.1214/11-AAP828

Optimal scaling and diffusion limits for the Langevin algorithm in high dimensions

Authors: Natesh S. Pillai, Andrew M. Stuart, Alexandre H. Thiéry

Abstract: The Metropolis-adjusted Langevin (MALA) algorithm is a sampling algorithm which makes local moves by incorporating information about the gradient of the logarithm of the target density. In this paper we study the efficiency of MALA on a natural class of target measures supported on an infinite dimensional Hilbert space. These natural measures have density with respect to a Gaussian random field me… ▽ More The Metropolis-adjusted Langevin (MALA) algorithm is a sampling algorithm which makes local moves by incorporating information about the gradient of the logarithm of the target density. In this paper we study the efficiency of MALA on a natural class of target measures supported on an infinite dimensional Hilbert space. These natural measures have density with respect to a Gaussian random field measure and arise in many applications such as Bayesian nonparametric statistics and the theory of conditioned diffusions. We prove that, started in stationarity, a suitably interpolated and scaled version of the Markov chain corresponding to MALA converges to an infinite dimensional diffusion process. Our results imply that, in stationarity, the MALA algorithm applied to an N-dimensional approximation of the target will take $\mathcal{O}(N^{1/3})$ steps to explore the invariant measure, comparing favorably with the Random Walk Metropolis which was recently shown to require $\mathcal{O}(N)$ steps when applied to the same class of problems. △ Less

Submitted 28 November, 2012; v1 submitted 2 March, 2011; originally announced March 2011.

Comments: Published in at http://dx.doi.org/10.1214/11-AAP828 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP828

Journal ref: Annals of Applied Probability 2012, Vol. 22, No. 6, 2320-2356

arXiv:1103.0444 [pdf, ps, other]

On the theta number of powers of cycle graphs

Authors: Christine Bachoc, Arnaud Pêcher, Alain Thiéry

Abstract: We give a closed formula for Lovasz theta number of the powers of cycle graphs and of their complements, the circular complete graphs. As a consequence, we establish that the circular chromatic number of a circular perfect graph is computable in polynomial time. We also derive an asymptotic estimate for this theta number. We give a closed formula for Lovasz theta number of the powers of cycle graphs and of their complements, the circular complete graphs. As a consequence, we establish that the circular chromatic number of a circular perfect graph is computable in polynomial time. We also derive an asymptotic estimate for this theta number. △ Less

Submitted 2 March, 2011; originally announced March 2011.

Comments: 17 pages

Showing 1–13 of 13 results for author: Thiéry, A