-
Deep Gaussian Process Priors for Bayesian Inference in Nonlinear Inverse Problems
Authors:
Kweku Abraham,
Neil Deo
Abstract:
We study the use of a deep Gaussian process (DGP) prior in a general nonlinear inverse problem satisfying certain regularity conditions. We prove that when the data arises from a true parameter $θ^*$ with a compositional structure, the posterior induced by the DGP prior concentrates around $θ^*$ as the number of observations increases. The DGP prior accounts for the unknown compositional structure…
▽ More
We study the use of a deep Gaussian process (DGP) prior in a general nonlinear inverse problem satisfying certain regularity conditions. We prove that when the data arises from a true parameter $θ^*$ with a compositional structure, the posterior induced by the DGP prior concentrates around $θ^*$ as the number of observations increases. The DGP prior accounts for the unknown compositional structure through the use of a hierarchical structure prior. As examples, we show that our results apply to Darcy's problem of recovering the scalar diffusivity from a steady-state heat equation and the problem of determining the attenuation potential in a steady-state Schrödinger equation. We further provide a lower bound, proving in Darcy's problem that typical Gaussian priors based on Whittle-Matérn processes (which ignore compositional structure) contract at a polynomially slower rate than the DGP prior for certain diffusivities arising from a generalised additive model.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Frontiers to the learning of nonparametric hidden Markov models
Authors:
Kweku Abraham,
Elisabeth Gassiat,
Zacharie Naulet
Abstract:
Hidden Markov models (HMMs) are flexible tools for clustering dependent data coming from unknown populations, allowing nonparametric modelling of the population densities. Identifiability fails when the data is in fact independent, and we study the frontier between learnable and unlearnable two-state nonparametric HMMs. Interesting new phenomena emerge when the cluster distributions are modelled v…
▽ More
Hidden Markov models (HMMs) are flexible tools for clustering dependent data coming from unknown populations, allowing nonparametric modelling of the population densities. Identifiability fails when the data is in fact independent, and we study the frontier between learnable and unlearnable two-state nonparametric HMMs. Interesting new phenomena emerge when the cluster distributions are modelled via density functions (the 'emission' densities) belonging to standard smoothness classes compared to the multinomial setting. Notably, in contrast to the multinomial setting previously considered, the identification of a direction separating the two emission densities becomes a critical, and challenging, issue. Surprisingly, it is possible to "borrow strength" from estimators of the smoother density to improve estimation of the other. We conduct precise analysis of minimax rates, showing a transition depending on the relative smoothnesses of the emission densities.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Cylindrical Grid Graphs $P_m \Box C_n$ are Non-Distance Magic
Authors:
Sajidha P,
V. Vilfred Kamalappan,
Julia K. Abraham
Abstract:
A bijective mapping $f: V(G) \rightarrow \left\{1,2,\ldots,n\right\}$ is called a \emph{Distance Magic Labeling (DML) of $G$} if ~ ${\sum_{v \in N(u)}} f(v) $ is a constant for all $u\in V(G)$ where $G$ is a simple graph of order $n$ and $N(u)$ = $\{v\in V(G):$ $uv\in E(G)\}$. Graph $G$ is called a \emph{Distance Magic Graph (DMG)} if it has a DML, otherwise it is called a \emph{Non-Distance Magic…
▽ More
A bijective mapping $f: V(G) \rightarrow \left\{1,2,\ldots,n\right\}$ is called a \emph{Distance Magic Labeling (DML) of $G$} if ~ ${\sum_{v \in N(u)}} f(v) $ is a constant for all $u\in V(G)$ where $G$ is a simple graph of order $n$ and $N(u)$ = $\{v\in V(G):$ $uv\in E(G)\}$. Graph $G$ is called a \emph{Distance Magic Graph (DMG)} if it has a DML, otherwise it is called a \emph{Non-Distance Magic (NDM) graph}. In 1996, Vilfred proposed a conjecture that cylindrical grid graphs $P_m \Box C_n$ are NDM for $m \geq 2$, $n \geq 3$ and $m,n\in\mathbb{N}$. Recently, the authors could prove the conjecture for the case when $m$ is even by introducing neighbourhood chains of Type-1 (NC-T1) and Type-2 (NC-T2). In this paper, they introduce neighbourhood chains of Type-3 (NC-T3) and using them completely settle the conjecture and also identify families of NDM graphs.
△ Less
Submitted 18 March, 2023;
originally announced March 2023.
-
A study on edge coloring and edge sum coloring of integral sum graphs
Authors:
V. Vilfred Kamalappan,
Lowell W. Beineke,
L. Mary Florida,
Julia K. Abraham
Abstract:
Frank Harary introduced the concept of integral sum graph. A graph $G$ is an \emph{ integral sum graph} if its vertices can be labeled with distinct integers so that $e = uv$ is an edge of $G$ if and only if the sum of the labels on vertices $u$ and $v$ is also a label in $G.$ For any non-empty set of integers $S$, let $G^+(S)$ denote the integral sum graph on the set $S$. In $G^+(S)$, we define a…
▽ More
Frank Harary introduced the concept of integral sum graph. A graph $G$ is an \emph{ integral sum graph} if its vertices can be labeled with distinct integers so that $e = uv$ is an edge of $G$ if and only if the sum of the labels on vertices $u$ and $v$ is also a label in $G.$ For any non-empty set of integers $S$, let $G^+(S)$ denote the integral sum graph on the set $S$. In $G^+(S)$, we define an \emph{edge-sum class} as the set of all edges each with same edge sum number and call $G^+(S)$ an \emph{edge sum color graph} if each edge-sum class is considered as an edge color class of $G^+(S)$. The number of distinct edge-sum classes of $G^+(S)$ is called its \emph{ edge sum chromatic number}. The main results of this paper are (i) the set of all edge-sum classes of an integral sum graph partitions its edge set; (ii) the edge chromatic number and the edge sum chromatic number are equal for the integral sum graphs $G_{0,s}$ and $S_n$, Star graph of order $n$, whereas it is not in the case of $G_{r,s} = G^+([r,s])$, $r < 0 < s$, $n,s \geq 2$, $n,r,s\in\mathbb{N}$. We also obtain an interesting integral sum labeling of Star graphs.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
Sharp multiple testing boundary for sparse sequences
Authors:
Kweku Abraham,
Ismael Castillo,
Etienne Roquain
Abstract:
This work investigates multiple testing by considering minimax separation rates in the sparse sequence model, when the testing risk is measured as the sum FDR+FNR (False Discovery Rate plus False Negative Rate). First using the popular beta-min separation condition, with all nonzero signals separated from $0$ by at least some amount, we determine the sharp minimax testing risk asymptotically and t…
▽ More
This work investigates multiple testing by considering minimax separation rates in the sparse sequence model, when the testing risk is measured as the sum FDR+FNR (False Discovery Rate plus False Negative Rate). First using the popular beta-min separation condition, with all nonzero signals separated from $0$ by at least some amount, we determine the sharp minimax testing risk asymptotically and thereby explicitly describe the transition from "achievable multiple testing with vanishing risk" to "impossible multiple testing". Adaptive multiple testing procedures achieving the corresponding optimal boundary are provided: the Benjamini--Hochberg procedure with a properly tuned level, and an empirical Bayes $\ell$-value (`local FDR') procedure. We prove that the FDR and FNR make non-symmetric contributions to the testing risk for most optimal procedures, the FNR part being dominant at the boundary. The multiple testing hardness is then investigated for classes of arbitrary sparse signals. A number of extensions, including results for classification losses and convergence rates in the case of large signals, are also investigated.
△ Less
Submitted 30 August, 2023; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Fundamental limits for learning hidden Markov model parameters
Authors:
Kweku Abraham,
Zacharie Naulet,
Elisabeth Gassiat
Abstract:
We study the frontier between learnable and unlearnable hidden Markov models (HMMs). HMMs are flexible tools for clustering dependent data coming from unknown populations. The model parameters are known to be fully identifiable (up to label-switching) without any modeling assumption on the distributions of the populations as soon as the clusters are distinct and the hidden chain is ergodic with a…
▽ More
We study the frontier between learnable and unlearnable hidden Markov models (HMMs). HMMs are flexible tools for clustering dependent data coming from unknown populations. The model parameters are known to be fully identifiable (up to label-switching) without any modeling assumption on the distributions of the populations as soon as the clusters are distinct and the hidden chain is ergodic with a full rank transition matrix. In the limit as any one of these conditions fails, it becomes impossible in general to identify parameters. For a chain with two hidden states we prove nonasymptotic minimax upper and lower bounds, matching up to constants, which exhibit thresholds at which the parameters become learnable. We also provide an upper bound on the relative entropy rate for parameters in a neighbourhood of the unlearnable region which may have interest in itself.
△ Less
Submitted 24 October, 2022; v1 submitted 24 June, 2021;
originally announced June 2021.
-
Empirical Bayes cumulative $\ell$-value multiple testing procedure for sparse sequences
Authors:
Kweku Abraham,
Ismael Castillo,
Etienne Roquain
Abstract:
In the sparse sequence model, we consider a popular Bayesian multiple testing procedure and investigate for the first time its behaviour from the frequentist point of view. Given a spike-and-slab prior on the high-dimensional sparse unknown parameter, one can easily compute posterior probabilities of coming from the spike, which correspond to the well known local-fdr values, also called $\ell$-val…
▽ More
In the sparse sequence model, we consider a popular Bayesian multiple testing procedure and investigate for the first time its behaviour from the frequentist point of view. Given a spike-and-slab prior on the high-dimensional sparse unknown parameter, one can easily compute posterior probabilities of coming from the spike, which correspond to the well known local-fdr values, also called $\ell$-values. The spike-and-slab weight parameter is calibrated in an empirical Bayes fashion, using marginal maximum likelihood. The multiple testing procedure under study, called here the cumulative $\ell$-value procedure, ranks coordinates according to their empirical $\ell$-values and thresholds so that the cumulative ranked sum does not exceed a user-specified level $t$.
We validate the use of this method from the multiple testing perspective: for alternatives of appropriately large signal strength, the false discovery rate (FDR) of the procedure is shown to converge to the target level $t$, while its false negative rate (FNR) goes to $0$. We complement this study by providing convergence rates for the method. Additionally, we prove that the $q$-value multiple testing procedure shares similar convergence rates in this model.
△ Less
Submitted 28 March, 2022; v1 submitted 1 February, 2021;
originally announced February 2021.
-
Multiple Testing in Nonparametric Hidden Markov Models: An Empirical Bayes Approach
Authors:
Kweku Abraham,
Ismael Castillo,
Elisabeth Gassiat
Abstract:
Given a nonparametric Hidden Markov Model (HMM) with two states, the question of constructing efficient multiple testing procedures is considered, treating one of the states as an unknown null hypothesis. A procedure is introduced, based on nonparametric empirical Bayes ideas, that controls the False Discovery Rate (FDR) at a user--specified level. Guarantees on power are also provided, in the for…
▽ More
Given a nonparametric Hidden Markov Model (HMM) with two states, the question of constructing efficient multiple testing procedures is considered, treating one of the states as an unknown null hypothesis. A procedure is introduced, based on nonparametric empirical Bayes ideas, that controls the False Discovery Rate (FDR) at a user--specified level. Guarantees on power are also provided, in the form of a control of the true positive rate. One of the key steps in the construction requires supremum--norm convergence of preliminary estimators of the emission densities of the HMM. We provide the existence of such estimators, with convergence at the optimal minimax rate, for the case of a HMM with $J\ge 2$ states, which is of independent interest.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
On statistical Calderón problems
Authors:
Kweku Abraham,
Richard Nickl
Abstract:
For $D$ a bounded domain in $\mathbb R^d, d \ge 2,$ with smooth boundary $\partial D$, the non-linear inverse problem of recovering the unknown conductivity $γ$ determining solutions $u=u_{γ, f}$ of the partial differential equation \begin{equation*} \begin{split} \nabla \cdot(γ\nabla u)&=0 \quad \text{ in }D, \\ u&=f \quad \text { on } \partial D, \end{split} \end{equation*} from noisy observatio…
▽ More
For $D$ a bounded domain in $\mathbb R^d, d \ge 2,$ with smooth boundary $\partial D$, the non-linear inverse problem of recovering the unknown conductivity $γ$ determining solutions $u=u_{γ, f}$ of the partial differential equation \begin{equation*} \begin{split} \nabla \cdot(γ\nabla u)&=0 \quad \text{ in }D, \\ u&=f \quad \text { on } \partial D, \end{split} \end{equation*} from noisy observations $Y$ of the Dirichlet-to-Neumann map \[f \mapsto Λ_γ(f) = {γ\frac{\partial u_{γ,f}}{\partial ν}}\Big|_{\partial D},\] with $\partial/\partial ν$ denoting the outward normal derivative, is considered. The data $Y$ consists of $Λ_γ$ corrupted by additive Gaussian noise at noise level $\varepsilon>0$, and a statistical algorithm $\hat γ(Y)$ is constructed which is shown to recover $γ$ in supremum-norm loss at a statistical convergence rate of the order $\log(1/\varepsilon)^{-δ}$ as $\varepsilon \to 0$. It is further shown that this convergence rate is optimal, up to the precise value of the exponent $δ>0$, in an information theoretic sense. The estimator $\hat γ(Y)$ has a Bayesian interpretation in terms of the posterior mean of a suitable Gaussian process prior and can be computed by MCMC methods.
△ Less
Submitted 20 April, 2020; v1 submitted 8 June, 2019;
originally announced June 2019.
-
Nonparametric Bayesian posterior contraction rates for scalar diffusions with high-frequency data
Authors:
Kweku Abraham
Abstract:
We consider inference in the scalar diffusion model $dX_t=b(X_t)dt+σ(X_t)dW_t$ with discrete data $(X_{jΔ_n})_{0\leq j \leq n}$, $n\to \infty,~Δ_n\to 0$ and periodic coefficients. For $σ$ given, we prove a general theorem detailing conditions under which Bayesian posteriors will contract in $L^2$-distance around the true drift function $b_0$ at the frequentist minimax rate (up to logarithmic facto…
▽ More
We consider inference in the scalar diffusion model $dX_t=b(X_t)dt+σ(X_t)dW_t$ with discrete data $(X_{jΔ_n})_{0\leq j \leq n}$, $n\to \infty,~Δ_n\to 0$ and periodic coefficients. For $σ$ given, we prove a general theorem detailing conditions under which Bayesian posteriors will contract in $L^2$-distance around the true drift function $b_0$ at the frequentist minimax rate (up to logarithmic factors) over Besov smoothness classes. We exhibit natural nonparametric priors which satisfy our conditions. Our results show that the Bayesian method adapts both to an unknown sampling regime and to unknown smoothness.
△ Less
Submitted 23 August, 2018; v1 submitted 15 February, 2018;
originally announced February 2018.