-
Optimal testing using combined test statistics across independent studies
Abstract: Combining test statistics from independent trials or experiments is a popular method of meta-analysis. However, there is very limited theoretical understanding of the power of the combined test, especially in high-dimensional models considering composite hypotheses tests. We derive a mathematical framework to study standard {meta-analysis} testing approaches in the context of the many normal means… ▽ More
Submitted 30 October, 2023; originally announced October 2023.
Comments: NeuRIPS 2023. 27 pages, 3 figures
MSC Class: 62B10; 62C20; 62F30; 62F03
-
Semi-parametric Bernstein-von Mises in Linear Inverse Problems
Abstract: We consider a Bayesian approach for the recovery of scalar parameters arising in inverse problems. We consider a general signal-in white noise model where we have access to two independent noisy observations of a function, and of a linear transformation of the function. The linear operator is unknown up to a scalar parameter. We present a Bernstein-von Mises theorem for the marginal posterior of t… ▽ More
Submitted 13 March, 2025; v1 submitted 4 October, 2023; originally announced October 2023.
MSC Class: 62F15; 62E20
-
Uncertainty quantification for sparse spectral variational approximations in Gaussian process regression
Abstract: We investigate the frequentist guarantees of the variational sparse Gaussian process regression model. In the theoretical analysis, we focus on the variational approach with spectral features as inducing variables. We derive guarantees and limitations for the frequentist coverage of the resulting variational credible sets. We also derive sufficient and necessary lower bounds for the number of indu… ▽ More
Submitted 28 September, 2023; v1 submitted 21 December, 2022; originally announced December 2022.
-
arXiv:2202.00968 [pdf, ps, other]
Optimal high-dimensional and nonparametric distributed testing under communication constraints
Abstract: We derive minimax testing errors in a distributed framework where the data is split over multiple machines and their communication to a central machine is limited to $b$ bits. We investigate both the $d$- and infinite-dimensional signal detection problem under Gaussian white noise. We also derive distributed testing algorithms reaching the theoretical lower bounds. Our results show that distribu… ▽ More
Submitted 11 December, 2022; v1 submitted 2 February, 2022; originally announced February 2022.
Comments: 53 pages
MSC Class: 62G10; 62F30; 62F03
-
Contraction rates for sparse variational approximations in Gaussian process regression
Abstract: We study the theoretical properties of a variational Bayes method in the Gaussian Process regression model. We consider the inducing variables method introduced by Titsias (2009a) and derive sufficient conditions for obtaining contraction rates for the corresponding variational Bayes (VB) posterior. As examples we show that for three particular covariance kernels (Matérn, squared exponential, rand… ▽ More
Submitted 30 March, 2022; v1 submitted 22 September, 2021; originally announced September 2021.
Comments: 26 pages, 6 figures, 1 table
-
Optimal distributed composite testing in high-dimensional Gaussian models with 1-bit communication
Abstract: In this paper we study the problem of signal detection in Gaussian noise in a distributed setting where the local machines in the star topology can communicate a single bit of information. We derive a lower bound on the Euclidian norm that the signal needs to have in order to be detectable. Moreover, we exhibit optimal distributed testing strategies that attain the lower bound.
Submitted 24 February, 2022; v1 submitted 9 December, 2020; originally announced December 2020.
Comments: 33 pages, 2 figures. To appear in IEEE Transactions on Information Theory
MSC Class: Primary: 62F03; 62F30. Secondary: 94A13; 68W15
-
arXiv:2003.12838 [pdf, ps, other]
Distributed function estimation: adaptation using minimal communication
Abstract: We investigate whether in a distributed setting, adaptive estimation of a smooth function at the optimal rate is possible under minimal communication. It turns out that the answer depends on the risk considered and on the number of servers over which the procedure is distributed. We show that for the $L_\infty$-risk, adaptively obtaining optimal rates under minimal communication is not possible. F… ▽ More
Submitted 28 March, 2020; originally announced March 2020.
Comments: 40 pages
MSC Class: 62G20; 62G10; 62G05; 94A15
-
arXiv:1804.00864 [pdf, ps, other]
Adaptive distributed methods under communication constraints
Abstract: We study distributed estimation methods under communication constraints in a distributed version of the nonparametric random design regression model. We derive minimax lower bounds and exhibit methods that attain those bounds. Moreover, we show that adaptive estimation is possible in this setting.
Submitted 4 February, 2019; v1 submitted 3 April, 2018; originally announced April 2018.
Comments: 46 pages
MSC Class: 62G05; 62G20
-
An asymptotic analysis of distributed nonparametric methods
Abstract: We investigate and compare the fundamental performance of several distributed learning methods that have been proposed recently. We do this in the context of a distributed version of the classical signal-in-Gaussian-white-noise model, which serves as a benchmark model for studying performance in this setting. The results show how the design and tuning of a distributed method can have great impact… ▽ More
Submitted 8 November, 2017; originally announced November 2017.
Comments: 29 pages, 4 figures
MSC Class: 62G20; 62G15; 62G05
-
arXiv:1709.06360 [pdf, ps, other]
Minimax lower bounds for function estimation on graphs
Abstract: We study minimax lower bounds for function estimation problems on large graph when the target function is smoothly varying over the graph. We derive minimax rates in the context of regression and classification problems on graphs that satisfy an asymptotic shape assumption and with a smoothness condition on the target function, both formulated in terms of the graph Laplacian.
Submitted 15 February, 2018; v1 submitted 19 September, 2017; originally announced September 2017.
-
arXiv:1609.01577 [pdf, ps, other]
Full adaptation to smoothness using randomly truncated series priors with Gaussian coefficients and inverse gamma scaling
Abstract: We study random series priors for estimating a functional parameter (f\in L^2[0,1]). We show that with a series prior with random truncation, Gaussian coefficients, and inverse gamma multiplicative scaling, it is possible to achieve posterior contraction at optimal rates and adaptation to arbitrary degrees of smoothness. We present general results that can be combined with existing rate of contrac… ▽ More
Submitted 5 December, 2016; v1 submitted 6 September, 2016; originally announced September 2016.
-
Estimating a smooth function on a large graph by Bayesian Laplacian regularisation
Abstract: We study a Bayesian approach to estimating a smooth function in the context of regression or classification problems on large graphs. We derive theoretical results that show how asymptotically optimal Bayesian regularization can be achieved under an asymptotic shape assumption on the underlying graph and a smoothness condition on the target function, both formulated in terms of the graph Laplacian… ▽ More
Submitted 5 March, 2017; v1 submitted 8 November, 2015; originally announced November 2015.
-
arXiv:1509.01906 [pdf, ps, other]
Rejoinder to discussions of "Frequentist coverage of adaptive nonparametric Bayesian credible sets"
Abstract: Rejoinder of "Frequentist coverage of adaptive nonparametric Bayesian credible sets" by Szabó, van der Vaart and van Zanten [arXiv:1310.4489v5].
Submitted 7 September, 2015; originally announced September 2015.
Comments: Published at http://dx.doi.org/10.1214/15-AOS1270REJ in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS1270REJ
Journal ref: Annals of Statistics 2015, Vol. 43, No. 4, 1463-1470
-
arXiv:1506.00515 [pdf, ps, other]
Gaussian process methods for one-dimensional diffusions: optimal rates and adaptation
Abstract: We study the performance of nonparametric Bayes procedures for one-dimensional diffusions with periodic drift. We improve existing convergence rate results for Gaussian process (GP) priors with fixed hyper parameters. Moreover, we exhibit several possibilities to achieve adaptation to smoothness. We achieve this by considering hierarchical procedures that involve either a prior on a multiplicative… ▽ More
Submitted 8 February, 2016; v1 submitted 1 June, 2015; originally announced June 2015.
-
arXiv:1409.5103 [pdf, ps, other]
Optimality of Poisson processes intensity learning with Gaussian processes
Abstract: In this paper we provide theoretical support for the so-called "Sigmoidal Gaussian Cox Process" approach to learning the intensity of an inhomogeneous Poisson process on a $d$-dimensional domain. This method was proposed by Adams, Murray and MacKay (ICML, 2009), who developed a tractable computational approach and showed in simulation and real data experiments that it can work quite satisfactorily… ▽ More
Submitted 2 March, 2015; v1 submitted 17 September, 2014; originally announced September 2014.
-
Honest Bayesian confidence sets for the L2-norm
Abstract: We investigate the problem of constructing Bayesian credible sets that are honest and adaptive for the L2-loss over a scale of Sobolev classes with regularity ranging between [D; 2D], for some given D in the context of the signal-in-white-noise model. We consider a scale of prior distributions indexed by a regularity hyper-parameter and choose the hyper-parameter both by marginal likelihood empiri… ▽ More
Submitted 23 April, 2014; v1 submitted 29 November, 2013; originally announced November 2013.
Comments: 24 pages, 3 figures
MSC Class: 62G15; 62G05 (Primary); 62G20 (secondary )
-
Guided proposals for simulating multi-dimensional diffusion bridges
Abstract: A Monte Carlo method for simulating a multi-dimensional diffusion process conditioned on hitting a fixed point at a fixed future time is developed. Proposals for such diffusion bridges are obtained by superimposing an additional guiding term to the drift of the process under consideration. The guiding term is derived via approximation of the target process by a simpler diffusion processes with kno… ▽ More
Submitted 13 October, 2015; v1 submitted 14 November, 2013; originally announced November 2013.
MSC Class: 60J60 (Primary); 65C30 (Secondary); 65C05
Journal ref: Bernoulli Volume 23, Number 4A (November 2017), 2917-2950
-
arXiv:1310.4489 [pdf, ps, other]
Frequentist coverage of adaptive nonparametric Bayesian credible sets
Abstract: We investigate the frequentist coverage of Bayesian credible sets in a nonparametric setting. We consider a scale of priors of varying regularity and choose the regularity by an empirical Bayes method. Next we consider a central set of prescribed posterior probability in the posterior distribution of the chosen regularity. We show that such an adaptive Bayes credible set gives correct uncertainty… ▽ More
Submitted 4 September, 2015; v1 submitted 16 October, 2013; originally announced October 2013.
Comments: Published at http://dx.doi.org/10.1214/14-AOS1270 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS1270
Journal ref: Annals of Statistics 2015, Vol. 43, No. 4, 1391-1428
-
Rate-optimal Bayesian intensity smoothing for inhomogeneous Poisson processes
Abstract: We apply nonparametric Bayesian methods to study the problem of estimating the intensity function of an inhomogeneous Poisson process. We exhibit a prior on intensities which both leads to a computationally feasible method and enjoys desirable theoretical optimality properties. The prior we use is based on B-spline expansions with free knots, adapted from well-established methods used in regressio… ▽ More
Submitted 27 November, 2013; v1 submitted 22 April, 2013; originally announced April 2013.
-
arXiv:1302.4561 [pdf, ps, other]
Optimal two-stage procedures for estimating location and size of the maximum of a multivariate regression function
Abstract: We propose a two-stage procedure for estimating the location $\boldsμ$ and size M of the maximum of a smooth d-variate regression function f. In the first stage, a preliminary estimator of $\boldsμ$ obtained from a standard nonparametric smoothing method is used. At the second stage, we "zoom-in" near the vicinity of the preliminary estimator and make further observations at some design points in… ▽ More
Submitted 19 February, 2013; originally announced February 2013.
Comments: Published in at http://dx.doi.org/10.1214/12-AOS1053 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS1053
Journal ref: Annals of Statistics 2012, Vol. 40, No. 6, 2850-2876
-
arXiv:1301.7567 [pdf, ps, other]
Consistent nonparametric Bayesian inference for discretely observed scalar diffusions
Abstract: We study Bayes procedures for the problem of nonparametric drift estimation for one-dimensional, ergodic diffusion models from discrete-time, low-frequency data. We give conditions for posterior consistency and verify these conditions for concrete priors, including priors based on wavelet expansions.
Submitted 31 January, 2013; originally announced January 2013.
Comments: Published in at http://dx.doi.org/10.3150/11-BEJ385 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)
Report number: IMS-BEJ-BEJ385
Journal ref: Bernoulli 2013, Vol. 19, No. 1, 44-63
-
arXiv:1211.2121 [pdf, ps, other]
Adaptive nonparametric Bayesian inference using location-scale mixture priors
Abstract: We study location-scale mixture priors for nonparametric statistical problems, including multivariate regression, density estimation and classification. We show that a rate-adaptive procedure can be obtained if the prior is properly constructed. In particular, we show that adaptation is achieved if a kernel mixture prior on a regression function is constructed using a Gaussian kernel, an inverse g… ▽ More
Submitted 9 November, 2012; originally announced November 2012.
Comments: Published in at http://dx.doi.org/10.1214/10-AOS811 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS811
Journal ref: Annals of Statistics 2010, Vol. 38, No. 6, 3300-3320
-
Bayes procedures for adaptive inference in inverse problems for the white noise model
Abstract: We study empirical and hierarchical Bayes approaches to the problem of estimating an infinite-dimensional parameter in mildly ill-posed inverse problems. We consider a class of prior distributions indexed by a hyperparameter that quantifies regularity. We prove that both methods we consider succeed in automatically selecting this parameter optimally, resulting in optimal convergence rates for trut… ▽ More
Submitted 29 May, 2013; v1 submitted 17 September, 2012; originally announced September 2012.
Comments: 41 pages, 2 figures
MSC Class: 62G05; 62C15 (Primary) 62G20 (Secondary)
-
arXiv:1111.5876 [pdf, ps, other]
Bayesian recovery of the initial condition for the heat equation
Abstract: We study a Bayesian approach to recovering the initial condition for the heat equation from noisy observations of the solution at a later time. We consider a class of prior distributions indexed by a parameter quantifying "smoothness" and show that the corresponding posterior distributions contract around the true parameter at a rate that depends on the smoothness of the true initial condition and… ▽ More
Submitted 1 March, 2013; v1 submitted 24 November, 2011; originally announced November 2011.
Comments: 17 pages, 4 figures. Published in Comm. Statist. Theory Methods. This version differs from the original in pagination and typographic detail. arXiv admin note: text overlap with arXiv:1103.2692
MSC Class: 62G05; 62G15; 62G20
Journal ref: Communications in Statistics - Theory and Methods 2013 Volume 42, Issue 7, 1294-1313
-
arXiv:1103.2692 [pdf, ps, other]
Bayesian inverse problems with Gaussian priors
Abstract: The posterior distribution in a nonparametric inverse problem is shown to contract to the true parameter at a rate that depends on the smoothness of the parameter, and the smoothness and scale of the prior. Correct combinations of these characteristics lead to the minimax rate. The frequentist coverage of credible sets is shown to depend on the combination of prior and true parameter, with smoothe… ▽ More
Submitted 23 February, 2012; v1 submitted 14 March, 2011; originally announced March 2011.
Comments: Published in at http://dx.doi.org/10.1214/11-AOS920 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS920
Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2626-2657
-
Nonparametric methods for volatility density estimation
Abstract: Stochastic volatility modelling of financial processes has become increasingly popular. The proposed models usually contain a stationary volatility process. We will motivate and review several nonparametric methods for estimation of the density of the volatility process. Both models based on discretely sampled continuous time processes and discrete time models will be discussed. The key insigh… ▽ More
Submitted 27 October, 2009; originally announced October 2009.
MSC Class: 62G07; 62G08; 62M07; 62P20; 91G70
Journal ref: Advanced Mathematical Methods for Finance, Chapter 11, 293-312, Giulia di Nunno, Bernt Øksendal Eds., Springer (2011)
-
arXiv:0908.3556 [pdf, ps, other]
Adaptive Bayesian estimation using a Gaussian random field with inverse Gamma bandwidth
Abstract: We consider nonparametric Bayesian estimation inference using a rescaled smooth Gaussian field as a prior for a multidimensional function. The rescaling is achieved using a Gamma variable and the procedure can be viewed as choosing an inverse Gamma bandwidth. The procedure is studied from a frequentist perspective in three statistical settings involving replicated observations (density estimatio… ▽ More
Submitted 25 August, 2009; originally announced August 2009.
Comments: Published in at http://dx.doi.org/10.1214/08-AOS678 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS678 MSC Class: 62H30; 62-07 (Primary); 65U05; 68T05 (Secondary)
Journal ref: Annals of Statistics 2009, Vol. 37, No. 5B, 2655-2675
-
arXiv:0806.3024 [pdf, ps, other]
Rates of contraction of posterior distributions based on Gaussian process priors
Abstract: We derive rates of contraction of posterior distributions on nonparametric or semiparametric models based on Gaussian processes. The rate of contraction is shown to depend on the position of the true parameter relative to the reproducing kernel Hilbert space of the Gaussian process and the small ball probabilities of the Gaussian process. We determine these quantities for a range of examples of… ▽ More
Submitted 18 June, 2008; originally announced June 2008.
Comments: Published in at http://dx.doi.org/10.1214/009053607000000613 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS0301 MSC Class: 60G15; 62G05 (Primary)
Journal ref: Annals of Statistics 2008, Vol. 36, No. 3, 1435-1463
-
arXiv:0805.3252 [pdf, ps, other]
Reproducing kernel Hilbert spaces of Gaussian priors
Abstract: We review definitions and properties of reproducing kernel Hilbert spaces attached to Gaussian variables and processes, with a view to applications in nonparametric Bayesian statistics using Gaussian priors. The rate of contraction of posterior distributions based on Gaussian priors can be described through a concentration function that is expressed in the reproducing Hilbert space. Absolute con… ▽ More
Submitted 21 May, 2008; originally announced May 2008.
Comments: Published in at http://dx.doi.org/10.1214/074921708000000156 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-COLL3-IMSCOLL315 MSC Class: 60G15; 62G05 (Primary)
Journal ref: IMS Collections 2008, Vol. 3, 200-222
-
arXiv:0803.4238 [pdf, ps, other]
Small Deviations of Smooth Stationary Gaussian Processes
Abstract: We investigate the small deviation probabilities of a class of very smooth stationary Gaussian processes playing an important role in Bayesian statistical inference. Our calculations are based on the appropriate modification of the entropy method due to Kuelbs, Li, and Linde as well as on classical results about the entropy of classes of analytic functions. They also involve Tsirelson's upper bo… ▽ More
Submitted 29 March, 2008; originally announced March 2008.
MSC Class: 60G10; 60G15; 46E15; 62A15
Journal ref: Theor. Probab. Appl., 2008, 53, 4, 788--798 (in Russian), 697--707 (in English)
-
arXiv:0710.3679 [pdf, ps, other]
Bayesian inference with rescaled Gaussian process priors
Abstract: We use rescaled Gaussian processes as prior models for functional parameters in nonparametric statistical models. We show how the rate of contraction of the posterior distributions depends on the scaling factor. In particular, we exhibit rescaled Gaussian process priors yielding posteriors that contract around the true parameter at optimal convergence rates. To derive our results we establish bo… ▽ More
Submitted 19 October, 2007; originally announced October 2007.
Comments: Published in at http://dx.doi.org/10.1214/07-EJS098 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-EJS-EJS_2007_98 MSC Class: 62G05; 62C10 (Primary) 60G15 (Secondary)
Journal ref: Electronic Journal of Statistics 2007, Vol. 1, 433-448
-
arXiv:math/0507412 [pdf, ps, other]
Donsker theorems for diffusions: Necessary and sufficient conditions
Abstract: We consider the empirical process G_t of a one-dimensional diffusion with finite speed measure, indexed by a collection of functions F. By the central limit theorem for diffusions, the finite-dimensional distributions of G_t converge weakly to those of a zero-mean Gaussian random process G. We prove that the weak convergence G_t\Rightarrow G takes place in \ell^{\infty}(F) if and only if the lim… ▽ More
Submitted 21 July, 2005; originally announced July 2005.
Comments: Published at http://dx.doi.org/10.1214/009117905000000152 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOP-AOP0059 MSC Class: 60J60; 60J55; 60F17; 62M05 (Primary)
Journal ref: Annals of Probability 2005, Vol. 33, No. 4, 1422-1451
-
arXiv:math/0503656 [pdf, ps, other]
Krein's spectral theory and the Paley-Wiener expansion for fractional Brownian motion
Abstract: In this paper we develop the spectral theory of the fractional Brownian motion (fBm) using the ideas of Krein's work on continuous analogous of orthogonal polynomials on the unit circle. We exhibit the functions which are orthogonal with respect to the spectral measure of the fBm and obtain an explicit reproducing kernel in the frequency domain. We use these results to derive an extension of the… ▽ More
Submitted 29 March, 2005; originally announced March 2005.
Comments: Published at http://dx.doi.org/10.1214/009117904000000955 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOP-AOP039 MSC Class: 60G15; 60G51; 62M15 (Primary)
Journal ref: Annals of Probability 2005, Vol. 33, No. 2, 620-644
-
arXiv:math/0206142 [pdf, ps, other]
Nonparametric volatility density estimation for discrete time models
Abstract: We consider discrete time models for asset prices with a stationary volatility process. We aim at estimating the multivariate density of this process at a set of consecutive time instants. A Fourier type deconvolution kernel density estimator based on the logarithm of the squared process is proposed to estimate the volatility density. Expansions of the bias and bounds on the variance are derived… ▽ More
Submitted 14 June, 2002; originally announced June 2002.
MSC Class: 62G07; 62M07; 62P20
Journal ref: Journal of Nonparametric Statistics 17 (2), 237-249 (2005)
-
arXiv:math/0107135 [pdf, ps, other]
Nonparametric Volatility Density Estimation
Abstract: We consider two kinds of stochastic volatility models. Both kinds of models contain a stationary volatility process, the density of which, at a fixed instant in time, we aim to estimate. We discuss discrete time models where for instance a log price process is modeled as the product of a volatility process and i.i.d. noise. We also consider samples of certain continuous time diffusion processe… ▽ More
Submitted 16 June, 2002; v1 submitted 19 July, 2001; originally announced July 2001.
MSC Class: 62G07; 62M07; 62P20
Journal ref: Bernoulli 9 (3), 451-645 (2003)