-
Bayesian spatio-temporal model for high-resolution short-term forecasting of precipitation fields
Authors:
Stephen Richard Johnson,
Sarah Elizabeth Heaps,
Kevin James Wilson,
Darren James Wilkinson
Abstract:
With extreme weather events becoming more common, the risk posed by surface water flooding is ever increasing. In this work we propose a model, and associated Bayesian inference scheme, for generating probabilistic (high-resolution short-term) forecasts of localised precipitation. The parametrisation of our underlying hierarchical dynamic spatio-temporal model is motivated by a forward-time, centr…
▽ More
With extreme weather events becoming more common, the risk posed by surface water flooding is ever increasing. In this work we propose a model, and associated Bayesian inference scheme, for generating probabilistic (high-resolution short-term) forecasts of localised precipitation. The parametrisation of our underlying hierarchical dynamic spatio-temporal model is motivated by a forward-time, centred-space finite difference solution to a collection of stochastic partial differential equations, where the main driving forces are advection and diffusion. Observations from both weather radar and ground based rain gauges provide information from which we can learn about the likely values of the (latent) precipitation field in addition to other unknown model parameters. Working in the Bayesian paradigm provides a coherent framework for capturing uncertainty both in the underlying model parameters and also in our forecasts. Further, appealing to simulation based (MCMC) sampling yields a straightforward solution to handling zeros, treated as censored observations, via data augmentation. Both the underlying state and the observations are of moderately large dimension ($\mathcal{O}(10^4)$ and $\mathcal{O}(10^3)$ respectively) and this renders standard inference approaches computationally infeasible. Our solution is to embed the ensemble Kalman smoother within a Gibbs sampling scheme to facilitate approximate Bayesian inference in reasonable time. Both the methodology and the effectiveness of our posterior sampling scheme are demonstrated via simulation studies and also by a case study of real data from the Urban Observatory project based in Newcastle upon Tyne, UK.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
On Bayesian inference for the Extended Plackett-Luce model
Authors:
Stephen R. Johnson,
Daniel A. Henderson,
Richard J. Boys
Abstract:
The analysis of rank ordered data has a long history in the statistical literature across a diverse range of applications. In this paper we consider the Extended Plackett-Luce model that induces a flexible (discrete) distribution over permutations. The parameter space of this distribution is a combination of potentially high-dimensional discrete and continuous components and this presents challeng…
▽ More
The analysis of rank ordered data has a long history in the statistical literature across a diverse range of applications. In this paper we consider the Extended Plackett-Luce model that induces a flexible (discrete) distribution over permutations. The parameter space of this distribution is a combination of potentially high-dimensional discrete and continuous components and this presents challenges for parameter interpretability and also posterior computation. Particular emphasis is placed on the interpretation of the parameters in terms of observable quantities and we propose a general framework for preserving the mode of the prior predictive distribution. Posterior sampling is achieved using an effective simulation based approach that does not require imposing restrictions on the parameter space. Working in the Bayesian framework permits a natural representation of the posterior predictive distribution and we draw on this distribution to address the rank aggregation problem and also to identify potential lack of model fit. The flexibility of the Extended Plackett-Luce model along with the effectiveness of the proposed sampling scheme are demonstrated using several simulation studies and real data examples.
△ Less
Submitted 14 February, 2020;
originally announced February 2020.
-
Revealing subgroup structure in ranked data using a Bayesian WAND
Authors:
Stephen R. Johnson,
Daniel A. Henderson,
Richard J. Boys
Abstract:
Ranked data arise in many areas of application ranging from the ranking of up-regulated genes for cancer to the ranking of academic statistics journals. Complications can arise when rankers do not report a full ranking of all entities; for example, they might only report their top--$M$ ranked entities after seeing some or all entities. It can also be useful to know whether rankers are equally info…
▽ More
Ranked data arise in many areas of application ranging from the ranking of up-regulated genes for cancer to the ranking of academic statistics journals. Complications can arise when rankers do not report a full ranking of all entities; for example, they might only report their top--$M$ ranked entities after seeing some or all entities. It can also be useful to know whether rankers are equally informative, and whether some entities are effectively judged to be exchangeable. When there is important subgroup structure in the data, summaries such as aggregate (overall) rankings can be misleading. In this paper we propose a flexible Bayesian nonparametric model for identifying heterogeneous structure and ranker reliability in ranked data. The model is a Weighted Adapted Nested Dirichlet (WAND) process mixture of Plackett-Luce models and inference proceeds through a simple and efficient Gibbs sampling scheme for posterior sampling. The richness of information in the posterior distribution allows us to infer many details of the structure both between ranker groups and between entity groups (within ranker groups), in contrast to many other (Bayesian) analyses. We also examine how posterior predictive checks can be used to identify lack of model fit. The methodology is illustrated using several simulation studies and real data examples.
△ Less
Submitted 25 October, 2018; v1 submitted 29 June, 2018;
originally announced June 2018.