-
Eliciting prior information from clinical trials via calibrated Bayes factor
Authors:
Roberto Macrì Demartino,
Leonardo Egidi,
Nicola Torelli,
Ioannis Ntzoufras
Abstract:
In the Bayesian framework power prior distributions are increasingly adopted in clinical trials and similar studies to incorporate external and past information, typically to inform the parameter associated to a treatment effect. Their use is particularly effective in scenarios with small sample sizes and where robust prior information is actually available. A crucial component of this methodology…
▽ More
In the Bayesian framework power prior distributions are increasingly adopted in clinical trials and similar studies to incorporate external and past information, typically to inform the parameter associated to a treatment effect. Their use is particularly effective in scenarios with small sample sizes and where robust prior information is actually available. A crucial component of this methodology is represented by its weight parameter, which controls the volume of historical information incorporated into the current analysis. This parameter can be considered as either fixed or random. Although various strategies exist for its determination, eliciting the prior distribution of the weight parameter according to a full Bayesian approach remains a challenge. In general, this parameter should be carefully selected to accurately reflect the available prior information without dominating the posterior inferential conclusions. To this aim, we propose a novel method for eliciting the prior distribution of the weight parameter through a simulation-based calibrated Bayes factor procedure. This approach allows for the prior distribution to be updated based on the strength of evidence provided by the data: The goal is to facilitate the integration of historical data when it aligns with current information and to limit it when discrepancies arise in terms, for instance, of prior-data conflicts. The performance of the proposed method is tested through simulation studies and applied to real data from clinical trials.
△ Less
Submitted 24 January, 2025; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Lasso Multinomial Performance Indicators for in-play Basketball Data
Authors:
Argyro Damoulaki,
Ioannis Ntzoufras,
Konstantinos Pelechrinis
Abstract:
A typical approach to quantify the contribution of each player in basketball uses the plus-minus method. The ratings obtained by such a method are estimated using simple regression models and their regularized variants, with response variable being either the points scored or the point differences. To capture more precisely the effect of each player, detailed possession-based play-by-play data may…
▽ More
A typical approach to quantify the contribution of each player in basketball uses the plus-minus method. The ratings obtained by such a method are estimated using simple regression models and their regularized variants, with response variable being either the points scored or the point differences. To capture more precisely the effect of each player, detailed possession-based play-by-play data may be used. This is the direction we take in this article, in which we investigate the performance of regularized adjusted plus-minus (RAPM) indicators estimated by different regularized models having as a response the number of points scored in each possession. Therefore, we use possession play-by-play data from all NBA games for the season 2021-22 (322,852 possessions). We initially present simple regression model-based indices starting from the implementation of ridge regression which is the standard technique in the relevant literature. We proceed with the lasso approach which has specific advantages and better performance than ridge regression when compared with selected objective validation criteria. Then, we implement regularized binary and multinomial logistic regression models to obtain more accurate performance indicators since the response is a discrete variable taking values mainly from zero to three. Our final proposal is an improved RAPM measure which is based on the expected points of a multinomial logistic regression model where each player's contribution is weighted by his participation in the team's possessions. The proposed indicator, called weighted expected points (wEPTS), outperforms all other RAPM measures we investigate in this study.
△ Less
Submitted 31 October, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
Bayesian analysis of diffusion-driven multi-type epidemic models with application to COVID-19
Authors:
Lampros Bouranis,
Nikolaos Demiris,
Konstantinos Kalogeropoulos,
Ioannis Ntzoufras
Abstract:
We consider a flexible Bayesian evidence synthesis approach to model the age-specific transmission dynamics of COVID-19 based on daily mortality counts. The temporal evolution of transmission rates in populations containing multiple types of individual is reconstructed via an appropriate dimension-reduction formulation driven by independent diffusion processes. A suitably tailored compartmental mo…
▽ More
We consider a flexible Bayesian evidence synthesis approach to model the age-specific transmission dynamics of COVID-19 based on daily mortality counts. The temporal evolution of transmission rates in populations containing multiple types of individual is reconstructed via an appropriate dimension-reduction formulation driven by independent diffusion processes. A suitably tailored compartmental model is used to learn the latent counts of infection, accounting for fluctuations in transmission influenced by public health interventions and changes in human behaviour. The model is fitted to freely available COVID-19 data sources from the UK, Greece and Austria and validated using a large-scale seroprevalence survey in England. In particular, we demonstrate how model expansion can facilitate evidence reconciliation at a latent level. The code implementing this work is made freely available via the Bernadette R package.
△ Less
Submitted 7 June, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Inconsistency identification in network meta-analysis via stochastic search variable selection
Authors:
Georgios Seitidis,
Stavros Nikolakopoulos,
Ioannis Ntzoufras,
Dimitris Mavridis
Abstract:
The reliability of the results of network meta-analysis (NMA) lies in the plausibility of key assumption of transitivity. This assumption implies that the effect modifiers' distribution is similar across treatment comparisons. Transitivity is statistically manifested through the consistency assumption which suggests that direct and indirect evidence are in agreement. Several methods have been sugg…
▽ More
The reliability of the results of network meta-analysis (NMA) lies in the plausibility of key assumption of transitivity. This assumption implies that the effect modifiers' distribution is similar across treatment comparisons. Transitivity is statistically manifested through the consistency assumption which suggests that direct and indirect evidence are in agreement. Several methods have been suggested to evaluate consistency. A popular approach suggests adding inconsistency factors to the NMA model. We follow a different direction by describing each inconsistency factor with a candidate covariate whose choice relies on variable selection techniques. Our proposed method, Stochastic Search Inconsistency Factor Selection (SSIFS), evaluates the consistency assumption both locally and globally, by applying the stochastic search variable selection method to determine whether the inconsistency factors should be included in the model. The posterior inclusion probability of each inconsistency factor quantifies how likely is a specific comparison to be inconsistent. We use posterior model odds or the median probability model to decide on the importance of inconsistency factors. Differences between direct and indirect evidence can be incorporated into the inconsistency detection process. A key point of our proposed approach is the construction of a reasonable "informative" prior concerning network consistency. The prior is based on the elicitation of information derived historical data from 201 published network meta-analyses. The performance of our proposed method is evaluated in two published network meta-analyses. The proposed methodology is publicly available in an R package called ssifs, developed and maintained by the authors of this work.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Assessing competitive balance in the English Premier League for over forty seasons using a stochastic block model
Authors:
Francesca Basini,
Vasiliki Tsouli,
Ioannis Ntzoufras,
Nial Friel
Abstract:
Competitive balance is the subject of much interest in the sports analytics literature and beyond. In this paper, we develop a statistical network model based on an extension of the stochastic block model to assess the balance between teams in a league. Here we represent the outcome of all matches in a football season as a dense network with nodes identified by teams and categorical edges represen…
▽ More
Competitive balance is the subject of much interest in the sports analytics literature and beyond. In this paper, we develop a statistical network model based on an extension of the stochastic block model to assess the balance between teams in a league. Here we represent the outcome of all matches in a football season as a dense network with nodes identified by teams and categorical edges representing the outcome of each game as a win, draw or a loss. The main focus and motivation for this paper is to provide a statistical framework to assess the issue of competitive balance in the context of the English First Division / Premier League over more than 40 seasons. The Premier League is arguably one of the most popular leagues in the world, in terms of its global reach and the revenue which it generates. Therefore it is of wide interest to assess its competitiveness. Our analysis provides evidence suggesting a structural change around the early 2000's from a reasonably balanced league to a two-tier league.
△ Less
Submitted 10 January, 2023; v1 submitted 19 July, 2021;
originally announced July 2021.
-
A Metropolized adaptive subspace algorithm for high-dimensional Bayesian variable selection
Authors:
Christian Staerk,
Maria Kateri,
Ioannis Ntzoufras
Abstract:
A simple and efficient adaptive Markov Chain Monte Carlo (MCMC) method, called the Metropolized Adaptive Subspace (MAdaSub) algorithm, is proposed for sampling from high-dimensional posterior model distributions in Bayesian variable selection. The MAdaSub algorithm is based on an independent Metropolis-Hastings sampler, where the individual proposal probabilities of the explanatory variables are u…
▽ More
A simple and efficient adaptive Markov Chain Monte Carlo (MCMC) method, called the Metropolized Adaptive Subspace (MAdaSub) algorithm, is proposed for sampling from high-dimensional posterior model distributions in Bayesian variable selection. The MAdaSub algorithm is based on an independent Metropolis-Hastings sampler, where the individual proposal probabilities of the explanatory variables are updated after each iteration using a form of Bayesian adaptive learning, in a way that they finally converge to the respective covariates' posterior inclusion probabilities. We prove the ergodicity of the algorithm and present a parallel version of MAdaSub with an adaptation scheme for the proposal probabilities based on the combination of information from multiple chains. The effectiveness of the algorithm is demonstrated via various simulated and real data examples, including a high-dimensional problem with more than 20,000 covariates.
△ Less
Submitted 7 July, 2022; v1 submitted 3 May, 2021;
originally announced May 2021.
-
Inference and model determination for Temperature-Driven non-linear Ecological Models
Authors:
Marios Kondakis,
Nikolaos Demiris,
Ioannis Ntzoufras,
Nikos E. Papanikolaou
Abstract:
This paper is concerned with a contemporary Bayesian approach to the effect of temperature on developmental rates. We develop statistical methods using recent computational tools to model four commonly used ecological non-linear mathematical curves that describe arthropods' developmental rates. Such models address the effect of temperature fluctuations on the developmental rate of arthropods. In a…
▽ More
This paper is concerned with a contemporary Bayesian approach to the effect of temperature on developmental rates. We develop statistical methods using recent computational tools to model four commonly used ecological non-linear mathematical curves that describe arthropods' developmental rates. Such models address the effect of temperature fluctuations on the developmental rate of arthropods. In addition to the widely used Gaussian distributional assumption, we also explore Inverse Gamma--based alternatives, which naturally accommodate adaptive variance fluctuation with temperature. Moreover, to overcome the associated parameter indeterminacy in the case of no development, we suggest the Zero Inflated Inverse Gamma model. The ecological models are compared graphically via posterior predictive plots and quantitatively via Marginal likelihood estimates and Information criteria values. Inference is performed using the Stan software and we investigate the statistical and computational efficiency of its Hamiltonian Monte Carlo and Variational Inference methods. We explore model uncertainty and use Bayesian Model Averaging framework for robust estimation of the key ecological parameters
△ Less
Submitted 30 April, 2021;
originally announced April 2021.
-
Meta Analysis of Bayes Factors
Authors:
Stavros Nikolakopoulos,
Ioannis Ntzoufras
Abstract:
Bayes Factors, the Bayesian tool for hypothesis testing, are receiving increasing attention in the literature. Compared to their frequentist rivals ($p$-values or test statistics), Bayes Factors have the conceptual advantage of providing evidence both for and against a null hypothesis and they can be calibrated so that they do not depend so heavily on the sample size. However, research on the synt…
▽ More
Bayes Factors, the Bayesian tool for hypothesis testing, are receiving increasing attention in the literature. Compared to their frequentist rivals ($p$-values or test statistics), Bayes Factors have the conceptual advantage of providing evidence both for and against a null hypothesis and they can be calibrated so that they do not depend so heavily on the sample size. However, research on the synthesis of Bayes Factors arising from individual studies has received very limited attention. In this work we review and propose methods for combining Bayes Factors from multiple studies, depending on the level of information available. In the process, we provide insights with respect to the interplay between frequentist and Bayesian evidence. We also clarify why some intuitive suggestions in the literature can be misleading. We assess the performance of the methods discussed via a simulation study and apply the methods in an example from the field of psychology.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
On the identifiability of Bayesian factor analytic models
Authors:
Panagiotis Papastamoulis,
Ioannis Ntzoufras
Abstract:
A well known identifiability issue in factor analytic models is the invariance with respect to orthogonal transformations. This problem burdens the inference under a Bayesian setup, where Markov chain Monte Carlo (MCMC) methods are used to generate samples from the posterior distribution. We introduce a post-processing scheme in order to deal with rotation, sign and permutation invariance of the M…
▽ More
A well known identifiability issue in factor analytic models is the invariance with respect to orthogonal transformations. This problem burdens the inference under a Bayesian setup, where Markov chain Monte Carlo (MCMC) methods are used to generate samples from the posterior distribution. We introduce a post-processing scheme in order to deal with rotation, sign and permutation invariance of the MCMC sample. The exact version of the contributed algorithm requires to solve $2^q$ assignment problems per (retained) MCMC iteration, where $q$ denotes the number of factors of the fitted model. For large numbers of factors two approximate schemes based on simulated annealing are also discussed. We demonstrate that the proposed method leads to interpretable posterior distributions using synthetic and publicly available data from typical factor analytic models as well as mixtures of factor analyzers. An R package is available online at CRAN web-page.
△ Less
Submitted 24 January, 2022; v1 submitted 10 April, 2020;
originally announced April 2020.
-
Power-Expected-Posterior Priors as Mixtures of g-Priors
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras
Abstract:
One of the main approaches used to construct prior distributions for objective Bayes methods is the concept of random imaginary observations. Under this setup, the expected-posterior prior (EPP) offers several advantages, among which it has a nice and simple interpretation and provides an effective way to establish compatibility of priors among models. In this paper, we study the power-expected po…
▽ More
One of the main approaches used to construct prior distributions for objective Bayes methods is the concept of random imaginary observations. Under this setup, the expected-posterior prior (EPP) offers several advantages, among which it has a nice and simple interpretation and provides an effective way to establish compatibility of priors among models. In this paper, we study the power-expected posterior prior as a generalization to the EPP in objective Bayesian model selection under normal linear models. We prove that it can be represented as a mixture of $g$-prior, like a wide range of prior distributions under normal linear models, and thus posterior distributions and Bayes factors are derived in closed form, keeping therefore computational tractability. Comparisons with other mixtures of $g$-prior are made and emphasis is given in the posterior distribution of g and its effect on Bayesian model selection and model averaging.
△ Less
Submitted 8 October, 2020; v1 submitted 13 February, 2020;
originally announced February 2020.
-
Bayesian models for prediction of the set-difference in volleyball
Authors:
Ioannis Ntzoufras,
Vasilis Palaskas,
Sotiris Drikos
Abstract:
The aim of this paper is to study and develop Bayesian models for the analysis of volleyball match outcomes as recorded by the set-difference. Due to the peculiarity of the outcome variable (set-difference) which takes discrete values from $-3$ to $3$, we cannot consider standard models based on the usual Poisson or binomial assumptions used for other sports such as football/soccer. Hence, the fir…
▽ More
The aim of this paper is to study and develop Bayesian models for the analysis of volleyball match outcomes as recorded by the set-difference. Due to the peculiarity of the outcome variable (set-difference) which takes discrete values from $-3$ to $3$, we cannot consider standard models based on the usual Poisson or binomial assumptions used for other sports such as football/soccer. Hence, the first and foremost challenge was to build models appropriate for the set-differences of each volleyball match. Here we consider two major approaches:
a) an ordered multinomial logistic regression model and
b) a model based on a truncated version of the Skellam distribution.
For the first model, we consider the set-difference as an ordinal response variable within the framework of multinomial logistic regression models. Concerning the second model, we adjust the Skellam distribution in order to account for the volleyball rules. We fit and compare both models with the same covariate structure as in Karlis & Ntzoufras (2003). Both models are fitted, illustrated and compared within Bayesian framework using data from both the regular season and the play-offs of the season 2016/17 of the Greek national men's volleyball league A1.
△ Less
Submitted 22 June, 2021; v1 submitted 11 November, 2019;
originally announced November 2019.
-
A Bayesian Quest for Finding a Unified Model for Predicting Volleyball Games
Authors:
Leonardo Egidi,
Ioannis Ntzoufras
Abstract:
Volleyball is a team sport with unique and specific characteristics. We introduce a new two level-hierarchical Bayesian model which accounts for theses volleyball specific characteristics. In the first level, we model the set outcome with a simple logistic regression model. Conditionally on the winner of the set, in the second level, we use a truncated negative binomial distribution for the points…
▽ More
Volleyball is a team sport with unique and specific characteristics. We introduce a new two level-hierarchical Bayesian model which accounts for theses volleyball specific characteristics. In the first level, we model the set outcome with a simple logistic regression model. Conditionally on the winner of the set, in the second level, we use a truncated negative binomial distribution for the points earned by the loosing team. An additional Poisson distributed inflation component is introduced to model the extra points played in the case that the two teams have point difference less than two points. The number of points of the winner within each set is deterministically specified by the winner of the set and the points of the inflation component. The team specific abilities and the home effect are used as covariates on all layers of the model (set, point, and extra inflated points). The implementation of the proposed model on the Italian Superlega 2017/2018 data shows an exceptional reproducibility of the final league table and a satisfactory predictive ability.
△ Less
Submitted 15 April, 2020; v1 submitted 5 November, 2019;
originally announced November 2019.
-
High-dimensional variable selection via low-dimensional adaptive learning
Authors:
Christian Staerk,
Maria Kateri,
Ioannis Ntzoufras
Abstract:
A stochastic search method, the so-called Adaptive Subspace (AdaSub) method, is proposed for variable selection in high-dimensional linear regression models. The method aims at finding the best model with respect to a certain model selection criterion and is based on the idea of adaptively solving low-dimensional sub-problems in order to provide a solution to the original high-dimensional problem.…
▽ More
A stochastic search method, the so-called Adaptive Subspace (AdaSub) method, is proposed for variable selection in high-dimensional linear regression models. The method aims at finding the best model with respect to a certain model selection criterion and is based on the idea of adaptively solving low-dimensional sub-problems in order to provide a solution to the original high-dimensional problem. Any of the usual $\ell_0$-type model selection criteria can be used, such as Akaike's Information Criterion (AIC), the Bayesian Information Criterion (BIC) or the Extended BIC (EBIC), with the last being particularly suitable for high-dimensional cases. The limiting properties of the new algorithm are analysed and it is shown that, under certain conditions, AdaSub converges to the best model according to the considered criterion. In a simulation study, the performance of AdaSub is investigated in comparison to alternative methods. The effectiveness of the proposed method is illustrated via various simulated datasets and a high-dimensional real data example.
△ Less
Submitted 19 April, 2021; v1 submitted 17 April, 2019;
originally announced May 2019.
-
Probability Based Independence Sampler for Bayesian Quantitative Learning in Graphical Log-Linear Marginal Models
Authors:
Ioannis Ntzoufras,
Claudia Tarantola,
Monia Lupparelli
Abstract:
Bayesian methods for graphical log-linear marginal models have not been developed in the same extent as traditional frequentist approaches. In this work, we introduce a novel Bayesian approach for quantitative learning for such models. These models belong to curved exponential families that are difficult to handle from a Bayesian perspective. Furthermore, the likelihood cannot be analytically expr…
▽ More
Bayesian methods for graphical log-linear marginal models have not been developed in the same extent as traditional frequentist approaches. In this work, we introduce a novel Bayesian approach for quantitative learning for such models. These models belong to curved exponential families that are difficult to handle from a Bayesian perspective. Furthermore, the likelihood cannot be analytically expressed as a function of the marginal log-linear interactions, but only in terms of cell counts or probabilities.
Posterior distributions cannot be directly obtained, and MCMC methods are needed. Finally, a well-defined model requires parameter values that lead to compatible marginal probabilities. Hence, any MCMC should account for this important restriction. We construct a fully automatic and efficient MCMC strategy for quantitative learning for graphical log-linear marginal models that handles these problems. While the prior is expressed in terms of the marginal log-linear interactions, we build an MCMC algorithm that employs a proposal on the probability parameter space. The corresponding proposal on the marginal log-linear interactions is obtained via parameter transformation.
By this strategy, we achieve to move within the desired target space. At each step, we directly work with well-defined probability distributions.
Moreover, we can exploit a conditional conjugate setup to build an efficient proposal on probability parameters. The proposed methodology is illustrated by a simulation study and a real dataset.
△ Less
Submitted 3 July, 2018;
originally announced July 2018.
-
Variations of Power-Expected-Posterior Priors in Normal Regression Models
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras,
Konstantinos Perrakis
Abstract:
The power-expected-posterior (PEP) prior is an objective prior for Gaussian linear models, which leads to consistent model selection inference, under the M-closed scenario, and tends to favor parsimonious models. Recently, two new forms of the PEP prior were proposed which generalize its applicability to a wider range of models. The properties of these two PEP variants within the context of the no…
▽ More
The power-expected-posterior (PEP) prior is an objective prior for Gaussian linear models, which leads to consistent model selection inference, under the M-closed scenario, and tends to favor parsimonious models. Recently, two new forms of the PEP prior were proposed which generalize its applicability to a wider range of models. The properties of these two PEP variants within the context of the normal linear model are examined thoroughly, focusing on the prior dispersion and on the consistency of the induced model selection procedure. Results show that both PEP variants have larger variances than the unit-information g-prior and that they are M-closed consistent as the limiting behavior of the corresponding marginal likelihoods matches that of the BIC. The consistency under the M-open case, using three different model misspecification scenarios is further investigated.
△ Less
Submitted 21 November, 2019; v1 submitted 22 September, 2016;
originally announced September 2016.
-
Power-Expected-Posterior Priors for Generalized Linear Models
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras,
Konstantinos Perrakis
Abstract:
The power-expected-posterior (PEP) prior provides an objective, automatic, consistent and parsimonious model selection procedure. At the same time it resolves the conceptual and computational problems due to the use of imaginary data. Namely, (i) it dispenses with the need to select and average across all possible minimal imaginary samples, and (ii) it diminishes the effect that the imaginary data…
▽ More
The power-expected-posterior (PEP) prior provides an objective, automatic, consistent and parsimonious model selection procedure. At the same time it resolves the conceptual and computational problems due to the use of imaginary data. Namely, (i) it dispenses with the need to select and average across all possible minimal imaginary samples, and (ii) it diminishes the effect that the imaginary data have upon the posterior distribution. These attributes allow for large sample approximations, when needed, in order to reduce the computational burden under more complex models. In this work we generalize the applicability of the PEP methodology, focusing on the framework of generalized linear models (GLMs), by introducing two new PEP definitions which are in effect applicable to any general model setting. Hyper-prior extensions for the power parameter that regulates the contribution of the imaginary data are introduced. We further study the validity of the predictive matching and of the model selection consistency, providing analytical proofs for the former and empirical evidence supporting the latter. For estimation of posterior model and inclusion probabilities we introduce a tuning-free Gibbs-based variable selection sampler. Several simulation scenarios and one real life example are considered in order to evaluate the performance of the proposed methods compared to other commonly used approaches based on mixtures of g-priors. Results indicate that the GLM-PEP priors are more effective in the identification of sparse and parsimonious model formulations.
△ Less
Submitted 29 September, 2017; v1 submitted 4 August, 2015;
originally announced August 2015.
-
Competitive balance measures and the Uncertainty of Outcome Hypothesis in European football
Authors:
Vasileios Manasis,
Ioannis Ntzoufras,
James Reade
Abstract:
Competitive balance is an important concept for professional sports and one of the key issues that European football has to address in order to ensure its long-term prosperity. However, the quantification of competitive balance is not an easy task. The difficulties are mainly associated with its multi-dimensionality character as well as the structure of each particular sport. This article uses dat…
▽ More
Competitive balance is an important concept for professional sports and one of the key issues that European football has to address in order to ensure its long-term prosperity. However, the quantification of competitive balance is not an easy task. The difficulties are mainly associated with its multi-dimensionality character as well as the structure of each particular sport. This article uses data from eight domestic leagues over 60 years to identify the best index for a holistic view of competitive balance in European football. The findings support the longstanding Uncertainty of Outcome Hypothesis using indices designed for the important three identified levels of competition and offering a weighting pattern for ranking places. Important conclusions may be derived concerning the relative importance of different aspects of competitive balance depending on the specific features of the best index.
△ Less
Submitted 19 May, 2021; v1 submitted 2 July, 2015;
originally announced July 2015.
-
Bayesian spatio-temporal epidemic models with applications to sheep pox
Authors:
C. Malesios,
N. Demiris,
K. Kalogeropoulos,
I. Ntzoufras
Abstract:
Epidemic data often possess certain characteristics, such as the presence of many zeros, the spatial nature of the disease spread mechanism or environmental noise. This paper addresses these issues via suitable Bayesian modelling. In doing so we utilise stochastic regression models appropriate for spatio-temporal count data with an excess number of zeros. The developed regression framework can inc…
▽ More
Epidemic data often possess certain characteristics, such as the presence of many zeros, the spatial nature of the disease spread mechanism or environmental noise. This paper addresses these issues via suitable Bayesian modelling. In doing so we utilise stochastic regression models appropriate for spatio-temporal count data with an excess number of zeros. The developed regression framework can incorporate serial correlation and time varying covariates through an Ornstein Uhlenbeck process formulation. In addition, we explore the effect of different priors, including default options and techniques based upon variations of mixtures of $g$-priors. The effect of different distance kernels for the epidemic model component is investigated. We proceed by developing branching process-based methods for testing scenarios for disease control, thus linking traditional spatio-temporal models with epidemic processes, useful in policy-focused decision making. The approach is illustrated with an application to a sheep pox dataset from the Evros region, Greece.
△ Less
Submitted 7 March, 2014;
originally announced March 2014.
-
Bayesian transformation family selection: moving towards a transformed Gaussian universe
Authors:
Efstratia Charitidou,
Dimitris Fouskakis,
Ioannis Ntzoufras
Abstract:
The problem of transformation selection is thoroughly treated from a Bayesian perspective. Several families of transformations are considered with a view to achieving normality: the Box-Cox, the Modulus, the Yeo & Johnson and the Dual transformation. Markov chain Monte Carlo algorithms have been constructed in order to sample from the posterior distribution of the transformation parameter $λ_T$ as…
▽ More
The problem of transformation selection is thoroughly treated from a Bayesian perspective. Several families of transformations are considered with a view to achieving normality: the Box-Cox, the Modulus, the Yeo & Johnson and the Dual transformation. Markov chain Monte Carlo algorithms have been constructed in order to sample from the posterior distribution of the transformation parameter $λ_T$ associated with each competing family $T$. We investigate different approaches to constructing compatible prior distributions for $λ_T$ over alternative transformation families, using a unit-information power-prior approach and an alternative normal prior with approximate unit-information interpretation. Selection and discrimination between different transformation families is attained via posterior model probabilities. We demonstrate the efficiency of our approach using a variety of simulated datasets. Although there is no choice of transformation family that can be universally applied to all problems, empirical evidence suggests that some particular data structures are best treated by specific transformation families. For example, skewness is associated with the Box-Cox family while fat-tailed distributions are efficiently treated using the Modulus transformation.
△ Less
Submitted 12 December, 2013;
originally announced December 2013.
-
On the use of marginal posteriors in marginal likelihood estimation via importance-sampling
Authors:
K. Perrakis,
I. Ntzoufras,
E. G. Tsionas
Abstract:
We investigate the efficiency of a marginal likelihood estimator where the product of the marginal posterior distributions is used as an importance-sampling function. The approach is generally applicable to multi-block parameter vector settings, does not require additional Markov Chain Monte Carlo (MCMC) sampling and is not dependent on the type of MCMC scheme used to sample from the posterior. Th…
▽ More
We investigate the efficiency of a marginal likelihood estimator where the product of the marginal posterior distributions is used as an importance-sampling function. The approach is generally applicable to multi-block parameter vector settings, does not require additional Markov Chain Monte Carlo (MCMC) sampling and is not dependent on the type of MCMC scheme used to sample from the posterior. The proposed approach is applied to normal regression models, finite normal mixtures and longitudinal Poisson models, and leads to accurate marginal likelihood estimates.
△ Less
Submitted 11 January, 2014; v1 submitted 4 November, 2013;
originally announced November 2013.
-
Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions
Authors:
Silia Vitoratou,
Ioannis Ntzoufras,
Irini Moustaki
Abstract:
In latent variable models the parameter estimation can be implemented by using the joint or the marginal likelihood, based on independence or conditional independence assumptions. The same dilemma occurs within the Bayesian framework with respect to the estimation of the Bayesian marginal (or integrated) likelihood, which is the main tool for model comparison and averaging. In most cases, the Baye…
▽ More
In latent variable models the parameter estimation can be implemented by using the joint or the marginal likelihood, based on independence or conditional independence assumptions. The same dilemma occurs within the Bayesian framework with respect to the estimation of the Bayesian marginal (or integrated) likelihood, which is the main tool for model comparison and averaging. In most cases, the Bayesian marginal likelihood is a high dimensional integral that cannot be computed analytically and a plethora of methods based on Monte Carlo integration (MCI) are used for its estimation. In this work, it is shown that the joint MCI approach makes subtle use of the properties of the adopted model, leading to increased error and bias in finite settings. The sources and the components of the error associated with estimators under the two approaches are identified here and provided in exact forms. Additionally, the effect of the sample covariation on the Monte Carlo estimators is examined. In particular, even under independence assumptions the sample covariance will be close to (but not exactly) zero which surprisingly has a severe effect on the estimated values and their variability. To address this problem, an index of the sample's divergence from independence is introduced as a multivariate extension of covariance. The implications addressed here are important in the majority of practical problems appearing in Bayesian inference of multi-parameter models with analogous structures.
△ Less
Submitted 4 November, 2013;
originally announced November 2013.
-
Thermodynamic assessment of probability distribution divergencies and Bayesian model comparison
Authors:
Silia Vitoratou,
Ioannis Ntzoufras
Abstract:
Within path sampling framework, we show that probability distribution divergences, such as the Chernoff information, can be estimated via thermodynamic integration. The Boltzmann-Gibbs distribution pertaining to different Hamiltonians is implemented to derive tempered transitions along the path, linking the distributions of interest at the endpoints. Under this perspective, a geometric approach is…
▽ More
Within path sampling framework, we show that probability distribution divergences, such as the Chernoff information, can be estimated via thermodynamic integration. The Boltzmann-Gibbs distribution pertaining to different Hamiltonians is implemented to derive tempered transitions along the path, linking the distributions of interest at the endpoints. Under this perspective, a geometric approach is feasible, which prompts intuition and facilitates tuning the error sources. Additionally, there are direct applications in Bayesian model evaluation. Existing marginal likelihood and Bayes factor estimators are reviewed here along with their stepping-stone sampling analogues. New estimators are presented and the use of compound paths is introduced.
△ Less
Submitted 17 October, 2013; v1 submitted 30 August, 2013;
originally announced August 2013.
-
Power-Conditional-Expected Priors: Using g-priors with Random Imaginary Data for Variable Selection
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras
Abstract:
The Zellner's g-prior and its recent hierarchical extensions are the most popular default prior choices in the Bayesian variable selection context. These prior set-ups can be expressed power-priors with fixed set of imaginary data. In this paper, we borrow ideas from the power-expected-posterior (PEP) priors in order to introduce, under the g-prior approach, an extra hierarchical level that accoun…
▽ More
The Zellner's g-prior and its recent hierarchical extensions are the most popular default prior choices in the Bayesian variable selection context. These prior set-ups can be expressed power-priors with fixed set of imaginary data. In this paper, we borrow ideas from the power-expected-posterior (PEP) priors in order to introduce, under the g-prior approach, an extra hierarchical level that accounts for the imaginary data uncertainty. For normal regression variable selection problems, the resulting power-conditional-expected-posterior (PCEP) prior is a conjugate normal-inverse gamma prior which provides a consistent variable selection procedure and gives support to more parsimonious models than the ones supported using the g-prior and the hyper-g prior for finite samples. Detailed illustrations and comparisons of the variable selection procedures using the proposed method, the g-prior and the hyper-g prior are provided using both simulated and real data examples.
△ Less
Submitted 9 July, 2013;
originally announced July 2013.
-
Power-Expected-Posterior Priors for Variable Selection in Gaussian Linear Models
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras,
David Draper
Abstract:
In the context of the expected-posterior prior (EPP) approach to Bayesian variable selection in linear models, we combine ideas from power-prior and unit-information-prior methodologies to simultaneously produce a minimally-informative prior and diminish the effect of training samples. The result is that in practice our power-expected-posterior (PEP) methodology is sufficiently insensitive to the…
▽ More
In the context of the expected-posterior prior (EPP) approach to Bayesian variable selection in linear models, we combine ideas from power-prior and unit-information-prior methodologies to simultaneously produce a minimally-informative prior and diminish the effect of training samples. The result is that in practice our power-expected-posterior (PEP) methodology is sufficiently insensitive to the size n* of the training sample, due to PEP's unit-information construction, that one may take n* equal to the full-data sample size n and dispense with training samples altogether. In this paper we focus on Gaussian linear models and develop our method under two different baseline prior choices: the independence Jeffreys (or reference) prior, yielding the J-PEP posterior, and the Zellner g-prior, leading to Z-PEP. We find that, under the reference baseline prior, the asymptotics of PEP Bayes factors are equivalent to those of Schwartz's BIC criterion, ensuring consistency of the PEP approach to model selection. We compare the performance of our method, in simulation studies and a real example involving prediction of air-pollutant concentrations from meteorological covariates, with that of a variety of previously-defined variants on Bayes factors for objective variable selection. Our prior, due to its unit-information structure, leads to a variable-selection procedure that (1) is systematically more parsimonious than the basic EPP with minimal training sample, while sacrificing no desirable performance characteristics to achieve this parsimony; (2) is robust to the size of the training sample, thus enjoying the advantages described above arising from the avoidance of training samples altogether; and (3) identifies maximum-a-posteriori models that achieve good out-of-sample predictive performance.
△ Less
Submitted 20 May, 2014; v1 submitted 9 July, 2013;
originally announced July 2013.
-
Limiting behavior of the Jeffreys Power-Expected-Posterior Bayes Factor in Gaussian Linear Models
Authors:
Dimitris Fouskakis,
Ioannis Ntzoufras
Abstract:
Expected-posterior priors (EPP) have been proved to be extremely useful for testing hypothesis on the regression coefficients of normal linear models. One of the advantages of using EPPs is that impropriety of baseline priors causes no indeterminacy. However, in regression problems, they based on one or more \textit{training samples}, that could influence the resulting posterior distribution. The…
▽ More
Expected-posterior priors (EPP) have been proved to be extremely useful for testing hypothesis on the regression coefficients of normal linear models. One of the advantages of using EPPs is that impropriety of baseline priors causes no indeterminacy. However, in regression problems, they based on one or more \textit{training samples}, that could influence the resulting posterior distribution. The power-expected-posterior priors are minimally-informative priors that diminishing the effect of training samples on the EPP approach, by combining ideas from the power-prior and unit-information-prior methodologies. In this paper we show the consistency of the Bayes factors when using the power-expected-posterior priors, with the independence Jeffreys (or reference) prior as a baseline, for normal linear models under very mild conditions on the design matrix.
△ Less
Submitted 30 November, 2014; v1 submitted 9 July, 2013;
originally announced July 2013.
-
Joint Specification of Model Space and Parameter Space Prior Distributions
Authors:
Petros Dellaportas,
Jonathan J. Forster,
Ioannis Ntzoufras
Abstract:
We consider the specification of prior distributions for Bayesian model comparison, focusing on regression-type models. We propose a particular joint specification of the prior distribution across models so that sensitivity of posterior model probabilities to the dispersion of prior distributions for the parameters of individual models (Lindley's paradox) is diminished. We illustrate the behavior…
▽ More
We consider the specification of prior distributions for Bayesian model comparison, focusing on regression-type models. We propose a particular joint specification of the prior distribution across models so that sensitivity of posterior model probabilities to the dispersion of prior distributions for the parameters of individual models (Lindley's paradox) is diminished. We illustrate the behavior of inferential and predictive posterior quantities in linear and log-linear regressions under our proposed prior densities with a series of simulated and real data examples.
△ Less
Submitted 24 July, 2012;
originally announced July 2012.
-
Bayesian variable selection using cost-adjusted BIC, with application to cost-effective measurement of quality of health care
Authors:
D. Fouskakis,
I. Ntzoufras,
D. Draper
Abstract:
In the field of quality of health care measurement, one approach to assessing patient sickness at admission involves a logistic regression of mortality within 30 days of admission on a fairly large number of sickness indicators (on the order of 100) to construct a sickness scale, employing classical variable selection methods to find an ``optimal'' subset of 10--20 indicators. Such ``benefit-onl…
▽ More
In the field of quality of health care measurement, one approach to assessing patient sickness at admission involves a logistic regression of mortality within 30 days of admission on a fairly large number of sickness indicators (on the order of 100) to construct a sickness scale, employing classical variable selection methods to find an ``optimal'' subset of 10--20 indicators. Such ``benefit-only'' methods ignore the considerable differences among the sickness indicators in cost of data collection, an issue that is crucial when admission sickness is used to drive programs (now implemented or under consideration in several countries, including the U.S. and U.K.) that attempt to identify substandard hospitals by comparing observed and expected mortality rates (given admission sickness). When both data-collection cost and accuracy of prediction of 30-day mortality are considered, a large variable-selection problem arises in which costly variables that do not predict well enough should be omitted from the final scale. In this paper (a) we develop a method for solving this problem based on posterior model odds, arising from a prior distribution that (1) accounts for the cost of each variable and (2) results in a set of posterior model probabilities that corresponds to a generalized cost-adjusted version of the Bayesian information criterion (BIC), and (b) we compare this method with a decision-theoretic cost-benefit approach based on maximizing expected utility. We use reversible-jump Markov chain Monte Carlo (RJMCMC) methods to search the model space, and we check the stability of our findings with two variants of the MCMC model composition ($\mathit{MC}^3$) algorithm.
△ Less
Submitted 17 August, 2009;
originally announced August 2009.
-
Bayesian Analysis of Marginal Log-Linear Graphical Models for Three Way Contingency Tables
Authors:
Ioannis Ntzoufras,
Claudia Tarantola
Abstract:
This paper deals with the Bayesian analysis of graphical models of marginal independence for three way contingency tables. We use a marginal log-linear parametrization, under which the model is defined through suitable zero-constraints on the interaction parameters calculated within marginal distributions. We undertake a comprehensive Bayesian analysis of these models, involving suitable choices…
▽ More
This paper deals with the Bayesian analysis of graphical models of marginal independence for three way contingency tables. We use a marginal log-linear parametrization, under which the model is defined through suitable zero-constraints on the interaction parameters calculated within marginal distributions. We undertake a comprehensive Bayesian analysis of these models, involving suitable choices of prior distributions, estimation, model determination, as well as the allied computational issues. The methodology is illustrated with reference to two real data sets.
△ Less
Submitted 7 July, 2008;
originally announced July 2008.