-
Finite mixture representations of zero-&-$N$-inflated distributions for count-compositional data
Authors:
André F. B. Menezes,
Andrew C. Parnell,
Keefe Murphy
Abstract:
We provide novel probabilistic portrayals of two multivariate models designed to handle zero-inflation in count-compositional data. We develop a new unifying framework that represents both as finite mixture distributions. One of these distributions, based on Dirichlet-multinomial components, has been studied before, but has not yet been properly characterised as a sampling distribution of the coun…
▽ More
We provide novel probabilistic portrayals of two multivariate models designed to handle zero-inflation in count-compositional data. We develop a new unifying framework that represents both as finite mixture distributions. One of these distributions, based on Dirichlet-multinomial components, has been studied before, but has not yet been properly characterised as a sampling distribution of the counts. The other, based on multinomial components, is a new contribution. Using our finite mixture representations enables us to derive key statistical properties, including moments, marginal distributions, and special cases for both distributions. We develop enhanced Bayesian inference schemes with efficient Gibbs sampling updates, wherever possible, for parameters and auxiliary variables, demonstrating improvements over existing methods in the literature. We conduct simulation studies to evaluate the efficiency of the Bayesian inference procedures and to illustrate the practical utility of the proposed distributions.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
A Model for Bimodal Rates and Proportions
Authors:
Roberto Vila,
Lucas Alfaia,
André F. B. Menezes,
Mehmet N. Çankaya,
Marcelo Bourguignon
Abstract:
The beta model is the most important distribution for fitting data with the unit interval. However, the beta distribution is not suitable to model bimodal unit interval data. In this paper, we propose a bimodal beta distribution constructed by using an approach based on the alpha-skew-normal model. We discuss several properties of this distribution such as bimodality, real moments, entropy measure…
▽ More
The beta model is the most important distribution for fitting data with the unit interval. However, the beta distribution is not suitable to model bimodal unit interval data. In this paper, we propose a bimodal beta distribution constructed by using an approach based on the alpha-skew-normal model. We discuss several properties of this distribution such as bimodality, real moments, entropy measures and identifiability. Furthermore, we propose a new regression model based on the proposed model and discuss residuals. Estimation is performed by maximum likelihood. A Monte Carlo experiment is conducted to evaluate the performances of these estimators in finite samples with a discussion of the results. An application is provided to show the modelling competence of the proposed distribution when the data sets show bimodality.
△ Less
Submitted 17 August, 2021;
originally announced August 2021.
-
On the one parameter unit-Lindley distribution and its associated regression model for proportion data
Authors:
J. Mazucheli,
A. F. B. Menezes,
S. Chakraborty
Abstract:
In this paper considering the transformation $X=\frac{Y}{1+Y}$, where $Y \sim\text{Lindley}(θ)$, we propose the unit-Lindley distribution and investigate some of its mathematical properties. A important fact associated with this new distribution is that is possible to obtain the analytical expression for bias correction of the maximum likelihood estimator. Moreover, it belongs to the exponential f…
▽ More
In this paper considering the transformation $X=\frac{Y}{1+Y}$, where $Y \sim\text{Lindley}(θ)$, we propose the unit-Lindley distribution and investigate some of its mathematical properties. A important fact associated with this new distribution is that is possible to obtain the analytical expression for bias correction of the maximum likelihood estimator. Moreover, it belongs to the exponential family. This distribution allows us to incorporate covariates directly in the mean and consequently to quantify the influence on the average of the response variable. Finally, a practical application is present and it is shown that our model fits much better than the Beta regression.
△ Less
Submitted 8 January, 2018;
originally announced January 2018.