-
A Standardization Procedure to Incorporate Variance Partitioning Based Priors in Latent Gaussian Models
Authors:
Luisa Ferrari,
Massimo Ventrucci
Abstract:
Latent Gaussian Models (LGMs) are a subset of Bayesian Hierarchical models where Gaussian priors, conditional on variance parameters, are assigned to all effects in the model. LGMs are employed in many fields for their flexibility and computational efficiency. However, practitioners find prior elicitation on the variance parameters challenging because of a lack of intuitive interpretation for them…
▽ More
Latent Gaussian Models (LGMs) are a subset of Bayesian Hierarchical models where Gaussian priors, conditional on variance parameters, are assigned to all effects in the model. LGMs are employed in many fields for their flexibility and computational efficiency. However, practitioners find prior elicitation on the variance parameters challenging because of a lack of intuitive interpretation for them. Recently, several papers have tackled this issue by rethinking the model in terms of variance partitioning (VP) and assigning priors to parameters reflecting the relative contribution of each effect to the total variance. So far, the class of priors based on VP has been mainly deployed for random effects and fixed effects separately. This work presents a novel standardization procedure that expands the applicability of VP priors to a broader class of LGMs, including both fixed and random effects. We describe the steps required for standardization through various examples, with a particular focus on the popular class of intrinsic Gaussian Markov random fields (IGMRFs). The practical advantages of standardization are demonstrated with simulated data and a real dataset on survival analysis.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Condensation phenomena of ions in an electrostatic logarithmic trap
Authors:
Loris Ferrari
Abstract:
The effects of an electrostatic logarithmic trap (ELT) on an ionic gas confined in a cylindric chamber are studied in detail, with special reference to the effects of the ion-ion Coulombic interactions and the resulting low-temperature thermodynamics. The collapse of the ions in radially localized states, about the axial cathode, is shown to cause an abrupt (but not critical) transition from non d…
▽ More
The effects of an electrostatic logarithmic trap (ELT) on an ionic gas confined in a cylindric chamber are studied in detail, with special reference to the effects of the ion-ion Coulombic interactions and the resulting low-temperature thermodynamics. The collapse of the ions in radially localized states, about the axial cathode, is shown to cause an abrupt (but not critical) transition from non degeneration to strong degeneration, at a special temperature $T_c$. This transition could actually involve both Bosons and Fermions and is not to be confused with a Bose-Einstein condensation (BEC), which is excluded in principle. However, while for Bosons the resulting effects on the pressure are observable in the ultra high vacuum (UHV) regime, the Fermions' density should fall well below UHV, for the pressure change to be observable. This is because the ion-ion \emph{exchange} interactions increase the kinetic energy along the axial cathode, which makes the Fermi level and the non degeneration threshold temperature increase accordingly.
△ Less
Submitted 3 October, 2024; v1 submitted 2 October, 2024;
originally announced October 2024.
-
A topological model for partial equivariance in deep learning and data analysis
Authors:
Lucia Ferrari,
Patrizio Frosini,
Nicola Quercioli,
Francesca Tombari
Abstract:
In this article, we propose a topological model to encode partial equivariance in neural networks. To this end, we introduce a class of operators, called P-GENEOs, that change data expressed by measurements, respecting the action of certain sets of transformations, in a non-expansive way. If the set of transformations acting is a group, then we obtain the so-called GENEOs. We then study the spaces…
▽ More
In this article, we propose a topological model to encode partial equivariance in neural networks. To this end, we introduce a class of operators, called P-GENEOs, that change data expressed by measurements, respecting the action of certain sets of transformations, in a non-expansive way. If the set of transformations acting is a group, then we obtain the so-called GENEOs. We then study the spaces of measurements, whose domains are subject to the action of certain self-maps, and the space of P-GENEOs between these spaces. We define pseudo-metrics on them and show some properties of the resulting spaces. In particular, we show how such spaces have convenient approximation and convexity properties.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Pandemic Data Quality Modelling: A Bayesian Approach
Authors:
Luisa Ferrari,
Giancarlo Manzi,
Alessandra Micheletti,
Federica Nicolussi,
Silvia Salini
Abstract:
When pandemics like COVID-19 spread around the world, the rapidly evolving situation compels officials and executives to take prompt decisions and adapt policies depending on the current state of the disease. In this context, it is crucial for policymakers to have always a firm grasp on what is the current state of the pandemic, and to envision how the number of infections and possible deaths is g…
▽ More
When pandemics like COVID-19 spread around the world, the rapidly evolving situation compels officials and executives to take prompt decisions and adapt policies depending on the current state of the disease. In this context, it is crucial for policymakers to have always a firm grasp on what is the current state of the pandemic, and to envision how the number of infections and possible deaths is going to evolve over the next weeks. However, as in many other situations involving compulsory registration of sensitive data from multiple collectors, cases might be reported with errors, often with delays deferring an up-to-date view of the state of things. Errors in collecting new cases affect the overall mortality, resulting in excess deaths reported by official statistics only months later. In this paper, we provide tools for evaluating the quality of pandemic mortality data. We accomplish this through a Bayesian approach accounting for the excess mortality pandemics might bring with respect to the normal level of mortality in the population.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Robust beta regression through the logit transformation
Authors:
Yuri S. Maluf,
Silvia L. P. Ferrari,
Francisco F. Queiroz
Abstract:
Beta regression models are employed to model continuous response variables in the unit interval, like rates, percentages, or proportions. Their applications rise in several areas, such as medicine, environment research, finance, and natural sciences. The maximum likelihood estimation is widely used to make inferences for the parameters. Nonetheless, it is well-known that the maximum likelihood-bas…
▽ More
Beta regression models are employed to model continuous response variables in the unit interval, like rates, percentages, or proportions. Their applications rise in several areas, such as medicine, environment research, finance, and natural sciences. The maximum likelihood estimation is widely used to make inferences for the parameters. Nonetheless, it is well-known that the maximum likelihood-based inference suffers from the lack of robustness in the presence of outliers. Such a case can bring severe bias and misleading conclusions. Recently, robust estimators for beta regression models were presented in the literature. However, these estimators require non-trivial restrictions in the parameter space, which limit their application. This paper develops new robust estimators that overcome this drawback. Their asymptotic and robustness properties are studied, and robust Wald-type tests are introduced. Simulation results evidence the merits of the new robust estimators. Inference and diagnostics using the new estimators are illustrated in an application to health insurance coverage data.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
Power logit regression for modeling bounded data
Authors:
Francisco Felipe Queiroz,
Silvia Lopes Paula Ferrari
Abstract:
The main purpose of this paper is to introduce a new class of regression models for bounded continuous data, commonly encountered in applied research. The models, named the power logit regression models, assume that the response variable follows a distribution in a wide, flexible class of distributions with three parameters, namely the median, a dispersion parameter and a skewness parameter. The p…
▽ More
The main purpose of this paper is to introduce a new class of regression models for bounded continuous data, commonly encountered in applied research. The models, named the power logit regression models, assume that the response variable follows a distribution in a wide, flexible class of distributions with three parameters, namely the median, a dispersion parameter and a skewness parameter. The paper offers a comprehensive set of tools for likelihood inference and diagnostic analysis, and introduces the new R package PLreg. Applications with real and simulated data show the merits of the proposed models, the statistical tools, and the computational package.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
One-class Autoencoder Approach for Optimal Electrode Set-up Identification in Wearable EEG Event Monitoring
Authors:
Laura M. Ferrari,
Guy Abi Hanna,
Paolo Volpe,
Esma Ismailova,
François Bremond,
Maria A. Zuluaga
Abstract:
A limiting factor towards the wide routine use of wearables devices for continuous healthcare monitoring is their cumbersome and obtrusive nature. This is particularly true for electroencephalography (EEG) recordings, which require the placement of multiple electrodes in contact with the scalp. In this work, we propose to identify the optimal wearable EEG electrode set-up, in terms of minimal numb…
▽ More
A limiting factor towards the wide routine use of wearables devices for continuous healthcare monitoring is their cumbersome and obtrusive nature. This is particularly true for electroencephalography (EEG) recordings, which require the placement of multiple electrodes in contact with the scalp. In this work, we propose to identify the optimal wearable EEG electrode set-up, in terms of minimal number of electrodes, comfortable location and performance, for EEG-based event detection and monitoring. By relying on the demonstrated power of autoencoder (AE) networks to learn latent representations from high-dimensional data, our proposed strategy trains an AE architecture in a one-class classification setup with different electrode set-ups as input data. The resulting models are assessed using the F-score and the best set-up is chosen according to the established optimal criteria. Using alpha wave detection as use case, we demonstrate that the proposed method allows to detect an alpha state from an optimal set-up consisting of electrodes in the forehead and behind the ear, with an average F-score of 0.78. Our results suggest that a learning-based approach can be used to enable the design and implementation of optimized wearable devices for real-life healthcare monitoring.
△ Less
Submitted 19 May, 2021; v1 submitted 9 April, 2021;
originally announced April 2021.
-
Robust estimation in beta regression via maximum Lq-likelihood
Authors:
Terezinha K. A. Ribeiro,
Silvia L. P. Ferrari
Abstract:
Beta regression models are widely used for modeling continuous data limited to the unit interval, such as proportions, fractions, and rates. The inference for the parameters of beta regression models is commonly based on maximum likelihood estimation. However, it is known to be sensitive to discrepant observations. In some cases, one atypical data point can lead to severe bias and erroneous conclu…
▽ More
Beta regression models are widely used for modeling continuous data limited to the unit interval, such as proportions, fractions, and rates. The inference for the parameters of beta regression models is commonly based on maximum likelihood estimation. However, it is known to be sensitive to discrepant observations. In some cases, one atypical data point can lead to severe bias and erroneous conclusions about the features of interest. In this work, we develop a robust estimation procedure for beta regression models based on the maximization of a reparameterized Lq-likelihood. The new estimator offers a trade-off between robustness and efficiency through a tuning constant. To select the optimal value of the tuning constant, we propose a data-driven method which ensures full efficiency in the absence of outliers. We also improve on an alternative robust estimator by applying our data-driven method to select its optimum tuning constant. Monte Carlo simulations suggest marked robustness of the two robust estimators with little loss of efficiency. Applications to three datasets are presented and discussed. As a by-product of the proposed methodology, residual diagnostic plots based on robust fits highlight outliers that would be masked under maximum likelihood estimation.
△ Less
Submitted 23 May, 2022; v1 submitted 21 October, 2020;
originally announced October 2020.
-
Modelling provincial Covid-19 epidemic data in Italy using an adjusted time-dependent SIRD model
Authors:
Luisa Ferrari,
Giuseppe Gerardi,
Giancarlo Manzi,
Alessandra Micheletti,
Federica Nicolussi,
Elia Biganzoli,
Silvia Salini
Abstract:
In this paper we develop a predictive model for the spread of COVID-19 infection at a provincial (i.e. EU NUTS-3) level in Italy by using official data from the Italian Ministry of Health integrated with data extracted from daily official press conferences of regional authorities and from local newspaper websites. This integration is mainly concerned with COVID-19 cause specific death data which a…
▽ More
In this paper we develop a predictive model for the spread of COVID-19 infection at a provincial (i.e. EU NUTS-3) level in Italy by using official data from the Italian Ministry of Health integrated with data extracted from daily official press conferences of regional authorities and from local newspaper websites. This integration is mainly concerned with COVID-19 cause specific death data which are not available at NUTS-3 level from open official data data channels. An adjusted time-dependent SIRD model is used to predict the behavior of the epidemic, specifically the number of susceptible, infected, deceased and recovered people. Predictive model performance is evaluated using comparison with real data.
△ Less
Submitted 2 June, 2020; v1 submitted 25 May, 2020;
originally announced May 2020.
-
Higher-order approximate confidence intervals
Authors:
Eliane C. Pinheiro,
Silvia L. P. Ferrari,
Francisco M. C. Medeiros
Abstract:
Standard confidence intervals employed in applied statistical analysis are usually based on asymptotic approximations. Such approximations can be considerably inaccurate in small and moderate sized samples. We derive accurate confidence intervals based on higher-order approximate quantiles of the score function. The coverage approximation error is $O(n^{-3/2})$ while the approximation error of con…
▽ More
Standard confidence intervals employed in applied statistical analysis are usually based on asymptotic approximations. Such approximations can be considerably inaccurate in small and moderate sized samples. We derive accurate confidence intervals based on higher-order approximate quantiles of the score function. The coverage approximation error is $O(n^{-3/2})$ while the approximation error of confidence intervals based on the asymptotic normality of MLEs is $O(n^{-1/2})$. Monte Carlo simulations confirm the theoretical findings. An implementation for regression models and real data applications are provided.
△ Less
Submitted 10 December, 2020; v1 submitted 27 November, 2018;
originally announced November 2018.
-
Box-Cox elliptical distributions with application
Authors:
Raúl Alejandro Morán-Vásquez,
Silvia L. P. Ferrari
Abstract:
We propose and study the class of Box-Cox elliptical distributions. It provides alternative distributions for modeling multivariate positive, marginally skewed and possibly heavy-tailed data. This new class of distributions has as a special case the class of log-elliptical distributions, and reduces to the Box-Cox symmetric class of distributions in the univariate setting. The parameters are inter…
▽ More
We propose and study the class of Box-Cox elliptical distributions. It provides alternative distributions for modeling multivariate positive, marginally skewed and possibly heavy-tailed data. This new class of distributions has as a special case the class of log-elliptical distributions, and reduces to the Box-Cox symmetric class of distributions in the univariate setting. The parameters are interpretable in terms of quantiles and relative dispersions of the marginal distributions and of associations between pairs of variables. The relation between the scale parameters and quantiles makes the Box-Cox elliptical distributions attractive for regression modeling purposes. Applications to data on vitamin intake are presented and discussed.
△ Less
Submitted 17 October, 2017;
originally announced October 2017.
-
Box-Cox symmetric distributions and applications to nutritional data
Authors:
Silvia L. P. Ferrari,
Giovana Fumes
Abstract:
We introduce the Box-Cox symmetric class of distributions, which is useful for modeling positively skewed, possibly heavy-tailed, data. The new class of distributions includes the Box-Cox t, Box-Cox Cole-Gree, Box-Cox power exponential distributions, and the class of the log-symmetric distributions as special cases. It provides easy parameter interpretation, which makes it convenient for regressio…
▽ More
We introduce the Box-Cox symmetric class of distributions, which is useful for modeling positively skewed, possibly heavy-tailed, data. The new class of distributions includes the Box-Cox t, Box-Cox Cole-Gree, Box-Cox power exponential distributions, and the class of the log-symmetric distributions as special cases. It provides easy parameter interpretation, which makes it convenient for regression modeling purposes. Additionally, it provides enough flexibility to handle outliers. The usefulness of the Box-Cox symmetric models is illustrated in applications to nutritional data.
△ Less
Submitted 7 March, 2017; v1 submitted 8 April, 2016;
originally announced April 2016.
-
Small-sample testing inference in symmetric and log-symmetric linear regression models
Authors:
Francisco M. C. Medeiros,
Silvia L. P. Ferrari
Abstract:
This paper deals with the issue of testing hypothesis in symmetric and log-symmetric linear regression models in small and moderate-sized samples. We focus on four tests, namely the Wald, likelihood ratio, score, and gradient tests. These tests rely on asymptotic results and are unreliable when the sample size is not large enough to guarantee a good agreement between the exact distribution of the…
▽ More
This paper deals with the issue of testing hypothesis in symmetric and log-symmetric linear regression models in small and moderate-sized samples. We focus on four tests, namely the Wald, likelihood ratio, score, and gradient tests. These tests rely on asymptotic results and are unreliable when the sample size is not large enough to guarantee a good agreement between the exact distribution of the test statistic and the corresponding chi-squared asymptotic distribution. Bartlett and Bartlett-type corrections typically attenuate the size distortion of the tests. These corrections are available in the literature for the likelihood ratio and score tests in symmetric linear regression models. Here, we derive a Bartlett-type correction for the gradient test. We show that the corrections are also valid for the log-symmetric linear regression models. We numerically compare the various tests, and bootstrapped tests, through simulations. Our results suggest that the corrected and bootstrapped tests exhibit type I probability error closer to the chosen nominal level with virtually no power loss. The analytically corrected tests, including the Bartlett-corrected gradient test derived in this paper, perform as well as the bootstrapped tests with the advantage of not requiring computationally-intensive calculations. We present two real data applications to illustrate the usefulness of the modified tests.
Keywords: Symmetric regression models; Bartlett correction; Bartlett-type correction; Bootstrap; Log-symmetric regression models; gradient statistic; score statistic; likelihood ratio statistic; Wald statistic.
△ Less
Submitted 1 February, 2016;
originally announced February 2016.
-
A comparative review of generalizations of the Gumbel extreme value distribution with an application to wind speed data
Authors:
E. C. Pinheiro,
S. L. P. Ferrari
Abstract:
The generalized extreme value distribution and its particular case, the Gumbel extreme value distribution, are widely applied for extreme value analysis. The Gumbel distribution has certain drawbacks because it is a non-heavy-tailed distribution and is characterized by constant skewness and kurtosis. The generalized extreme value distribution is frequently used in this context because it encompass…
▽ More
The generalized extreme value distribution and its particular case, the Gumbel extreme value distribution, are widely applied for extreme value analysis. The Gumbel distribution has certain drawbacks because it is a non-heavy-tailed distribution and is characterized by constant skewness and kurtosis. The generalized extreme value distribution is frequently used in this context because it encompasses the three possible limiting distributions for a normalized maximum of infinite samples of independent and identically distributed observations. However, the generalized extreme value distribution might not be a suitable model when each observed maximum does not come from a large number of observations. Hence, other forms of generalizations of the Gumbel distribution might be preferable. Our goal is to collect in the present literature the distributions that contain the Gumbel distribution embedded in them and to identify those that have flexible skewness and kurtosis, are heavy-tailed and could be competitive with the generalized extreme value distribution. The generalizations of the Gumbel distribution are described and compared using an application to a wind speed data set and Monte Carlo simulations. We show that some distributions suffer from overparameterization and coincide with other generalized Gumbel distributions with a smaller number of parameters, i.e., are non-identifiable. Our study suggests that the generalized extreme value distribution and a mixture of two extreme value distributions should be considered in practical applications.
△ Less
Submitted 11 August, 2015; v1 submitted 9 February, 2015;
originally announced February 2015.
-
A class of regression models for parallel and series systems with a random number of components
Authors:
Alice L. Morais,
Silvia L. P. Ferrari
Abstract:
In this paper we extend the Weibull power series (WPS) class of distributions and named this new class as extended Weibull power series (EWPS) class of distributions. The EWPS distributions are related to series and parallel systems with a random num- ber of components, whereas the WPS distributions (Morais and Barreto-Souza, 2011) are related to series systems only. Unlike the WPS distributions,…
▽ More
In this paper we extend the Weibull power series (WPS) class of distributions and named this new class as extended Weibull power series (EWPS) class of distributions. The EWPS distributions are related to series and parallel systems with a random num- ber of components, whereas the WPS distributions (Morais and Barreto-Souza, 2011) are related to series systems only. Unlike the WPS distributions, for which the Weibull is a limiting special case, the Weibull law is a particular case of the EWPS distributions. We prove that the distributions in this class are identifiable under a simple assumption. We also prove stochastic and hazard rate order results and highlight that the shapes of the EWPS distributions are markedly more flexible than the shapes of the WPS distributions. We define a regression model for the EWPS response random variable to model a scale parameter and its quantiles. We present the maximum likelihood estimator and prove its consistency and normal asymptotic distribution. Although the construction of this class was motivated by series and parallel systems, the EWPS distributions are suitable for modeling a wide range of positive data sets. To illustrate potential uses of this model, we apply it to a real data set on the tensile strength of coconut fibers and present a simple device for diagnostic purposes.
△ Less
Submitted 30 July, 2014; v1 submitted 29 May, 2014;
originally announced May 2014.
-
Small-sample one-sided testing in extreme value regression models
Authors:
Silvia L. P. Ferrari,
Eliane C. Pinheiro
Abstract:
We derive adjusted signed likelihood ratio statistics for a general class of extreme value regression models. The adjustments reduce the error in the standard normal approximation to the distribution of the signed likelihood ratio statistic. We use Monte Carlo simulations to compare the finite-sample performance of the different tests. Our simulations suggest that the signed likelihood ratio test…
▽ More
We derive adjusted signed likelihood ratio statistics for a general class of extreme value regression models. The adjustments reduce the error in the standard normal approximation to the distribution of the signed likelihood ratio statistic. We use Monte Carlo simulations to compare the finite-sample performance of the different tests. Our simulations suggest that the signed likelihood ratio test tends to be liberal when the sample size is not large, and that the adjustments are effective in shrinking the size distortion. Two real data applications are presented and discussed.
△ Less
Submitted 23 May, 2014;
originally announced May 2014.
-
Improved likelihood inference in generalized linear models
Authors:
Tiago M. Vargas,
Silvia L. P. Ferrari,
Artur J. Lemonte
Abstract:
We address the issue of performing testing inference in generalized linear models when the sample size is small. This class of models provides a straightforward way of modeling normal and non-normal data and has been widely used in several practical situations. The likelihood ratio, Wald and score statistics, and the recently proposed gradient statistic provide the basis for testing inference on t…
▽ More
We address the issue of performing testing inference in generalized linear models when the sample size is small. This class of models provides a straightforward way of modeling normal and non-normal data and has been widely used in several practical situations. The likelihood ratio, Wald and score statistics, and the recently proposed gradient statistic provide the basis for testing inference on the parameters in these models. We focus on the small-sample case, where the reference chi-squared distribution gives a poor approximation to the true null distribution of these test statistics. We derive a general Bartlett-type correction factor in matrix notation for the gradient test which reduces the size distortion of the test, and numerically compare the proposed test with the usual likelihood ratio, Wald, score and gradient tests, and with the Bartlett-corrected likelihood ratio and score tests. Our simulation results suggest that the corrected test we propose can be an interesting alternative to the other tests since it leads to very accurate inference even for very small samples. We also present an empirical application for illustrative purposes.
△ Less
Submitted 15 August, 2013;
originally announced August 2013.
-
Errors-in-variables beta regression models
Authors:
Jalmar M. F. Carrasco,
Silvia L. P. Ferrari,
Reinaldo B. Arellano-Valle
Abstract:
Beta regression models provide an adequate approach for modeling continuous outcomes limited to the interval (0,1). This paper deals with an extension of beta regression models that allow for explanatory variables to be measured with error. The structural approach, in which the covariates measured with error are assumed to be random variables, is employed. Three estimation methods are presented, n…
▽ More
Beta regression models provide an adequate approach for modeling continuous outcomes limited to the interval (0,1). This paper deals with an extension of beta regression models that allow for explanatory variables to be measured with error. The structural approach, in which the covariates measured with error are assumed to be random variables, is employed. Three estimation methods are presented, namely maximum likelihood, maximum pseudo-likelihood and regression calibration. Monte Carlo simulations are used to evaluate the performance of the proposed estimators and the naïve estimator. Also, a residual analysis for beta regression models with measurement errors is proposed. The results are illustrated in a real data set.
△ Less
Submitted 10 April, 2013; v1 submitted 4 December, 2012;
originally announced December 2012.
-
Mixed Beta Regression: A Bayesian Perspective
Authors:
Jorge I. Figueroa-Zuñiga,
Reinaldo B. Arellano-Valle,
Silvia L. P. Ferrari
Abstract:
This paper builds on recent research that focuses on regression modeling of continuous bounded data, such as proportions measured on a continuous scale. Specifically, it deals with beta regression models with mixed effects from a Bayesian approach. We use a suitable parameterization of the beta law in terms of its mean and a precision parameter, and allow both parameters to be modeled through regr…
▽ More
This paper builds on recent research that focuses on regression modeling of continuous bounded data, such as proportions measured on a continuous scale. Specifically, it deals with beta regression models with mixed effects from a Bayesian approach. We use a suitable parameterization of the beta law in terms of its mean and a precision parameter, and allow both parameters to be modeled through regression structures that may involve fixed and random effects. Specification of prior distributions is discussed, computational implementation via Gibbs sampling is provided, and illustrative examples are presented.
△ Less
Submitted 14 November, 2012; v1 submitted 11 January, 2012;
originally announced January 2012.
-
A general class of zero-or-one inflated beta regression models
Authors:
Raydonal Ospina,
Silvia L. P. Ferrari
Abstract:
This paper proposes a general class of regression models for continuous proportions when the data contain zeros or ones. The proposed class of models assumes that the response variable has a mixed continuous-discrete distribution with probability mass at zero or one. The beta distribution is used to describe the continuous component of the model, since its density has a wide range of different sha…
▽ More
This paper proposes a general class of regression models for continuous proportions when the data contain zeros or ones. The proposed class of models assumes that the response variable has a mixed continuous-discrete distribution with probability mass at zero or one. The beta distribution is used to describe the continuous component of the model, since its density has a wide range of different shapes depending on the values of the two parameters that index the distribution. We use a suitable parameterization of the beta law in terms of its mean and a precision parameter. The parameters of the mixture distribution are modeled as functions of regression parameters. We provide inference, diagnostic, and model selection tools for this class of models. A practical application that employs real data is presented.
△ Less
Submitted 2 November, 2011; v1 submitted 11 March, 2011;
originally announced March 2011.
-
Testing hypotheses in the Birnbaum-Saunders distribution under type-II censored samples
Authors:
Artur J. Lemonte,
Silvia L. P. Ferrari
Abstract:
The two-parameter Birnbaum-Saunders distribution has been used succesfully to model fatigue failure times. Although censoring is typical in reliability and survival studies, little work has been published on the analysis of censored data for this distribution. In this paper, we address the issue of performing testing inference on the two parameters of the Birnbaum-Saunders distribution under type-…
▽ More
The two-parameter Birnbaum-Saunders distribution has been used succesfully to model fatigue failure times. Although censoring is typical in reliability and survival studies, little work has been published on the analysis of censored data for this distribution. In this paper, we address the issue of performing testing inference on the two parameters of the Birnbaum-Saunders distribution under type-II right censored samples. The likelihood ratio statistic and a recently proposed statistic, the gradient statistic, provide a convenient framework for statistical inference in such a case, since they do not require to obtain, estimate or invert an information matrix, which is an advantage in problems involving censored data. An extensive Monte Carlo simulation study is carried out in order to investigate and compare the finite sample performance of the likelihood ratio and the gradient tests. Our numerical results show evidence that the gradient test should be preferred. Three empirical applications are presented.
△ Less
Submitted 10 September, 2010;
originally announced September 2010.
-
A New Generalized Kumaraswamy Distribution
Authors:
Jalmar M. F. Carrasco,
Silvia L. P. Ferrari,
Gauss M. Cordeiro
Abstract:
A new five-parameter continuous distribution which generalizes the Kumaraswamy and the beta distributions as well as some other well-known distributions is proposed and studied. The model has as special cases new four- and three-parameter distributions on the standard unit interval. Moments, mean deviations, Rényi's entropy and the moments of order statistics are obtained for the new generalized K…
▽ More
A new five-parameter continuous distribution which generalizes the Kumaraswamy and the beta distributions as well as some other well-known distributions is proposed and studied. The model has as special cases new four- and three-parameter distributions on the standard unit interval. Moments, mean deviations, Rényi's entropy and the moments of order statistics are obtained for the new generalized Kumaraswamy distribution. The score function is given and estimation is performed by maximum likelihood. Hypothesis testing is also discussed. A data set is used to illustrate an application of the proposed distribution.
△ Less
Submitted 6 April, 2010;
originally announced April 2010.
-
Small-sample corrections for score tests in Birnbaum-Saunders regressions
Authors:
Artur J. Lemonte,
Silvia L. P. Ferrari
Abstract:
In this paper we deal with the issue of performing accurate small-sample inference in the Birnbaum-Saunders regression model, which can be useful for modeling lifetime or reliability data. We derive a Bartlett-type correction for the score test and numerically compare the corrected test with the usual score test, the likelihood ratio test and its Bartlett-corrected version. Our simulation results…
▽ More
In this paper we deal with the issue of performing accurate small-sample inference in the Birnbaum-Saunders regression model, which can be useful for modeling lifetime or reliability data. We derive a Bartlett-type correction for the score test and numerically compare the corrected test with the usual score test, the likelihood ratio test and its Bartlett-corrected version. Our simulation results suggest that the corrected test we propose is more reliable than the other tests.
△ Less
Submitted 15 July, 2010; v1 submitted 5 May, 2009;
originally announced May 2009.
-
Improved testing inference in mixed linear models
Authors:
Tatiane F. N. Melo,
Silvia L. P. Ferrari,
Francisco Cribari-Neto
Abstract:
Mixed linear models are commonly used in repeated measures studies. They account for the dependence amongst observations obtained from the same experimental unit. Oftentimes, the number of observations is small, and it is thus important to use inference strategies that incorporate small sample corrections. In this paper, we develop modified versions of the likelihood ratio test for fixed effects i…
▽ More
Mixed linear models are commonly used in repeated measures studies. They account for the dependence amongst observations obtained from the same experimental unit. Oftentimes, the number of observations is small, and it is thus important to use inference strategies that incorporate small sample corrections. In this paper, we develop modified versions of the likelihood ratio test for fixed effects inference in mixed linear models. In particular, we derive a Bartlett correction to such a test and also to a test obtained from a modified profile likelihood function. Our results generalize those in Zucker et al. (Journal of the Royal Statistical Society B, 2000, 62, 827-838) by allowing the parameter of interest to be vector-valued. Additionally, our Bartlett corrections allow for random effects nonlinear covariance matrix structure. We report numerical evidence which shows that the proposed tests display superior finite sample behavior relative to the standard likelihood ratio test. An application is also presented and discussed.
△ Less
Submitted 4 August, 2011; v1 submitted 23 June, 2008;
originally announced June 2008.
-
Improved Likelihood Inference in Birnbaum-Saunders Regressions
Authors:
Artur J. Lemonte,
Silvia L. P. Ferrari,
Francisco Cribari-Neto
Abstract:
The Birnbaum-Saunders regression model is commonly used in reliability studies. We address the issue of performing inference in this class of models when the number of observations is small. We show that the likelihood ratio test tends to be liberal when the sample size is small, and we obtain a correction factor which reduces the size distortion of the test. The correction makes the error rate…
▽ More
The Birnbaum-Saunders regression model is commonly used in reliability studies. We address the issue of performing inference in this class of models when the number of observations is small. We show that the likelihood ratio test tends to be liberal when the sample size is small, and we obtain a correction factor which reduces the size distortion of the test. The correction makes the error rate of he test vanish faster as the sample size increases. The numerical results show that the modified test is more reliable in finite samples than the usual likelihood ratio test. We also present an empirical application.
△ Less
Submitted 22 April, 2009; v1 submitted 13 June, 2008;
originally announced June 2008.
-
Inflated Beta Distributions
Authors:
Raydonal Ospina,
Silvia L. P. Ferrari
Abstract:
This paper considers the issue of modeling fractional data observed in the interval [0,1), (0,1] or [0,1]. Mixed continuous-discrete distributions are proposed. The beta distribution is used to describe the continuous component of the model since its density can have quite diferent shapes depending on the values of the two parameters that index the distribution. Properties of the proposed distri…
▽ More
This paper considers the issue of modeling fractional data observed in the interval [0,1), (0,1] or [0,1]. Mixed continuous-discrete distributions are proposed. The beta distribution is used to describe the continuous component of the model since its density can have quite diferent shapes depending on the values of the two parameters that index the distribution. Properties of the proposed distributions are examined. Also, maximum likelihood and method of moments estimation is discussed. Finally, practical applications that employ real data are presented.
△ Less
Submitted 11 November, 2007; v1 submitted 4 May, 2007;
originally announced May 2007.