-
Impact of existence and nonexistence of pivot on the coverage of empirical best linear prediction intervals for small areas
Authors:
Yuting Chen,
Masayo Y. Hirose,
Partha Lahiri
Abstract:
We advance the theory of parametric bootstrap in constructing highly efficient empirical best (EB) prediction intervals of small area means. The coverage error of such a prediction interval is of the order $O(m^{-3/2})$, where $m$ is the number of small areas to be pooled using a linear mixed normal model. In the context of an area level model where the random effects follow a non-normal known dis…
▽ More
We advance the theory of parametric bootstrap in constructing highly efficient empirical best (EB) prediction intervals of small area means. The coverage error of such a prediction interval is of the order $O(m^{-3/2})$, where $m$ is the number of small areas to be pooled using a linear mixed normal model. In the context of an area level model where the random effects follow a non-normal known distribution except possibly for unknown hyperparameters, we analytically show that the order of coverage error of empirical best linear (EBL) prediction interval remains the same even if we relax the normality of the random effects by the existence of pivot for a suitably standardized random effects when hyperpameters are known. Recognizing the challenge of showing existence of a pivot, we develop a simple moment-based method to claim non-existence of pivot. We show that existing parametric bootstrap EBL prediction interval fails to achieve the desired order of the coverage error, i.e. $O(m^{-3/2})$, in absence of a pivot. We obtain a surprising result that the order $O(m^{-1})$ term is always positive under certain conditions indicating possible overcoverage of the existing parametric bootstrap EBL prediction interval. In general, we analytically show for the first time that the coverage problem can be corrected by adopting a suitably devised double parametric bootstrap. Our Monte Carlo simulations show that our proposed single bootstrap method performs reasonably well when compared to rival methods.
△ Less
Submitted 17 October, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
Asymptotic Moments Matching to Uniformly Minimum Variance Unbiased Estimation under Ewens Sampling Formula
Authors:
Masayo Y. Hirose,
Shuhei Mano
Abstract:
The Ewens sampling formula is a distribution related to the random partition of a positive integer. In this study, we investigate the issue of non-existence solutions in parameter estimation under the distribution. As a result, the first and second moments matching estimators to the uniformly minimum variance unbiased estimator are derived using the Ewens sampling formula in asymptotic sense. A Mo…
▽ More
The Ewens sampling formula is a distribution related to the random partition of a positive integer. In this study, we investigate the issue of non-existence solutions in parameter estimation under the distribution. As a result, the first and second moments matching estimators to the uniformly minimum variance unbiased estimator are derived using the Ewens sampling formula in asymptotic sense. A Monte Carlo simulation study is performed to evaluate the efficiency of the resulting estimators.
△ Less
Submitted 23 May, 2021;
originally announced May 2021.
-
Asymptotic bias reduction of maximum likelihood estimates via penalized likelihoods with differential geometry
Authors:
Masayo Y. Hirose,
Shuhei Mano
Abstract:
A procedure for asymptotic bias reduction of maximum likelihood estimates of generic estimands is developed. The estimator is realized as a plug-in estimator, where the parameter maximizes the penalized likelihood with a penalty function that satisfies a quasi-linear partial differential equation of the first order. The integration of the partial differential equation with the aid of differential…
▽ More
A procedure for asymptotic bias reduction of maximum likelihood estimates of generic estimands is developed. The estimator is realized as a plug-in estimator, where the parameter maximizes the penalized likelihood with a penalty function that satisfies a quasi-linear partial differential equation of the first order. The integration of the partial differential equation with the aid of differential geometry is discussed. Applications to generalized linear models, linear mixed-effects models, and a location-scale family are presented.
△ Less
Submitted 25 March, 2024; v1 submitted 30 November, 2020;
originally announced November 2020.
-
Statistical generalized derivative applied to the profile likelihood estimation in a mixture of semiparametric models
Authors:
Yuichi Hirose,
Ivy Liu
Abstract:
There is a difficulty in finding an estimate of variance of the profile likelihood estimator in the joint model of longitudinal and survival data. We solve the difficulty by introducing the ``statistical generalized derivative''. The derivative is used to show the asymptotic normality of the estimator without assuming the second derivative of the density function in the model exists.
There is a difficulty in finding an estimate of variance of the profile likelihood estimator in the joint model of longitudinal and survival data. We solve the difficulty by introducing the ``statistical generalized derivative''. The derivative is used to show the asymptotic normality of the estimator without assuming the second derivative of the density function in the model exists.
△ Less
Submitted 19 July, 2018;
originally announced July 2018.
-
A New Model Variance Estimator for an Area Level Small Area Model to Solve Multiple Problems Simultaneously
Authors:
Masayo Yoshimori Hirose,
Partha Lahiri
Abstract:
The two-level normal hierarchical model (NHM) has played a critical role in the theory of small area estimation (SAE), one of the growing areas in statistics with numerous applications in different disciplines. In this paper, we address major well-known shortcomings associated with the empirical best linear unbiased prediction (EBLUP) of a small area mean and its mean squared error (MSE) estimatio…
▽ More
The two-level normal hierarchical model (NHM) has played a critical role in the theory of small area estimation (SAE), one of the growing areas in statistics with numerous applications in different disciplines. In this paper, we address major well-known shortcomings associated with the empirical best linear unbiased prediction (EBLUP) of a small area mean and its mean squared error (MSE) estimation by considering an appropriate model variance estimator that satisfies multiple properties. The proposed model variance estimator simultaneously (i) improves on the estimation of the related shrinkage factors, (ii) protects EBLUP from the common overshrinkage problem, (iii) avoids complex bias correction in generating strictly positive second-order unbiased mean square error (MSE) estimator either by the Taylor series or single parametric bootstrap method. The idea of achieving multiple desirable properties in an EBLUP method through a suitably devised model variance estimator is the first of its kind and holds promise in providing good inferences for small area means under the classical linear mixed model prediction framework. The proposed methodology is also evaluated using a Monte Carlo simulation study and real data analysis.
△ Less
Submitted 16 January, 2017;
originally announced January 2017.
-
Second-order unbiased naive estimator of mean squared error for EBLUP in small-area estimation
Authors:
Masayo Yoshimori Hirose
Abstract:
An empirical best linear unbiased prediction (EBLUP) estimator is utilized for efficient inference in small-area estimation. To measure its uncertainty, we need to estimate its mean squared error (MSE) since the true MSE cannot generally be derived in a closed form. The "naive MSE estimator", one of the estimators available for small-area inference, is unlikely to be chosen, since it does not achi…
▽ More
An empirical best linear unbiased prediction (EBLUP) estimator is utilized for efficient inference in small-area estimation. To measure its uncertainty, we need to estimate its mean squared error (MSE) since the true MSE cannot generally be derived in a closed form. The "naive MSE estimator", one of the estimators available for small-area inference, is unlikely to be chosen, since it does not achieve the desired asymptotic property, namely second-order unbiasedness, although it maintains strict positivity and tractability. Therefore, users tend to choose the second-order unbiased MSE estimator. In this paper, we seek a new adjusted maximum-likelihood method to obtain a naive MSE estimator that achieves the required asymptotic property. To obtain the result, we also reveal the relationship between the general adjusted maximum-likelihood method for the model variance parameter and the general functional form of the second-order unbiased, and strictly positive, MSE estimator. We also compare the performance of the new method with that of the existing naive estimator through a Monte Carlo simulation study. The results show that the new method remedies the underestimation associated with the existing naive estimator.
△ Less
Submitted 12 December, 2016;
originally announced December 2016.
-
Non-area-specific adjustment factor for second-order efficient empirical Bayes confidence interval
Authors:
Masayo Y. Hirose
Abstract:
An empirical Bayes confidence interval has high user demand in many applications. In particular, the second-order empirical Bayes confidence interval, the coverage error of which is of the third order for large number of areas, is widely used in small area estimation when the sample size within each area is not large enough to make reliable direct estimates based on a design-based approach. Yoshim…
▽ More
An empirical Bayes confidence interval has high user demand in many applications. In particular, the second-order empirical Bayes confidence interval, the coverage error of which is of the third order for large number of areas, is widely used in small area estimation when the sample size within each area is not large enough to make reliable direct estimates based on a design-based approach. Yoshimori and Lahiri (2014a) proposed a new type of confidence interval, called the second-order efficient empirical Bayes confidence interval, whose length is less than that of the direct confidence interval based on the design-based approach. However, this interval still has some disadvantages: (i) it is hard to use when at least one leverage value is high; (ii) many iterations tend to be required to obtain the estimators of one global model variance parameter as the number of areas getting larger, due to the area-specific adjustment factor. To prevent such issues, this paper proposes, as never done before, a more efficient confidence interval to allow for high leverage and reduce the number of iterations for large number of areas, by adopting a non-area-specific adjustment factor, maintaining the existing desired properties. We also reveal the relationship between the general adjustment factor and the measure of uncertainty of the empirical Bayes estimator to create a second-order confidence interval. Moreover, we present two simulation studies and a real data analysis to show the efficiency of this confidence interval.
△ Less
Submitted 24 December, 2016; v1 submitted 15 July, 2016;
originally announced July 2016.
-
On differentiability of implicitly defined function in semi-parametric profile likelihood estimation
Authors:
Yuichi Hirose
Abstract:
In this paper, we study the differentiability of implicitly defined functions which we encounter in the profile likelihood estimation of parameters in semi-parametric models. Scott and Wild (Biometrika 84 (1997) 57-71; J. Statist. Plann. Inference 96 (2001) 3-27) and Murphy and van der Vaart (J. Amer. Statist. Assoc. 95 (2000) 449-485) developed methodologies that can avoid dealing with such impli…
▽ More
In this paper, we study the differentiability of implicitly defined functions which we encounter in the profile likelihood estimation of parameters in semi-parametric models. Scott and Wild (Biometrika 84 (1997) 57-71; J. Statist. Plann. Inference 96 (2001) 3-27) and Murphy and van der Vaart (J. Amer. Statist. Assoc. 95 (2000) 449-485) developed methodologies that can avoid dealing with such implicitly defined functions by parametrizing parameters in the profile likelihood and using an approximate least favorable submodel in semi-parametric models. Our result shows applicability of an alternative approach presented in Hirose (Ann. Inst. Statist. Math. 63 (2011) 1247-1275) which uses the direct expansion of the profile likelihood.
△ Less
Submitted 7 January, 2016;
originally announced January 2016.
-
Reparametrization of the least favorable submodel in semi-parametric multisample models
Authors:
Yuichi Hirose,
Alan Lee
Abstract:
The method of estimation in Scott and Wild (Biometrika 84 (1997) 57--71 and J. Statist. Plann. Inference 96 (2001) 3--27) uses a reparametrization of the profile likelihood that often reduces the computation times dramatically. Showing the efficiency of estimators for this method has been a challenging problem. In this paper, we try to solve the problem by investigating conditions under which the…
▽ More
The method of estimation in Scott and Wild (Biometrika 84 (1997) 57--71 and J. Statist. Plann. Inference 96 (2001) 3--27) uses a reparametrization of the profile likelihood that often reduces the computation times dramatically. Showing the efficiency of estimators for this method has been a challenging problem. In this paper, we try to solve the problem by investigating conditions under which the efficient score function and the efficient information matrix can be expressed in terms of the parameters in the reparametrized model.
△ Less
Submitted 9 May, 2012;
originally announced May 2012.