Search | arXiv e-print repository

Robust Bayesian Inference for Censored Survival Models

Authors: Yasuyuki Hamura, Takahiro Onizuka, Shintaro Hashimoto, Shonosuke Sugasawa

Abstract: This paper proposes a robust Bayesian accelerated failure time model for censored survival data. We develop a new family of life-time distributions using a scale mixture of the generalized gamma distributions, where we propose a novel super heavy-tailed distribution as a mixing density. We theoretically show that, under some conditions, the proposed method satisfies the full posterior robustness,… ▽ More This paper proposes a robust Bayesian accelerated failure time model for censored survival data. We develop a new family of life-time distributions using a scale mixture of the generalized gamma distributions, where we propose a novel super heavy-tailed distribution as a mixing density. We theoretically show that, under some conditions, the proposed method satisfies the full posterior robustness, which guarantees robustness of point estimation as well as uncertainty quantification. For posterior computation, we employ an integral expression of the proposed heavy-tailed distribution to develop an efficient posterior computation algorithm based on the Markov chain Monte Carlo. The performance of the proposed method is illustrated through numerical experiments and real data example. △ Less

Submitted 15 April, 2025; originally announced April 2025.

Comments: 51 pages, 3 figures

arXiv:2503.00538 [pdf, ps, other]

Geometric Ergodicity of Gibbs Algorithms for a Normal Model With a Global-Local Shrinkage Prior

Authors: Yasuyuki Hamura

Abstract: In this paper, we consider Gibbs samplers for a normal linear regression model with a global-local shrinkage prior. We show that they produce geometrically ergodic Markov chains under some assumptions. In the first half of the paper, we prove geometric ergodicity under the horseshoe local prior and a three-parameter beta global prior which does not have a finite $(p / 5)$-th negative moment, where… ▽ More In this paper, we consider Gibbs samplers for a normal linear regression model with a global-local shrinkage prior. We show that they produce geometrically ergodic Markov chains under some assumptions. In the first half of the paper, we prove geometric ergodicity under the horseshoe local prior and a three-parameter beta global prior which does not have a finite $(p / 5)$-th negative moment, where $p$ is the number of regression coefficients. This is in contrast to the case of a known general result which is applicable if the global parameter has a finite approximately $(p / 2)$-th negative moment. In the second half of the paper, we consider a more general class of global-local shrinkage priors. Geometric ergodicity is proved for two-stage and three-stage Gibbs samplers based on rejection sampling without assuming the negative moment condition. △ Less

Submitted 9 May, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

Comments: Sections 4, 5, and 6 have been added; 28 pages

arXiv:2410.17070 [pdf, ps, other]

A Short Note on the Efficiency of Markov Chains for Bayesian Linear Regression Models with Heavy-Tailed Errors

Authors: Yasuyuki Hamura

Abstract: In this short note, we consider posterior simulation for a linear regression model when the error distribution is given by a scale mixture of multivariate normals. We first show that the sampler of Backlund and Hobert (2020) for the case of the conditionally conjugate normal-inverse Wishart prior continues to be geometrically ergodic even when the error density is heavier-tailed. Moreover, we prov… ▽ More In this short note, we consider posterior simulation for a linear regression model when the error distribution is given by a scale mixture of multivariate normals. We first show that the sampler of Backlund and Hobert (2020) for the case of the conditionally conjugate normal-inverse Wishart prior continues to be geometrically ergodic even when the error density is heavier-tailed. Moreover, we prove that the ergodicity is uniform by verifying the minorization condition. In the second half of this note, we treat an improper case and show that the sampler of Section 4 of Roy and Hobert (2010) is geometrically ergodic under significantly milder conditions. △ Less

Submitted 28 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

Comments: 10 pages; the last section has been added; this version is not going to be replaced

arXiv:2410.10618 [pdf, other]

An Approximate Identity Link Function for Bayesian Generalized Linear Models

Authors: Yasuyuki Hamura

Abstract: In this note, we consider using a link function that has heavier tails than the usual exponential link function. We construct efficient Gibbs algorithms for Poisson and Multinomial models based on this link function by introducing gamma and inverse Gaussian latent variables and show that the algorithms generate geometrically ergodic Markov chains in simple settings. Our algorithms can be used for… ▽ More In this note, we consider using a link function that has heavier tails than the usual exponential link function. We construct efficient Gibbs algorithms for Poisson and Multinomial models based on this link function by introducing gamma and inverse Gaussian latent variables and show that the algorithms generate geometrically ergodic Markov chains in simple settings. Our algorithms can be used for more complicated models with many parameters. We fit our simple Poisson model to a real dataset and confirm that the posterior distribution has similar implications to those under the usual Poisson regression model based on the exponential link function. Although less interpretable, our models are potentially more tractable or flexible from a computational point of view in some cases. △ Less

Submitted 14 October, 2024; originally announced October 2024.

Comments: 19 pages, 2 figures

arXiv:2404.07586 [pdf, other]

State-Space Modeling of Shape-constrained Functional Time Series

Authors: Daichi Hiraki, Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Functional time series data frequently appears in econometric analyses, where the functions of interest are subject to some shape constraints, including monotonicity and convexity, as typical of the estimation of the Lorenz curve. This paper proposes a state-space model for time-varying functions to extract trends and serial dependence from functional time series while imposing the shape constrain… ▽ More Functional time series data frequently appears in econometric analyses, where the functions of interest are subject to some shape constraints, including monotonicity and convexity, as typical of the estimation of the Lorenz curve. This paper proposes a state-space model for time-varying functions to extract trends and serial dependence from functional time series while imposing the shape constraints on the estimated functions. The function of interest is modeled by a convex combination of selected basis functions to satisfy the shape constraints, where the time-varying convex weights on simplex follow the dynamic multi-logit models. To enable posterior computation by an efficient Markov chain Monte Carlo method, a novel data augmentation technique is devised for the complicated likelihood of this model. The proposed method is applied to the estimation of time-varying Lorenz curves, and its utility is illustrated through numerical experiments and analysis of panel data of household incomes in Japan. △ Less

Submitted 1 December, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

Comments: 34 pages, 7 figures, 6 tables

arXiv:2303.00281 [pdf, other]

Posterior Robustness with Milder Conditions: Contamination Models Revisited

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Robust Bayesian linear regression is a classical but essential statistical tool. Although novel robustness properties of posterior distributions have been proved recently under a certain class of error distributions, their sufficient conditions are restrictive and exclude several important situations. In this work, we revisit a classical two-component mixture model for response variables, also kno… ▽ More Robust Bayesian linear regression is a classical but essential statistical tool. Although novel robustness properties of posterior distributions have been proved recently under a certain class of error distributions, their sufficient conditions are restrictive and exclude several important situations. In this work, we revisit a classical two-component mixture model for response variables, also known as contamination model, where one component is a light-tailed regression model and the other component is heavy-tailed. The latter component is independent of the regression parameters, which is crucial in proving the posterior robustness. We obtain new sufficient conditions for posterior (non-)robustness and reveal non-trivial robustness results by using those conditions. In particular, we find that even the Student-$t$ error distribution can achieve the posterior robustness in our framework. A numerical study is performed to check the Kullback-Leibler divergence between the posterior distribution based on full data and that based on data obtained by removing outliers. △ Less

Submitted 3 April, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

Comments: 19 pages, 1 figure

arXiv:2302.09707 [pdf, other]

doi 10.1080/10618600.2023.2258186

Gibbs Sampler for Matrix Generalized Inverse Gaussian Distributions

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Sampling from matrix generalized inverse Gaussian (MGIG) distributions is required in Markov Chain Monte Carlo (MCMC) algorithms for a variety of statistical models. However, an efficient sampling scheme for the MGIG distributions has not been fully developed. We here propose a novel blocked Gibbs sampler for the MGIG distributions, based on the Choleski decomposition. We show that the full condit… ▽ More Sampling from matrix generalized inverse Gaussian (MGIG) distributions is required in Markov Chain Monte Carlo (MCMC) algorithms for a variety of statistical models. However, an efficient sampling scheme for the MGIG distributions has not been fully developed. We here propose a novel blocked Gibbs sampler for the MGIG distributions, based on the Choleski decomposition. We show that the full conditionals of the diagonal and unit lower-triangular entries are univariate generalized inverse Gaussian and multivariate normal distributions, respectively. Several variants of the Metropolis-Hastings algorithm can also be considered for this problem, but we mathematically prove that the average acceptance rates become extremely low in particular scenarios. We demonstrate the computational efficiency of the proposed Gibbs sampler through simulation studies and data analysis. △ Less

Submitted 19 February, 2023; originally announced February 2023.

Comments: 34 pages, 5 figures

arXiv:2208.05121 [pdf, other]

Locally Adaptive Bayesian Isotonic Regression using Half Shrinkage Priors

Authors: Ryo Okano, Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Isotonic regression or monotone function estimation is a problem of estimating function values under monotonicity constraints, which appears naturally in many scientific fields. This paper proposes a new Bayesian method with global-local shrinkage priors for estimating monotone function values. Specifically, we introduce half shrinkage priors for positive valued random variables and assign them fo… ▽ More Isotonic regression or monotone function estimation is a problem of estimating function values under monotonicity constraints, which appears naturally in many scientific fields. This paper proposes a new Bayesian method with global-local shrinkage priors for estimating monotone function values. Specifically, we introduce half shrinkage priors for positive valued random variables and assign them for the first-order differences of function values. We also develop fast and simple Gibbs sampling algorithms for full posterior analysis. By incorporating advanced shrinkage priors, the proposed method is adaptive to local abrupt changes or jumps in target functions. We show this adaptive property theoretically by proving that the posterior mean estimators are robust to large differences and that asymptotic risk for unchanged points can be improved. Finally, we demonstrate the proposed methods through simulations and applications to a real data set. △ Less

Submitted 6 February, 2024; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: 47 pages

Journal ref: Scandinavian Journal of Statistics, 2024

arXiv:2203.08440 [pdf, other]

doi 10.1214/22-ba1348

Sparse Bayesian inference on gamma-distributed observations using shape-scale inverse-gamma mixtures

Authors: Yasuyuki Hamura, Takahiro Onizuka, Shintaro Hashimoto, Shonosuke Sugasawa

Abstract: In various applications, we deal with high-dimensional positive-valued data that often exhibits sparsity. This paper develops a new class of continuous global-local shrinkage priors tailored to analyzing gamma-distributed observations where most of the underlying means are concentrated around a certain value. Unlike existing shrinkage priors, our new prior is a shape-scale mixture of inverse-gamma… ▽ More In various applications, we deal with high-dimensional positive-valued data that often exhibits sparsity. This paper develops a new class of continuous global-local shrinkage priors tailored to analyzing gamma-distributed observations where most of the underlying means are concentrated around a certain value. Unlike existing shrinkage priors, our new prior is a shape-scale mixture of inverse-gamma distributions, which has a desirable interpretation of the form of posterior mean and admits flexible shrinkage. We show that the proposed prior has two desirable theoretical properties; Kullback-Leibler super-efficiency under sparsity and robust shrinkage rules for large observations. We propose an efficient sampling algorithm for posterior inference. The performance of the proposed method is illustrated through simulation and two real data examples, the average length of hospital stay for COVID-19 in South Korea and adaptive variance estimation of gene expression data. △ Less

Submitted 30 November, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 57 pages, 8 figures

arXiv:2203.01704 [pdf, other]

doi 10.1080/10618600.2022.2119988

On Data Augmentation for Models Involving Reciprocal Gamma Functions

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: In this paper, we introduce a new and efficient data augmentation approach to the posterior inference of the models with shape parameters when the reciprocal gamma function appears in full conditional densities. Our approach is to approximate full conditional densities of shape parameters by using Gauss's multiplication formula and Stirling's formula for the gamma function, where the approximation… ▽ More In this paper, we introduce a new and efficient data augmentation approach to the posterior inference of the models with shape parameters when the reciprocal gamma function appears in full conditional densities. Our approach is to approximate full conditional densities of shape parameters by using Gauss's multiplication formula and Stirling's formula for the gamma function, where the approximation error can be made arbitrarily small. We use the techniques to construct efficient Gibbs and Metropolis-Hastings algorithms for a variety of models that involve the gamma distribution, Student's $t$-distribution, the Dirichlet distribution, the negative binomial distribution, and the Wishart distribution. The proposed sampling method is numerically demonstrated through simulation studies. △ Less

Submitted 26 August, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

Comments: 41 pages, 6 figures

arXiv:2106.10503 [pdf, other]

Robust Bayesian Modeling of Counts with Zero inflation and Outliers: Theoretical Robustness and Efficient Computation

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Count data with zero inflation and large outliers are ubiquitous in many scientific applications. However, posterior analysis under a standard statistical model, such as Poisson or negative binomial distribution, is sensitive to such contamination. This study introduces a novel framework for Bayesian modeling of counts that is robust to both zero inflation and large outliers. In doing so, we intro… ▽ More Count data with zero inflation and large outliers are ubiquitous in many scientific applications. However, posterior analysis under a standard statistical model, such as Poisson or negative binomial distribution, is sensitive to such contamination. This study introduces a novel framework for Bayesian modeling of counts that is robust to both zero inflation and large outliers. In doing so, we introduce rescaled beta distribution and adopt it to absorb undesirable effects from zero and outlying counts. The proposed approach has two appealing features: the efficiency of the posterior computation via a custom Gibbs sampling algorithm and a theoretically guaranteed posterior robustness, where extreme outliers are automatically removed from the posterior distribution. We demonstrate the usefulness of the proposed method by applying it to trend filtering and spatial modeling using predictive Gaussian processes. △ Less

Submitted 8 May, 2024; v1 submitted 19 June, 2021; originally announced June 2021.

Comments: 32 pages (main text) and 23 pages (supplementary material)

arXiv:2103.00518 [pdf, other]

doi 10.1080/03610926.2021.1980046

Bayesian Point Estimation and Predictive Density Estimation for the Binomial Distribution with a Restricted Probability Parameter

Authors: Yasuyuki Hamura

Abstract: In this paper, we consider Bayesian point estimation and predictive density estimation in the binomial case. After presenting preliminary results on these problems, we compare the risk functions of the Bayes estimators based on the truncated and untruncated beta priors and obtain dominance conditions when the probability parameter is less than or equal to a known constant. The case where there are… ▽ More In this paper, we consider Bayesian point estimation and predictive density estimation in the binomial case. After presenting preliminary results on these problems, we compare the risk functions of the Bayes estimators based on the truncated and untruncated beta priors and obtain dominance conditions when the probability parameter is less than or equal to a known constant. The case where there are both a lower bound restriction and an upper bound restriction is also treated. Then our problems are shown to be related to similar problems in the Poisson case. Finally, numerical studies are presented. △ Less

Submitted 20 March, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

Comments: 29 pages, 4 figures; Theorem 3.3 and Sections 4 and 6 have been added

arXiv:2010.03141 [pdf, other]

doi 10.1007/s42081-021-00141-z

Bayesian Shrinkage Approaches to Unbalanced Problems of Estimation and Prediction on the Basis of Negative Multinomial Samples

Authors: Yasuyuki Hamura

Abstract: In this paper, we treat estimation and prediction problems where negative multinomial variables are observed and in particular consider unbalanced settings. First, the problem of estimating multiple negative multinomial parameter vectors under the standardized squared error loss is treated and a new empirical Bayes estimator which dominates the UMVU estimator under suitable conditions is derived.… ▽ More In this paper, we treat estimation and prediction problems where negative multinomial variables are observed and in particular consider unbalanced settings. First, the problem of estimating multiple negative multinomial parameter vectors under the standardized squared error loss is treated and a new empirical Bayes estimator which dominates the UMVU estimator under suitable conditions is derived. Second, we consider estimation of the joint predictive density of several multinomial tables under the Kullback-Leibler divergence and obtain a sufficient condition under which the Bayesian predictive density with respect to a hierarchical shrinkage prior dominates the Bayesian predictive density with respect to the Jeffreys prior. Third, our proposed Bayesian estimator and predictive density give risk improvements in simulations. Finally, the problem of estimating the joint predictive density of negative multinomial variables is discussed. △ Less

Submitted 19 November, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: 34 pages, 1 figure

arXiv:2006.07052 [pdf, other]

doi 10.1016/j.jspi.2021.07.004

Bayesian Predictive Density Estimation for a Chi-squared Model Using Information from a Normal Observation with Unknown Mean and Variance

Authors: Yasuyuki Hamura, Tatsuya Kubokawa

Abstract: In this paper, we consider the problem of estimating the density function of a Chi-squared variable on the basis of observations of another Chi-squared variable and a normal variable under the Kullback-Leibler divergence. We assume that these variables have a common unknown scale parameter and that the mean of the normal variable is also unknown. We compare the risk functions of two Bayesian predi… ▽ More In this paper, we consider the problem of estimating the density function of a Chi-squared variable on the basis of observations of another Chi-squared variable and a normal variable under the Kullback-Leibler divergence. We assume that these variables have a common unknown scale parameter and that the mean of the normal variable is also unknown. We compare the risk functions of two Bayesian predictive densities: one with respect to a hierarchical shrinkage prior and the other based on a noninformative prior. The hierarchical Bayesian predictive density depends on the normal variable while the Bayesian predictive density based on the noninformative prior does not. Sufficient conditions for the former to dominate the latter are obtained. These predictive densities are compared by simulation. △ Less

Submitted 19 July, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

Comments: 18 pages, 1 figure, extensively rewritten

arXiv:2005.02800 [pdf, other]

Log-Regularly Varying Scale Mixture of Normals for Robust Regression

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Linear regression with the classical normality assumption for the error distribution may lead to an undesirable posterior inference of regression coefficients due to the potential outliers. This paper considers the finite mixture of two components with thin and heavy tails as the error distribution, which has been routinely employed in applied statistics. For the heavily-tailed component, we intro… ▽ More Linear regression with the classical normality assumption for the error distribution may lead to an undesirable posterior inference of regression coefficients due to the potential outliers. This paper considers the finite mixture of two components with thin and heavy tails as the error distribution, which has been routinely employed in applied statistics. For the heavily-tailed component, we introduce the novel class of distributions; their densities are log-regularly varying and have heavier tails than those of Cauchy distribution, yet they are expressed as a scale mixture of normal distributions and enable the efficient posterior inference by Gibbs sampler. We prove the robustness to outliers of the posterior distributions under the proposed models with a minimal set of assumptions, which justifies the use of shrinkage priors with unbounded densities for the coefficient vector in the presence of outliers. The extensive comparison with the existing methods via simulation study shows the improved performance of our model in point and interval estimation, as well as its computational efficiency. Further, we confirm the posterior robustness of our method in the empirical study with the shrinkage priors for regression coefficients. △ Less

Submitted 9 January, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

Comments: 62 pages

arXiv:2001.09602 [pdf, ps, other]

doi 10.1016/j.jmva.2020.104653

Bayesian Shrinkage Estimation of Negative Multinomial Parameter Vectors

Authors: Yasuyuki Hamura, Tatsuya Kubokawa

Abstract: The negative multinomial distribution is a multivariate generalization of the negative binomial distribution. In this paper, we consider the problem of estimating an unknown matrix of probabilities on the basis of observations of negative multinomial variables under the standardized squared error loss. First, a general sufficient condition for a shrinkage estimator to dominate the UMVU estimator i… ▽ More The negative multinomial distribution is a multivariate generalization of the negative binomial distribution. In this paper, we consider the problem of estimating an unknown matrix of probabilities on the basis of observations of negative multinomial variables under the standardized squared error loss. First, a general sufficient condition for a shrinkage estimator to dominate the UMVU estimator is derived and an empirical Bayes estimator satisfying the condition is constructed. Next, a hierarchical shrinkage prior is introduced, an associated Bayes estimator is shown to dominate the UMVU estimator under some conditions, and some remarks about posterior computation are presented. Finally, shrinkage estimators and the UMVU estimator are compared by simulation. △ Less

Submitted 29 October, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

Comments: 31 pages; the code for numerical computation of the hierarchical Bayes estimator in Section 4 has been corrected; Tables 2, 3, and 4 and the second-to-the-last paragraph of Section 4 have been changed

arXiv:2001.08465 [pdf, other]

Shrinkage with Robustness: Log-Adjusted Priors for Sparse Signals

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: We introduce a new class of distributions named log-adjusted shrinkage priors for the analysis of sparse signals, which extends the three parameter beta priors by multiplying an additional log-term to their densities. The proposed prior has density tails that are heavier than even those of the Cauchy distribution and realizes the tail-robustness of the Bayes estimator, while keeping the strong shr… ▽ More We introduce a new class of distributions named log-adjusted shrinkage priors for the analysis of sparse signals, which extends the three parameter beta priors by multiplying an additional log-term to their densities. The proposed prior has density tails that are heavier than even those of the Cauchy distribution and realizes the tail-robustness of the Bayes estimator, while keeping the strong shrinkage effect on noises. We verify this property via the improved posterior mean squared errors in the tail. An integral representation with latent variables for the new density is available and enables fast and simple Gibbs samplers for the full posterior analysis. Our log-adjusted prior is significantly different from existing shrinkage priors with logarithms for allowing its further generalization by multiple log-terms in the density. The performance of the proposed priors is investigated through simulation studies and data analysis. △ Less

Submitted 26 January, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

Comments: 40 pages

arXiv:1907.01333 [pdf, other]

On Global-local Shrinkage Priors for Count Data

Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

Abstract: Global-local shrinkage prior has been recognized as useful class of priors which can strongly shrink small signals towards prior means while keeping large signals unshrunk. Although such priors have been extensively discussed under Gaussian responses, we intensively encounter count responses in practice in which the previous knowledge of global-local shrinkage priors cannot be directly imported. I… ▽ More Global-local shrinkage prior has been recognized as useful class of priors which can strongly shrink small signals towards prior means while keeping large signals unshrunk. Although such priors have been extensively discussed under Gaussian responses, we intensively encounter count responses in practice in which the previous knowledge of global-local shrinkage priors cannot be directly imported. In this paper, we discuss global-local shrinkage priors for analyzing sequence of counts. We provide sufficient conditions under which the posterior mean keeps the observation as it is for very large signals, known as tail robustness property. Then, we propose tractable priors to meet the derived conditions approximately or exactly and develop an efficient posterior computation algorithm for Bayesian inference. The proposed methods are free from tuning parameters, that is, all the hyperparameters are automatically estimated based on the data. We demonstrate the proposed methods through simulation and an application to a real dataset. △ Less

Submitted 16 August, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

Comments: 28 pages (main text) + 14 pages (supplementary material)

Showing 1–18 of 18 results for author: Hamura, Y