-
Multivariate Species Sampling Models
Authors:
Beatrice Franzolini,
Antonio Lijoi,
Igor Prünster,
Giovanni Rebaudo
Abstract:
Species sampling processes have long served as the framework for studying random discrete distributions. However, their statistical applicability is limited when partial exchangeability is assumed as probabilistic invariance for the observables. Despite numerous discrete models for partially exchangeable observations, a unifying framework is currently missing, leaving many questions about the indu…
▽ More
Species sampling processes have long served as the framework for studying random discrete distributions. However, their statistical applicability is limited when partial exchangeability is assumed as probabilistic invariance for the observables. Despite numerous discrete models for partially exchangeable observations, a unifying framework is currently missing, leaving many questions about the induced learning mechanisms unanswered in this setting. To fill this gap, we consider the natural extension of species sampling models to a multivariate framework, obtaining a general class of models characterized by their partially exchangeable partition probability function. A notable subclass, named regular multivariate species sampling models, exists among these models. In the subclass, dependence across processes is accurately captured by the correlation among them: a correlation of one equals full exchangeability and a null correlation corresponds to independence. Regular multivariate species sampling models encompass discrete processes for partial exchangeable data used in Bayesian models, thereby highlighting their core distributional properties and providing a means for developing new models.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Nonparametric priors with full-range borrowing of information
Authors:
Filippo Ascolani,
Beatrice Franzolini,
Antonio Lijoi,
Igor Prünster
Abstract:
Modeling of the dependence structure across heterogeneous data is crucial for Bayesian inference since it directly impacts the borrowing of information. Despite the extensive advances over the last two decades, most available proposals allow only for non-negative correlations. We derive a new class of dependent nonparametric priors that can induce correlations of any sign, thus introducing a new a…
▽ More
Modeling of the dependence structure across heterogeneous data is crucial for Bayesian inference since it directly impacts the borrowing of information. Despite the extensive advances over the last two decades, most available proposals allow only for non-negative correlations. We derive a new class of dependent nonparametric priors that can induce correlations of any sign, thus introducing a new and more flexible idea of borrowing of information. This is achieved thanks to a novel concept, which we term hyper-tie, and represents a direct and simple measure of dependence. We investigate prior and posterior distributional properties of the model and develop algorithms to perform posterior inference. Illustrative examples on simulated and real data show that our proposal outperforms alternatives in terms of prediction and clustering.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
Flexible clustering via hidden hierarchical Dirichlet priors
Authors:
Antonio Lijoi,
Igor Prünster,
Giovanni Rebaudo
Abstract:
The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular Bayesian nonparametric model for clustering probability distributions is the nested Dirichlet process, which however has the drawback of grouping distributions in a single cluster when ties are observed…
▽ More
The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular Bayesian nonparametric model for clustering probability distributions is the nested Dirichlet process, which however has the drawback of grouping distributions in a single cluster when ties are observed across samples. With the goal of achieving a flexible and effective clustering method for both samples and observations, we investigate a nonparametric prior that arises as the composition of two different discrete random structures and derive a closed-form expression for the induced distribution of the random partition, the fundamental tool regulating the clustering behavior of the model. On the one hand, this allows to gain a deeper insight into the theoretical properties of the model and, on the other hand, it yields an MCMC algorithm for evaluating Bayesian inferences of interest. Moreover, we single out limitations of this algorithm when working with more than two populations and, consequently, devise an alternative more efficient sampling scheme, which as a by-product, allows testing homogeneity between different populations. Finally, we perform a comparison with the nested Dirichlet process and provide illustrative examples of both synthetic and real data.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
A Wasserstein index of dependence for random measures
Authors:
Marta Catalano,
Hugo Lavenant,
Antonio Lijoi,
Igor Prünster
Abstract:
Optimal transport and Wasserstein distances are flourishing in many scientific fields as a means for comparing and connecting random structures. Here we pioneer the use of an optimal transport distance between Lévy measures to solve a statistical problem. Dependent Bayesian nonparametric models provide flexible inference on distinct, yet related, groups of observations. Each component of a vector…
▽ More
Optimal transport and Wasserstein distances are flourishing in many scientific fields as a means for comparing and connecting random structures. Here we pioneer the use of an optimal transport distance between Lévy measures to solve a statistical problem. Dependent Bayesian nonparametric models provide flexible inference on distinct, yet related, groups of observations. Each component of a vector of random measures models a group of exchangeable observations, while their dependence regulates the borrowing of information across groups. We derive the first statistical index of dependence in $[0,1]$ for (completely) random measures that accounts for their whole infinite-dimensional distribution, which is assumed to be equal across different groups. This is accomplished by using the geometric properties of the Wasserstein distance to solve a max-min problem at the level of the underlying Lévy measures. The Wasserstein index of dependence sheds light on the models' deep structure and has desirable properties: (i) it is $0$ if and only if the random measures are independent; (ii) it is $1$ if and only if the random measures are completely dependent; (iii) it simultaneously quantifies the dependence of $d \ge 2$ random measures, avoiding the need for pairwise comparisons; (iv) it can be evaluated numerically. Moreover, the index allows for informed prior specifications and fair model comparisons for Bayesian nonparametric models.
△ Less
Submitted 15 September, 2023; v1 submitted 14 September, 2021;
originally announced September 2021.
-
Inner spike and slab Bayesian nonparametric models
Authors:
Antonio Canale,
Antonio Lijoi,
Bernardo Nipoti,
Igor Prünster
Abstract:
Discrete Bayesian nonparametric models whose expectation is a convex linear combination of a point mass at some point of the support and a diffuse probability distribution allow to incorporate strong prior information, while still being extremely flexible. Recent contributions in the statistical literature have successfully implemented such a modelling strategy in a variety of applications, includ…
▽ More
Discrete Bayesian nonparametric models whose expectation is a convex linear combination of a point mass at some point of the support and a diffuse probability distribution allow to incorporate strong prior information, while still being extremely flexible. Recent contributions in the statistical literature have successfully implemented such a modelling strategy in a variety of applications, including density estimation, nonparametric regression and model-based clustering. We provide a thorough study of a large class of nonparametric models we call inner spike and slab hNRMI models, which are obtained by considering homogeneous normalized random measures with independent increments (hNRMI) with base measure given by a convex linear combination of a point mass and a diffuse probability distribution. In this paper we investigate the distributional properties of these models and our results include: i) the exchangeable partition probability function they induce, ii) the distribution of the number of distinct values in an exchangeable sample, iii) the posterior predictive distribution, and iv) the distribution of the number of elements that coincide with the only point of the support with positive probability. Our findings are the main building block for an actual implementation of Bayesian inner spike and slab hNRMI models by means of a generalized Pólya urn scheme.
△ Less
Submitted 21 July, 2021;
originally announced July 2021.
-
Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process
Authors:
Pierpaolo De Blasi,
Ramsés H. Mena,
Igor Prünster
Abstract:
Discrete random probability measures are a key ingredient of Bayesian nonparametric inferential procedures. A sample generates ties with positive probability and a fundamental object of both theoretical and applied interest is the corresponding random number of distinct values. The growth rate can be determined from the rate of decay of the small frequencies implying that, when the decreasingly or…
▽ More
Discrete random probability measures are a key ingredient of Bayesian nonparametric inferential procedures. A sample generates ties with positive probability and a fundamental object of both theoretical and applied interest is the corresponding random number of distinct values. The growth rate can be determined from the rate of decay of the small frequencies implying that, when the decreasingly ordered frequencies admit a tractable form, the asymptotics of the number of distinct values can be conveniently assessed. We focus on the geometric stick-breaking process and we investigate the effect of the choice of the distribution for the success probability on the asymptotic behavior of the number of distinct values. We show that a whole range of logarithmic behaviors are obtained by appropriately tuning the prior. We also derive a two-term expansion and illustrate its use in a comparison with a larger family of discrete random probability measures having an additional parameter given by the scale of the negative binomial distribution.
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
Stochastic approximations to the Pitman-Yor process
Authors:
Julyan Arbel,
Pierpaolo De Blasi,
Igor Pruenster
Abstract:
In this paper we consider approximations to the popular Pitman-Yor process obtained by truncating the stick-breaking representation. The truncation is determined by a random stopping rule that achieves an almost sure control on the approximation error in total variation distance. We derive the asymptotic distribution of the random truncation point as the approximation error epsilon goes to zero in…
▽ More
In this paper we consider approximations to the popular Pitman-Yor process obtained by truncating the stick-breaking representation. The truncation is determined by a random stopping rule that achieves an almost sure control on the approximation error in total variation distance. We derive the asymptotic distribution of the random truncation point as the approximation error epsilon goes to zero in terms of a polynomially tilted positive stable distribution. The practical usefulness and effectiveness of this theoretical result is demonstrated by devising a sampling algorithm to approximate functionals of the epsilon-version of the Pitman-Yor process.
△ Less
Submitted 13 July, 2019; v1 submitted 28 June, 2018;
originally announced June 2018.
-
Latent nested nonparametric priors
Authors:
Federico Camerlenghi,
David B. Dunson,
Antonio Lijoi,
Igor Prünster,
Abel Rodríguez
Abstract:
Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, c…
▽ More
Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalising to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop an Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by product. The results and their inferential implications are showcased on synthetic and real data.
△ Less
Submitted 15 January, 2018;
originally announced January 2018.
-
Are Gibbs-type priors the most natural generalization of the Dirichlet process?
Authors:
P. De Blasi,
S. Favaro,
A. Lijoi,
R. H. Mena,
I. Pruenster,
M. Ruggiero
Abstract:
Discrete random probability measures and the exchangeable random partitions they induce are key tools for addressing a variety of estimation and prediction problems in Bayesian inference. Indeed, many popular nonparametric priors, such as the Dirichlet and the Pitman-Yor process priors, select discrete probability distributions almost surely and, therefore, automatically induce exchangeable random…
▽ More
Discrete random probability measures and the exchangeable random partitions they induce are key tools for addressing a variety of estimation and prediction problems in Bayesian inference. Indeed, many popular nonparametric priors, such as the Dirichlet and the Pitman-Yor process priors, select discrete probability distributions almost surely and, therefore, automatically induce exchangeable random partitions. Here we focus on the family of Gibbs-type priors, a recent and elegant generalization of the Dirichlet and the Pitman-Yor process priors. These random probability measures share properties that are appealing both from a theoretical and an applied point of view: (i) they admit an intuitive characterization in terms of their predictive structure justifying their use in terms of a precise assumption on the learning mechanism; (ii) they stand out in terms of mathematical tractability; (iii) they include several interesting special cases besides the Dirichlet and the Pitman-Yor processes. The goal of our paper is to provide a systematic and unified treatment of Gibbs-type priors and highlight their implications for Bayesian nonparametric inference. We will deal with their distributional properties, the resulting estimators, frequentist asymptotic validation and the construction of time-dependent versions. Applications, mainly concerning hierarchical mixture models and species sampling, will serve to convey the main ideas. The intuition inherent to this class of priors and the neat results that can be deduced for it lead one to wonder whether it actually represents the most natural generalization of the Dirichlet process.
△ Less
Submitted 28 February, 2015;
originally announced March 2015.
-
Bayesian inference with dependent normalized completely random measures
Authors:
Antonio Lijoi,
Bernardo Nipoti,
Igor Prünster
Abstract:
The proposal and study of dependent prior processes has been a major research focus in the recent Bayesian nonparametric literature. In this paper, we introduce a flexible class of dependent nonparametric priors, investigate their properties and derive a suitable sampling scheme which allows their concrete implementation. The proposed class is obtained by normalizing dependent completely random me…
▽ More
The proposal and study of dependent prior processes has been a major research focus in the recent Bayesian nonparametric literature. In this paper, we introduce a flexible class of dependent nonparametric priors, investigate their properties and derive a suitable sampling scheme which allows their concrete implementation. The proposed class is obtained by normalizing dependent completely random measures, where the dependence arises by virtue of a suitable construction of the Poisson random measures underlying the completely random measures. We first provide general distributional results for the whole class of dependent completely random measures and then we specialize them to two specific priors, which represent the natural candidates for concrete implementation due to their analytic tractability: the bivariate Dirichlet and normalized $σ$-stable processes. Our analytical results, and in particular the partially exchangeable partition probability function, form also the basis for the determination of a Markov Chain Monte Carlo algorithm for drawing posterior inferences, which reduces to the well-known Blackwell--MacQueen Pólya urn scheme in the univariate case. Such an algorithm can be used for density estimation and for analyzing the clustering structure of the data and is illustrated through a real two-sample dataset example.
△ Less
Submitted 2 July, 2014;
originally announced July 2014.
-
A note on "Bayesian nonparametric estimators derived from conditional Gibbs structures"
Authors:
Antonio Lijoi,
Igor Prünster,
Stephen G. Walker
Abstract:
A note on "Bayesian nonparametric estimators derived from conditional Gibbs structures" by Antonio Lijoi, Igor Prünster, Stephen G. Walker [arXiv:0808.2863].
A note on "Bayesian nonparametric estimators derived from conditional Gibbs structures" by Antonio Lijoi, Igor Prünster, Stephen G. Walker [arXiv:0808.2863].
△ Less
Submitted 16 January, 2014;
originally announced January 2014.
-
Conditional formulae for Gibbs-type exchangeable random partitions
Authors:
Stefano Favaro,
Antonio Lijoi,
Igor Prünster
Abstract:
Gibbs-type random probability measures and the exchangeable random partitions they induce represent an important framework both from a theoretical and applied point of view. In the present paper, motivated by species sampling problems, we investigate some properties concerning the conditional distribution of the number of blocks with a certain frequency generated by Gibbs-type random partitions. T…
▽ More
Gibbs-type random probability measures and the exchangeable random partitions they induce represent an important framework both from a theoretical and applied point of view. In the present paper, motivated by species sampling problems, we investigate some properties concerning the conditional distribution of the number of blocks with a certain frequency generated by Gibbs-type random partitions. The general results are then specialized to three noteworthy examples yielding completely explicit expressions of their distributions, moments and asymptotic behaviors. Such expressions can be interpreted as Bayesian nonparametric estimators of the rare species variety and their performance is tested on some real genomic data.
△ Less
Submitted 5 September, 2013;
originally announced September 2013.
-
A Bayesian nonparametric approach to modeling market share dynamics
Authors:
Igor Prünster,
Matteo Ruggiero
Abstract:
We propose a flexible stochastic framework for modeling the market share dynamics over time in a multiple markets setting, where firms interact within and between markets. Firms undergo stochastic idiosyncratic shocks, which contract their shares, and compete to consolidate their position by acquiring new ones in both the market where they operate and in new markets. The model parameters can meani…
▽ More
We propose a flexible stochastic framework for modeling the market share dynamics over time in a multiple markets setting, where firms interact within and between markets. Firms undergo stochastic idiosyncratic shocks, which contract their shares, and compete to consolidate their position by acquiring new ones in both the market where they operate and in new markets. The model parameters can meaningfully account for phenomena such as barriers to entry and exit, fixed and sunk costs, costs of expanding to new sectors with different technologies and competitive advantage among firms. The construction is obtained in a Bayesian framework by means of a collection of nonparametric hierarchical mixtures, which induce the dependence between markets and provide a generalization of the Blackwell-MacQueen Pólya urn scheme, which in turn is used to generate a partially exchangeable dynamical particle system. A Markov Chain Monte Carlo algorithm is provided for simulating trajectories of the system, by means of which we perform a simulation study for transitions to different economic regimes. Moreover, it is shown that the infinite-dimensional properties of the system, when appropriately transformed and rescaled, are those of a collection of interacting Fleming-Viot diffusions.
△ Less
Submitted 1 February, 2013;
originally announced February 2013.
-
Asymptotics for a Bayesian nonparametric estimator of species variety
Authors:
Stefano Favaro,
Antonio Lijoi,
Igor Prünster
Abstract:
In Bayesian nonparametric inference, random discrete probability measures are commonly used as priors within hierarchical mixture models for density estimation and for inference on the clustering of the data. Recently, it has been shown that they can also be exploited in species sampling problems: indeed they are natural tools for modeling the random proportions of species within a population thus…
▽ More
In Bayesian nonparametric inference, random discrete probability measures are commonly used as priors within hierarchical mixture models for density estimation and for inference on the clustering of the data. Recently, it has been shown that they can also be exploited in species sampling problems: indeed they are natural tools for modeling the random proportions of species within a population thus allowing for inference on various quantities of statistical interest. For applications that involve large samples, the exact evaluation of the corresponding estimators becomes impracticable and, therefore, asymptotic approximations are sought. In the present paper, we study the limiting behaviour of the number of new species to be observed from further sampling, conditional on observed data, assuming the observations are exchangeable and directed by a normalized generalized gamma process prior. Such an asymptotic study highlights a connection between the normalized generalized gamma process and the two-parameter Poisson-Dirichlet process that was previously known only in the unconditional case.
△ Less
Submitted 23 November, 2012;
originally announced November 2012.
-
Exchangeable Hoeffding decompositions over finite sets: a characterization and counterexamples
Authors:
Omar El-Dakkak,
Giovanni Peccati,
Igor Prünster
Abstract:
We study Hoeffding decomposable exchangeable sequences with values in a finite set D. We provide a new combinatorial characterization of Hoeffding decomposability and use this result to show that, if the cardinality of D is strictly greater than 2, then there exists a class of neither Pólya nor i.i.d. D-valued exchangeable sequences that are Hoeffding decomposable. The construction of such sequenc…
▽ More
We study Hoeffding decomposable exchangeable sequences with values in a finite set D. We provide a new combinatorial characterization of Hoeffding decomposability and use this result to show that, if the cardinality of D is strictly greater than 2, then there exists a class of neither Pólya nor i.i.d. D-valued exchangeable sequences that are Hoeffding decomposable. The construction of such sequences is based on some ideas appearing in Hill, Lane and Sudderth [1987] and answers a question left open in El-Dakkak and Peccati [2008].
△ Less
Submitted 23 May, 2012;
originally announced May 2012.
-
On the posterior distribution of classes of random means
Authors:
Lancelot F. James,
Antonio Lijoi,
Igor Prünster
Abstract:
The study of properties of mean functionals of random probability measures is an important area of research in the theory of Bayesian nonparametric statistics. Many results are now known for random Dirichlet means, but little is known, especially in terms of posterior distributions, for classes of priors beyond the Dirichlet process. In this paper, we consider normalized random measures with ind…
▽ More
The study of properties of mean functionals of random probability measures is an important area of research in the theory of Bayesian nonparametric statistics. Many results are now known for random Dirichlet means, but little is known, especially in terms of posterior distributions, for classes of priors beyond the Dirichlet process. In this paper, we consider normalized random measures with independent increments (NRMI's) and mixtures of NRMI. In both cases, we are able to provide exact expressions for the posterior distribution of their means. These general results are then specialized, leading to distributional results for means of two important particular cases of NRMI's and also of the two-parameter Poisson--Dirichlet process.
△ Less
Submitted 23 February, 2010;
originally announced February 2010.
-
Asymptotics for posterior hazards
Authors:
Pierpaolo De Blasi,
Giovanni Peccati,
Igor Prünster
Abstract:
An important issue in survival analysis is the investigation and the modeling of hazard rates. Within a Bayesian nonparametric framework, a natural and popular approach is to model hazard rates as kernel mixtures with respect to a completely random measure. In this paper we provide a comprehensive analysis of the asymptotic behavior of such models. We investigate consistency of the posterior dis…
▽ More
An important issue in survival analysis is the investigation and the modeling of hazard rates. Within a Bayesian nonparametric framework, a natural and popular approach is to model hazard rates as kernel mixtures with respect to a completely random measure. In this paper we provide a comprehensive analysis of the asymptotic behavior of such models. We investigate consistency of the posterior distribution and derive fixed sample size central limit theorems for both linear and quadratic functionals of the posterior hazard rate. The general results are then specialized to various specific kernels and mixing measures yielding consistency under minimal conditions and neat central limit theorems for the distribution of functionals.
△ Less
Submitted 13 August, 2009;
originally announced August 2009.
-
Bayesian nonparametric estimators derived from conditional Gibbs structures
Authors:
Antonio Lijoi,
Igor Prünster,
Stephen G. Walker
Abstract:
We consider discrete nonparametric priors which induce Gibbs-type exchangeable random partitions and investigate their posterior behavior in detail. In particular, we deduce conditional distributions and the corresponding Bayesian nonparametric estimators, which can be readily exploited for predicting various features of additional samples. The results provide useful tools for genomic applicatio…
▽ More
We consider discrete nonparametric priors which induce Gibbs-type exchangeable random partitions and investigate their posterior behavior in detail. In particular, we deduce conditional distributions and the corresponding Bayesian nonparametric estimators, which can be readily exploited for predicting various features of additional samples. The results provide useful tools for genomic applications where prediction of future outcomes is required.
△ Less
Submitted 21 August, 2008;
originally announced August 2008.
-
On rates of convergence for posterior distributions in infinite-dimensional models
Authors:
Stephen G. Walker,
Antonio Lijoi,
Igor Prünster
Abstract:
This paper introduces a new approach to the study of rates of convergence for posterior distributions. It is a natural extension of a recent approach to the study of Bayesian consistency. In particular, we improve on current rates of convergence for models including the mixture of Dirichlet process model and the random Bernstein polynomial model.
This paper introduces a new approach to the study of rates of convergence for posterior distributions. It is a natural extension of a recent approach to the study of Bayesian consistency. In particular, we improve on current rates of convergence for models including the mixture of Dirichlet process model and the random Bernstein polynomial model.
△ Less
Submitted 14 August, 2007;
originally announced August 2007.
-
Linear and quadratic functionals of random hazard rates: an asymptotic analysis
Authors:
Giovanni Peccati,
Igor Prünster
Abstract:
A popular Bayesian nonparametric approach to survival analysis consists in modeling hazard rates as kernel mixtures driven by a completely random measure. In this paper we derive asymptotic results for linear and quadratic functionals of such random hazard rates. In particular, we prove central limit theorems for the cumulative hazard function and for the path-second moment and path-variance of…
▽ More
A popular Bayesian nonparametric approach to survival analysis consists in modeling hazard rates as kernel mixtures driven by a completely random measure. In this paper we derive asymptotic results for linear and quadratic functionals of such random hazard rates. In particular, we prove central limit theorems for the cumulative hazard function and for the path-second moment and path-variance of the hazard rate. Our techniques are based on recently established criteria for the weak convergence of single and double stochastic integrals with respect to Poisson random measures. We illustrate our results by considering specific models involving kernels and random measures commonly exploited in practice.
△ Less
Submitted 21 November, 2006;
originally announced November 2006.
-
Distributions of linear functionals of two parameter Poisson--Dirichlet random measures
Authors:
Lancelot F. James,
Antonio Lijoi,
Igor Prünster
Abstract:
The present paper provides exact expressions for the probability distributions of linear functionals of the two-parameter Poisson--Dirichlet process $\operatorname {PD}(α,θ)$. We obtain distributional results yielding exact forms for density functions of these functionals. Moreover, several interesting integral identities are obtained by exploiting a correspondence between the mean of a Poisson-…
▽ More
The present paper provides exact expressions for the probability distributions of linear functionals of the two-parameter Poisson--Dirichlet process $\operatorname {PD}(α,θ)$. We obtain distributional results yielding exact forms for density functions of these functionals. Moreover, several interesting integral identities are obtained by exploiting a correspondence between the mean of a Poisson--Dirichlet process and the mean of a suitable Dirichlet process. Finally, some distributional characterizations in terms of mixture representations are proved. The usefulness of the results contained in the paper is demonstrated by means of some illustrative examples. Indeed, our formulae are relevant to occupation time phenomena connected with Brownian motion and more general Bessel processes, as well as to models arising in Bayesian nonparametric statistics.
△ Less
Submitted 31 March, 2008; v1 submitted 18 September, 2006;
originally announced September 2006.
-
Normalized random measures driven by increasing additive processes
Authors:
Luis E. Nieto-Barajas,
Igor Prunster,
Stephen G. Walker
Abstract:
This paper introduces and studies a new class of nonparametric prior distributions. Random probability distribution functions are constructed via normalization of random measures driven by increasing additive processes. In particular, we present results for the distribution of means under both prior and posterior conditions and, via the use of strategic latent variables, undertake a full Bayesia…
▽ More
This paper introduces and studies a new class of nonparametric prior distributions. Random probability distribution functions are constructed via normalization of random measures driven by increasing additive processes. In particular, we present results for the distribution of means under both prior and posterior conditions and, via the use of strategic latent variables, undertake a full Bayesian analysis. Our class of priors includes the well-known and widely used mixture of a Dirichlet process.
△ Less
Submitted 30 August, 2005;
originally announced August 2005.
-
Baysian inference via classes of normalized random measures
Authors:
Lancelot F. James,
Antonio Lijoi,
Igor Pruenster
Abstract:
One of the main research areas in Bayesian Nonparametrics is the proposal and study of priors which generalize the Dirichlet process. Here we exploit theoretical properties of Poisson random measures in order to provide a comprehensive Bayesian analysis of random probabilities which are obtained by an appropriate normalization. Specifically we achieve explicit and tractable forms of the posterio…
▽ More
One of the main research areas in Bayesian Nonparametrics is the proposal and study of priors which generalize the Dirichlet process. Here we exploit theoretical properties of Poisson random measures in order to provide a comprehensive Bayesian analysis of random probabilities which are obtained by an appropriate normalization. Specifically we achieve explicit and tractable forms of the posterior and the marginal distributions, including an explicit and easily used description of generalizations of the important Blackwell-MacQueen Pólya urn distribution. Such simplifications are achieved by the use of a latent variable which admits quite interesting interpretations which allow to gain a better understanding of the behaviour of these random probability measures. It is noteworthy that these models are generalizations of models considered by Kingman (1975) in a non-Bayesian context. Such models are known to play a significant role in a variety of applications including genetics, physics, and work involving random mappings and assemblies. Hence our analysis is of utility in those contexts as well. We also show how our results may be applied to Bayesian mixture models and describe computational schemes which are generalizations of known efficient methods for the case of the Dirichlet process. We illustrate new examples of processes which can play the role of priors for Bayesian nonparametric inference and finally point out some interesting connections with the theory of generalized gamma convolutions initiated by Thorin and further developed by Bondesson.
△ Less
Submitted 18 March, 2005;
originally announced March 2005.