-
Inverse clustering of Gibbs Partitions via independent fragmentation and dual dependent coagulation operators
Authors:
Man Wai Ho,
Lancelot F. James,
John W. Lau
Abstract:
Gibbs partitions of the integers generated by stable subordinators of index $α\in(0,1)$ form remarkable classes of random partitions where in principle much is known about their properties, including practically effortless obtainment of otherwise complex asymptotic results potentially relevant to applications in general combinatorial stochastic processes, random tree/graph growth models and Bayesi…
▽ More
Gibbs partitions of the integers generated by stable subordinators of index $α\in(0,1)$ form remarkable classes of random partitions where in principle much is known about their properties, including practically effortless obtainment of otherwise complex asymptotic results potentially relevant to applications in general combinatorial stochastic processes, random tree/graph growth models and Bayesian statistics. This class includes the well-known models based on the two-parameter Poisson-Dirichlet distribution which forms the bulk of explicit applications. This work continues efforts to provide interpretations for a larger classes of Gibbs partitions by embedding important operations within this framework. Here we address the formidable problem of extending the dual, infinite-block, coagulation/fragmentation results of Jim Pitman (1999, Annals of Probability), where in terms of coagulation they are based on independent two-parameter Poisson-Dirichlet distributions, to all such Gibbs (stable Poisson-Kingman) models. Our results create nested families of Gibbs partitions, and corresponding mass partitions, over any $0<β<α<1.$ We primarily focus on the fragmentation operations, which remain independent in this setting, and corresponding remarkable calculations for Gibbs partitions derived from that operation. We also present definitive results for the dual coagulation operations, now based on our construction of dependent processes, and demonstrate its relatively simple application in terms of Mittag-Leffler and generalized gamma models. The latter demonstrates another approach to recover the duality results in Pitman (1999).
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Gibbs Partitions, Riemann-Liouville Fractional Operators, Mittag-Leffler Functions, and Fragmentations Derived From Stable Subordinators
Authors:
Man-Wai Ho,
Lancelot F. James,
John W. Lau
Abstract:
Pitman(2003)(and subsequently Gnedin and Pitman (2006) showed that a large class of random partitions of the integers derived from a stable subordinator of index $α\in(0,1)$ have infinite Gibbs (product) structure as a characterizing feature. The most notable case are random partitions derived from the two-parameter Poisson-Dirichlet distribution, $\mathrm{PD}(α,θ)$, which are induced by mixing ov…
▽ More
Pitman(2003)(and subsequently Gnedin and Pitman (2006) showed that a large class of random partitions of the integers derived from a stable subordinator of index $α\in(0,1)$ have infinite Gibbs (product) structure as a characterizing feature. The most notable case are random partitions derived from the two-parameter Poisson-Dirichlet distribution, $\mathrm{PD}(α,θ)$, which are induced by mixing over variables with generalized Mittag-Leffler distributions, denoted by $\mathrm{ML}(α,θ).$ Our aim in this work is to provide indications on the utility of the wider class of Gibbs partitions as it relates to a study of Riemann-Liouville fractional integrals and size-biased sampling, decompositions of special functions, and its potential use in the understanding of various constructions of more exotic processes. We provide novel characterizations of general laws associated with two nested families of $\mathrm{PD}(α,θ)$ mass partitions that are constructed from notable fragmentation operations described in Dong, Goldschmidt and Martin(2006) and Pitman(1999), respectively. These operations are known to be related in distribution to various constructions of discrete random trees/graphs in $[n],$ and their scaling limits, such as stable trees. A centerpiece of our work are results related to Mittag-Leffler functions, which play a key role in fractional calculus and are otherwise Laplace transforms of the $\mathrm{ML}(α,θ)$ variables. Notably, this leads to an interpretation of $\mathrm{PD}(α,θ)$ laws within a mixed Poisson waiting time framework based on $\mathrm{ML}(α,θ)$ variables, which suggests connections to recent construction of Pólya urn models with random immigration by Peköz, Röllin and Ross(2018). Simplifications in the Brownian case are highlighted.
△ Less
Submitted 28 July, 2018; v1 submitted 14 February, 2018;
originally announced February 2018.
-
A conjugate class of random probability measures based on tilting and with its posterior analysis
Authors:
John W. Lau
Abstract:
This article constructs a class of random probability measures based on exponentially and polynomially tilting operated on the laws of completely random measures. The class is proved to be conjugate in that it covers both prior and posterior random probability measures in the Bayesian sense. Moreover, the class includes some common and widely used random probability measures, the normalized comple…
▽ More
This article constructs a class of random probability measures based on exponentially and polynomially tilting operated on the laws of completely random measures. The class is proved to be conjugate in that it covers both prior and posterior random probability measures in the Bayesian sense. Moreover, the class includes some common and widely used random probability measures, the normalized completely random measures (James (Poisson process partition calculus with applications to exchangeable models and Bayesian nonparametrics (2002) Preprint), Regazzini, Lijoi and Prünster (Ann. Statist. 31 (2003) 560-585), Lijoi, Mena and Prünster (J. Amer. Statist. Assoc. 100 (2005) 1278-1291)) and the Poisson-Dirichlet process (Pitman and Yor (Ann. Probab. 25 (1997) 855-900), Ishwaran and James (J. Amer. Statist. Assoc. 96 (2001) 161-173), Pitman (In Science and Statistics: A Festschrift for Terry Speed (2003) 1-34 IMS)), in a single construction. We describe an augmented version of the Blackwell-MacQueen Pólya urn sampling scheme (Blackwell and MacQueen (Ann. Statist. 1 (1973) 353-355)) that simplifies implementation and provide a simulation study for approximating the probabilities of partition sizes.
△ Less
Submitted 18 December, 2013;
originally announced December 2013.
-
Bayesian nonparametric estimation and consistency of mixed multinomial logit choice models
Authors:
Pierpaolo De Blasi,
Lancelot F. James,
John W. Lau
Abstract:
This paper develops nonparametric estimation for discrete choice models based on the mixed multinomial logit (MMNL) model. It has been shown that MMNL models encompass all discrete choice models derived under the assumption of random utility maximization, subject to the identification of an unknown distribution $G$. Noting the mixture model description of the MMNL, we employ a Bayesian nonparametr…
▽ More
This paper develops nonparametric estimation for discrete choice models based on the mixed multinomial logit (MMNL) model. It has been shown that MMNL models encompass all discrete choice models derived under the assumption of random utility maximization, subject to the identification of an unknown distribution $G$. Noting the mixture model description of the MMNL, we employ a Bayesian nonparametric approach, using nonparametric priors on the unknown mixing distribution $G$, to estimate choice probabilities. We provide an important theoretical support for the use of the proposed methodology by investigating consistency of the posterior distribution for a general nonparametric prior on the mixing distribution. Consistency is defined according to an $L_1$-type distance on the space of choice probabilities and is achieved by extending to a regression model framework a recent approach to strong consistency based on the summability of square roots of prior probabilities. Moving to estimation, slightly different techniques for non-panel and panel data models are discussed. For practical implementation, we describe efficient and relatively easy-to-use blocked Gibbs sampling procedures. These procedures are based on approximations of the random probability measure by classes of finite stick-breaking processes. A simulation study is also performed to investigate the performance of the proposed methods.
△ Less
Submitted 24 February, 2011;
originally announced February 2011.
-
Gibbs Partitions (EPPF's) Derived From a Stable Subordinator are Fox H and Meijer G Transforms
Authors:
Man-Wai Ho,
Lancelot F. James,
John W. Lau
Abstract:
This paper derives explicit results for the infinite Gibbs partitions generated by the jumps of an $α-$stable subordinator, derived in Pitman \cite{Pit02, Pit06}. We first show that for general $α$ the conditional EPPF can be represented as ratios of Fox-$H$ functions, and in the case of rational $α,$ Meijer-G functions. Furthermore the results show that the resulting unconditional EPPF's, can b…
▽ More
This paper derives explicit results for the infinite Gibbs partitions generated by the jumps of an $α-$stable subordinator, derived in Pitman \cite{Pit02, Pit06}. We first show that for general $α$ the conditional EPPF can be represented as ratios of Fox-$H$ functions, and in the case of rational $α,$ Meijer-G functions. Furthermore the results show that the resulting unconditional EPPF's, can be expressed in terms of H and G transforms indexed by a function h. Hence when h is itself a H or G function the EPPF is also an H or G function. An implication, in the case of rational $α,$ is that one can compute explicitly thousands of EPPF's derived from possibly exotic special functions. This would also apply to all $α$ except that computations for general Fox functions are not yet available. However, moving away from special functions, we demonstrate how results from probability theory may be used to obtain calculations. We show that a forward recursion can be applied that only requires calculation of the simplest components. Additionally we identify general classes of EPPF's where explicit calculations can be carried out using distribution theory.
△ Less
Submitted 29 August, 2007; v1 submitted 4 August, 2007;
originally announced August 2007.
-
Coagulation Fragmentation Laws Induced By General Coagulations of Two-Parameter Poisson-Dirichlet Processes
Authors:
Man-Wai Ho,
Lancelot F. James,
John W. Lau
Abstract:
Pitman~(1999) describes a duality relationship between fragmentation and coagulation operators. An explicit relationship is described for the two-parameter Poisson-Dirichlet laws, with parameters {\footnotesize $(α,θ)$} and $(β,θ/α)$, wherein $PD(α, θ)$ is coagulated by $PD(β,θ/α)$ for $0<α<1$, $0 \leqβ<1$ and $-β<θ/α$. This remarkable explicit agreement was obtained by combinatorial methods via…
▽ More
Pitman~(1999) describes a duality relationship between fragmentation and coagulation operators. An explicit relationship is described for the two-parameter Poisson-Dirichlet laws, with parameters {\footnotesize $(α,θ)$} and $(β,θ/α)$, wherein $PD(α, θ)$ is coagulated by $PD(β,θ/α)$ for $0<α<1$, $0 \leqβ<1$ and $-β<θ/α$. This remarkable explicit agreement was obtained by combinatorial methods via exchangeable partition probability functions~(EPPF). This work discusses an alternative analysis which can feasibly extend the characterizations above to more general models of $PD(α,θ)$ coagulated with some law $Q$. The analysis exploits distributional relationships between compositions of species sampling random probability measures and coagulation operators and recent work on Cauchy-Stieltjes transforms of random probability measures by Vershik, Yor and Tsilevich (2004) and James (2002). We use this to obtain explicit descriptions in the case where {\footnotesize $Q$} corresponds to a large class of power tempered Poisson Kingman models analyzed in James~(2002). That is, explicit results are obtained for models outside of the $PD(β,θ/α)$ family.
△ Less
Submitted 25 January, 2006;
originally announced January 2006.
-
A Class of Generalized Hyperbolic Continuous Time Integrated Stochastic Volatility Likelihood Models
Authors:
Lancelot F. James,
John W. Lau
Abstract:
This paper discusses and analyzes a class of likelihood models which are based on two distributional innovations in financial models for stock returns. That is, the notion that the marginal distribution of aggregate returns of log-stock prices are well approximated by generalized hyperbolic distributions, and that volatility clustering can be handled by specifying the integrated volatility as a…
▽ More
This paper discusses and analyzes a class of likelihood models which are based on two distributional innovations in financial models for stock returns. That is, the notion that the marginal distribution of aggregate returns of log-stock prices are well approximated by generalized hyperbolic distributions, and that volatility clustering can be handled by specifying the integrated volatility as a random process such as that proposed in a recent series of papers by Barndorff-Nielsen and Shephard (BNS). The BNS models produce likelihoods for aggregate returns which can be viewed as a subclass of latent regression models where one has n conditionally independent Normal random variables whose mean and variance are representable as linear functionals of a common unobserved Poisson random measure. James (2005b) recently obtains an exact analysis for such models yielding expressions of the likelihood in terms of quite tractable Fourier-Cosine integrals. Here, our idea is to analyze a class of likelihoods, which can be used for similar purposes, but where the latent regression models are based on n conditionally independent models with distributions belonging to a subclass of the generalized hyperbolic distributions and whose corresponding parameters are representable as linear functionals of a common unobserved Poisson random measure. Our models are perhaps most closely related to the Normal inverse Gaussian/GARCH/A-PARCH models of Brandorff-Nielsen (1997) and Jensen and Lunde (2001), where in our case the GARCH component is replaced by quantities such as INT-OU processes. It is seen that, importantly, such likelihood models exhibit quite different features structurally. One nice feature of the model is that it allows for more flexibility in terms of modelling of external regression parameters.
△ Less
Submitted 3 March, 2005;
originally announced March 2005.