Multiplicative Decomposition of Heterogeneity in Mixtures of Continuous Distributions
Authors:
Abraham Nunes,
Martin Alda,
Thomas Trappenberg
Abstract:
A system's heterogeneity (\textit{diversity}) is the effective size of its event space, and can be quantified using the Rényi family of indices (also known as Hill numbers in ecology or Hannah-Kay indices in economics), which are indexed by an elasticity parameter $q \geq 0$. Under these indices, the heterogeneity of a composite system (the $γ$-heterogeneity) is decomposable into heterogeneity ari…
▽ More
A system's heterogeneity (\textit{diversity}) is the effective size of its event space, and can be quantified using the Rényi family of indices (also known as Hill numbers in ecology or Hannah-Kay indices in economics), which are indexed by an elasticity parameter $q \geq 0$. Under these indices, the heterogeneity of a composite system (the $γ$-heterogeneity) is decomposable into heterogeneity arising from variation \textit{within} and \textit{between} component subsystems (the $α$- and $β$-heterogeneity, respectively). Since the average heterogeneity of a component subsystem should not be greater than that of the pooled system, we require that $γ\geq α$. There exists a multiplicative decomposition for Rényi heterogeneity of composite systems with discrete event spaces, but less attention has been paid to decomposition in the continuous setting. We therefore describe multiplicative decomposition of the Rényi heterogeneity for continuous mixture distributions under parametric and non-parametric pooling assumptions. Under non-parametric pooling, the $γ$-heterogeneity must often be estimated numerically, but the multiplicative decomposition holds such that $γ\geq α$ for $q > 0$. Conversely, under parametric pooling, $γ$-heterogeneity can be computed efficiently in closed-form, but the $γ\geq α$ condition holds reliably only at $q=1$. Our findings will further contribute to heterogeneity measurement in continuous systems.
△ Less
Submitted 17 June, 2020; v1 submitted 22 February, 2020;
originally announced February 2020.
Representational Rényi heterogeneity
Authors:
Abraham Nunes,
Martin Alda,
Timothy Bardouille,
Thomas Trappenberg
Abstract:
A discrete system's heterogeneity is measured by the Rényi heterogeneity family of indices (also known as Hill numbers or Hannah--Kay indices), whose units are {the numbers equivalent}. Unfortunately, numbers equivalent heterogeneity measures for non-categorical data require {a priori} (A) categorical partitioning and (B) pairwise distance measurement on the observable data space, thereby precludi…
▽ More
A discrete system's heterogeneity is measured by the Rényi heterogeneity family of indices (also known as Hill numbers or Hannah--Kay indices), whose units are {the numbers equivalent}. Unfortunately, numbers equivalent heterogeneity measures for non-categorical data require {a priori} (A) categorical partitioning and (B) pairwise distance measurement on the observable data space, thereby precluding application to problems with ill-defined categories or where semantically relevant features must be learned as abstractions from some data. We thus introduce representational Rényi heterogeneity (RRH), which transforms an observable domain onto a latent space upon which the Rényi heterogeneity is both tractable and semantically relevant. This method requires neither {a priori} binning nor definition of a distance function on the observable space. We show that RRH can generalize existing biodiversity and economic equality indices. Compared with existing indices on a beta-mixture distribution, we show that RRH responds more appropriately to changes in mixture component separation and weighting. Finally, we demonstrate the measurement of RRH in a set of natural images, with respect to abstract representations learned by a deep neural network. The RRH approach will further enable heterogeneity measurement in disciplines whose data do not easily conform to the assumptions of existing indices.
△ Less
Submitted 6 April, 2020; v1 submitted 10 December, 2019;
originally announced December 2019.