-
The Zeta Tail Distribution: A Novel Event-Count Model
Authors:
Michael R. Powers
Abstract:
We introduce the Zeta Tail(a) probability distribution as a new model for random damage-event counts in risk analysis. Although readily motivated as an analogue of the Geometric(p) distribution, Zeta Tail(a) has received little attention in the scholarly literature. In the present work, we begin by deriving various fundamental properties of this novel distribution. We then assess its usefulness as…
▽ More
We introduce the Zeta Tail(a) probability distribution as a new model for random damage-event counts in risk analysis. Although readily motivated as an analogue of the Geometric(p) distribution, Zeta Tail(a) has received little attention in the scholarly literature. In the present work, we begin by deriving various fundamental properties of this novel distribution. We then assess its usefulness as an alternative to Geometric(p), both theoretically and through application to a set of meteorological data. Lastly, we discuss conceptual differences between employing the Zeta Tail(a) model conditionally (i.e., given observed data with certain known characteristics) and unconditionally (i.e., for arbitrary, as yet unobserved data).
△ Less
Submitted 8 July, 2025; v1 submitted 20 June, 2025;
originally announced June 2025.
-
Assessing Risk Heterogeneity through Heavy-Tailed Frequency and Severity Mixtures
Authors:
Michael R. Powers,
Jiaxin Xu
Abstract:
In operational risk management and actuarial finance, the analysis of risk often begins by dividing a random damage-generation process into its separate frequency and severity components. In the present article, we construct canonical families of mixture distributions for each of these components, based on a Negative Binomial kernel for frequency and a Gamma kernel for severity. The mixtures are e…
▽ More
In operational risk management and actuarial finance, the analysis of risk often begins by dividing a random damage-generation process into its separate frequency and severity components. In the present article, we construct canonical families of mixture distributions for each of these components, based on a Negative Binomial kernel for frequency and a Gamma kernel for severity. The mixtures are employed to assess the heterogeneity of risk factors underlying an empirical distribution through the shape of the implied mixing distribution. From the duality of the Negative Binomial and Gamma distributions, we first derive necessary and sufficient conditions for heavy-tailed (i.e., inverse power-law) canonical mixtures. We then formulate flexible 4-parameter families of mixing distributions for Geometric and Exponential kernels to generate heavy-tailed 4-parameter mixture models, and extend these mixtures to arbitrary Negative Binomial and Gamma kernels, respectively, yielding 5-parameter mixtures for detecting and measuring risk heterogeneity. To check the robustness of such heterogeneity inferences, we show how a fitted 5-parameter model may be re-expressed in terms of alternative Negative Binomial or Gamma kernels whose associated mixing distributions form a "calibrated" family.
△ Less
Submitted 16 June, 2025; v1 submitted 7 May, 2025;
originally announced May 2025.
-
A Criterion for Extending Continuous-Mixture Identifiability Results
Authors:
Michael R. Powers,
Jiaxin Xu
Abstract:
Mixture distributions provide a versatile and widely used framework for modeling random phenomena, and are particularly well-suited to the analysis of geoscientific processes and their attendant risks to society. For continuous mixtures of random variables, we specify a simple criterion - generating-function accessibility - to extend previously known kernel-based identifiability (or unidentifiabil…
▽ More
Mixture distributions provide a versatile and widely used framework for modeling random phenomena, and are particularly well-suited to the analysis of geoscientific processes and their attendant risks to society. For continuous mixtures of random variables, we specify a simple criterion - generating-function accessibility - to extend previously known kernel-based identifiability (or unidentifiability) results to new kernel distributions. This criterion, based on functional relationships between the relevant kernels' moment-generating functions or Laplace transforms, may be applied to continuous mixtures of both discrete and continuous random variables. To illustrate the proposed approach, we present results for several specific kernels, in each case briefly noting its relevance to research in the geosciences and/or related risk analysis.
△ Less
Submitted 16 June, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
Using Linked Micromaps for Evidence-Based Policy
Authors:
Randall Powers,
John Eltinge,
Wendy Martinez,
Darcy Steeg Morris
Abstract:
Linked micromaps were originally developed to display geographically indexed statistics in an intuitive way by connecting them to a sequence of small maps. The approach integrates several visualization design principles, such as small multiples, discrete color indexing, and ordering. Linked micromaps allow for other types of data displays that are connected to and conditional on geographic areas.…
▽ More
Linked micromaps were originally developed to display geographically indexed statistics in an intuitive way by connecting them to a sequence of small maps. The approach integrates several visualization design principles, such as small multiples, discrete color indexing, and ordering. Linked micromaps allow for other types of data displays that are connected to and conditional on geographic areas. Initial applications of micromaps used data from the National Cancer Institute and the Environmental Protection Agency. In this paper, we will show how linked micromaps can be used to better understand and explore relationships and distributions of statistics linked to US states and Washington, DC. We will compare linked micromaps with other popular data displays of geographic data, such as bubble maps, choropleth maps, and bar charts. We will illustrate how linked micromaps can be used for evidence-based decision-making using data from the Bureau of Labor Statistics, the Census Bureau, and the Economic Research Service. The presentations, R scripts, and the data sets used in this article are available here: https://github.com/wlmcensus/Joint-Statistical-Meetings-Presentation-2024. The work discussed in this article was presented at the Joint Statistical Meetings (JSM) 2024 and the American Association for Public Opinion Research (AAPOR) 2024 Annual Conference.
△ Less
Submitted 13 November, 2024; v1 submitted 6 November, 2024;
originally announced November 2024.
-
A Versatility Measure for Parametric Risk Models
Authors:
Michael R. Powers,
Jiaxin Xu
Abstract:
Parametric statistical methods play a central role in analyzing risk through its underlying frequency and severity components. Given the wide availability of numerical algorithms and high-speed computers, researchers and practitioners often model these separate (although possibly statistically dependent) random variables by fitting a large number of parametric probability distributions to historic…
▽ More
Parametric statistical methods play a central role in analyzing risk through its underlying frequency and severity components. Given the wide availability of numerical algorithms and high-speed computers, researchers and practitioners often model these separate (although possibly statistically dependent) random variables by fitting a large number of parametric probability distributions to historical data and then comparing goodness-of-fit statistics. However, this approach is highly susceptible to problems of overfitting because it gives insufficient weight to fundamental considerations of functional simplicity and adaptability. To address this shortcoming, we propose a formal mathematical measure for assessing the versatility of frequency and severity distributions prior to their application. We then illustrate this approach by computing and comparing values of the versatility measure for a variety of probability distributions commonly used in risk analysis.
△ Less
Submitted 16 June, 2025; v1 submitted 27 July, 2024;
originally announced July 2024.
-
Heavy-Tailed Loss Frequencies from Mixtures of Negative Binomial and Poisson Counts
Authors:
Jiansheng Dai,
Ziheng Huang,
Michael R. Powers,
Jiaxin Xu
Abstract:
Heavy-tailed random variables have been used in insurance research to model both loss frequencies and loss severities, with substantially more emphasis on the latter. In the present work, we take a step toward addressing this imbalance by exploring the class of heavy-tailed frequency models formed by continuous mixtures of Negative Binomial and Poisson random variables. We begin by defining the co…
▽ More
Heavy-tailed random variables have been used in insurance research to model both loss frequencies and loss severities, with substantially more emphasis on the latter. In the present work, we take a step toward addressing this imbalance by exploring the class of heavy-tailed frequency models formed by continuous mixtures of Negative Binomial and Poisson random variables. We begin by defining the concept of a calibrative family of mixing distributions (each member of which is identifiable from its associated Negative Binomial mixture), and show how to construct such families from only a single member. We then introduce a new heavy-tailed frequency model -- the two-parameter ZY distribution -- as a generalization of both the one-parameter Zeta and Yule distributions, and construct calibrative families for both the new distribution and the heavy-tailed two-parameter Waring distribution. Finally, we pursue natural extensions of both the ZY and Waring families to a unifying, four-parameter heavy-tailed model, providing the foundation for a novel loss-frequency modeling approach to complement conventional GLM analyses. This approach is illustrated by application to a classic set of Swedish commercial motor-vehicle insurance loss data.
△ Less
Submitted 10 November, 2022; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Characterizing the Zeta Distribution via Continuous Mixtures
Authors:
Jiansheng Dai,
Ziheng Huang,
Michael R. Powers,
Jiaxin Xu
Abstract:
We offer two novel characterizations of the Zeta distribution: first, as tractable continuous mixtures of Negative Binomial distributions (with fixed shape parameter, r > 0), and second, as a tractable continuous mixture of Poisson distributions. In both the Negative Binomial case for r >= 1 and the Poisson case, the resulting Zeta distributions are identifiable because each mixture can be associa…
▽ More
We offer two novel characterizations of the Zeta distribution: first, as tractable continuous mixtures of Negative Binomial distributions (with fixed shape parameter, r > 0), and second, as a tractable continuous mixture of Poisson distributions. In both the Negative Binomial case for r >= 1 and the Poisson case, the resulting Zeta distributions are identifiable because each mixture can be associated with a unique mixing distribution. In the Negative Binomial case for 0 < r < 1, the mixing distributions are quasi-distributions (for which the quasi-probability density function assumes some negative values).
△ Less
Submitted 4 June, 2021; v1 submitted 14 August, 2020;
originally announced August 2020.