-
Locally tail-scale invariant scoring rules for evaluation of extreme value forecasts
Authors:
Helga Kristin Olafsdottir,
Holger Rootzén,
David Bolin
Abstract:
Statistical analysis of extremes can be used to predict the probability of future extreme events, such as large rainfalls or devastating windstorms. The quality of these forecasts can be measured through scoring rules. Locally scale invariant scoring rules give equal importance to the forecasts at different locations regardless of differences in the prediction uncertainty. This is a useful feature…
▽ More
Statistical analysis of extremes can be used to predict the probability of future extreme events, such as large rainfalls or devastating windstorms. The quality of these forecasts can be measured through scoring rules. Locally scale invariant scoring rules give equal importance to the forecasts at different locations regardless of differences in the prediction uncertainty. This is a useful feature when computing average scores but can be an unnecessarily strict requirement when mostly concerned with extremes. We propose the concept of local weight-scale invariance, describing scoring rules fulfilling local scale invariance in a certain region of interest, and as a special case local tail-scale invariance, for large events. Moreover, a new version of the weighted Continuous Ranked Probability score (wCRPS) called the scaled wCRPS (swCRPS) that possesses this property is developed and studied. The score is a suitable alternative for scoring extreme value models over areas with varying scale of extreme events, and we derive explicit formulas of the score for the Generalised Extreme Value distribution. The scoring rules are compared through simulation, and their usage is illustrated in modelling of extreme water levels, annual maximum rainfalls, and in an application to non-extreme forecast for the prediction of air pollution.
△ Less
Submitted 21 February, 2024; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Multivariate generalized Pareto distributions: parametrizations, representations, and properties
Authors:
Holger Rootzén,
Johan Segers,
Jennifer L. Wadsworth
Abstract:
Multivariate generalized Pareto distributions arise as the limit distributions of exceedances over multivariate thresholds of random vectors in the domain of attraction of a max-stable distribution. These distributions can be parametrized and represented in a number of different ways. Moreover, generalized Pareto distributions enjoy a number of interesting stability properties. An overview of the…
▽ More
Multivariate generalized Pareto distributions arise as the limit distributions of exceedances over multivariate thresholds of random vectors in the domain of attraction of a max-stable distribution. These distributions can be parametrized and represented in a number of different ways. Moreover, generalized Pareto distributions enjoy a number of interesting stability properties. An overview of the main features of such distributions are given, expressed compactly in several parametrizations, giving the potential user of these distributions a convenient catalogue of ways to handle and work with generalized Pareto distributions.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.
-
Multivariate peaks over thresholds models
Authors:
Holger Rootzén,
Johan Segers,
Jennifer L. Wadsworth
Abstract:
Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data…
▽ More
Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data, and show how conditioning the distribution of extreme episodes on threshold exceedance gives four basic representations of the family of generalized Pareto distributions. The first representation is constructed on the real scale of the observations. The second one starts with a model on a standard exponential scale which then is transformed to the real scale. The third and fourth are reformulations of a spectral representation proposed in A. Ferreira and L. de Haan [Bernoulli 20 (2014) 1717--1737]. Numerically tractable forms of densities and censored densities are found and give tools for flexible parametric likelihood inference. New simulation algorithms, explicit formulas for probabilities and conditional probabilities, and conditions which make the conditional distribution of weighted component sums generalized Pareto are derived.
△ Less
Submitted 3 May, 2017; v1 submitted 21 March, 2016;
originally announced March 2016.
-
Correction note to "Limit Theorems for Empirical Processes of Cluster Functionals" [arXiv:0910.0343]
Authors:
Holger Drees,
Holger Rootzén
Abstract:
We correct an error in a technical lemma of Drees and Rootzén (2010) [arXiv:0910.0343] and discuss consequences for applications.
We correct an error in a technical lemma of Drees and Rootzén (2010) [arXiv:0910.0343] and discuss consequences for applications.
△ Less
Submitted 29 October, 2015;
originally announced October 2015.
-
Error distributions for random grid approximations of multidimensional stochastic integrals
Authors:
Carl Lindberg,
Holger Rootzén
Abstract:
This paper proves joint convergence of the approximation error for several stochastic integrals with respect to local Brownian semimartingales, for nonequidistant and random grids. The conditions needed for convergence are that the Lebesgue integrals of the integrands tend uniformly to zero and that the squared variation and covariation processes converge. The paper also provides tools which simpl…
▽ More
This paper proves joint convergence of the approximation error for several stochastic integrals with respect to local Brownian semimartingales, for nonequidistant and random grids. The conditions needed for convergence are that the Lebesgue integrals of the integrands tend uniformly to zero and that the squared variation and covariation processes converge. The paper also provides tools which simplify checking these conditions and which extend the range for the results. These results are used to prove an explicit limit theorem for random grid approximations of integrals based on solutions of multidimensional SDEs, and to find ways to "design" and optimize the distribution of the approximation error. As examples we briefly discuss strategies for discrete option hedging.
△ Less
Submitted 23 September, 2013; v1 submitted 25 August, 2011;
originally announced August 2011.
-
Limit theorems for empirical processes of cluster functionals
Authors:
Holger Drees,
Holger Rootzén
Abstract:
Let $(X_{n,i})_{1\le i\le n,n\in\mathbb{N}}$ be a triangular array of row-wise stationary $\mathbb{R}^d$-valued random variables. We use a "blocks method" to define clusters of extreme values: the rows of $(X_{n,i})$ are divided into $m_n$ blocks $(Y_{n,j})$, and if a block contains at least one extreme value, the block is considered to contain a cluster. The cluster starts at the first extreme va…
▽ More
Let $(X_{n,i})_{1\le i\le n,n\in\mathbb{N}}$ be a triangular array of row-wise stationary $\mathbb{R}^d$-valued random variables. We use a "blocks method" to define clusters of extreme values: the rows of $(X_{n,i})$ are divided into $m_n$ blocks $(Y_{n,j})$, and if a block contains at least one extreme value, the block is considered to contain a cluster. The cluster starts at the first extreme value in the block and ends at the last one. The main results are uniform central limit theorems for empirical processes $Z_n(f):=\frac{1}{\sqrt {nv_n}}\sum_{j=1}^{m_n}(f(Y_{n,j})-Ef(Y_{n,j})),$ for $v_n=P\{X_{n,i}\neq0\}$ and $f$ belonging to classes of cluster functionals, that is, functions of the blocks $Y_{n,j}$ which only depend on the cluster values and which are equal to 0 if $Y_{n,j}$ does not contain a cluster. Conditions for finite-dimensional convergence include $β$-mixing, suitable Lindeberg conditions and convergence of covariances. To obtain full uniform convergence, we use either "bracketing entropy" or bounds on covering numbers with respect to a random semi-metric. The latter makes it possible to bring the powerful Vapnik--Červonenkis theory to bear. Applications include multivariate tail empirical processes and empirical processes of cluster values and of order statistics in clusters. Although our main field of applications is the analysis of extreme values, the theory can be applied more generally to rare events occurring, for example, in nonparametric curve estimation.
△ Less
Submitted 18 May, 2020; v1 submitted 2 October, 2009;
originally announced October 2009.
-
Models for dependent extremes using stable mixtures
Authors:
Anne-Laure Fougères,
John P. Nolan,
Holger Rootzén
Abstract:
This paper unifies and extends results on a class of multivariate Extreme Value (EV) models studied by Hougaard, Crowder, and Tawn. In these models both unconditional and conditional distributions are EV, and all lower-dimensional marginals and maxima belong to the class. This leads to substantial economies of understanding, analysis and prediction. One interpretation of the models is as size mi…
▽ More
This paper unifies and extends results on a class of multivariate Extreme Value (EV) models studied by Hougaard, Crowder, and Tawn. In these models both unconditional and conditional distributions are EV, and all lower-dimensional marginals and maxima belong to the class. This leads to substantial economies of understanding, analysis and prediction. One interpretation of the models is as size mixtures of EV distributions, where the mixing is by positive stable distributions. A second interpretation is as exponential-stable location mixtures (for Gumbel) or as power-stable scale mixtures (for non-Gumbel EV distributions). A third interpretation is through a Peaks over Thresholds model with a positive stable intensity. The mixing variables are used as a modeling tool and for better understanding and model checking. We study extreme value analogues of components of variance models, and new time series, spatial, and continuous parameter models for extreme values. The results are applied to data from a pitting corrosion investigation.
△ Less
Submitted 15 November, 2007;
originally announced November 2007.
-
Extremes on Trees
Authors:
Tailen Hsing,
Holger Rootzen
Abstract:
This paper considers the asymptotic distribution of the longest edge of the minimal spanning tree and nearest neighbor graph on X_1,...,X_{N_n} where X_1,X_2,... are i.i.d. in \Re^2 with distribution F and N_n is independent of the X_i and satisfies N_n/n\to_p1. A new approach based on spatial blocking and a locally orthogonal coordinate system is developed to treat cases for which F has unbound…
▽ More
This paper considers the asymptotic distribution of the longest edge of the minimal spanning tree and nearest neighbor graph on X_1,...,X_{N_n} where X_1,X_2,... are i.i.d. in \Re^2 with distribution F and N_n is independent of the X_i and satisfies N_n/n\to_p1. A new approach based on spatial blocking and a locally orthogonal coordinate system is developed to treat cases for which F has unbounded support. The general results are applied to a number of special cases, including elliptically contoured distributions, distributions with independent Weibull-like margins and distributions with parallel level curves.
△ Less
Submitted 25 March, 2005;
originally announced March 2005.