-
Records in stochastic processes -- Theory and applications
Authors:
Gregor Wergen
Abstract:
In recent years there has been a surge of interest in the statistics of record-breaking events in stochastic processes. Along with that, many new and interesting applications of the theory of records were discovered and explored. The record statistics of uncorrelated random variables sampled from time-dependent distributions was studied extensively. The findings were applied in various areas to mo…
▽ More
In recent years there has been a surge of interest in the statistics of record-breaking events in stochastic processes. Along with that, many new and interesting applications of the theory of records were discovered and explored. The record statistics of uncorrelated random variables sampled from time-dependent distributions was studied extensively. The findings were applied in various areas to model and explain record-breaking events in observational data. Particularly interesting and fruitful was the study of record-breaking temperatures and their connection with global warming, but also records in sports, biology and some areas in physics were considered in the last years. Similarly, researchers have recently started to understand the record statistics of correlated processes such as random walks, which can be helpful to model record events in financial time series. This review is an attempt to summarize and evaluate the progress that was made in the field of record statistics throughout the last years.
△ Less
Submitted 16 July, 2013; v1 submitted 26 November, 2012;
originally announced November 2012.
-
Record statistics and persistence for a random walk with a drift
Authors:
Satya N. Majumdar,
Gregory Schehr,
Gregor Wergen
Abstract:
We study the statistics of records of a one-dimensional random walk of n steps, starting from the origin, and in presence of a constant bias c. At each time-step the walker makes a random jump of length ηdrawn from a continuous distribution f(η) which is symmetric around a constant drift c. We focus in particular on the case were f(η) is a symmetric stable law with a Lévy index 0 < μ\leq 2. The re…
▽ More
We study the statistics of records of a one-dimensional random walk of n steps, starting from the origin, and in presence of a constant bias c. At each time-step the walker makes a random jump of length ηdrawn from a continuous distribution f(η) which is symmetric around a constant drift c. We focus in particular on the case were f(η) is a symmetric stable law with a Lévy index 0 < μ\leq 2. The record statistics depends crucially on the persistence probability which, as we show here, exhibits different behaviors depending on the sign of c and the value of the parameter μ. Hence, in the limit of a large number of steps n, the record statistics is sensitive to these parameters (c and μ) of the jump distribution. We compute the asymptotic mean record number <R_n> after n steps as well as its full distribution P(R,n). We also compute the statistics of the ages of the longest and the shortest lasting record. Our exact computations show the existence of five distinct regions in the (c, 0 < μ\leq 2) strip where these quantities display qualitatively different behaviors. We also present numerical simulation results that verify our analytical predictions.
△ Less
Submitted 28 August, 2012; v1 submitted 29 June, 2012;
originally announced June 2012.
-
Rounding Effects in Record Statistics
Authors:
G. Wergen,
D. Volovik,
S. Redner,
J. Krug
Abstract:
We analyze record-breaking events in time series of continuous random variables that are subsequently discretized by rounding down to integer multiples of a discretization scale $Δ>0$. Rounding leads to ties of an existing record, thereby reducing the number of new records. For an infinite number of random variables that are drawn from distributions with a finite upper limit, the number of discret…
▽ More
We analyze record-breaking events in time series of continuous random variables that are subsequently discretized by rounding down to integer multiples of a discretization scale $Δ>0$. Rounding leads to ties of an existing record, thereby reducing the number of new records. For an infinite number of random variables that are drawn from distributions with a finite upper limit, the number of discrete records is finite, while for distributions with a thinner than exponential upper tail, fewer discrete records arise compared to continuous variables. In the latter case the record sequence becomes highly regular at long times.
△ Less
Submitted 22 October, 2012; v1 submitted 20 June, 2012;
originally announced June 2012.
-
Record Statistics for Multiple Random Walks
Authors:
Gregor Wergen,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We study the statistics of the number of records R_{n,N} for N identical and independent symmetric discrete-time random walks of n steps in one dimension, all starting at the origin at step 0. At each time step, each walker jumps by a random length drawn independently from a symmetric and continuous distribution. We consider two cases: (I) when the variance σ^2 of the jump distribution is finite a…
▽ More
We study the statistics of the number of records R_{n,N} for N identical and independent symmetric discrete-time random walks of n steps in one dimension, all starting at the origin at step 0. At each time step, each walker jumps by a random length drawn independently from a symmetric and continuous distribution. We consider two cases: (I) when the variance σ^2 of the jump distribution is finite and (II) when σ^2 is divergent as in the case of Lévy flights with index 0 < μ< 2. In both cases we find that the mean record number <R_{n,N}> grows universally as \sim α_N \sqrt{n} for large n, but with a very different behavior of the amplitude α_N for N > 1 in the two cases. We find that for large N, α_N \approx 2 \sqrt{\log N} independently of σ^2 in case I. In contrast, in case II, the amplitude approaches to an N-independent constant for large N, α_N \approx 4/\sqrtπ, independently of 0<μ<2. For finite σ^2 we argue, and this is confirmed by our numerical simulations, that the full distribution of (R_{n,N}/\sqrt{n} - 2 \sqrt{\log N}) \sqrt{\log N} converges to a Gumbel law as n \to \infty and N \to \infty. In case II, our numerical simulations indicate that the distribution of R_{n,N}/\sqrt{n} converges, for n \to \infty and N \to \infty, to a universal nontrivial distribution, independently of μ. We discuss the applications of our results to the study of the record statistics of 366 daily stock prices from the Standard & Poors 500 index.
△ Less
Submitted 23 April, 2012;
originally announced April 2012.
-
Correlations of record events as a test for heavy-tailed distributions
Authors:
J. Franke,
G. Wergen,
J. Krug
Abstract:
A record is an entry in a time series that is larger or smaller than all previous entries. If the time series consists of independent, identically distributed random variables with a superimposed linear trend, record events are positively (negatively) correlated when the tail of the distribution is heavier (lighter) than exponential. Here we use these correlations to detect heavy-tailed behavior i…
▽ More
A record is an entry in a time series that is larger or smaller than all previous entries. If the time series consists of independent, identically distributed random variables with a superimposed linear trend, record events are positively (negatively) correlated when the tail of the distribution is heavier (lighter) than exponential. Here we use these correlations to detect heavy-tailed behavior in small sets of independent random variables. The method consists of converting random subsets of the data into time series with a tunable linear drift and computing the resulting record correlations.
△ Less
Submitted 6 January, 2012; v1 submitted 9 September, 2011;
originally announced September 2011.
-
Correlations between record events in sequences of random variables with a linear trend
Authors:
Gregor Wergen,
Jasper Franke,
Joachim Krug
Abstract:
The statistics of records in sequences of independent, identically distributed random variables is a classic subject of study. One of the earliest results concerns the stochastic independence of record events. Recently, records statistics beyond the case of i.i.d. random variables have received much attention, but the question of independence of record events has not been addressed systematically.…
▽ More
The statistics of records in sequences of independent, identically distributed random variables is a classic subject of study. One of the earliest results concerns the stochastic independence of record events. Recently, records statistics beyond the case of i.i.d. random variables have received much attention, but the question of independence of record events has not been addressed systematically. In this paper, we study this question in detail for the case of independent, non-identically distributed random variables, specifically, for random variables with a linearly moving mean. We find a rich pattern of positive and negative correlations, and show how their asymptotics is determined by the universality classes of extreme value statistics.
△ Less
Submitted 23 September, 2011; v1 submitted 19 May, 2011;
originally announced May 2011.
-
Record statistics for biased random walks, with an application to financial data
Authors:
Gregor Wergen,
Miro Bogner,
Joachim Krug
Abstract:
We consider the occurrence of record-breaking events in random walks with asymmetric jump distributions. The statistics of records in symmetric random walks was previously analyzed by Majumdar and Ziff and is well understood. Unlike the case of symmetric jump distributions, in the asymmetric case the statistics of records depends on the choice of the jump distribution. We compute the record rate…
▽ More
We consider the occurrence of record-breaking events in random walks with asymmetric jump distributions. The statistics of records in symmetric random walks was previously analyzed by Majumdar and Ziff and is well understood. Unlike the case of symmetric jump distributions, in the asymmetric case the statistics of records depends on the choice of the jump distribution. We compute the record rate $P_n(c)$, defined as the probability for the $n$th value to be larger than all previous values, for a Gaussian jump distribution with standard deviation $σ$ that is shifted by a constant drift $c$. For small drift, in the sense of $c/σ\ll n^{-1/2}$, the correction to $P_n(c)$ grows proportional to arctan$(\sqrt{n})$ and saturates at the value $\frac{c}{\sqrt{2} σ}$. For large $n$ the record rate approaches a constant, which is approximately given by $1-(σ/\sqrt{2π}c)\textrm{exp}(-c^2/2σ^2)$ for $c/σ\gg 1$. These asymptotic results carry over to other continuous jump distributions with finite variance. As an application, we compare our analytical results to the record statistics of 366 daily stock prices from the Standard & Poors 500 index. The biased random walk accounts quantitatively for the increase in the number of upper records due to the overall trend in the stock prices, and after detrending the number of upper records is in good agreement with the symmetric random walk. However the number of lower records in the detrended data is significantly reduced by a mechanism that remains to be identified.
△ Less
Submitted 4 March, 2011;
originally announced March 2011.
-
Records and sequences of records from random variables with a linear trend
Authors:
Jasper Franke,
Gregor Wergen,
Joachim Krug
Abstract:
We consider records and sequences of records drawn from discrete time series of the form $X_{n}=Y_{n}+cn$, where the $Y_{n}$ are independent and identically distributed random variables and $c$ is a constant drift. For very small and very large drift velocities, we investigate the asymptotic behavior of the probability $p_n(c)$ of a record occurring in the $n$th step and the probability $P_N(c)$ t…
▽ More
We consider records and sequences of records drawn from discrete time series of the form $X_{n}=Y_{n}+cn$, where the $Y_{n}$ are independent and identically distributed random variables and $c$ is a constant drift. For very small and very large drift velocities, we investigate the asymptotic behavior of the probability $p_n(c)$ of a record occurring in the $n$th step and the probability $P_N(c)$ that all $N$ entries are records, i.e. that $X_1 < X_2 < ... < X_N$. Our work is motivated by the analysis of temperature time series in climatology, and by the study of mutational pathways in evolutionary biology.
△ Less
Submitted 9 August, 2010;
originally announced August 2010.
-
Record-breaking temperatures reveal a warming climate
Authors:
Gregor Wergen,
Joachim Krug
Abstract:
We present a mathematical analysis of records drawn from independent random variables with a drifting mean. To leading order the change in the record rate is proportional to the ratio of the drift velocity to the standard deviation of the underlying distribution. We apply the theory to time series of daily temperatures for given calendar days, obtained from historical climate recordings of Europea…
▽ More
We present a mathematical analysis of records drawn from independent random variables with a drifting mean. To leading order the change in the record rate is proportional to the ratio of the drift velocity to the standard deviation of the underlying distribution. We apply the theory to time series of daily temperatures for given calendar days, obtained from historical climate recordings of European and American weather stations as well as re-analysis data. We conclude that the change in the mean temperature has increased the rate of record breaking events in a moderate but significant way: For the European station data covering the time period 1976-2005, we find that about 5 of the 17 high temperature records observed on average in 2005 can be attributed to the warming climate.
△ Less
Submitted 25 November, 2010; v1 submitted 18 May, 2010;
originally announced May 2010.