-
Statistical tests based on Renyi entropy estimation
Authors:
Mehmet Siddik Cadirci,
Dafydd Evans,
Nikolai Leonenko,
Vitali Makogin,
Oleg Seleznjev
Abstract:
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entr…
▽ More
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entropy principle and a class of estimators for Renyi entropy based on nearest neighbour distances. We prove the L2-consistency of these statistics using results on the subadditivity of Euclidean functionals on nearest neighbour graphs, and investigate their rate of convergence and asymptotic distribution using Monte Carlo methods.
△ Less
Submitted 23 January, 2025;
originally announced February 2025.
-
Approach to Evaluating Characteristics of Multichannel Loss System with FCFD Preempted Priority Discipline
Authors:
A. G. Tatashev,
O. V. Seleznjev,
M. V. Yashina
Abstract:
In the paper we consider a multichannel loss preemptive priority system with a Poisson input and general service time distribution depending on the priority of job. Jobs of the same priority are preempted according with First Come First Displaced (FCFD) protocol. Approximate formulas are obtained for the loss probability of a prescribed priority job and some other characteristics of the system. It…
▽ More
In the paper we consider a multichannel loss preemptive priority system with a Poisson input and general service time distribution depending on the priority of job. Jobs of the same priority are preempted according with First Come First Displaced (FCFD) protocol. Approximate formulas are obtained for the loss probability of a prescribed priority job and some other characteristics of the system. It particular cases, the obtained formulas are exact.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Approximate Formulas for Characteristics of Multichannel LIFO Preemptive-Resume Priority Queueing System
Authors:
A. G. Tatashev,
O. V. Seleznjev,
M. V. Yashina
Abstract:
This paper considers a multichannel preemptive-resume priority queueing system with a Poisson input and an arbitrary service time distribution depending on the priority of job. Jobs of the same priority are serviced according to the LIFO rule. If, at moment of job arrival, all servers are busy, and at least one server is busy with the service of a job of a not higher than the priority of arriving…
▽ More
This paper considers a multichannel preemptive-resume priority queueing system with a Poisson input and an arbitrary service time distribution depending on the priority of job. Jobs of the same priority are serviced according to the LIFO rule. If, at moment of job arrival, all servers are busy, and at least one server is busy with the service of a job of a not higher than the priority of arriving job, then the service of a job is preempted such that the priority of preempted job is lowest from the priorities of the jobs in service. The service of a preempted job is resumed later. The paper proposes approximate formulas for the sojourn time of a prescribed priority job and some other characteristics of the system.
△ Less
Submitted 18 June, 2022;
originally announced June 2022.
-
Queueing Systems with Some Versions of Limited Processor Sharing Discipline
Authors:
M. S. Alencar,
A. G. Tatashev,
O. V. Seleznjev,
M. V. Yashina
Abstract:
The paper considers a queueing system with limited processor sharing. No more than n jobs may be served simultaneously. This system may be used for modeling bandwidth sharing in wireless communication systems and processes of service in computer networks. If there are n jobs in the considered queueing system and a new job arrives, then the arriving job is lost or the service of a job is interrupte…
▽ More
The paper considers a queueing system with limited processor sharing. No more than n jobs may be served simultaneously. This system may be used for modeling bandwidth sharing in wireless communication systems and processes of service in computer networks. If there are n jobs in the considered queueing system and a new job arrives, then the arriving job is lost or the service of a job is interrupted and this job is lost. We study two rules to choose the job to be lost. In accordance with one of these rules, the job with the shortest remaining length is lost. Relations are obtained between the state probabilities of considered system and the state probabilities of the corresponding unlimited processor sharing system. These relations allow to compute the state probabilities for considered system if the state probabilities for the unlimited processor sharing system are known. In the case of Poisson arrival process, the probability that the server capacity is exhausted is equal to the probability that a job is lost. We have obtained an explicit formulas for the stationary state probabilities and the loss probability for this case. These probabilities are invariant under the job length distribution under the condition that the average value of the length is fixed.
△ Less
Submitted 27 January, 2022;
originally announced February 2022.
-
Statistical tests based on Rényi entropy estimation
Authors:
Mehmet Siddik Cadirci,
Dafydd Evans,
Nikolai Leonenko,
Oleg Seleznjev
Abstract:
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entr…
▽ More
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entropy principle and a class of estimators for Rényi entropy based on nearest neighbour distances. We prove the L^2-consistency of these statistics using results on the subadditivity of Euclidean functionals on nearest neighbour graphs, and investigate their rate of convergence and asymptotic distribution using Monte Carlo methods. In addition we present a novel iterative method for estimating the shape parameter of the multivariate Student and multivariate Pearson type II distributions.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
Consistency of the drift parameter estimator for the discretized fractional Ornstein-Uhlenbeck process with Hurst index $H\in(0,\frac12)$
Authors:
Kestutis Kubilius,
Yuliya Mishura,
Kostiantyn Ralchenko,
Oleg Seleznjev
Abstract:
We consider Langevin equation involving fractional Brownian motion with Hurst index $H\in(0,\frac12)$. Its solution is the fractional Ornstein-Uhlenbeck process and with unknown drift parameter $θ$. We construct the estimator that is similar in form to maximum likelihood estimator for Langevin equation with standard Brownian motion. Observations are discrete in time. It is assumed that the interva…
▽ More
We consider Langevin equation involving fractional Brownian motion with Hurst index $H\in(0,\frac12)$. Its solution is the fractional Ornstein-Uhlenbeck process and with unknown drift parameter $θ$. We construct the estimator that is similar in form to maximum likelihood estimator for Langevin equation with standard Brownian motion. Observations are discrete in time. It is assumed that the interval between observations is $n^{-1}$, i.e. tends to zero (high frequency data) and the number of observations increases to infinity as $n^m$ with $m>1$. It is proved that for positive $θ$ the estimator is strongly consistent for any $m>1$ and for negative $θ$ it is consistent when $m>\frac{1}{2H}$.
△ Less
Submitted 19 January, 2015;
originally announced January 2015.
-
Boundary non-crossing probabilities for fractional Brownian motion with trend
Authors:
Enkelejd Hashorva,
Yuliya Mishura,
Oleg Seleznjev
Abstract:
In this paper we investigate the boundary non-crossing probabilities of a fractional Brownian motion considering some general deterministic trend function. We derive bounds for non-crossing probabilities and discuss the case of a large trend function. As a by-product we solve a minimization problem related to the norm of the trend function.
In this paper we investigate the boundary non-crossing probabilities of a fractional Brownian motion considering some general deterministic trend function. We derive bounds for non-crossing probabilities and discuss the case of a large trend function. As a by-product we solve a minimization problem related to the norm of the trend function.
△ Less
Submitted 29 September, 2013;
originally announced September 2013.
-
Estimation of quadratic density functionals under m-dependence
Authors:
David Källberg,
Oleg Seleznjev
Abstract:
In this paper, we study estimation of certain integral functionals of one or two densities with samples from stationary m-dependent sequences. We consider two types of U-statistic estimators for these functionals that are functions of the number of epsilon-close vector observations in the samples. We show that the estimators are consistent and obtain their rates of convergence under weak distribut…
▽ More
In this paper, we study estimation of certain integral functionals of one or two densities with samples from stationary m-dependent sequences. We consider two types of U-statistic estimators for these functionals that are functions of the number of epsilon-close vector observations in the samples. We show that the estimators are consistent and obtain their rates of convergence under weak distributional assumptions. In particular, we propose estimators based on incomplete U-statistics which have favorable consistency properties even when m-dependence is the only dependence condition that can be imposed on the stationary sequences. The results can be used for divergence and entropy estimation, and thus find many applications in statistics and applied sciences.
△ Less
Submitted 19 September, 2013;
originally announced September 2013.
-
Statistical estimation of quadratic Rényi entropy for a stationary m-dependent sequence
Authors:
David Källberg,
Nikolaj Leonenko,
Oleg Seleznjev
Abstract:
The Rényi entropy is a generalization of the Shannon entropy and is widely used in mathematical statistics and applied sciences for quantifying the uncertainty in a probability distribution. We consider estimation of the quadratic Rényi entropy and related functionals for the marginal distribution of a stationary m-dependent sequence. The U-statistic estimators under study are based on the number…
▽ More
The Rényi entropy is a generalization of the Shannon entropy and is widely used in mathematical statistics and applied sciences for quantifying the uncertainty in a probability distribution. We consider estimation of the quadratic Rényi entropy and related functionals for the marginal distribution of a stationary m-dependent sequence. The U-statistic estimators under study are based on the number of epsilon-close vector observations in the corresponding sample. A variety of asymptotic properties for these estimators are obtained (e.g., consistency, asymptotic normality, Poisson convergence). The results can be used in diverse statistical and computer science problems whenever the conventional independence assumption is too strong (e.g., epsilon-keys in time series databases, distribution identification problems for dependent samples).
△ Less
Submitted 7 March, 2013;
originally announced March 2013.
-
Estimation of entropy-type integral functionals
Authors:
David Källberg,
Oleg Seleznjev
Abstract:
Entropy-type integral functionals of densities are widely used in mathematical statistics, information theory, and computer science. Examples include measures of closeness between distributions (e.g., density power divergence) and uncertainty characteristics for a random variable (e.g., Rényi entropy). In this paper, we study U-statistic estimators for a class of such functionals. The estimators a…
▽ More
Entropy-type integral functionals of densities are widely used in mathematical statistics, information theory, and computer science. Examples include measures of closeness between distributions (e.g., density power divergence) and uncertainty characteristics for a random variable (e.g., Rényi entropy). In this paper, we study U-statistic estimators for a class of such functionals. The estimators are based on epsilon-close vector observations in the corresponding independent and identically distributed samples. We prove asymptotic properties of the estimators (consistency and asymptotic normality) under mild integrability and smoothness conditions for the densities. The results can be applied in diverse problems in mathematical statistics and computer science (e.g., distribution identification problems, approximate matching for random databases, two-sample problems).
△ Less
Submitted 7 March, 2013; v1 submitted 12 September, 2012;
originally announced September 2012.
-
Approximation of a random process with variable smoothness
Authors:
Enkelejd Hashorva,
Mikhail Lifshits,
Oleg Seleznjev
Abstract:
We consider the rate of piecewise constant approximation to a locally stationary process $X(t),t\in [0,1]$, having a variable smoothness index $α(t)$. Assuming that $α(\cdot)$ attains its unique minimum at zero and satisfies the regularity condition, we propose a method for construction of observation points (composite dilated design) and find an asymptotics for the integrated mean square error, w…
▽ More
We consider the rate of piecewise constant approximation to a locally stationary process $X(t),t\in [0,1]$, having a variable smoothness index $α(t)$. Assuming that $α(\cdot)$ attains its unique minimum at zero and satisfies the regularity condition, we propose a method for construction of observation points (composite dilated design) and find an asymptotics for the integrated mean square error, where a piecewise constant approximation $X_n$ is based on $N(n)\sim n$ observations of $X$. Further, we prove that the suggested approximation rate is optimal, and then show how to find an optimal constant.
△ Less
Submitted 6 June, 2012;
originally announced June 2012.
-
Stratified Monte Carlo quadrature for continuous random fields
Authors:
Konrad Abramowicz,
Oleg Seleznjev
Abstract:
We consider the problem of numerical approximation of integrals of random fields over a unit hypercube. We use a stratified Monte Carlo quadrature and measure the approximation performance by the mean squared error. The quadrature is defined by a finite number of stratified randomly chosen observations with the partition (or strata) generated by a rectangular grid (or design). We study the class o…
▽ More
We consider the problem of numerical approximation of integrals of random fields over a unit hypercube. We use a stratified Monte Carlo quadrature and measure the approximation performance by the mean squared error. The quadrature is defined by a finite number of stratified randomly chosen observations with the partition (or strata) generated by a rectangular grid (or design). We study the class of locally stationary random fields whose local behavior is like a fractional Brownian field in the mean square sense and find the asymptotic approximation accuracy for a sequence of designs for large number of the observations. For the Hölder class of random functions, we provide an upper bound for the approximation error. Additionally, for a certain class of isotropic random functions with an isolated singularity at the origin, we construct a sequence of designs eliminating the effect of the singularity point.
△ Less
Submitted 4 May, 2011; v1 submitted 26 April, 2011;
originally announced April 2011.
-
Statistical Inference for Rényi Entropy Functionals
Authors:
David Källberg,
Nikolaj Leonenko,
Oleg Seleznjev
Abstract:
Numerous entropy-type characteristics (functionals) generalizing Rényi entropy are widely used in mathematical statistics, physics, information theory, and signal processing for characterizing uncertainty in probability distributions and distribution identification problems. We consider estimators of some entropy (integral) functionals for discrete and continuous distributions based on the number…
▽ More
Numerous entropy-type characteristics (functionals) generalizing Rényi entropy are widely used in mathematical statistics, physics, information theory, and signal processing for characterizing uncertainty in probability distributions and distribution identification problems. We consider estimators of some entropy (integral) functionals for discrete and continuous distributions based on the number of epsilon-close vector records in the corresponding independent and identically distributed samples from two distributions. The estimators form a triangular scheme of generalized U-statistics. We show the asymptotic properties of these estimators (e.g., consistency and asymptotic normality). The results can be applied in various problems in computer science and mathematical statistics (e.g., approximate matching for random databases, record linkage, image matching).
△ Less
Submitted 25 March, 2011;
originally announced March 2011.
-
Multivariate piecewise linear interpolation of a random field
Authors:
Konrad Abramowicz,
Oleg Seleznjev
Abstract:
We consider a multivariate piecewise linear interpolation of a continuous random field on a d-dimensional cube. The approximation performance is measured by the integrated mean square error. Multivariate piecewise linear interpolator is defined by N field observations on a locations grid (or design). We investigate the class of locally stationary random fields whose local behavior is like a fracti…
▽ More
We consider a multivariate piecewise linear interpolation of a continuous random field on a d-dimensional cube. The approximation performance is measured by the integrated mean square error. Multivariate piecewise linear interpolator is defined by N field observations on a locations grid (or design). We investigate the class of locally stationary random fields whose local behavior is like a fractional Brownian field in mean square sense and find the asymptotic approximation accuracy for a sequence of designs for large N. Moreover, for certain classes of continuous and continuously differentiable fields we provide the upper bound for the approximation accuracy in the uniform mean square norm.
△ Less
Submitted 9 February, 2011;
originally announced February 2011.
-
Spline approximation of a random process with singularity
Authors:
Konrad Abramowicz,
Oleg Seleznjev
Abstract:
Let a continuous random process $X$ defined on $[0,1]$ be $(m+β)$-smooth, $0\le m, 0<β\le 1$, in quadratic mean for all $t>0$ and have an isolated singularity point at $t=0$. In addition, let $X$ be locally like a $m$-fold integrated $β$-fractional Brownian motion for all non-singular points. We consider approximation of $X$ by piecewise Hermite interpolation splines with $n$ free knots (i.e., a s…
▽ More
Let a continuous random process $X$ defined on $[0,1]$ be $(m+β)$-smooth, $0\le m, 0<β\le 1$, in quadratic mean for all $t>0$ and have an isolated singularity point at $t=0$. In addition, let $X$ be locally like a $m$-fold integrated $β$-fractional Brownian motion for all non-singular points. We consider approximation of $X$ by piecewise Hermite interpolation splines with $n$ free knots (i.e., a sampling design, a mesh). The approximation performance is measured by mean errors (e.g., integrated or maximal quadratic mean errors). We construct a sequence of sampling designs with asymptotic approximation rate $n^{-(m+β)}$ for the whole interval.
△ Less
Submitted 19 May, 2010; v1 submitted 29 April, 2010;
originally announced April 2010.