Search | arXiv e-print repository

The least favorable noise

Authors: Philip A. Ernst, Abram M. Kagan, L. C. G. Rogers

Abstract: Suppose that a random variable $X$ of interest is observed perturbed by independent additive noise $Y$. This paper concerns the "the least favorable perturbation" $\hat Y_\ep$, which maximizes the prediction error $E(X-E(X|X+Y))^2$ in the class of $Y$ with $ \var (Y)\leq \ep$. We find a characterization of the answer to this question, and show by example that it can be surprisingly complicated. Ho… ▽ More Suppose that a random variable $X$ of interest is observed perturbed by independent additive noise $Y$. This paper concerns the "the least favorable perturbation" $\hat Y_\ep$, which maximizes the prediction error $E(X-E(X|X+Y))^2$ in the class of $Y$ with $ \var (Y)\leq \ep$. We find a characterization of the answer to this question, and show by example that it can be surprisingly complicated. However, in the special case where $X$ is infinitely divisible, the solution is complete and simple. We also explore the conjecture that noisier $Y$ makes prediction worse. △ Less

Submitted 17 March, 2021; originally announced March 2021.

Comments: 15 pages, 9 figures

MSC Class: 60E07; 60E10; 60E05

arXiv:1904.03559 [pdf, ps, other]

Statistical Meaning of Mean Functions

Authors: Abram M. Kagan, Paul J. Smith

Abstract: The basic properties of the Fisher information allow to reveal the statistical meaning of classical inequalities between mean functions. The properties applied to scale mixtures of Gaussian distributions lead to a new mean function of purely statistical origin, unrelated to the classical arithmetic, geometric, and harmonic means. We call it the informational mean and show that when the arguments o… ▽ More The basic properties of the Fisher information allow to reveal the statistical meaning of classical inequalities between mean functions. The properties applied to scale mixtures of Gaussian distributions lead to a new mean function of purely statistical origin, unrelated to the classical arithmetic, geometric, and harmonic means. We call it the informational mean and show that when the arguments of the mean functions are Hermitian positive definite matrices, not necessarily commuting, the informational mean lies between the arithmetic and harmonic means, playing, in a sense, the role of the geometric mean that cannot be correctly defined in case of non-commuting matrices.\\ Surprisingly the monotonicity and additivity properties of the Fisher information lead to a new generalization of the classical inequality between the arithmetic and harmonic means. △ Less

Submitted 6 April, 2019; originally announced April 2019.

arXiv:1903.04663 [pdf, ps, other]

Calibrating dependence between random elements

Authors: Abram M. Kagan, Gabor J. Székely

Abstract: Attempts to quantify dependence between random elements X and Y via maximal correlation go back to Gebelein (1941) and Rényi (1959). After summarizing properties (including some new) of the Rényi measure of dependence, a calibrated scale of dependence is introduced. It is based on the ``complexity`` of approximating functions of X by functions of Y. Attempts to quantify dependence between random elements X and Y via maximal correlation go back to Gebelein (1941) and Rényi (1959). After summarizing properties (including some new) of the Rényi measure of dependence, a calibrated scale of dependence is introduced. It is based on the ``complexity`` of approximating functions of X by functions of Y. △ Less

Submitted 11 March, 2019; originally announced March 2019.

MSC Class: 60H99; 62E10

arXiv:1902.06802 [pdf, ps, other]

Efficiency requires innovation

Authors: Abram M. Kagan

Abstract: In estimation a parameter $θ\in{\mathbb R}$ from a sample $(x_1,\ldots,x_n)$ from a population $P_θ$ a simple way of incorporating a new observation $x_{n+1}$ into an estimator $\tildeθ_{n} = \tildeθ_{n}(x_1,\ldots,x_n)$ is transforming $\tildeθ_n$ to what we call the {\it jackknife extension} $\tildeθ_{n+1}^{(e)} = \tildeθ_{n+1}^{(e)}(x_1,\ldots,x_n,x_{n+1})$, \[\tildeθ_{n+1}^{(e)} = \{\tildeθ_n… ▽ More In estimation a parameter $θ\in{\mathbb R}$ from a sample $(x_1,\ldots,x_n)$ from a population $P_θ$ a simple way of incorporating a new observation $x_{n+1}$ into an estimator $\tildeθ_{n} = \tildeθ_{n}(x_1,\ldots,x_n)$ is transforming $\tildeθ_n$ to what we call the {\it jackknife extension} $\tildeθ_{n+1}^{(e)} = \tildeθ_{n+1}^{(e)}(x_1,\ldots,x_n,x_{n+1})$, \[\tildeθ_{n+1}^{(e)} = \{\tildeθ_n (x_1 ,\ldots,x_n)+ \tildeθ_n (x_{n+1},x_2 ,\ldots,x_n) + \ldots + \tildeθ_n (x_1 ,\ldots,x_{n-1},x_{n+1})\}/(n+1).\] Though $\tildeθ_{n+1}^{(e)}$ lacks an innovation the statistician could expect from a larger data set, it is still better than $\tildeθ_n$, \[{\rm var}(\tildeθ_{n+1}^{(e)})\leq\frac{n}{n+1} {\rm var}(\tildeθ_n).\] However, an estimator obtained by jackknife extension for all $n$ is asymptotically efficient only for samples from exponential families. For a general $P_θ$, asymptotically efficient estimators require innovation when a new observation is added to the data. Some examples illustrate the concept. △ Less

Submitted 18 February, 2019; originally announced February 2019.

arXiv:1902.06800 [pdf, ps, other]

The KLR-theorem revisited

Authors: Abram M. Kagan

Abstract: For independent random variables $X_1,\ldots, X_n;Y_1,\ldots, Y_n$ with all $X_i$ identically distributed and same for $Y_j$, we study the relation \[E\{a\bar X + b\bar Y|X_1 -\bar X +Y_1 -\bar Y,\ldots,X_n -\bar X +Y_n -\bar Y\}={\rm const}\] with $a, b$ some constants. It is proved that for $n\geq 3$ and $ab>0$ the relation holds iff $X_i$ and $Y_j$ are Gaussian.\\ A new characterization arises… ▽ More For independent random variables $X_1,\ldots, X_n;Y_1,\ldots, Y_n$ with all $X_i$ identically distributed and same for $Y_j$, we study the relation \[E\{a\bar X + b\bar Y|X_1 -\bar X +Y_1 -\bar Y,\ldots,X_n -\bar X +Y_n -\bar Y\}={\rm const}\] with $a, b$ some constants. It is proved that for $n\geq 3$ and $ab>0$ the relation holds iff $X_i$ and $Y_j$ are Gaussian.\\ A new characterization arises in case of $a=1, b= -1$. In this case either $X_i$ or $Y_j$ or both have a Gaussian component. It is the first (at least known to the author) case when presence of a Gaussian component is a characteristic property. △ Less

Submitted 18 February, 2019; originally announced February 2019.

arXiv:1507.02769 [pdf, ps, other]

doi 10.1007/s13171-015-0076-5

On the structure of UMVUEs

Authors: Abram M. Kagan, Yaakov Malinovsky

Abstract: In all setups when the structure of UMVUEs is known, there exists a subalgebra $\cal U$ (MVE-algebra) of the basic $σ$-algebra such that all $\cal U$-measurable statistics with finite second moments are UMVUEs. It is shown that MVE-algebras are, in a sense, similar to the subalgebras generated by complete sufficient statistics. Examples are given when these subalgebras differ, in these cases a new… ▽ More In all setups when the structure of UMVUEs is known, there exists a subalgebra $\cal U$ (MVE-algebra) of the basic $σ$-algebra such that all $\cal U$-measurable statistics with finite second moments are UMVUEs. It is shown that MVE-algebras are, in a sense, similar to the subalgebras generated by complete sufficient statistics. Examples are given when these subalgebras differ, in these cases a new statistical structure arises. △ Less

Submitted 9 July, 2015; originally announced July 2015.

Comments: Accepted for publication in Sankhya A

MSC Class: 62B99; 62F10; 62G05

arXiv:1307.3654 [pdf, ps, other]

Partially complete sufficient statistics are jointly complete

Authors: Abram M. Kagan, Yaakov Malinovsky, Lutz Mattner

Abstract: The theory of the basic statistical concept of (Lehmann-Scheffé-)completeness is perfected by providing the theorem indicated in the title and previously overlooked for several decades. Relations to earlier results are discussed and illustrating examples are presented. Of the two proofs offered for the main result, the first is direct and short, following the prototypical example of Landers and… ▽ More The theory of the basic statistical concept of (Lehmann-Scheffé-)completeness is perfected by providing the theorem indicated in the title and previously overlooked for several decades. Relations to earlier results are discussed and illustrating examples are presented. Of the two proofs offered for the main result, the first is direct and short, following the prototypical example of Landers and Rogge (1976), and the second is very short and purely statistical, utilizing the basic theory of optimal unbiased estimation in the little known version completed by Schmetterer and Strasser (1974). △ Less

Submitted 23 May, 2014; v1 submitted 13 July, 2013; originally announced July 2013.

Comments: 16 pages. Very minor changes (one more reference, abstract slightly longer). To appear in Teoriya Veroyatnostei i ee Primeneniya

MSC Class: Primary 62B99; Secondary 62F10; 62G05

arXiv:1302.3238 [pdf, ps, other]

Contribution to the theory of Pitman estimators

Authors: Abram M. Kagan, Tinghui Yu, Andrew Barron, Mokshay Madiman

Abstract: New inequalities are proved for the variance of the Pitman estimators (minimum variance equivariant estimators) of θconstructed from samples of fixed size from populations F(x-θ). The inequalities are closely related to the classical Stam inequality for the Fisher information, its analog in small samples, and a powerful variance drop inequality. The only condition required is finite variance of F;… ▽ More New inequalities are proved for the variance of the Pitman estimators (minimum variance equivariant estimators) of θconstructed from samples of fixed size from populations F(x-θ). The inequalities are closely related to the classical Stam inequality for the Fisher information, its analog in small samples, and a powerful variance drop inequality. The only condition required is finite variance of F; even the absolute continuity of F is not assumed. As corollaries of the main inequalities for small samples, one obtains alternate proofs of known properties of the Fisher information, as well as interesting new observations like the fact that the variance of the Pitman estimator based on a sample of size n scaled by n monotonically decreases in n. Extensions of the results to the polynomial versions of the Pitman estimators and a multivariate location parameter are given. Also, the search for characterization of equality conditions for one of the inequalities leads to a Cauchy-type functional equation for independent random variables, and an interesting new behavior of its solutions is described. △ Less

Submitted 13 February, 2013; originally announced February 2013.

Journal ref: Zapiski Nauchnykh Seminarov POMI, Special issue in honour of I. A. Ibragimov's 80th birthday, Vol. 408, pp. 245-267, 2012

arXiv:1302.0924 [pdf, ps, other]

On the Nile Problem by Sir Ronald Fisher

Authors: Abram M. Kagan, Yaakov Malinovsky

Abstract: The Nile problem by Ronald Fisher may be interpreted as the problem of making statistical inference for a special curved exponential family when the minimal sufficient statistic is incomplete. The problem itself and its versions for general curved exponential families pose a mathematical-statistical challenge: studying the subalgebras of ancillary statistics within the $σ$-algebra of the (incomple… ▽ More The Nile problem by Ronald Fisher may be interpreted as the problem of making statistical inference for a special curved exponential family when the minimal sufficient statistic is incomplete. The problem itself and its versions for general curved exponential families pose a mathematical-statistical challenge: studying the subalgebras of ancillary statistics within the $σ$-algebra of the (incomplete) minimal sufficient statistics and closely related questions of the structure of UMVUEs. In this paper a new method is developed that, in particular, proves that in the classical Nile problem no statistic subject to mild natural conditions is a UMVUE. The method almost solves an old problem of the existence of UMVUEs. The method is purely statistical (vs. analytical) and works for any family possessing an ancillary statistic. It complements an analytical method that uses only the first order ancillarity (and thus works when the existence of ancillary subalgebras is an open problem) and works for curved exponential families with polynomial constraints on the canonical parameters of which the Nile problem is a special case. △ Less

Submitted 5 July, 2013; v1 submitted 4 February, 2013; originally announced February 2013.

arXiv:1202.6427 [pdf, ps, other]

Monotonicity in the Sample Size of the Length of Classical Confidence Intervals

Authors: Abram M. Kagan, Yaakov Malinovsky

Abstract: It is proved that the average length of standard confidence intervals for parameters of gamma and normal distributions monotonically decrease with the sample size. The proofs are based on fine properties of the classical gamma function. It is proved that the average length of standard confidence intervals for parameters of gamma and normal distributions monotonically decrease with the sample size. The proofs are based on fine properties of the classical gamma function. △ Less

Submitted 28 August, 2012; v1 submitted 28 February, 2012; originally announced February 2012.

Showing 1–10 of 10 results for author: Kagan, A M