-
arXiv:1907.00196 [pdf, ps, other]
Statistical estimation of the Kullback-Leibler divergence
Abstract: Wide conditions are provided to guarantee asymptotic unbiasedness and L^2-consistency of the introduced estimates of the Kullback-Leibler divergence for probability measures in R^d having densities w.r.t. the Lebesgue measure. These estimates are constructed by means of two independent collections of i.i.d. observations and involve the specified k-nearest neighbor statistics. In particular, the es… ▽ More
Submitted 29 June, 2019; originally announced July 2019.
MSC Class: 60F25; 62G20; 62H12
-
arXiv:1804.08741 [pdf, ps, other]
Statistical Estimation of Conditional Shannon Entropy
Abstract: The new estimates of the conditional Shannon entropy are introduced in the framework of the model describing a discrete response variable depending on a vector of d factors having a density w.r.t. the Lebesgue measure in R^d. Namely, the mixed-pair model (X,Y) is considered where X and Y take values in R^d and an arbitrary finite set, respectively. Such models include, for instance, the famous log… ▽ More
Submitted 23 April, 2018; originally announced April 2018.
MSC Class: 60F25; 62G20; 62H12
-
arXiv:1801.02050 [pdf, ps, other]
Statistical estimation of the Shannon entropy
Abstract: The behavior of the Kozachenko - Leonenko estimates for the (differential) Shannon entropy is studied when the number of i.i.d. vector-valued observations tends to infinity. The asymptotic unbiasedness and L^2-consistency of the estimates are established. The conditions employed involve the analogues of the Hardy - Littlewood maximal function. It is shown that the results are valid in particular f… ▽ More
Submitted 6 January, 2018; originally announced January 2018.
MSC Class: 60F25; 62G20; 62H12
-
Modification of the MDR-EFE method for stratified samples
Abstract: The MDR-EFE method of performing identification of relevant factors within a given collection X_1,...,X_n is developed for stratified samples in the case of binary response variable Y. We establish a criterion of strong consistency of estimates (involving K-cross-validation procedure and penalty) for a specified prediction error function. The cost approach is proposed to compare experiments with r… ▽ More
Submitted 21 June, 2016; originally announced June 2016.
MSC Class: 62G05; 62G20
-
arXiv:1406.1138 [pdf, ps, other]
Simulation and analytical approach to the identification of significant factors
Abstract: We develop our previous works concerning the identification of the collection of significant factors determining some, in general, non-binary random response variable. Such identification is important, e.g., in biological and medical studies. Our approach is to examine the quality of response variable prediction by functions in (certain part of) the factors. The prediction error estimation require… ▽ More
Submitted 4 June, 2014; originally announced June 2014.
Comments: 25 pages, 6 tables, 3 figures
MSC Class: 62G05; 62E20
-
arXiv:1301.6609 [pdf, ps, other]
Central limit theorem related to MDR-method
Abstract: In many medical and biological investigations, including genetics, it is typical to handle high dimensional data which can be viewed as a set of values of some factors and a binary response variable. For instance, the response variable can describe the state of a patient health and one often assumes that it depends only on some part of factors. An important problem is to determine collections of s… ▽ More
Submitted 28 January, 2013; originally announced January 2013.
MSC Class: 60F05; 60F15; 62P10
-
arXiv:1106.4989 [pdf, ps, other]
Statistical methods of SNP data analysis with applications
Abstract: Various statistical methods important for genetic analysis are considered and developed. Namely, we concentrate on the multifactor dimensionality reduction, logic regression, random forests and stochastic gradient boosting. These methods and their new modifications, e.g., the MDR method with "independent rule", are used to study the risk of complex diseases such as cardiovascular ones. The roles o… ▽ More
Submitted 24 June, 2011; originally announced June 2011.
-
arXiv:1104.4180 [pdf, ps, other]
On the Newman Conjecture
Abstract: We consider a random field, defined on an integer-valued d-dimensional lattice, with covariance function satisfying a condition more general than summability. Such condition appeared in the well-known Newman's conjecture concerning the central limit theorem (CLT) for stationary associated random fields. As was demonstrated by Herrndorf and Shashkin, the conjecture fails already for d=1. In the pre… ▽ More
Submitted 21 April, 2011; originally announced April 2011.
-
arXiv:1005.0483 [pdf, ps, other]
Central limit theorems for the excursion set volumes of weakly dependent random fields
Abstract: The multivariate central limit theorems (CLT) for the volumes of excursion sets of stationary quasi-associated random fields on $\mathbb{R}^d$ are proved. Special attention is paid to Gaussian and shot noise fields. Formulae for the covariance matrix of the limiting distribution are provided. A statistical version of the CLT is considered as well. Some numerical results are also discussed.
Submitted 1 March, 2012; v1 submitted 4 May, 2010; originally announced May 2010.
Comments: Published in at http://dx.doi.org/10.3150/10-BEJ339 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)
Report number: IMS-BEJ-BEJ339
Journal ref: Bernoulli 2012, Vol. 18, No. 1, 100-118
-
arXiv:math/0608237 [pdf, ps, other]
Strong invariance principle for dependent random fields
Abstract: A strong invariance principle is established for random fields which satisfy dependence conditions more general than positive or negative association. We use the approach of Csörgő and Révész applied recently by Balan to associated random fields. The key step in our proof combines new moment and maximal inequalities, established by the authors for partial sums of multiindexed random variables, w… ▽ More
Submitted 10 August, 2006; originally announced August 2006.
Comments: Published at http://dx.doi.org/10.1214/074921706000000167 in the IMS Lecture Notes--Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-LNMS48-LNMS4813 MSC Class: 60F15; 60F17 (Primary)
Journal ref: IMS Lecture Notes--Monograph Series 2006, Vol. 48, 128-143
-
arXiv:math/0504225 [pdf, ps, other]
Generalization of the Critical Volume NTCP Model in the Radiobiology
Abstract: A generalization of the well known critical volume NTCP model is proposed to take into account dependence of the functional subunits of irradiated organ (or tissue). A new statistical version of the CLT is established to analyze the corresponding random fields.
Submitted 11 April, 2005; originally announced April 2005.
MSC Class: AMS: 60F05; 62E20; 62G15; 62P10