-
An AUK-based index for measuring and testing the joint dependence of a random vector
Abstract: We present an index of dependence that allows one to measure the joint or mutual dependence of a $d$-dimensional random vector with $d>2$. The index is based on a $d$-dimensional Kendall process. We further propose a standardized version of our index of dependence that is easy to interpret, and provide an algorithm for its computation. We discuss tests of total independence based on consistent est… ▽ More
Submitted 23 December, 2020; v1 submitted 24 November, 2020; originally announced November 2020.
Comments: 33 pages (plus 8 pages supplementary material), 7 figures, 9 tables
MSC Class: Primary 62H20; 62H05; 62E10; secondary 62-09; 62G99
-
Multi-Panel Kendall Plot in Light of an ROC Curve Analysis Applied to Measuring Dependence
Abstract: The Kendall plot ($\K$-plot) is a plot measuring dependence between the components of a bivariate random variable. The $\K$-plot graphs the Kendall distribution function against the distribution function of $VU$, where $V$ and $U$ are independent uniform $[0,1]$ random variables. We associate $\K$-plots with the receiver operating characteristic ($\ROC$) curve, a well-accepted graphical tool in bi… ▽ More
Submitted 21 November, 2018; originally announced November 2018.
Comments: Statistics: A Journal of Theoretical and Applied Statistics. Accepted
-
arXiv:1806.02314 [pdf, ps, other]
On the limiting distribution of sample central moments
Abstract: We investigate the limiting behavior of sample central moments, examining the special cases where the limiting (as the sample size tends to infinity) distribution is degenerate. Parent (non-degenerate) distributions with this property are called \emph{singular}, and we show in this article that the singular distributions contain at most three supporting points. Moreover, using the \emph{delta}-met… ▽ More
Submitted 6 June, 2018; originally announced June 2018.
Comments: 26 pages, 1 figure
-
arXiv:1612.07670 [pdf, ps, other]
The out-of-source error in multi-source cross validation-type procedures
Abstract: A scientific phenomenon under study may often be manifested by data arising from processes, i.e. sources, that may describe this phenomenon. In this contex of multi-source data, we define the "out-of-source" error, that is the error committed when a new observation of unknown source origin is allocated to one of the sources using a rule that is trained on the known labeled data. We present an unbi… ▽ More
Submitted 22 December, 2016; originally announced December 2016.
Comments: 16 pages, 4 tables
Journal ref: New Advances in Statistics and Data Science 2017, 27-44
-
arXiv:1612.07408 [pdf, ps, other]
Statistical Distances and Their Role in Robustness
Abstract: Statistical distances, divergences, and similar quantities have a large history and play a fundamental role in statistics, machine learning and associated scientific disciplines. However, within the statistical literature, this extensive role has too often been played out behind the scenes, with other aspects of the statistical problems being viewed as more central, more interesting, or more impor… ▽ More
Submitted 21 December, 2016; originally announced December 2016.
Comments: 23 pages
Journal ref: New Advances in Statistics and Data Science 2017, 3-26
-
arXiv:1511.02980 [pdf, ps, other]
Optimality of Training/Test Size and Resampling Effectiveness of Cross-Validation Estimators of the Generalization Error
Abstract: An important question in constructing Cross Validation (CV) estimators of the generalization error is whether rules can be established that allow "optimal" selection of the size of the training set, for fixed sample size $n$. We define the {\it resampling effectiveness} of random CV estimators of the generalization error as the ratio of the limiting value of the variance of the CV estimator over t… ▽ More
Submitted 9 November, 2015; originally announced November 2015.
Comments: 53 pages, 6 figures, 16 tables
-
arXiv:1511.02962 [pdf, ps, other]
Uniform Integrability of the OLS Estimators, and the Convergence of their Moments
Abstract: The problem of convergence of moments of a sequence of random variables to the moments of its asymptotic distribution is important in many applications. These include the determination of the optimal training sample size in the cross validation estimation of the generalization error of computer algorithms, and in the construction of graphical methods for studying dependence patterns between two bi… ▽ More
Submitted 13 June, 2018; v1 submitted 9 November, 2015; originally announced November 2015.
Comments: 10 pages
MSC Class: 62J05; 62E20; 60E15; 60F05; 05A10
Journal ref: TEST 2016, Vol. 25, No 4, 775-784
-
arXiv:1411.1165 [pdf, ps, other]
A factorial moment distance and an application to the matching problem
Abstract: In this note we introduce the notion of factorial moment distance for non-negative integer-valued random variables and we compare it with the total variation distance. Furthermore, we study the rate of convergence in the classical matching problem and in a generalized matching distribution.
Submitted 5 November, 2014; originally announced November 2014.
Comments: 10 pages
MSC Class: Primary 60E05; 60E15; Secondary 44A10; 41A25
Journal ref: Theory of Probability and Its Applications 2017, Vol. 62, No 3, 617-628
-
arXiv:1408.1849 [pdf, ps, other]
Orthogonal polynomials in the Cumulative Ord family and its application to variance bounds
Abstract: This article presents and reviews several basic properties of the Cumulative Ord family of distributions; this family contains all the commonly used discrete distributions. A complete classification of the Ord family of probability mass functions is related to the orthogonality of the corresponding Rodrigues polynomials. Also, for any random variable $X$ of this family and for any suitable functio… ▽ More
Submitted 13 June, 2018; v1 submitted 8 August, 2014; originally announced August 2014.
Comments: 31 pages, 1 Table
MSC Class: Primary 60E05; 62E99; 05E35; 42A61; Secondary 60E15
Journal ref: Statistics 2018, Vol. 52, No 2, 364-392
-
arXiv:1110.3265 [pdf, ps, other]
A note on a variance bound for the multinomial and the negative multinomial distribution
Abstract: We prove a Chernoff-type upper variance bound for the multinomial and the negative multinomial distribution. An application is also given.
Submitted 12 June, 2018; v1 submitted 14 October, 2011; originally announced October 2011.
Comments: 8 pages
MSC Class: Primary 60E15
Journal ref: Naval Research Logistics 2014, Vol. 61, No 3, 179-183
-
arXiv:1110.0090 [pdf, ps, other]
Unified extension of variance bounds for integrated Pearson family
Abstract: We use some properties of orthogonal polynomials to provide a class of upper/lower variance bounds for a function $g(X)$ of an absolutely continuous random variable $X$, in terms of the derivatives of $g$ up to some order. The new bounds are better than the existing ones.
Submitted 8 June, 2018; v1 submitted 1 October, 2011; originally announced October 2011.
Comments: 14 pages
MSC Class: 60E15
Journal ref: Annals of the Institute of Statistical Mathematics 2013, Vol. 65, No 4, 687-702
-
arXiv:1104.0040 [pdf, ps, other]
Moment-based inference for Pearson's quadratic q subfamily of distributions
Abstract: The author uses a Stein-type covariance identity to obtain moment estimators for the parameters of the quadratic polynomial subfamily of Pearson distributions. The asymptotic distribution of the estimators is obtained, and normality and symmetry tests based on it are provided. Simulation is used to compare the performance of the proposed tests with that of other existing tests for symmetry and nor… ▽ More
Submitted 5 April, 2011; v1 submitted 31 March, 2011; originally announced April 2011.
Comments: 21 pages, 14 figures, 10 tables, submitted for publication
MSC Class: 62E01
Journal ref: Communications in Statistics Theory and Methods 2013, Vol. 42, No 12, 1-10
-
arXiv:1007.3662 [pdf, ps, other]
An extended Stein-type covariance identity for the Pearson family with applications to lower variance bounds
Abstract: For an absolutely continuous (integer-valued) r.v. $X$ of the Pearson (Ord) family, we show that, under natural moment conditions, a Stein-type covariance identity of order $k$ holds (cf. [Goldstein and Reinert, J. Theoret. Probab. 18 (2005) 237--260]). This identity is closely related to the corresponding sequence of orthogonal polynomials, obtained by a Rodrigues-type formula, and provides conve… ▽ More
Submitted 3 May, 2011; v1 submitted 21 July, 2010; originally announced July 2010.
Comments: Published in at http://dx.doi.org/10.3150/10-BEJ282 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)
Report number: IMS-BEJ-BEJ282
Journal ref: Bernoulli 2011, Vol. 17, No. 2, 507-529