-
A Characterization of Most(More) Powerful Test Statistics with Simple Nonparametric Applications
Authors:
Albert Vexler,
Alan D. Hutson
Abstract:
Data-driven most powerful tests are statistical hypothesis decision-making tools that deliver the greatest power against a fixed null hypothesis among all corresponding data-based tests of a given size. When the underlying data distributions are known, the likelihood ratio principle can be applied to conduct most powerful tests. Reversing this notion, we consider the following questions. (a) Assum…
▽ More
Data-driven most powerful tests are statistical hypothesis decision-making tools that deliver the greatest power against a fixed null hypothesis among all corresponding data-based tests of a given size. When the underlying data distributions are known, the likelihood ratio principle can be applied to conduct most powerful tests. Reversing this notion, we consider the following questions. (a) Assuming a test statistic, say T, is given, how can we transform T to improve the power of the test? (b) Can T be used to generate the most powerful test? (c) How does one compare test statistics with respect to an attribute of the desired most powerful decision-making procedure? To examine these questions, we propose one-to-one mapping of the term 'Most Powerful' to the distribution properties of a given test statistic via matching characterization. This form of characterization has practical applicability and aligns well with the general principle of sufficiency. Findings indicate that to improve a given test, we can employ relevant ancillary statistics that do not have changes in their distributions with respect to tested hypotheses. As an example, the present method is illustrated by modifying the usual t-test under nonparametric settings. Numerical studies based on generated data and a real-data set confirm that the proposed approach can be useful in practice.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Exact Multivariate Two-Sample Density-Based Empirical Likelihood Ratio Tests Applicable to Retrospective and Group Sequential Studies
Authors:
Ablert Vexler,
Gregory Gurevich,
Li Zou
Abstract:
Nonparametric tests for equality of multivariate distributions are frequently desired in research. It is commonly required that test-procedures based on relatively small samples of vectors accurately control the corresponding Type I Error (TIE) rates. Often, in the multivariate testing, extensions of null-distribution-free univariate methods, e.g., Kolmogorov-Smirnov and Cramer-von Mises type sche…
▽ More
Nonparametric tests for equality of multivariate distributions are frequently desired in research. It is commonly required that test-procedures based on relatively small samples of vectors accurately control the corresponding Type I Error (TIE) rates. Often, in the multivariate testing, extensions of null-distribution-free univariate methods, e.g., Kolmogorov-Smirnov and Cramer-von Mises type schemes, are not exact, since their null distributions depend on underlying data distributions. The present paper extends the density-based empirical likelihood technique in order to nonparametrically approximate the most powerful test for the multivariate two-sample (MTS) problem, yielding an exact finite-sample test statistic. We rigorously establish and apply one-to-one-mapping between the equality of vectors distributions and the equality of distributions of relevant univariate linear projections. In this framework, we prove an algorithm that simplifies the use of projection pursuit, employing only a few of the infinitely many linear combinations of observed vectors components. The displayed distribution-free strategy is employed in retrospective and group sequential manners. The asymptotic consistency of the proposed technique is shown. Monte Carlo studies demonstrate that the proposed procedures exhibit extremely high and stable power characteristics across a variety of settings. Supplementary materials for this article are available online.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
An AUK-based index for measuring and testing the joint dependence of a random vector
Authors:
Georgios Afendras,
Marianthi Markatou,
Albert Vexler
Abstract:
We present an index of dependence that allows one to measure the joint or mutual dependence of a $d$-dimensional random vector with $d>2$. The index is based on a $d$-dimensional Kendall process. We further propose a standardized version of our index of dependence that is easy to interpret, and provide an algorithm for its computation. We discuss tests of total independence based on consistent est…
▽ More
We present an index of dependence that allows one to measure the joint or mutual dependence of a $d$-dimensional random vector with $d>2$. The index is based on a $d$-dimensional Kendall process. We further propose a standardized version of our index of dependence that is easy to interpret, and provide an algorithm for its computation. We discuss tests of total independence based on consistent estimates of the area under the Kendall curve. We evaluate the performance of our procedures via simulation, and apply our methods to a real data set.
△ Less
Submitted 23 December, 2020; v1 submitted 24 November, 2020;
originally announced November 2020.
-
Univariate Likelihood Projections and Characterizations of the Multivariate Normal Distribution
Authors:
Albert Vexler
Abstract:
The problem of characterizing a multivariate distribution of a random vector using examination of univariate combinations of vector components is an essential issue of multivariate analysis. The likelihood principle plays a prominent role in developing powerful statistical inference tools. In this context, we raise the question: can the univariate likelihood function based on a random vector be us…
▽ More
The problem of characterizing a multivariate distribution of a random vector using examination of univariate combinations of vector components is an essential issue of multivariate analysis. The likelihood principle plays a prominent role in developing powerful statistical inference tools. In this context, we raise the question: can the univariate likelihood function based on a random vector be used to provide the uniqueness in reconstructing the vector distribution? In multivariate normal (MN) frameworks, this question links to a reverse of Cochran's theorem that concerns the distribution of quadratic forms in normal variables. We characterize the MN distribution through the univariate likelihood type projections. The proposed principle is employed to illustrate simple techniques for assessing multivariate normality via well-known tests that use univariate observations. The presented testing strategy can exhibit high and stable power characteristics in comparison to the well-known procedures in various scenarios when observed vectors are non-MN distributed, whereas their components are normally distributed random variables. In such cases, the classical multivariate normality tests may break down completely.
KEY WORDS: Characterization, Goodness of fit, Infinity divisible, Likelihood, Multivariate normal distribution, Projection, Quadratic form, Test for multivariate normality.
△ Less
Submitted 27 October, 2019;
originally announced October 2019.
-
Multi-Panel Kendall Plot in Light of an ROC Curve Analysis Applied to Measuring Dependence
Authors:
Albert Vexler,
Georgios Afendras,
Marianthi Markatou
Abstract:
The Kendall plot ($\K$-plot) is a plot measuring dependence between the components of a bivariate random variable. The $\K$-plot graphs the Kendall distribution function against the distribution function of $VU$, where $V$ and $U$ are independent uniform $[0,1]$ random variables. We associate $\K$-plots with the receiver operating characteristic ($\ROC$) curve, a well-accepted graphical tool in bi…
▽ More
The Kendall plot ($\K$-plot) is a plot measuring dependence between the components of a bivariate random variable. The $\K$-plot graphs the Kendall distribution function against the distribution function of $VU$, where $V$ and $U$ are independent uniform $[0,1]$ random variables. We associate $\K$-plots with the receiver operating characteristic ($\ROC$) curve, a well-accepted graphical tool in biostatistics for evaluating the ability of a biomarker to discriminate between two populations. The most commonly used global index of diagnostic accuracy of biomarkers is the area under the $\ROC$ curve ($\AUC$). In parallel with the $\AUC$, we propose a novel strategy to measure the association between random variables from a continuous bivariate distribution. First, we discuss why the area under the conventional Kendall curve ($\AUK$) cannot be used as an index of dependence. We then suggest a simple and meaningful extension of the definition of the $\K$-plots and define an index of dependence that is based on $\AUK$. This measure characterizes a wide range of two-variable relationships, thereby completely detecting the underlying dependence structure. Properties of the proposed index satisfy the mathematical definition of a measure. Finally, simulations and real data examples illustrate the applicability of the proposed method.
△ Less
Submitted 21 November, 2018;
originally announced November 2018.