-
Some Results on Generalized Familywise Error Rate Controlling Procedures under Dependence
Authors:
Monitirtha Dey,
Subir Kumar Bhandari
Abstract:
The topic of multiple hypotheses testing now has a potpourri of novel theories and ubiquitous applications in diverse scientific fields. However, the universal utility of this field often hinders the possibility of having a generalized theory that accommodates every scenario. This tradeoff is better reflected through the lens of dependence, a central piece behind the theoretical and applied develo…
▽ More
The topic of multiple hypotheses testing now has a potpourri of novel theories and ubiquitous applications in diverse scientific fields. However, the universal utility of this field often hinders the possibility of having a generalized theory that accommodates every scenario. This tradeoff is better reflected through the lens of dependence, a central piece behind the theoretical and applied developments of multiple testing. Although omnipresent in many scientific avenues, the nature and extent of dependence vary substantially with the context and complexity of the particular scenario. Positive dependence is the norm in testing many treatments versus a single control or in spatial statistics. On the contrary, negative dependence arises naturally in tests based on split samples and in cyclical, ordered comparisons. In GWAS, the SNP markers are generally considered to be weakly dependent. Generalized familywise error rate (k-FWER) control has been one of the prominent frequentist approaches in simultaneous inference. However, the performances of k-FWER controlling procedures are yet unexplored under different dependencies. This paper revisits the classical testing problem of normal means in different correlated frameworks. We establish upper bounds on the generalized familywise error rates under each dependence, consequently giving rise to improved testing procedures. Towards this, we present improved probability inequalities, which are of independent theoretical interest
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Asymptotics in Multiple Hypotheses Testing under Dependence: beyond Normality
Authors:
Monitirtha Dey
Abstract:
Correlated observations are ubiquitous phenomena in a plethora of scientific avenues. Tackling this dependence among test statistics has been one of the pertinent problems in simultaneous inference. However, very little literature exists that elucidates the effect of correlation on different testing procedures under general distributional assumptions. In this work, we address this gap in a unified…
▽ More
Correlated observations are ubiquitous phenomena in a plethora of scientific avenues. Tackling this dependence among test statistics has been one of the pertinent problems in simultaneous inference. However, very little literature exists that elucidates the effect of correlation on different testing procedures under general distributional assumptions. In this work, we address this gap in a unified way by considering the multiple testing problem under a general correlated framework. We establish an upper bound on the family-wise error rate(FWER) of Bonferroni's procedure for equicorrelated test statistics. Consequently, we find that for a quite general class of distributions, Bonferroni FWER asymptotically tends to zero when the number of hypotheses approaches infinity. We extend this result to general positively correlated elliptically contoured setups. We also present examples of distributions for which Bonferroni FWER has a strictly positive limit under equicorrelation. We extend the limiting zero results to the class of step-down procedures under quite general correlated setups. Specifically, the probability of rejecting at least one hypothesis approaches zero asymptotically for any step-down procedure. The results obtained in this work generalize existing results for correlated Normal test statistics and facilitate new insights into the performances of multiple testing procedures under dependence.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Asymptotically Optimal Sequential Multiple Testing Procedures for Correlated Normal
Authors:
Monitirtha Dey,
Subir Kumar Bhandari
Abstract:
Simultaneous statistical inference has been a cornerstone in the statistics methodology literature because of its fundamental theory and paramount applications. The mainstream multiple testing literature has traditionally considered two frameworks: the sample size is deterministic, and the test statistics corresponding to different tests are independent. However, in many modern scientific avenues,…
▽ More
Simultaneous statistical inference has been a cornerstone in the statistics methodology literature because of its fundamental theory and paramount applications. The mainstream multiple testing literature has traditionally considered two frameworks: the sample size is deterministic, and the test statistics corresponding to different tests are independent. However, in many modern scientific avenues, these assumptions are often violated. There is little study that explores the multiple testing problem in a sequential framework where the test statistics corresponding to the various streams are dependent. This work fills this gap in a unified way by considering the classical means-testing problem in an equicorrelated Gaussian and sequential framework. We focus on sequential test procedures that control the type I and type II familywise error probabilities at pre-specified levels. We establish that our proposed test procedures achieve the optimal expected sample sizes under every possible signal configuration asymptotically, as the two error probabilities vanish at arbitrary rates. Towards this, we elucidate that the ratio of the expected sample size of our proposed rule and that of the classical SPRT goes to one asymptotically, thus illustrating their connection. Generalizing this, we show that our proposed procedures, with appropriately adjusted critical values, are asymptotically optimal for controlling any multiple testing error metric lying between multiples of FWER in a certain sense. This class of metrics includes FDR/FNR, pFDR/pFNR, the per-comparison and per-family error rates, and the false positive rate.
△ Less
Submitted 20 March, 2025; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Demand Analysis with a Thin Price Sample
Authors:
Monitirtha Dey,
Arpan Kumar,
Diganta Mukherjee
Abstract:
For about 125 items of food, the Consumer Expenditure Survey (CES) schedule of the Indian National Sample Survey asks the interviewer to obtain both quantity and value of household consumption during the reference period from the respondent. This would appear to put a great burden on the respondent. But it is likely that the price usually paid is almost the same within each first stage unit (fsu).…
▽ More
For about 125 items of food, the Consumer Expenditure Survey (CES) schedule of the Indian National Sample Survey asks the interviewer to obtain both quantity and value of household consumption during the reference period from the respondent. This would appear to put a great burden on the respondent. But it is likely that the price usually paid is almost the same within each first stage unit (fsu). The present work proposes a new sampling scheme to estimate demand elasticities of essential food items. While the conventional sampling method used in practice (e.g. in NSS consumer expenditure survey) involves seeking price information from many households sampled from a fsu, the proposed procedure involves only one household chosen randomly from every fsu for price data collection and thus requires much less interview burden. Using unit records for vegetable items in the NSS's 2011-12 CES, our results show that in spite of requiring much less data, the new scheme captures the household food consumption behavior as precisely as before.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.