Skip to main content

Showing 1–15 of 15 results for author: Hemerik, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.16985  [pdf, other

    stat.ME

    Nonparametric methods controlling the median of the false discovery proportion

    Authors: Jesse Hemerik

    Abstract: When testing many hypotheses, often we do not have strong expectations about the directions of the effects. In some situations however, the alternative hypotheses are that the parameters lie in a certain direction or interval, and it is in fact expected that most hypotheses are false. This is often the case when researchers perform multiple noninferiority or equivalence tests, e.g. when testing fo… ▽ More

    Submitted 29 January, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

    MSC Class: 62G10

  2. arXiv:2410.02306  [pdf, ps, other

    stat.AP stat.ME

    Choosing alpha post hoc: the danger of multiple standard significance thresholds

    Authors: Jesse Hemerik, Nick W Koning

    Abstract: A fundamental assumption of classical hypothesis testing is that the significance threshold $α$ is chosen independently from the data. The validity of confidence intervals likewise relies on choosing $α$ beforehand. We point out that the independence of $α$ is guaranteed in practice because, in most fields, there exists one standard $α$ that everyone uses -- so that $α$ is automatically independen… ▽ More

    Submitted 10 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted for publication in Statistical Science

    MSC Class: 62A01

  3. arXiv:2401.17993   

    stat.ME

    Robust Inference for Generalized Linear Mixed Models: An Approach Based on Score Sign Flipping

    Authors: Angela Andreella, Jelle Goeman, Jesse Hemerik, Livio Finos

    Abstract: Despite the versatility of generalized linear mixed models in handling complex experimental designs, they often suffer from misspecification and convergence problems. This makes inference on the values of coefficients problematic. To address these challenges, we propose a robust extension of the score-based statistical test using sign-flipping transformations. Our approach efficiently handles with… ▽ More

    Submitted 27 March, 2025; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: The paper contains errors that we are thoroughly analyzing for a revised version, though this process requires time

  4. On the term "randomization test"

    Authors: Jesse Hemerik

    Abstract: There exists no consensus on the meaning of the term "randomization test". Contradicting uses of the term are leading to confusion, misunderstandings and indeed invalid data analyses. As we point out, a main source of the confusion is that the term was not explicitly defined when it was first used in the 1930's. Later authors made clear proposals to reach a consensus regarding the term. This resul… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    MSC Class: 62G10

    Journal ref: The American Statistician, 2024

  5. arXiv:2209.13918  [pdf, other

    stat.ME math.ST

    Inference in generalized linear models with robustness to misspecified variances

    Authors: Riccardo De Santis, Jelle J. Goeman, Jesse Hemerik, Samuel Davenport, Livio Finos

    Abstract: Generalized linear models usually assume a common dispersion parameter, an assumption that is seldom true in practice. Consequently, standard parametric methods may suffer appreciable loss of type I error control. As an alternative, we present a semi-parametric group-invariance method based on sign flipping of score contributions. Our method requires only the correct specification of the mean mode… ▽ More

    Submitted 13 September, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

  6. arXiv:2208.11570  [pdf, other

    stat.ME

    Flexible control of the median of the false discovery proportion

    Authors: Jesse Hemerik, Aldo Solari, Jelle J Goeman

    Abstract: We introduce a multiple testing procedure that controls the median of the proportion of false discoveries (FDP) in a flexible way. The procedure only requires a vector of p-values as input and is comparable to the Benjamini-Hochberg method, which controls the mean of the FDP. Our method allows freely choosing one or several values of alpha after seeing the data -- unlike Benjamini-Hochberg, which… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 August, 2022; originally announced August 2022.

    MSC Class: 62F03

  7. arXiv:2202.00967  [pdf, other

    stat.ME math.ST

    More Efficient Exact Group-Invariance Testing: using a Representative Subgroup

    Authors: Nick W. Koning, Jesse Hemerik

    Abstract: Non-parametric tests based on permutation, rotation or sign-flipping are examples of group-invariance tests. These tests test invariance of the null distribution under a set of transformations that has a group structure, in the algebraic sense. Such groups are often huge, which makes it computationally infeasible to test using the entire group. Hence, it is standard practice to test using a random… ▽ More

    Submitted 22 November, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    MSC Class: 62G10; 62G09

  8. arXiv:2012.00368  [pdf, other

    stat.AP

    Permutation-based true discovery proportions for functional Magnetic Resonance Imaging cluster analysis

    Authors: Angela Andreella, Jesse Hemerik, Wouter Weeda, Livio Finos, Jelle Goeman

    Abstract: We propose a permutation-based method for testing a large collection of hypotheses simultaneously. Our method provides lower bounds for the number of true discoveries in any selected subset of hypotheses. These bounds are simultaneously valid with high confidence. The methodology is particularly useful in functional Magnetic Resonance Imaging cluster analysis, where it provides a confidence statem… ▽ More

    Submitted 26 January, 2023; v1 submitted 1 December, 2020; originally announced December 2020.

  9. arXiv:2007.02844  [pdf, other

    stat.ME

    On optimal two-stage testing of multiple mediators

    Authors: Vera Djordjilović, Jesse Hemerik, Magne Thoresen

    Abstract: Mediation analysis in high-dimensional settings often involves identifying potential mediators among a large number of measured variables. For this purpose, a two-step familywise error rate procedure called ScreenMin has been recently proposed (Djordjilović et al. 2019). In ScreenMin, variables are first screened and only those that pass the screening are tested. The proposed threshold for selecti… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 20 pages, 5 gifures

  10. arXiv:2001.01466  [pdf, ps, other

    stat.ME

    Permutation testing in high-dimensional linear models: an empirical investigation

    Authors: Jesse Hemerik, Magne Thoresen, Livio Finos

    Abstract: Permutation testing in linear models, where the number of nuisance coefficients is smaller than the sample size, is a well-studied topic. The common approach of such tests is to permute residuals after regressing on the nuisance covariates. Permutation-based tests are valuable in particular because they can be highly robust to violations of the standard linear model, such as non-normality and hete… ▽ More

    Submitted 8 October, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

    Comments: Accepted for publication in Journal of Statistical Computation and Simulation

    MSC Class: 62G09

  11. Another look at the Lady Tasting Tea and differences between permutation tests and randomization tests

    Authors: Jesse Hemerik, Jelle J. Goeman

    Abstract: The statistical literature is known to be inconsistent in the use of the terms "permutation test" and "randomization test". Several authors succesfully argue that these terms should be used to refer to two distinct classes of tests and that there are major conceptual differences between these classes. The present paper explains an important difference in mathematical reasoning between these classe… ▽ More

    Submitted 6 October, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: International Statistical Review. Early view version (2020)

    MSC Class: 62G10

  12. arXiv:1911.00862  [pdf, other

    stat.ME

    Optimal two-stage testing of multiple mediators

    Authors: Vera Djordjilović, Jesse Hemerik, Magne Thoresen

    Abstract: Mediation analysis in high-dimensional settings often involves identifying potential mediators among a large number of measured variables. For this purpose, a two step familywise error rate (FWER) procedure called ScreenMin has been recently proposed (Djordjilović et al. 2019). In ScreenMin, variables are first screened and only those that pass the screening are tested. The proposed threshold for… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

  13. Robust testing in generalized linear models by sign-flipping score contributions

    Authors: Jesse Hemerik, Jelle J Goeman, Livio Finos

    Abstract: Generalized linear models are often misspecified due to overdispersion, heteroscedasticity and ignored nuisance variables. Existing quasi-likelihood methods for testing in misspecified models often do not provide satisfactory type-I error rate control. We provide a novel semi-parametric test, based on sign-flipping individual score contributions. The tested parameter is allowed to be multi-dimensi… ▽ More

    Submitted 8 May, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: To appear in Journal of the Royal Statistical Society: Series B (Methodology). Early view version (2020)

    MSC Class: 62G10

  14. arXiv:1901.04885  [pdf, other

    math.ST stat.ME

    Only Closed Testing Procedures are Admissible for Controlling False Discovery Proportions

    Authors: Jelle Goeman, Jesse Hemerik, Aldo Solari

    Abstract: We consider the class of all multiple testing methods controlling tail probabilities of the false discovery proportion, either for one random set or simultaneously for many such sets. This class encompasses methods controlling familywise error rate, generalized familywise error rate, false discovery exceedance, joint error rate, simultaneous control of all false discovery proportions, and others,… ▽ More

    Submitted 29 April, 2022; v1 submitted 15 January, 2019; originally announced January 2019.

    MSC Class: 62F03

  15. Permutation-based simultaneous confidence bounds for the false discovery proportion

    Authors: Jesse Hemerik, Aldo Solari, Jelle J. Goeman

    Abstract: When multiple hypotheses are tested, interest is often in ensuring that the proportion of false discoveries (FDP) is small with high confidence. In this paper, confidence upper bounds for the FDP are constructed, which are simultaneous over all rejection cut-offs. In particular this allows the user to select a set of hypotheses post hoc such that the FDP lies below some constant with high confiden… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    MSC Class: 62G09; 62H15

    Journal ref: Biometrika, 106(3):635-649, 2019