A cautionary tale on using imputation methods for inference in matched pairs design

Ramosaj, Burim; Amro, Lubna; Pauly, Markus

Statistics > Applications

arXiv:1806.06551 (stat)

[Submitted on 18 Jun 2018 (v1), last revised 10 Aug 2018 (this version, v2)]

Title:A cautionary tale on using imputation methods for inference in matched pairs design

Authors:Burim Ramosaj, Lubna Amro, Markus Pauly

View PDF

Abstract:Imputation procedures in biomedical fields have turned into statistical practice, since further analyses can be conducted ignoring the former presence of missing values. In particular, non-parametric imputation schemes like the random forest or a combination with the stochastic gradient boosting have shown favorable imputation performance compared to the more traditionally used MICE procedure. However, their effect on valid statistical inference has not been analyzed so far. This paper closes this gap by investigating their validity for inferring mean differences in incompletely observed pairs while opposing them to a recent approach that only works with the given observations at hand. Our findings indicate that machine learning schemes for (multiply) imputing missing values may inflate type-I-error or result in comparably low power in small to moderate matched pairs, even after modifying the test statistics using Rubin's multiple imputation rule. In addition to an extensive simulation study, an illustrative data example from a breast cancer gene study has been considered.

Subjects:	Applications (stat.AP)
Cite as:	arXiv:1806.06551 [stat.AP]
	(or arXiv:1806.06551v2 [stat.AP] for this version)
	https://doi.org/10.48550/arXiv.1806.06551

Submission history

From: Lubna Amro [view email]
[v1] Mon, 18 Jun 2018 08:48:32 UTC (661 KB)
[v2] Fri, 10 Aug 2018 13:50:20 UTC (121 KB)

Statistics > Applications

Title:A cautionary tale on using imputation methods for inference in matched pairs design

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Applications

Title:A cautionary tale on using imputation methods for inference in matched pairs design

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators