-
A generalization of a U-statistics-based MCAR Test: Utilizing Partially Observed Variables
Authors:
Danijel Aleksić
Abstract:
In this paper, a generalized version of a U-statistics-based test for MCAR developed by Aleksić (2024) is presented. The novel test, similar to the original, tests for MCAR by calculating and combining the covariances between the response indicators and the data variables. However, unlike the old test, it is able to utilize partially observed variables, resulting in a significantly larger class of…
▽ More
In this paper, a generalized version of a U-statistics-based test for MCAR developed by Aleksić (2024) is presented. The novel test, similar to the original, tests for MCAR by calculating and combining the covariances between the response indicators and the data variables. However, unlike the old test, it is able to utilize partially observed variables, resulting in a significantly larger class of detectable alternatives. The novel test appears to be well calibrated, much better than the Little's MCAR test that was used as a benchmark. For the alternatives that were detectable for the old test, the novel test has comparable, although slightly lower, power as the old one, but is still able to outperform Little's test in all of the studied scenarios. For alternatives that were previously undetectable or barely detectable, the novel test performs the best of three. The novel test has the same assumption of finite fourth moments of the data, the same assumption necessary for Little's test. The results indicate that the novel test is more robust to this assumption, although both tests have similar limitations.
△ Less
Submitted 9 January, 2025;
originally announced January 2025.
-
To impute or not to? Testing multivariate normality on incomplete dataset: Revisiting the BHEP test
Authors:
Danijel Aleksić,
Bojana Milošević
Abstract:
In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of test statistic un…
▽ More
In this paper, we focus on testing multivariate normality using the BHEP test with data that are missing completely at random. Our objective is twofold: first, to gain insight into the asymptotic behavior of BHEP test statistics under two widely used approaches for handling missing data, namely complete-case analysis and imputation, and second, to compare the power performance of test statistic under these approaches. It is observed that under the imputation approach, the affine invariance of test statistics is not preserved. To address this issue, we propose an appropriate bootstrap algorithm for approximating p-values. Extensive simulation studies demonstrate that both mean and median approaches exhibit greater power compared to testing with complete-case analysis, and open some questions for further research.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
A Novel Test of Missing Completely at Random: U-statistics-based Approach
Authors:
Danijel Aleksić
Abstract:
In this paper, a novel test for testing whether data are Missing Completely at Random is proposed. Asymptotic properties of the test are derived utilizing the theory of non-degenerate U-statistics. It is shown that the novel test statistic coincides with the well-known Little's statistic in the case of a univariate nonresponse. Then, the extensive simulation study is conducted to examine the perfo…
▽ More
In this paper, a novel test for testing whether data are Missing Completely at Random is proposed. Asymptotic properties of the test are derived utilizing the theory of non-degenerate U-statistics. It is shown that the novel test statistic coincides with the well-known Little's statistic in the case of a univariate nonresponse. Then, the extensive simulation study is conducted to examine the performance of the test in terms of the preservation of type I error and in terms of power, under various underlying distributions, dimensions of the data and sample sizes. Performance of the Little's MCAR test is used as a benchmark for the comparison. The novel test shows better performance in all of the studied scenarios, better preserving the type I error and having higher empirical powers.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.