-
Wilcoxon-Mann-Whitney Effects for Clustered Data: Informative Cluster Size
Authors:
Changrui Liu,
Solomon W. Harrar
Abstract:
In clustered data setting, informative cluster size has been a focus of recent research. In the nonparametric context, the problem has been considered mainly for testing equality of distribution functions. The aim in this paper is to develop inferential procedure for the Wilcoxon-Mann-Whintey effect (also known as the nonprametric relative effect). Unbiased estimator is provided and its asymptotic…
▽ More
In clustered data setting, informative cluster size has been a focus of recent research. In the nonparametric context, the problem has been considered mainly for testing equality of distribution functions. The aim in this paper is to develop inferential procedure for the Wilcoxon-Mann-Whintey effect (also known as the nonprametric relative effect). Unbiased estimator is provided and its asymptotic properties are investigated. The asymptotic theory is employed to develop inferential methods. While the proposed method takes information in the cluster sizes into consideration when constructing the estimator, it is equally applicable for ignorable cluster size situation. Simulation results show that our method appropriately accounts for informative cluster size and it generally outperforms existing methods, especially those designed under ignorable cluster sizes. The applications of the method is illustrated using data from a longitudinal study of alcohol use and a periodontal study.
△ Less
Submitted 13 December, 2022;
originally announced December 2022.
-
Nonparametric Methods for Complex Multivariate Data: Asymptotics and Small Sample Approximations
Authors:
Yue Cui,
Solomon W. Harrar
Abstract:
Quality of Life (QOL) outcomes are important in the management of chronic illnesses. In studies of efficacies of treatments or intervention modalities, QOL scales multi-dimensional constructs are routinely used as primary endpoints. The standard data analysis strategy computes composite (average) overall and domain scores, and conducts a mixed-model analysis for evaluating efficacy or monitoring m…
▽ More
Quality of Life (QOL) outcomes are important in the management of chronic illnesses. In studies of efficacies of treatments or intervention modalities, QOL scales multi-dimensional constructs are routinely used as primary endpoints. The standard data analysis strategy computes composite (average) overall and domain scores, and conducts a mixed-model analysis for evaluating efficacy or monitoring medical conditions as if these scores were in continuous metric scale. However, assumptions of parametric models like continuity and homoscedastivity can be violated in many cases. Furthermore, it is even more challenging when there are missing values on some of the variables. In this paper, we propose a purely nonparametric approach in the sense that meaningful and, yet, nonparametric effect size measures are developed. We propose estimator for the effect size and develop the asymptotic properties. Our methods are shown to be particularly effective in the presence of some form of clustering and/or missing values. Inferential procedures are derived from the asymptotic theory. The Asthma Randomized Trial of Indoor Wood Smoke data will be used to illustrate the applications of the proposed methods. The data was collected from a three-arm randomized trial which evaluated interventions targeting biomass smoke particulate matter from older model residential wood stoves in homes that have children with asthma.
△ Less
Submitted 30 November, 2021;
originally announced December 2021.
-
Nonparametric Method for Clustered Data in Pre-Post Factorial Design
Authors:
Solomon W. Harrar,
Yue Cui
Abstract:
In repeated measures factorial designs involving clustered units, parametric methods such as linear mixed effects models are used to handle within subject correlations. However, assumptions of these parametric models such as continuity and normality are usually hard to come by in many cases. The homoscedasticity assumption is rather hard to verify in practice. Furthermore, these assumptions may no…
▽ More
In repeated measures factorial designs involving clustered units, parametric methods such as linear mixed effects models are used to handle within subject correlations. However, assumptions of these parametric models such as continuity and normality are usually hard to come by in many cases. The homoscedasticity assumption is rather hard to verify in practice. Furthermore, these assumptions may not even be realistic when data are measured in a non-metric scale as commonly happens, for example, in Quality of Life outcomes. In this article, nonparametric effect-size measures for clustered data in factorial designs with pre-post measurements will be introduced. The effect-size measures provide intuitively-interpretable and informative probabilistic comparisons of treatment and time effects. The dependence among observations within a cluster can be arbitrary across treatment groups. The effect-size estimators along with their asymptotic properties for computing confidence intervals and performing hypothesis tests will be discussed. ANOVA-type statistics with $χ^2$ approximation that retain some of the optimal asymptotic behaviors in small samples are investigated. Within each treatment group, we allow some clusters to involve observations measured on both pre and post intervention periods (referred to as complete clusters), while others to contain observations from either pre or post intervention period only (referred to as incomplete clusters). Our methods are shown to be, particularly effective in the presence of multiple forms of clustering. The developed nonparametric methods are illustrated with data from a three-arm Randomized Trial of Indoor Wood Smoke reduction. The study considered two active treatments to improve asthma symptoms of kids living in homes that use wood stove for heating.
△ Less
Submitted 11 April, 2021;
originally announced April 2021.