-
A Note on Implementing a Special Case of the LEAR Covariance Model in Standard Software
Authors:
Sean L. Simpson,
Min Zhu,
Keith E. Muller
Abstract:
Repeated measures analyses require proper choice of the correlation model to ensure accurate inference and optimal efficiency. The linear exponent autoregressive (LEAR) correlation model provides a flexible two-parameter correlation structure that accommodates a variety of data types in which the correlation within-sampling unit decreases exponentially in time or space. The LEAR model subsumes thr…
▽ More
Repeated measures analyses require proper choice of the correlation model to ensure accurate inference and optimal efficiency. The linear exponent autoregressive (LEAR) correlation model provides a flexible two-parameter correlation structure that accommodates a variety of data types in which the correlation within-sampling unit decreases exponentially in time or space. The LEAR model subsumes three classic temporal correlation structures, namely compound symmetry, continuous-time AR(1), and MA(1), while maintaining parsimony and providing appealing statistical and computational properties. It also supplies a plausible correlation structure for power analyses across many experimental designs. However, no commonly used statistical packages provide a straightforward way to implement the model, limiting its use to those with the appropriate programming skills. Here we present a reparameterization of the LEAR model that allows easily implementing it in standard software for the special case of data with equally spaced temporal or spatial intervals.
△ Less
Submitted 26 July, 2017;
originally announced July 2017.
-
Reducing decision errors in the paired comparison of the diagnostic accuracy of screening tests with Gaussian outcomes
Authors:
Brandy M. Ringham,
Todd A. Alonzo,
John T. Brinton,
Sarah M. Kreidler,
Aarti Munjal,
Keith E. Muller,
Deborah H. Glueck
Abstract:
Scientists often use a paired comparison of the areas under the receiver operating characteristic curves to decide which continuous cancer screening test has the best diagnostic accuracy. In the paired design, all participants are screened with both tests. Participants with unremarkable screening results enter a follow-up period. Participants with suspicious screening results and those who show ev…
▽ More
Scientists often use a paired comparison of the areas under the receiver operating characteristic curves to decide which continuous cancer screening test has the best diagnostic accuracy. In the paired design, all participants are screened with both tests. Participants with unremarkable screening results enter a follow-up period. Participants with suspicious screening results and those who show evidence of disease during follow-up receive the gold standard test. The remaining participants are classified as non-cases, even though some may have occult disease. The standard analysis includes all study participants in the analysis, which can create bias in the estimates of diagnostic accuracy. If the bias affects the area under the curve for one screening test more than the other screening test, scientists may make the wrong decision as to which screening test has better diagnostic accuracy. We describe a weighted maximum likelihood bias correction method to reduce decision errors. We assessed the ability of the bias correction method to reduce decision errors via simulation studies. The simulations compared the Type I error rate and power of the standard analysis with that of the bias-corrected analysis. The performance of the bias correction method depends on characteristics of the screening tests and the disease, and on the percentage of study participants who receive the gold standard test. In studies with a large amount of bias in the difference in the full area under the curve, the bias correction method reduces the Type I error rate and improves power for the correct decision. In order to determine if bias correction is needed for a specific screening trial, we recommend the investigator conduct a simulation study using our free software.
△ Less
Submitted 21 May, 2013;
originally announced May 2013.
-
Kronecker product linear exponent AR(1) correlation structures and separability tests for multivariate repeated measures
Authors:
Sean L. Simpson,
Lloyd J. Edwards,
Martin A. Styner,
Keith E. Muller
Abstract:
Longitudinal imaging studies have moved to the forefront of medical research due to their ability to characterize spatio-temporal features of biological structures across the lifespan. Credible models of the correlations in longitudinal imaging require two or more pattern components. Valid inference requires enough flexibility of the correlation model to allow reasonable fidelity to the true patte…
▽ More
Longitudinal imaging studies have moved to the forefront of medical research due to their ability to characterize spatio-temporal features of biological structures across the lifespan. Credible models of the correlations in longitudinal imaging require two or more pattern components. Valid inference requires enough flexibility of the correlation model to allow reasonable fidelity to the true pattern. On the other hand, the existence of computable estimates demands a parsimonious parameterization of the correlation structure. For many one-dimensional spatial or temporal arrays, the linear exponent autoregressive (LEAR) correlation structure meets these two opposing goals in one model. The LEAR structure is a flexible two-parameter correlation model that applies in situations in which the within-subject correlation decreases exponentially in time or space. It allows for an attenuation or acceleration of the exponential decay rate imposed by the commonly used continuous-time AR(1) structure. Here we propose the Kronecker product LEAR correlation structure for multivariate repeated measures data in which the correlation between measurements for a given subject is induced by two factors. We also provide a scientifically informed approach to assessing the adequacy of a Kronecker product LEAR model and a general unstructured Kronecker product model. The approach provides useful guidance for high dimension, low sample size data that preclude using standard likelihood based tests. Longitudinal medical imaging data of caudate morphology in schizophrenia illustrates the appeal of the Kronecker product LEAR correlation structure.
△ Less
Submitted 8 November, 2012; v1 submitted 21 October, 2010;
originally announced October 2010.
-
On probabilities for separating sets of order statistics
Authors:
Deborah H. Glueck,
Anis Karimpour-Fard,
Jan Mandel,
Keith E. Muller
Abstract:
Consider a set of order statistics that arise from sorting samples from two different populations, each with their own, possibly different distribution function. The probability that these order statistics fall in disjoint, ordered intervals, and that of the smallest statistics, a certain number come from the first populations, are given in terms of the two distribution functions. The result is…
▽ More
Consider a set of order statistics that arise from sorting samples from two different populations, each with their own, possibly different distribution function. The probability that these order statistics fall in disjoint, ordered intervals, and that of the smallest statistics, a certain number come from the first populations, are given in terms of the two distribution functions. The result is applied to computing the joint probability of the number of rejections and the number of false rejections for the Benjamini-Hochberg false discovery rate procedure.
△ Less
Submitted 24 June, 2007;
originally announced June 2007.
-
Fast computation by block permanents of cumulative distribution functions of order statistics from several populations
Authors:
Deborah H. Glueck,
Anis Karimpour-Fard,
Jan Mandel,
Larry Hunter,
Keith E. Muller
Abstract:
The joint cumulative distribution function for order statistics arising from several different populations is given in terms of the distribution function of the populations. The computational cost of the formula in the case of two populations is still exponential in the worst case, but it is a dramatic improvement compared to the general formula by Bapat and Beg. In the case when only the joint…
▽ More
The joint cumulative distribution function for order statistics arising from several different populations is given in terms of the distribution function of the populations. The computational cost of the formula in the case of two populations is still exponential in the worst case, but it is a dramatic improvement compared to the general formula by Bapat and Beg. In the case when only the joint distribution function of a subset of the order statistics of fixed size is needed, the complexity is polynomial, for the case of two populations.
△ Less
Submitted 25 May, 2007;
originally announced May 2007.