-
Generalized Estimating Equations for Hearing Loss Data with Specified Correlation Structures
Authors:
Zhuoran Wei,
Hanbing Zhu,
Sharon Curhan,
Gary Curhan,
Molin Wang
Abstract:
Due to the nature of pure-tone audiometry test, hearing loss data often has a complicated correlation structure. Generalized estimating equation (GEE) is commonly used to investigate the association between exposures and hearing loss, because it is robust to misspecification of the correlation matrix. However, this robustness typically entails a moderate loss of estimation efficiency in finite sam…
▽ More
Due to the nature of pure-tone audiometry test, hearing loss data often has a complicated correlation structure. Generalized estimating equation (GEE) is commonly used to investigate the association between exposures and hearing loss, because it is robust to misspecification of the correlation matrix. However, this robustness typically entails a moderate loss of estimation efficiency in finite samples. This paper proposes to model the correlation coefficients and use second-order generalized estimating equations to estimate the correlation parameters. In simulation studies, we assessed the finite sample performance of our proposed method and compared it with other methods, such as GEE with independent, exchangeable and unstructured correlation structures. Our method achieves an efficiency gain which is larger for the coefficients of the covariates corresponding to the within-cluster variation (e.g., ear-level covariates) than the coefficients of cluster-level covariates. The efficiency gain is also more pronounced when the within-cluster correlations are moderate to strong, or when comparing to GEE with an unstructured correlation structure. As a real-world example, we applied the proposed method to data from the Audiology Assessment Arm of the Conservation of Hearing Study, and studied the association between a dietary adherence score and hearing loss.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Estimating intracluster correlation for ordinal data
Authors:
Benjamin W. Langworthy,
Zhaoxun Hou,
Gary C. Curhan,
Sharon G. Curhan,
Molin Wang
Abstract:
Purpose: In this paper we consider the estimation of intracluster correlation for ordinal data. We focus on pure-tone audiometry hearing threshold data, where thresholds are measured in 5 decibel increments. We estimate the intracluster correlation for tests from iPhone-based hearing assessment application as a measure of test/retest reliability. Methods: We present a method to estimate the intrac…
▽ More
Purpose: In this paper we consider the estimation of intracluster correlation for ordinal data. We focus on pure-tone audiometry hearing threshold data, where thresholds are measured in 5 decibel increments. We estimate the intracluster correlation for tests from iPhone-based hearing assessment application as a measure of test/retest reliability. Methods: We present a method to estimate the intracluster correlation using mixed effects cumulative logistic and probit models, which assume the outcome data are ordinal. This contrasts with using a mixed effects linear model which assumes that the outcome data are continuous. Results: In simulation studies we show that using a mixed effects linear model to estimate the intracluster correlation for ordinal data results in a negative finite sample bias, while using mixed effects cumulative logistic or probit models reduces this bias. The estimated intracluster correlation for the iPhone-based hearing assessment application is higher when using the mixed effects cumulative logistic and probit models compared to using a mixed effects linear model. Conclusion: When data are ordinal, using mixed effects cumulative logistic or probit models reduces the bias of intracluster correlation estimates relative to using a mixed effects linear model.
△ Less
Submitted 2 November, 2022; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Analytical method for detecting outlier evaluators
Authors:
Yujie Wu,
Sharon Curhan,
Bernard Rosner,
Gary Curhan,
Molin Wang
Abstract:
Epidemiologic and medical studies often rely on evaluators to obtain measurements of exposures or outcomes for study participants, and valid estimates of associations depends on the quality of data. Even though statistical methods have been proposed to adjust for measurement errors, they often rely on unverifiable assumptions and could lead to biased estimates if those assumptions are violated. Th…
▽ More
Epidemiologic and medical studies often rely on evaluators to obtain measurements of exposures or outcomes for study participants, and valid estimates of associations depends on the quality of data. Even though statistical methods have been proposed to adjust for measurement errors, they often rely on unverifiable assumptions and could lead to biased estimates if those assumptions are violated. Therefore, methods for detecting potential `outlier' evaluators are needed to improve data quality during data collection stage. In this paper, we propose a two-stage algorithm to detect `outlier' evaluators whose evaluation results tend to be higher or lower than their counterparts. In the first stage, evaluators' effects are obtained by fitting a regression model. In the second stage, hypothesis tests are performed to detect `outlier' evaluators, where we consider both the power of each hypothesis test and the false discovery rate (FDR) among all tests. We conduct an extensive simulation study to evaluate the proposed method, and illustrate the method by detecting potential `outlier' audiologists in the data collection stage for the Audiology Assessment Arm of the Conservation of Hearing Study, an epidemiologic study for examining risk factors of hearing loss in the Nurses' Health Study II. Our simulation study shows that our method not only can detect true `outlier' evaluators, but also is less likely to falsely reject true `normal' evaluators. Our two-stage `outlier' detection algorithm is a flexible approach that can effectively detect `outlier' evaluators, and thus data quality can be improved during data collection stage.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.