-
Increasing power and robustness in screening trials by testing stored specimens in the control arm
Authors:
Hormuzd A. Katki,
Li C. Cheung
Abstract:
Background: Screening trials require large sample sizes and long time-horizons to demonstrate mortality reductions. We recently proposed increasing statistical power by testing stored control-arm specimens, called the Intended Effect (IE) design. To evaluate feasibility of the IE design, the US National Cancer Institute (NCI) is collecting blood specimens in the control-arm of the NCI Vanguard Mul…
▽ More
Background: Screening trials require large sample sizes and long time-horizons to demonstrate mortality reductions. We recently proposed increasing statistical power by testing stored control-arm specimens, called the Intended Effect (IE) design. To evaluate feasibility of the IE design, the US National Cancer Institute (NCI) is collecting blood specimens in the control-arm of the NCI Vanguard Multicancer Detection pilot feasibility trial. However, key assumptions of the IE design require more investigation and relaxation. Methods: We relax the IE design to (1) reduce costs by testing only a stratified sample of control-arm specimens by incorporating inverse-probability sampling weights, (2) correct for potential loss-of-signal in stored control-arm specimens, and (3) correct for non-compliance with control-arm specimen collections. We also examine sensitivity to unintended effects of screening. Results: In simulations, testing all primary-outcome control-arm specimens and a 50% sample of the rest maintains nearly all the power of the IE while only testing half the control-arm specimens. Power remains increased from the IE analysis (versus the standard analysis) even if unintended effects exist. The IE design is robust to some loss-of-signal scenarios, but otherwise requires retest-positive fractions that correct bias at a small loss of power. The IE can be biased and lose power under control-arm non-compliance scenarios, but corrections correct bias and can increase power. Conclusions: The IE design can be made more cost-efficient and robust to loss-of-signal. Unintended effects will not typically reduce the power gain over the standard trial design. Non-compliance with control-arm specimen collections can cause bias and loss of power that can be mitigated by corrections. Although promising, practical experience with the IE design in screening trials is necessary.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Representative Pure Risk Estimation by Using Data from Epidemiologic Studies, Surveys, and Registries: Estimating Risks for Minority Subgroups
Authors:
Lingxiao Wang,
Yan Li,
Barry I. Graubard,
Hormuzd A. Katki
Abstract:
Representative risk estimation is fundamental to clinical decision-making. However, risks are often estimated from non-representative epidemiologic studies, which usually underrepresent minorities. "Model-based" methods use population registries to improve externally validity of risk estimation but assume hazard ratios (HR) are generalizable from samples to the target finite population. "Pseudowei…
▽ More
Representative risk estimation is fundamental to clinical decision-making. However, risks are often estimated from non-representative epidemiologic studies, which usually underrepresent minorities. "Model-based" methods use population registries to improve externally validity of risk estimation but assume hazard ratios (HR) are generalizable from samples to the target finite population. "Pseudoweighting" methods improve representativeness of studies by using an external probability-based survey as the reference, but the resulting estimators can be biased due to propensity model misspecification or inefficient due to variable pseudoweights or small sample sizes of minorities in the cohort and/or survey. We propose a two-step pseudoweighting procedure that poststratifies the event rates among age/race/sex strata in the pseudoweighted cohort to the population rates to produce efficient and robust pure risk estimation (i.e., a cause-specific absolute risk in the absence of competing events). For developing an all-cause mortality risk model representative for the US, our findings suggest that HRs for minorities are not generalizable, and that surveys can have inadequate numbers of events for minorities. Poststratification on event rates is crucial for obtaining reliable risk estimation for minority subgroups.
△ Less
Submitted 10 April, 2023; v1 submitted 10 March, 2022;
originally announced March 2022.
-
Efficient and Robust Propensity-Score-Based Methods for Population Inference using Epidemiologic Cohorts
Authors:
Lingxiao Wang,
Barry I. Graubard,
Hormuzd A. Katki,
Yan Li
Abstract:
Most epidemiologic cohorts are composed of volunteers who do not represent the general population. To enable population inference from cohorts, we and others have proposed utilizing probability survey samples as external references to develop a propensity score (PS) for membership in the cohort versus survey. Herein we develop a unified framework for PS-based weighting (such as inverse PS weightin…
▽ More
Most epidemiologic cohorts are composed of volunteers who do not represent the general population. To enable population inference from cohorts, we and others have proposed utilizing probability survey samples as external references to develop a propensity score (PS) for membership in the cohort versus survey. Herein we develop a unified framework for PS-based weighting (such as inverse PS weighting (IPSW)) and matching methods (such as kernel-weighting (KW) method). We identify a fundamental Strong Exchangeability Assumption (SEA) underlying existing PS-based matching methods whose failure invalidates inference even if the PS-model is correctly specified. We relax the SEA to a Weak Exchangeability Assumption (WEA) for the matching method. Also, we propose IPSW.S and KW.S methods that reduce the variance of PS-based estimators by scaling the survey weights used in the PS estimation. We prove consistency of the IPSW.S and KW.S estimators of population means and prevalences under WEA, and provide asymptotic variances and consistent variance estimators. In simulations, the KW.S and IPSW.S estimators had smallest MSE. In our data example, the original KW estimates had large bias, whereas the KW.S estimates had the smallest MSE.
△ Less
Submitted 30 November, 2020;
originally announced November 2020.
-
Statistical approaches using longitudinal biomarkers for disease early detection: A comparison of methodologies
Authors:
Yongli Han,
Paul S. Albert,
Christine D. Berg,
Nicolas Wentzensen,
Hormuzd A. Katki,
Danping Liu
Abstract:
Early detection of clinical outcomes such as cancer may be predicted based on longitudinal biomarker measurements. Tracking longitudinal biomarkers as a way to identify early disease onset may help to reduce mortality from diseases like ovarian cancer that are more treatable if detected early. Two general frameworks for disease risk prediction, the shared random effects model (SREM) and the patter…
▽ More
Early detection of clinical outcomes such as cancer may be predicted based on longitudinal biomarker measurements. Tracking longitudinal biomarkers as a way to identify early disease onset may help to reduce mortality from diseases like ovarian cancer that are more treatable if detected early. Two general frameworks for disease risk prediction, the shared random effects model (SREM) and the pattern mixture model (PMM) could be used to assess longitudinal biomarkers on disease early detection. In this paper, we studied the predictive performances of SREM and PMM on disease early detection through an application to ovarian cancer, where early detection using the risk of ovarian cancer algorithm (ROCA) has been evaluated. Comparisons of the above three methods were performed via the analyses of the ovarian cancer data from the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial and extensive simulation studies. The time-dependent receiving operating characteristic (ROC) curve and its area (AUC) were used to evaluate the prediction accuracy. The out-of-sample predictive performance was calculated using leave-one-out cross-validation (LOOCV), aiming to minimize the problem of model over-fitting. A careful analysis of the use of the biomarker cancer antigen 125 for ovarian cancer early detection showed improved performance of PMM as compared with SREM and ROCA. More generally, simulation studies showed that PMM outperforms ROCA unless biomarkers are taken at very frequent screening settings.
△ Less
Submitted 21 August, 2019;
originally announced August 2019.
-
Novel decision-theoretic and risk-stratification metrics of predictive performance: Application to deciding who should undergo genetic testing
Authors:
Hormuzd A. Katki
Abstract:
Currently, women are referred for BRCA1/2 mutation-testing only if their family-history of breast/ovarian cancer implies that their risk of carrying a mutation exceeds 10\%. However, as mutation-testing costs fall, prominent voices have called for testing all women, which would strain clinical resources by testing millions of women, almost all of whom will test negative. To better evaluate risk-th…
▽ More
Currently, women are referred for BRCA1/2 mutation-testing only if their family-history of breast/ovarian cancer implies that their risk of carrying a mutation exceeds 10\%. However, as mutation-testing costs fall, prominent voices have called for testing all women, which would strain clinical resources by testing millions of women, almost all of whom will test negative. To better evaluate risk-thresholds for BRCA1/2 testing, we introduce two broadly applicable, linked metrics: Mean Risk Stratification (MRS) and a decision-theoretic metric, Net Benefit of Information (NBI). MRS and NBI provide a range of risk thresholds at which a marker/model is "optimally informative", in the sense of maximizing both MRS and NBI. NBI is a function of only MRS and the risk-threshold for action, connecting decision-theory to risk-stratification and providing a decision-theoretic rationale for MRS. AUC and Youden's index reflect on both the fraction of maximum MRS, and of maximum NBI, attained by the marker/model, providing AUC and Youden's index with long-sought decision-theoretic and risk-stratification rationale. To evaluate risk-thresholds for BRCA1/2 testing, we propose an eclectic approach considering AUC, Net Benefit, and MRS/NBI. MRS/NBI interpret AUC in the context of mutation-prevalence and provide a range of risk thresholds for which the risk model is optimally informative.
△ Less
Submitted 15 November, 2017;
originally announced November 2017.