-
Prediction of Model Generalizability for Unseen Data: Methodology and Case Study in Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI
Authors:
Engin Dikici,
Xuan Nguyen,
Noah Takacs,
Luciano M. Prevedello
Abstract:
A medical AI system's generalizability describes the continuity of its performance acquired from varying geographic, historical, and methodologic settings. Previous literature on this topic has mostly focused on "how" to achieve high generalizability with limited success. Instead, we aim to understand "when" the generalizability is achieved: Our study presents a medical AI system that could estima…
▽ More
A medical AI system's generalizability describes the continuity of its performance acquired from varying geographic, historical, and methodologic settings. Previous literature on this topic has mostly focused on "how" to achieve high generalizability with limited success. Instead, we aim to understand "when" the generalizability is achieved: Our study presents a medical AI system that could estimate its generalizability status for unseen data on-the-fly. We introduce a latent space mapping (LSM) approach utilizing Frechet distance loss to force the underlying training data distribution into a multivariate normal distribution. During the deployment, a given test data's LSM distribution is processed to detect its deviation from the forced distribution; hence, the AI system could predict its generalizability status for any previously unseen data set. If low model generalizability is detected, then the user is informed by a warning message. While the approach is applicable for most classification deep neural networks, we demonstrate its application to a brain metastases (BM) detector for T1-weighted contrast-enhanced (T1c) 3D MRI. The BM detection model was trained using 175 T1c studies acquired internally, and tested using (1) 42 internally and (2) 72 externally acquired exams from the publicly distributed Brain Mets dataset provided by the Stanford University School of Medicine. Generalizability scores, false positive (FP) rates, and sensitivities of the BM detector were computed for the test datasets. The model predicted its generalizability to be low for 31% of the testing data, where it produced (1) ~13.5 FPs at 76.1% BM detection sensitivity for the low and (2) ~10.5 FPs at 89.2% BM detection sensitivity for the high generalizability groups respectively. The results suggest that the proposed formulation enables a model to predict its generalizability for unseen data.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Advancing Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI using Noisy Student-based Training
Authors:
Engin Dikici,
Xuan V. Nguyen,
Matthew Bigelow,
John. L. Ryu,
Luciano M. Prevedello
Abstract:
The detection of brain metastases (BM) in their early stages could have a positive impact on the outcome of cancer patients. We previously developed a framework for detecting small BM (with diameters of less than 15mm) in T1-weighted Contrast-Enhanced 3D Magnetic Resonance images (T1c) to assist medical experts in this time-sensitive and high-stakes task. The framework utilizes a dedicated convolu…
▽ More
The detection of brain metastases (BM) in their early stages could have a positive impact on the outcome of cancer patients. We previously developed a framework for detecting small BM (with diameters of less than 15mm) in T1-weighted Contrast-Enhanced 3D Magnetic Resonance images (T1c) to assist medical experts in this time-sensitive and high-stakes task. The framework utilizes a dedicated convolutional neural network (CNN) trained using labeled T1c data, where the ground truth BM segmentations were provided by a radiologist. This study aims to advance the framework with a noisy student-based self-training strategy to make use of a large corpus of unlabeled T1c data (i.e., data without BM segmentations or detections). Accordingly, the work (1) describes the student and teacher CNN architectures, (2) presents data and model noising mechanisms, and (3) introduces a novel pseudo-labeling strategy factoring in the learned BM detection sensitivity of the framework. Finally, it describes a semi-supervised learning strategy utilizing these components. We performed the validation using 217 labeled and 1247 unlabeled T1c exams via 2-fold cross-validation. The framework utilizing only the labeled exams produced 9.23 false positives for 90% BM detection sensitivity; whereas, the framework using the introduced learning strategy led to ~9% reduction in false detections (i.e., 8.44) for the same sensitivity level. Furthermore, while experiments utilizing 75% and 50% of the labeled datasets resulted in algorithm performance degradation (12.19 and 13.89 false positives respectively), the impact was less pronounced with the noisy student-based training strategy (10.79 and 12.37 false positives respectively).
△ Less
Submitted 19 November, 2021; v1 submitted 10 November, 2021;
originally announced November 2021.
-
Augmented Networks for Faster Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI
Authors:
Engin Dikici,
Xuan V. Nguyen,
Matthew Bigelow,
Luciano M. Prevedello
Abstract:
Early detection of brain metastases (BM) is one of the determining factors for the successful treatment of patients with cancer; however, the accurate detection of small BM lesions (< 15mm) remains a challenging task. We previously described a framework for the detection of small BM in single-sequence gadolinium-enhanced T1-weighted 3D MRI datasets. It combined classical image processing (IP) with…
▽ More
Early detection of brain metastases (BM) is one of the determining factors for the successful treatment of patients with cancer; however, the accurate detection of small BM lesions (< 15mm) remains a challenging task. We previously described a framework for the detection of small BM in single-sequence gadolinium-enhanced T1-weighted 3D MRI datasets. It combined classical image processing (IP) with a dedicated convolutional neural network, taking approximately 30 seconds to process each dataset due to computation-intensive IP stages. To overcome the speed limitation, this study aims to reformulate the framework via an augmented pair of CNNs (eliminating the IP) to reduce the processing times while preserving the BM detection performance. Our previous implementation of the BM detection algorithm utilized Laplacian of Gaussians (LoG) for the candidate selection portion of the solution. In this study, we introduce a novel BM candidate detection CNN (cdCNN) to replace this classical IP stage. The network is formulated to have (1) a similar receptive field as the LoG method, and (2) a bias for the detection of BM lesion loci. The proposed CNN is later augmented with a classification CNN to perform the BM detection task. The cdCNN achieved 97.4% BM detection sensitivity when producing 60K candidates per 3D MRI dataset, while the LoG achieved 96.5% detection sensitivity with 73K candidates. The augmented BM detection framework generated on average 9.20 false-positive BM detections per patient for 90% sensitivity, which is comparable with our previous results. However, it processes each 3D data in 1.9 seconds, presenting a 93.5% reduction in the computation time.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
Artificial Intelligence to Assist in Exclusion of Coronary Atherosclerosis during CCTA Evaluation of Chest-Pain in the Emergency Department: Preparing an Application for Real-World Use
Authors:
Richard D. White,
Barbaros S. Erdal,
Mutlu Demirer,
Vikash Gupta,
Matthew T. Bigelow,
Engin Dikici,
Sema Candemir,
Mauricio S. Galizia,
Jessica L. Carpenter,
Thomas P. O Donnell,
Abdul H. Halabi,
Luciano M. Prevedello
Abstract:
Coronary Computed Tomography Angiography (CCTA) evaluation of chest-pain patients in an Emergency Department (ED) is considered appropriate. While a negative CCTA interpretation supports direct patient discharge from an ED, labor-intensive analyses are required, with accuracy in jeopardy from distractions. We describe the development of an Artificial Intelligence (AI) algorithm and workflow for as…
▽ More
Coronary Computed Tomography Angiography (CCTA) evaluation of chest-pain patients in an Emergency Department (ED) is considered appropriate. While a negative CCTA interpretation supports direct patient discharge from an ED, labor-intensive analyses are required, with accuracy in jeopardy from distractions. We describe the development of an Artificial Intelligence (AI) algorithm and workflow for assisting interpreting physicians in CCTA screening for the absence of coronary atherosclerosis. The two-phase approach consisted of (1) Phase 1 - focused on the development and preliminary testing of an algorithm for vessel-centerline extraction classification in a balanced study population (n = 500 with 50% disease prevalence) derived by retrospective random case selection; and (2) Phase 2 - concerned with simulated-clinical Trialing of the developed algorithm on a per-case basis in a more real-world study population (n = 100 with 28% disease prevalence) from an ED chest-pain series. This allowed pre-deployment evaluation of the AI-based CCTA screening application which provides a vessel-by-vessel graphic display of algorithm inference results integrated into a clinically capable viewer. Algorithm performance evaluation used Area Under the Receiver-Operating-Characteristic Curve (AUC-ROC); confusion matrices reflected ground-truth vs AI determinations. The vessel-based algorithm demonstrated strong performance with AUC-ROC = 0.96. In both Phase 1 and Phase 2, independent of disease prevalence differences, negative predictive values at the case level were very high at 95%. The rate of completion of the algorithm workflow process (96% with inference results in 55-80 seconds) in Phase 2 depended on adequate image quality. There is potential for this AI application to assist in CCTA interpretation to help extricate atherosclerosis from chest-pain presentations.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Constrained Generative Adversarial Network Ensembles for Sharable Synthetic Data Generation
Authors:
Engin Dikici,
Luciano M. Prevedello,
Matthew Bigelow,
Richard D. White,
Barbaros Selnur Erdal
Abstract:
The sharing of medical imaging datasets between institutions, and even inside the same institution, is limited by various regulations/legal barriers. Although these limitations are necessities for protecting patient privacy and setting strict boundaries for data ownership, medical research projects that require large datasets suffer considerably as a result. Machine learning has been revolutionize…
▽ More
The sharing of medical imaging datasets between institutions, and even inside the same institution, is limited by various regulations/legal barriers. Although these limitations are necessities for protecting patient privacy and setting strict boundaries for data ownership, medical research projects that require large datasets suffer considerably as a result. Machine learning has been revolutionized with the emerging deep neural network approaches over recent years, making the data-related limitations even a larger problem as these novel techniques commonly require immense imaging datasets. This paper introduces constrained Generative Adversarial Network ensembles (cGANe) to address this problem by altering the representation of the imaging data, whereas containing the significant information, enabling the reproduction of similar research results elsewhere with the sharable data. Accordingly, a framework representing the generation of a cGANe is described, and the approach is validated for the generation of synthetic 3D brain metastatic region data from T1-weighted contrast-enhanced MRI studies. For 90% brain metastases (BM) detection sensitivity, our previously reported detection algorithm produced on average 9.12 false-positive BM detections per patient after training with the original data, whereas producing 9.53 false-positives after training with the cGANe generated synthetic data. Although the applicability of the introduced approach needs further validation studies with a range of medical imaging data types, the results suggest that the BM-detection algorithm can achieve comparable performance by using cGANe generated synthetic data. Hence, the generalization of the proposed approach for various modalities may occur in the near future.
△ Less
Submitted 28 February, 2020;
originally announced March 2020.
-
Integrating AI into Radiology workflow: Levels of research, production, and feedback maturity
Authors:
Engin Dikici,
Matthew Bigelow,
Luciano M. Prevedello,
Richard D. White,
Barbaros Selnur Erdal
Abstract:
This report represents a roadmap for integrating Artificial Intelligence (AI)-based image analysis algorithms into existing Radiology workflows such that: (1) radiologists can significantly benefit from enhanced automation in various imaging tasks due to AI; and (2) radiologists' feedback is utilized to further improve the AI application. This is achieved by establishing three maturity levels wher…
▽ More
This report represents a roadmap for integrating Artificial Intelligence (AI)-based image analysis algorithms into existing Radiology workflows such that: (1) radiologists can significantly benefit from enhanced automation in various imaging tasks due to AI; and (2) radiologists' feedback is utilized to further improve the AI application. This is achieved by establishing three maturity levels where: (1) research enables the visualization of AI-based results/annotations by radiologists without generating new patient records; (2) production allows the AI-based system to generate results stored in an institution's Picture Archiving and Communication System; and (3) feedback equips radiologists with tools for editing the AI inference results for periodic retraining of the deployed AI systems, thereby allowing the continuous organic improvement of AI-based radiology-workflow solutions. A case study (i.e., detection of brain metastases with T1-weighted contrast-enhanced 3D MRI) illustrates the deployment details of a particular AI-based application according to the aforementioned maturity levels. It is shown that the given AI application significantly improves with the feedback coming from radiologists; the number of incorrectly detected brain metastases (false positives) reduces from 14.2 to 9.12 per patient with the number of subsequently annotated datasets increasing from 93 to 217 as a result of radiologist adjudication.
△ Less
Submitted 14 October, 2019;
originally announced October 2019.
-
Automated Brain Metastases Detection Framework for T1-Weighted Contrast-Enhanced 3D MRI
Authors:
Engin Dikici,
John L. Ryu,
Mutlu Demirer,
Matthew Bigelow,
Richard D. White,
Wayne Slone,
Barbaros Selnur Erdal,
Luciano M. Prevedello
Abstract:
Brain Metastases (BM) complicate 20-40% of cancer cases. BM lesions can present as punctate (1 mm) foci, requiring high-precision Magnetic Resonance Imaging (MRI) in order to prevent inadequate or delayed BM treatment. However, BM lesion detection remains challenging partly due to their structural similarities to normal structures (e.g., vasculature). We propose a BM-detection framework using a si…
▽ More
Brain Metastases (BM) complicate 20-40% of cancer cases. BM lesions can present as punctate (1 mm) foci, requiring high-precision Magnetic Resonance Imaging (MRI) in order to prevent inadequate or delayed BM treatment. However, BM lesion detection remains challenging partly due to their structural similarities to normal structures (e.g., vasculature). We propose a BM-detection framework using a single-sequence gadolinium-enhanced T1-weighted 3D MRI dataset. The framework focuses on detection of smaller (< 15 mm) BM lesions and consists of: (1) candidate-selection stage, using Laplacian of Gaussian approach for highlighting parts of a MRI volume holding higher BM occurrence probabilities, and (2) detection stage that iteratively processes cropped region-of-interest volumes centered by candidates using a custom-built 3D convolutional neural network ("CropNet"). Data is augmented extensively during training via a pipeline consisting of random gamma correction and elastic deformation stages; the framework thereby maintains its invariance for a plausible range of BM shape and intensity representations. This approach is tested using five-fold cross-validation on 217 datasets from 158 patients, with training and testing groups randomized per patient to eliminate learning bias. The BM database included lesions with a mean diameter of ~5.4 mm and a mean volume of ~160 mm3. For 90% BM-detection sensitivity, the framework produced on average 9.12 false-positive BM detections per patient (standard deviation of 3.49); for 85% sensitivity, the average number of false-positives declined to 5.85. Comparative analysis showed that the framework produces comparable BM-detection accuracy with the state-of-art approaches validated for significantly larger lesions.
△ Less
Submitted 13 August, 2019;
originally announced August 2019.