Skip to main content

Showing 1–7 of 7 results for author: Rubin, G D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04450  [pdf

    cs.CR cs.AI cs.CL cs.LG

    Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification

    Authors: Payel Bhattacharjee, Fengwei Tian, Geoffrey D. Rubin, Joseph Y. Lo, Nirav Merchant, Heidi Hanson, John Gounley, Ravi Tandon

    Abstract: Purpose: This study proposes a framework for fine-tuning large language models (LLMs) with differential privacy (DP) to perform multi-abnormality classification on radiology report text. By injecting calibrated noise during fine-tuning, the framework seeks to mitigate the privacy risks associated with sensitive patient data and protect against data leakage while maintaining classification performa… ▽ More

    Submitted 9 August, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

    Comments: 18 pages, 5 figures, 2 tables

  2. arXiv:2506.03259  [pdf

    cs.CL

    Evaluating Large Language Models for Zero-Shot Disease Labeling in CT Radiology Reports Across Organ Systems

    Authors: Michael E. Garcia-Alcoser, Mobina GhojoghNejad, Fakrul Islam Tushar, David Kim, Kyle J. Lafata, Geoffrey D. Rubin, Joseph Y. Lo

    Abstract: Purpose: This study aims to evaluate the effectiveness of large language models (LLMs) in automating disease annotation of CT radiology reports. We compare a rule-based algorithm (RBA), RadBERT, and three lightweight open-weight LLMs for multi-disease labeling of chest, abdomen, and pelvis (CAP) CT reports. Materials and Methods: This retrospective study analyzed 40,833 CT reports from 29,540 pa… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 23 pages, 10 figures, to be submitted in Radiology: Artificial Intelligence

    ACM Class: I.2.7

  3. arXiv:2402.04419  [pdf

    eess.IV cs.LG

    What limits performance of weakly supervised deep learning for chest CT classification?

    Authors: Fakrul Islam Tushar, Vincent M. D'Anniballe, Geoffrey D. Rubin, Joseph Y. Lo

    Abstract: Weakly supervised learning with noisy data has drawn attention in the medical imaging community due to the sparsity of high-quality disease labels. However, little is known about the limitations of such weakly supervised learning and the effect of these constraints on disease classification performance. In this paper, we test the effects of such weak supervision by examining model tolerance for th… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 16 pages , 8 figures. arXiv admin note: text overlap with arXiv:2202.11709

  4. arXiv:2102.02959  [pdf

    cs.AI cs.CL cs.LG

    Multi-Label Annotation of Chest Abdomen Pelvis Computed Tomography Text Reports Using Deep Learning

    Authors: Vincent M. D'Anniballe, Fakrul Islam Tushar, Khrystyna Faryna, Songyue Han, Maciej A. Mazurowski, Geoffrey D. Rubin, Joseph Y. Lo

    Abstract: Purpose: To develop high throughput multi-label annotators for body (chest, abdomen, and pelvis) Computed Tomography (CT) reports that can be applied across a variety of abnormalities, organs, and disease states. Approach: We used a dictionary approach to develop rule-based algorithms (RBA) for extraction of disease labels from radiology text reports. We targeted three organ systems (lungs/pleur… ▽ More

    Submitted 7 March, 2022; v1 submitted 4 February, 2021; originally announced February 2021.

  5. Weakly Supervised 3D Classification of Chest CT using Aggregated Multi-Resolution Deep Segmentation Features

    Authors: Anindo Saha, Fakrul I. Tushar, Khrystyna Faryna, Vincent M. D'Anniballe, Rui Hou, Maciej A. Mazurowski, Geoffrey D. Rubin, Joseph Y. Lo

    Abstract: Weakly supervised disease classification of CT imaging suffers from poor localization owing to case-level annotations, where even a positive scan can hold hundreds to thousands of negative slices along multiple planes. Furthermore, although deep learning segmentation and classification models extract distinctly unique combinations of anatomical features from the same target class(es), they are typ… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: Accepted to 2020 SPIE Medical Imaging: Computer-Aided Diagnosis [oral presentation]

  6. arXiv:2008.01158  [pdf

    cs.CV cs.LG eess.IV

    Classification of Multiple Diseases on Body CT Scans using Weakly Supervised Deep Learning

    Authors: Fakrul Islam Tushar, Vincent M. D'Anniballe, Rui Hou, Maciej A. Mazurowski, Wanyi Fu, Ehsan Samei, Geoffrey D. Rubin, Joseph Y. Lo

    Abstract: Purpose: To design multi-disease classifiers for body CT scans for three different organ systems using automatically extracted labels from radiology text reports.Materials & Methods: This retrospective study included a total of 12,092 patients (mean age 57 +- 18; 6,172 women) for model development and testing (from 2012-2017). Rule-based algorithms were used to extract 19,225 disease labels from 1… ▽ More

    Submitted 16 November, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 22 pages, 6 figures, 2 tables; Accepted for publication at Radiology: Artificial Intelligence

  7. arXiv:2002.04752  [pdf

    eess.IV cs.CV cs.LG

    Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes

    Authors: Rachel Lea Draelos, David Dov, Maciej A. Mazurowski, Joseph Y. Lo, Ricardo Henao, Geoffrey D. Rubin, Lawrence Carin

    Abstract: Machine learning models for radiology benefit from large-scale data sets with high quality labels for abnormalities. We curated and analyzed a chest computed tomography (CT) data set of 36,316 volumes from 19,993 unique patients. This is the largest multiply-annotated volumetric medical imaging data set reported. To annotate this data set, we developed a rule-based method for automatically extract… ▽ More

    Submitted 12 October, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: 20 pages, 3 figures, 5 tables (appendices additional). Published in Medical Image Analysis (October 2020)