Skip to main content

Showing 1–38 of 38 results for author: Gichoya, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.16047  [pdf

    cs.CV cs.AI

    Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis

    Authors: Frank Li, Hari Trivedi, Bardia Khosravi, Theo Dapamede, Mohammadreza Chavoshi, Abdulhameed Dere, Rohan Satya Isaac, Aawez Mansuri, Janice Newsome, Saptarshi Purkayastha, Judy Gichoya

    Abstract: Foundation models, trained on vast amounts of data using self-supervised techniques, have emerged as a promising frontier for advancing artificial intelligence (AI) applications in medicine. This study evaluates three different vision-language foundation models (RAD-DINO, CheXagent, and BiomedCLIP) on their ability to capture fine-grained imaging features for radiology tasks. The models were asses… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  2. arXiv:2504.05636  [pdf, other

    eess.IV cs.CV cs.LG

    A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study

    Authors: Jungkyu Park, Jan Witowski, Yanqi Xu, Hari Trivedi, Judy Gichoya, Beatrice Brown-Mulry, Malte Westerhoff, Linda Moy, Laura Heacock, Alana Lewin, Krzysztof J. Geras

    Abstract: Although digital breast tomosynthesis (DBT) improves diagnostic performance over full-field digital mammography (FFDM), false-positive recalls remain a concern in breast cancer screening. We developed a multi-modal artificial intelligence system integrating FFDM, synthetic mammography, and DBT to provide breast-level predictions and bounding-box localizations of suspicious findings. Our AI system,… ▽ More

    Submitted 11 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  3. arXiv:2503.14550  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Novel AI-Based Quantification of Breast Arterial Calcification to Predict Cardiovascular Risk

    Authors: Theodorus Dapamede, Aisha Urooj, Vedant Joshi, Gabrielle Gershon, Frank Li, Mohammadreza Chavoshi, Beatrice Brown-Mulry, Rohan Satya Isaac, Aawez Mansuri, Chad Robichaux, Chadi Ayoub, Reza Arsanjani, Laurence Sperling, Judy Gichoya, Marly van Assen, Charles W. ONeill, Imon Banerjee, Hari Trivedi

    Abstract: Women are underdiagnosed and undertreated for cardiovascular disease. Automatic quantification of breast arterial calcification on screening mammography can identify women at risk for cardiovascular disease and enable earlier treatment and management of disease. In this retrospective study of 116,135 women from two healthcare systems, a transformer-based neural network quantified BAC severity (no… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  4. arXiv:2503.13581  [pdf, other

    eess.IV cs.CV

    Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection

    Authors: Beatrice Brown-Mulry, Rohan Satya Isaac, Sang Hyup Lee, Ambika Seth, KyungJee Min, Theo Dapamede, Frank Li, Aawez Mansuri, MinJae Woo, Christian Allison Fauria-Robinson, Bhavna Paryani, Judy Wawira Gichoya, Hari Trivedi

    Abstract: While research has established the potential of AI models for mammography to improve breast cancer screening outcomes, there have not been any detailed subgroup evaluations performed to assess the strengths and weaknesses of commercial models for digital breast tomosynthesis (DBT) imaging. This study presents a granular evaluation of the Lunit INSIGHT DBT model on a large retrospective cohort of 1… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 14 pages, 7 figures (plus 7 figures in supplement), 3 tables (plus 1 table in supplement)

  5. arXiv:2501.10727  [pdf, other

    cs.CV cs.AI eess.IV

    In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review

    Authors: Amelia Jiménez-Sánchez, Natalia-Rozalia Avlona, Sarah de Boer, Víctor M. Campello, Aasa Feragen, Enzo Ferrante, Melanie Ganz, Judy Wawira Gichoya, Camila González, Steff Groefsema, Alessa Hering, Adam Hulman, Leo Joskowicz, Dovile Juodelyte, Melih Kandemir, Thijs Kooi, Jorge del Pozo Lérida, Livie Yumeng Li, Andre Pacheco, Tim Rädsch, Mauricio Reyes, Théo Sourget, Bram van Ginneken, David Wen, Nina Weng , et al. (4 additional authors not shown)

    Abstract: Datasets play a critical role in medical imaging research, yet issues such as label quality, shortcuts, and metadata are often overlooked. This lack of attention may harm the generalizability of algorithms and, consequently, negatively impact patient outcomes. While existing medical imaging literature reviews mostly focus on machine learning (ML) methods, with only a few focusing on datasets for s… ▽ More

    Submitted 2 June, 2025; v1 submitted 18 January, 2025; originally announced January 2025.

    Comments: ACM Conference on Fairness, Accountability, and Transparency - FAccT 2025

  6. arXiv:2411.10091  [pdf

    cs.HC cs.AI

    AI and the Future of Work in Africa White Paper

    Authors: Jacki O'Neill, Vukosi Marivate, Barbara Glover, Winnie Karanu, Girmaw Abebe Tadesse, Akua Gyekye, Anne Makena, Wesley Rosslyn-Smith, Matthew Grollnek, Charity Wayua, Rehema Baguma, Angel Maduke, Sarah Spencer, Daniel Kandie, Dennis Ndege Maari, Natasha Mutangana, Maxamed Axmed, Nyambura Kamau, Muhammad Adamu, Frank Swaniker, Brian Gatuguti, Jonathan Donner, Mark Graham, Janet Mumo, Caroline Mbindyo , et al. (50 additional authors not shown)

    Abstract: This white paper is the output of a multidisciplinary workshop in Nairobi (Nov 2023). Led by a cross-organisational team including Microsoft Research, NEPAD, Lelapa AI, and University of Oxford. The workshop brought together diverse thought-leaders from various sectors and backgrounds to discuss the implications of Generative AI for the future of work in Africa. Discussions centred around four key… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  7. arXiv:2411.00866  [pdf

    cs.CV cs.AI

    Emory Knee Radiograph (MRKR) Dataset

    Authors: Brandon Price, Jason Adleberg, Kaesha Thomas, Zach Zaiman, Aawez Mansuri, Beatrice Brown-Mulry, Chima Okecheukwu, Judy Gichoya, Hari Trivedi

    Abstract: The Emory Knee Radiograph (MRKR) dataset is a large, demographically diverse collection of 503,261 knee radiographs from 83,011 patients, 40% of which are African American. This dataset provides imaging data in DICOM format along with detailed clinical information, including patient-reported pain scores, diagnostic codes, and procedural codes, which are not commonly available in similar datasets.… ▽ More

    Submitted 30 October, 2024; originally announced November 2024.

    Comments: 16 pages

  8. arXiv:2405.05049  [pdf

    cs.CL

    Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

    Authors: Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi

    Abstract: Background Advancements in Large Language Models (LLMs) hold transformative potential in healthcare, however, recent work has raised concern about the tendency of these models to produce outputs that display racial or gender biases. Although training data is a likely source of such biases, exploration of disease and demographic associations in text data at scale has been limited. Methods We cond… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  9. arXiv:2312.14804  [pdf

    cs.CY

    Using large language models to promote health equity

    Authors: Emma Pierson, Divya Shanmugam, Rajiv Movva, Jon Kleinberg, Monica Agrawal, Mark Dredze, Kadija Ferryman, Judy Wawira Gichoya, Dan Jurafsky, Pang Wei Koh, Karen Levy, Sendhil Mullainathan, Ziad Obermeyer, Harini Suresh, Keyon Vafa

    Abstract: Advances in large language models (LLMs) have driven an explosion of interest about their societal impacts. Much of the discourse around how they will impact social equity has been cautionary or negative, focusing on questions like "how might LLMs be biased and how would we mitigate those biases?" This is a vital discussion: the ways in which AI generally, and LLMs specifically, can entrench biase… ▽ More

    Submitted 6 January, 2025; v1 submitted 22 December, 2023; originally announced December 2023.

  10. arXiv:2312.12442  [pdf

    cs.CV cs.AI

    Hierarchical Classification System for Breast Cancer Specimen Report (HCSBC) -- an end-to-end model for characterizing severity and diagnosis

    Authors: Thiago Santos, Harish Kamath, Christopher R. McAdams, Mary S. Newell, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Constance Lehman, Judy Gichoya, Imon Banerjee, Hari Trivedi

    Abstract: Automated classification of cancer pathology reports can extract information from unstructured reports and categorize each report into structured diagnosis and severity categories. Thus, such system can reduce the burden for populating tumor registries, help registration for clinical trial as well as developing large dataset for deep learning model development using true pathologic ground truth. H… ▽ More

    Submitted 2 November, 2023; originally announced December 2023.

  11. arXiv:2312.10083  [pdf

    cs.CY cs.AI cs.CV cs.LG

    The Limits of Fair Medical Imaging AI In The Wild

    Authors: Yuzhe Yang, Haoran Zhang, Judy W Gichoya, Dina Katabi, Marzyeh Ghassemi

    Abstract: As artificial intelligence (AI) rapidly approaches human-level performance in medical imaging, it is crucial that it does not exacerbate or propagate healthcare disparities. Prior research has established AI's capacity to infer demographic data from chest X-rays, leading to a key concern: do models using demographic shortcuts have unfair predictions across subpopulations? In this study, we conduct… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Code and data are available at https://github.com/YyzHarry/shortcut-ood-fairness

  12. arXiv:2311.12560  [pdf

    cs.CV

    Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors

    Authors: Carolina A. M. Heming, Mohamed Abdalla, Shahram Mohanna, Monish Ahluwalia, Linglin Zhang, Hari Trivedi, MinJae Woo, Benjamin Fine, Judy Wawira Gichoya, Leo Anthony Celi, Laleh Seyyed-Kalantari

    Abstract: Clinical AI model reporting cards should be expanded to incorporate a broad bias reporting of both social and non-social factors. Non-social factors consider the role of other factors, such as disease dependent, anatomic, or instrument factors on AI model bias, which are essential to ensure safe deployment.

    Submitted 2 July, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  13. Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research

    Authors: Bardia Khosravi, Frank Li, Theo Dapamede, Pouria Rouzrokh, Cooper U. Gamble, Hari M. Trivedi, Cody C. Wyles, Andrew B. Sellergren, Saptarshi Purkayastha, Bradley J. Erickson, Judy W. Gichoya

    Abstract: Chest X-rays (CXR) are essential for diagnosing a variety of conditions, but when used on new populations, model generalizability issues limit their efficacy. Generative AI, particularly denoising diffusion probabilistic models (DDPMs), offers a promising approach to generating synthetic images, enhancing dataset diversity. This study investigates the impact of synthetic data supplementation on th… ▽ More

    Submitted 7 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  14. arXiv:2309.12325  [pdf

    cs.CY cs.AI cs.CV cs.LG

    FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare

    Authors: Karim Lekadir, Aasa Feragen, Abdul Joseph Fofanah, Alejandro F Frangi, Alena Buyx, Anais Emelie, Andrea Lara, Antonio R Porras, An-Wen Chan, Arcadi Navarro, Ben Glocker, Benard O Botwe, Bishesh Khanal, Brigit Beger, Carol C Wu, Celia Cintas, Curtis P Langlotz, Daniel Rueckert, Deogratias Mzurikwao, Dimitrios I Fotiadis, Doszhan Zhussupov, Enzo Ferrante, Erik Meijering, Eva Weicken, Fabio A González , et al. (95 additional authors not shown)

    Abstract: Despite major advances in artificial intelligence (AI) for medicine and healthcare, the deployment and adoption of AI technologies remain limited in real-world clinical practice. In recent years, concerns have been raised about the technical, clinical, ethical and legal risks associated with medical AI. To increase real world adoption, it is essential that medical AI tools are trusted and accepted… ▽ More

    Submitted 8 July, 2024; v1 submitted 11 August, 2023; originally announced September 2023.

    ACM Class: I.2.0; I.4.0; I.5.0

  15. arXiv:2305.04422  [pdf

    eess.IV cs.CV cs.CY cs.LG

    Multivariate Analysis on Performance Gaps of Artificial Intelligence Models in Screening Mammography

    Authors: Linglin Zhang, Beatrice Brown-Mulry, Vineela Nalla, InChan Hwang, Judy Wawira Gichoya, Aimilia Gastounioti, Imon Banerjee, Laleh Seyyed-Kalantari, MinJae Woo, Hari Trivedi

    Abstract: Although deep learning models for abnormality classification can perform well in screening mammography, the demographic, imaging, and clinical characteristics associated with increased risk of model failure remain unclear. This retrospective study uses the Emory BrEast Imaging Dataset(EMBED) containing mammograms from 115931 patients imaged at Emory Healthcare between 2013-2020, with BI-RADS asses… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 29 pages, 6 tables, 7 figures, 2 supplemental tables

  16. arXiv:2303.10473  [pdf

    cs.CR cs.CV eess.IV

    Report of the Medical Image De-Identification (MIDI) Task Group -- Best Practices and Recommendations

    Authors: David A. Clunie, Adam Flanders, Adam Taylor, Brad Erickson, Brian Bialecki, David Brundage, David Gutman, Fred Prior, J Anthony Seibert, John Perry, Judy Wawira Gichoya, Justin Kirby, Katherine Andriole, Luke Geneslaw, Steve Moore, TJ Fitzgerald, Wyatt Tellis, Ying Xiao, Keyvan Farahani

    Abstract: This report addresses the technical aspects of de-identification of medical images of human subjects and biospecimens, such that re-identification risk of ethical, moral, and legal concern is sufficiently reduced to allow unrestricted public sharing for any purpose, regardless of the jurisdiction of the source and distribution sites. All medical images, regardless of the mode of acquisition, are c… ▽ More

    Submitted 16 March, 2025; v1 submitted 18 March, 2023; originally announced March 2023.

    Comments: 138 pages

  17. arXiv:2303.10338  [pdf

    cs.AI cs.HC

    A general-purpose AI assistant embedded in an open-source radiology information system

    Authors: Saptarshi Purkayastha, Rohan Isaac, Sharon Anthony, Shikhar Shukla, Elizabeth A. Krupinski, Joshua A. Danish, Judy W. Gichoya

    Abstract: Radiology AI models have made significant progress in near-human performance or surpassing it. However, AI model's partnership with human radiologist remains an unexplored challenge due to the lack of health information standards, contextual and workflow differences, and data labeling variations. To overcome these challenges, we integrated an AI model service that uses DICOM standard SR annotation… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Full research paper version of the demo paper accepted at the AIME 2023 - 21st International Conference of Artificial Intelligence in Medicine

  18. arXiv:2211.06925  [pdf, other

    cs.CV cs.LG

    Early Diagnosis of Chronic Obstructive Pulmonary Disease from Chest X-Rays using Transfer Learning and Fusion Strategies

    Authors: Ryan Wang, Li-Ching Chen, Lama Moukheiber, Mira Moukheiber, Dana Moukheiber, Zach Zaiman, Sulaiman Moukheiber, Tess Litchman, Kenneth Seastedt, Hari Trivedi, Rebecca Steinberg, Po-Chih Kuo, Judy Gichoya, Leo Anthony Celi

    Abstract: Chronic obstructive pulmonary disease (COPD) is one of the most common chronic illnesses in the world and the third leading cause of mortality worldwide. It is often underdiagnosed or not diagnosed until later in the disease course. Spirometry tests are the gold standard for diagnosing COPD but can be difficult to obtain, especially in resource-poor countries. Chest X-rays (CXRs), however, are rea… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

    Comments: 15 pages, 12 figures

  19. arXiv:2208.00475  [pdf, other

    cs.CV

    Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics

    Authors: Xiaoyuan Guo, Jiali Duan, C. -C. Jay Kuo, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Language modality within the vision language pretraining framework is innately discretized, endowing each word in the language vocabulary a semantic meaning. In contrast, visual modality is inherently continuous and high-dimensional, which potentially prohibits the alignment as well as fusion between vision and language modalities. We therefore propose to "discretize" the visual representation by… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 7 pages, 4 figures, ICPR2022. arXiv admin note: text overlap with arXiv:2203.00048

  20. arXiv:2207.00066  [pdf

    cs.LG cs.AI math.NA

    Advances in Prediction of Readmission Rates Using Long Term Short Term Memory Networks on Healthcare Insurance Data

    Authors: Shuja Khalid, Francisco Matos, Ayman Abunimer, Joel Bartlett, Richard Duszak, Michal Horny, Judy Gichoya, Imon Banerjee, Hari Trivedi

    Abstract: 30-day hospital readmission is a long standing medical problem that affects patients' morbidity and mortality and costs billions of dollars annually. Recently, machine learning models have been created to predict risk of inpatient readmission for patients with specific diseases, however no model exists to predict this risk across all patients. We developed a bi-directional Long Short Term Memory (… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: 7 pages, 3 figures, 3 tables

  21. arXiv:2204.07824  [pdf, other

    eess.IV cs.CV

    Few-Shot Transfer Learning to improve Chest X-Ray pathology detection using limited triplets

    Authors: Ananth Reddy Bhimireddy, John Lee Burns, Saptarshi Purkayastha, Judy Wawira Gichoya

    Abstract: Deep learning approaches applied to medical imaging have reached near-human or better-than-human performance on many diagnostic tasks. For instance, the CheXpert competition on detecting pathologies in chest x-rays has shown excellent multi-class classification performance. However, training and validating deep learning models require extensive collections of images and still produce false inferen… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  22. arXiv:2204.03074  [pdf, other

    cs.CV

    OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System

    Authors: Xiaoyuan Guo, Jiali Duan, Saptarshi Purkayastha, Hari Trivedi, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Improving the retrieval relevance on noisy datasets is an emerging need for the curation of a large-scale clean dataset in the medical domain. While existing methods can be applied for class-wise retrieval (aka. inter-class), they cannot distinguish the granularity of likeness within the same class (aka. intra-class). The problem is exacerbated on medical external datasets, where noisy samples of… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 12 pages, 6 figures, 2 tables

  23. arXiv:2202.04073  [pdf

    eess.IV cs.CV cs.LG

    The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms

    Authors: Jiwoong J. Jeong, Brianna L. Vey, Ananth Reddy, Thomas Kim, Thiago Santos, Ramon Correa, Raman Dutt, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Minjae Woo, Christopher R. McAdams, Mary S. Newell, Imon Banerjee, Judy Gichoya, Hari Trivedi

    Abstract: Developing and validating artificial intelligence models in medical imaging requires datasets that are large, granular, and diverse. To date, the majority of publicly available breast imaging datasets lack in one or more of these areas. Models trained on these data may therefore underperform on patient populations or pathologies that have not previously been encountered. The EMory BrEast imaging D… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  24. arXiv:2202.01863  [pdf

    eess.IV cs.CV cs.LG

    Best Practices and Scoring System on Reviewing A.I. based Medical Imaging Papers: Part 1 Classification

    Authors: Timothy L. Kline, Felipe Kitamura, Ian Pan, Amine M. Korchi, Neil Tenenholtz, Linda Moy, Judy Wawira Gichoya, Igor Santos, Steven Blumer, Misha Ysabel Hwang, Kim-Ann Git, Abishek Shroff, Elad Walach, George Shih, Steve Langer

    Abstract: With the recent advances in A.I. methodologies and their application to medical imaging, there has been an explosion of related research programs utilizing these techniques to produce state-of-the-art classification performance. Ultimately, these research programs culminate in submission of their work for consideration in peer reviewed journals. To date, the criteria for acceptance vs. rejection i… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  25. arXiv:2112.13885  [pdf, other

    eess.IV cs.CV

    MedShift: identifying shift data for medical dataset curation

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Hari Trivedi, Saptarshi Purkayastha, Imon Banerjee

    Abstract: To curate a high-quality dataset, identifying data variance between the internal and external sources is a fundamental and crucial step. However, methods to detect shift or variance in data have not been significantly researched. Challenges to this are the lack of effective approaches to learn dense representation of a dataset and difficulties of sharing private data across medical institutions. T… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 35 pages, 28 figures, 2 tables

  26. arXiv:2111.08711  [pdf, other

    eess.IV cs.CV cs.LG

    Two-step adversarial debiasing with partial learning -- medical image case-studies

    Authors: Ramon Correa, Jiwoong Jason Jeong, Bhavik Patel, Hari Trivedi, Judy W. Gichoya, Imon Banerjee

    Abstract: The use of artificial intelligence (AI) in healthcare has become a very active research area in the last few years. While significant progress has been made in image classification tasks, only a few AI methods are actually being deployed in hospitals. A major hurdle in actively using clinical AI models currently is the trustworthiness of these models. More often than not, these complex models are… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  27. arXiv:2110.15811  [pdf, other

    eess.IV cs.CV

    CVAD: A generic medical anomaly detector based on Cascade VAE

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Saptarshi Purkayastha, Imon Banerjee

    Abstract: Detecting out-of-distribution (OOD) samples in medical imaging plays an important role for downstream medical diagnosis. However, existing OOD detectors are demonstrated on natural images composed of inter-classes and have difficulty generalizing to medical images. The key issue is the granularity of OOD data in the medical domain, where intra-class OOD samples are predominant. We focus on the gen… ▽ More

    Submitted 26 January, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures, 4 tables

  28. arXiv:2108.00117  [pdf, other

    cs.CV

    Margin-Aware Intra-Class Novelty Identification for Medical Images

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Saptarshi Purkayastha, Imon Banerjee

    Abstract: Traditional anomaly detection methods focus on detecting inter-class variations while medical image novelty identification is inherently an intra-class detection problem. For example, a machine learning model trained with normal chest X-ray and common lung abnormalities, is expected to discover and flag idiopathic pulmonary fibrosis which a rare lung disease and unseen by the model during training… ▽ More

    Submitted 22 January, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: 35 pages, 8 figures

    Journal ref: Journal of Medical Imaging 2022

  29. arXiv:2107.10356  [pdf

    cs.CV cs.CY eess.IV

    Reading Race: AI Recognises Patient's Racial Identity In Medical Images

    Authors: Imon Banerjee, Ananth Reddy Bhimireddy, John L. Burns, Leo Anthony Celi, Li-Ching Chen, Ramon Correa, Natalie Dullerud, Marzyeh Ghassemi, Shih-Cheng Huang, Po-Chih Kuo, Matthew P Lungren, Lyle Palmer, Brandon J Price, Saptarshi Purkayastha, Ayis Pyrros, Luke Oakden-Rayner, Chima Okechukwu, Laleh Seyyed-Kalantari, Hari Trivedi, Ryan Wang, Zachary Zaiman, Haoran Zhang, Judy W Gichoya

    Abstract: Background: In medical imaging, prior studies have demonstrated disparate AI performance by race, yet there is no known correlation for race on medical imaging that would be obvious to the human expert interpreting the images. Methods: Using private and public datasets we evaluate: A) performance quantification of deep learning models to detect race from medical images, including the ability of… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    MSC Class: 68-XX ACM Class: I.2

  30. arXiv:2106.02118  [pdf

    eess.IV cs.CV cs.LG

    A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals

    Authors: Ju Sun, Le Peng, Taihui Li, Dyah Adila, Zach Zaiman, Genevieve B. Melton, Nicholas Ingraham, Eric Murray, Daniel Boley, Sean Switzer, John L. Burns, Kun Huang, Tadashi Allen, Scott D. Steenburg, Judy Wawira Gichoya, Erich Kummerfeld, Christopher Tignanelli

    Abstract: Importance: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and int… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Check out the medRxiv version at https://doi.org/10.1101/2021.06.04.21258316 for updates

  31. arXiv:2007.05786  [pdf, other

    cs.CV cs.LG

    Generalization of Deep Convolutional Neural Networks -- A Case-study on Open-source Chest Radiographs

    Authors: Nazanin Mashhaditafreshi, Amara Tariq, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Deep Convolutional Neural Networks (DCNNs) have attracted extensive attention and been applied in many areas, including medical image analysis and clinical diagnosis. One major challenge is to conceive a DCNN model with remarkable performance on both internal and external data. We demonstrate that DCNNs may not generalize to new data, but increasing the quality and heterogeneity of the training da… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

  32. arXiv:2007.02124  [pdf

    cs.IR cs.CY

    A Modern Non-SQL Approach to Radiology-Centric Search Engine Design with Clinical Validation

    Authors: Ningcheng Li, Guy Maresh, Maxwell Cretcher, Khashayar Farsad, Ramsey Al-Hakim, John Kaufman, Judy Gichoya

    Abstract: Healthcare data is increasing in size at an unprecedented speed with much attention on big data analysis and Artificial Intelligence application for quality assurance, clinical training, severity triaging, and decision support. Radiology is well-suited for innovation given its intrinsically paired linguistic and visual data. Previous attempts to unlock this information goldmine were encumbered by… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

  33. arXiv:2006.13262  [pdf

    eess.IV cs.CV cs.LG

    Was there COVID-19 back in 2012? Challenge for AI in Diagnosis with Similar Indications

    Authors: Imon Banerjee, Priyanshu Sinha, Saptarshi Purkayastha, Nazanin Mashhaditafreshi, Amara Tariq, Jiwoong Jeong, Hari Trivedi, Judy W. Gichoya

    Abstract: Purpose: Since the recent COVID-19 outbreak, there has been an avalanche of research papers applying deep learning based image processing to chest radiographs for detection of the disease. To test the performance of the two top models for CXR COVID-19 diagnosis on external datasets to assess model generalizability. Methods: In this paper, we present our argument regarding the efficiency and applic… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  34. arXiv:2005.12378  [pdf, ps, other

    cs.CY

    Artificial Intelligence for Global Health: Learning From a Decade of Digital Transformation in Health Care

    Authors: Varoon Mathur, Saptarshi Purkayastha, Judy Wawira Gichoya

    Abstract: The health needs of those living in resource-limited settings are a vastly overlooked and understudied area in the intersection of machine learning (ML) and health care. While the use of ML in health care is more recently popularized over the last few years from the advancement of deep learning, low-and-middle income countries (LMICs) have already been undergoing a digital transformation of their… ▽ More

    Submitted 27 May, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted Paper at ICLR 2020 Workshop on Practical ML for Developing Countries

  35. arXiv:2004.07965  [pdf, other

    eess.IV cs.CV cs.LG

    A DICOM Framework for Machine Learning Pipelines against Real-Time Radiology Images

    Authors: Pradeeban Kathiravelu, Puneet Sharma, Ashish Sharma, Imon Banerjee, Hari Trivedi, Saptarshi Purkayastha, Priyanshu Sinha, Alexandre Cadrin-Chenevert, Nabile Safdar, Judy Wawira Gichoya

    Abstract: Executing machine learning (ML) pipelines in real-time on radiology images is hard due to the limited computing resources in clinical environments and the lack of efficient data transfer capabilities to run them on research clusters. We propose Niffler, an integrated framework that enables the execution of ML pipelines at research clusters by efficiently querying and retrieving radiology images fr… ▽ More

    Submitted 5 August, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Preprint

    Journal ref: Journal of Digital Imaging (JDI), 2021

  36. arXiv:2003.07507  [pdf

    cs.CL

    Multi-label natural language processing to identify diagnosis and procedure codes from MIMIC-III inpatient notes

    Authors: A. K. Bhavani Singh, Mounika Guntu, Ananth Reddy Bhimireddy, Judy W. Gichoya, Saptarshi Purkayastha

    Abstract: In the United States, 25% or greater than 200 billion dollars of hospital spending accounts for administrative costs that involve services for medical coding and billing. With the increasing number of patient records, manual assignment of the codes performed is overwhelming, time-consuming and error-prone, causing billing errors. Natural language processing can automate the extraction of codes/lab… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: This is a shortened version of the Capstone Project that was accepted by the Faculty of Indiana University, in partial fulfillment of the requirements for the degree of Master of Science in Health Informatics

  37. arXiv:1912.12397  [pdf

    cs.CL cs.LG

    Natural language processing of MIMIC-III clinical notes for identifying diagnosis and procedures with neural networks

    Authors: Siddhartha Nuthakki, Sunil Neela, Judy W. Gichoya, Saptarshi Purkayastha

    Abstract: Coding diagnosis and procedures in medical records is a crucial process in the healthcare industry, which includes the creation of accurate billings, receiving reimbursements from payers, and creating standardized patient care records. In the United States, Billing and Insurance related activities cost around $471 billion in 2012 which constitutes about 25% of all the U.S hospital spending. In thi… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: This is a shortened version of the Capstone Project that was accepted by the Faculty of Indiana University, in partial fulfillment of the requirements for the degree of Master of Science in Health Informatics in Dec 2018

  38. arXiv:1803.11244  [pdf

    cs.CY

    Phronesis of AI in radiology: Superhuman meets natural stupidity

    Authors: Judy W. Gichoya, Siddhartha Nuthakki, Pallavi G. Maity, Saptarshi Purkayastha

    Abstract: Advances in AI in the last decade have clearly made economists, politicians, journalists, and citizenry in general believe that the machines are coming to take human jobs. We review 'superhuman' AI performance claims in radiology and then provide a self-reflection on our own work in the area in the form of a critical review, a tribute of sorts to McDermotts 1976 paper, asking the field for some se… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.