Skip to main content

Showing 1–17 of 17 results for author: Santamaria-Pang, A

.
  1. arXiv:2505.10823  [pdf, ps, other

    cs.CV eess.IV

    From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification

    Authors: Xue Li, Jameson Merkow, Noel C. F. Codella, Alberto Santamaria-Pang, Naiteek Sangani, Alexander Ersoy, Christopher Burt, John W. Garrett, Richard J. Bruce, Joshua D. Warner, Tyler Bradshaw, Ivan Tarapov, Matthew P. Lungren, Alan B. McMillan

    Abstract: Foundation models, pretrained on extensive datasets, have significantly advanced machine learning by providing robust and transferable embeddings applicable to various domains, including medical imaging diagnostics. This study evaluates the utility of embeddings derived from both general-purpose and medical domain-specific foundation models for training lightweight adapter models in multi-class ra… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 figures, 4 tables

  2. arXiv:2503.23764  [pdf, other

    cs.CV cs.AI

    WaveFormer: A 3D Transformer with Wavelet-Driven Feature Representation for Efficient Medical Image Segmentation

    Authors: Md Mahfuz Al Hasan, Mahdi Zaman, Abdul Jawad, Alberto Santamaria-Pang, Ho Hin Lee, Ivan Tarapov, Kyle See, Md Shah Imran, Antika Roy, Yaser Pourmohammadi Fallah, Navid Asadizanjani, Reza Forghani

    Abstract: Transformer-based architectures have advanced medical image analysis by effectively modeling long-range dependencies, yet they often struggle in 3D settings due to substantial memory overhead and insufficient capture of fine-grained local features. We address these limitations with WaveFormer, a novel 3D-transformer that: i) leverages the fundamental frequency-domain properties of features for con… ▽ More

    Submitted 31 March, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

  3. arXiv:2503.10057  [pdf, other

    cs.CV

    Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations

    Authors: Ho Hin Lee, Alberto Santamaria-Pang, Jameson Merkov, Matthew Lungren, Ivan Tarapov

    Abstract: Accurate survival prediction in oncology requires integrating diverse imaging modalities to capture the complex interplay of tumor biology. Traditional single-modality approaches often fail to leverage the complementary insights provided by radiological and pathological assessments. In this work, we introduce M4Survive (Multi-Modal Mamba Modeling for Survival Prediction), a novel framework that le… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 10 pages

  4. arXiv:2503.05701  [pdf

    cs.LG cs.CL

    OPTIC: Optimizing Patient-Provider Triaging & Improving Communications in Clinical Operations using GPT-4 Data Labeling and Model Distillation

    Authors: Alberto Santamaria-Pang, Frank Tuan, Ross Campbell, Cindy Zhang, Ankush Jindal, Roopa Surapur, Brad Holloman, Deanna Hanisch, Rae Buckley, Carisa Cooney, Ivan Tarapov, Kimberly S. Peairs, Brian Hasselfeld, Peter Greene

    Abstract: The COVID-19 pandemic has accelerated the adoption of telemedicine and patient messaging through electronic medical portals (patient medical advice requests, or PMARs). While these platforms enhance patient access to healthcare, they have also increased the burden on healthcare providers due to the surge in PMARs. This study seeks to develop an efficient tool for message triaging to reduce physici… ▽ More

    Submitted 5 February, 2025; originally announced March 2025.

    Comments: 15 pages, 8 figures. submitted to Journal of the American Medical Informatics Association

  5. arXiv:2410.06542  [pdf, other

    eess.IV cs.CV

    MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging

    Authors: Noel C. F. Codella, Ying Jin, Shrey Jain, Yu Gu, Ho Hin Lee, Asma Ben Abacha, Alberto Santamaria-Pang, Will Guyman, Naiteek Sangani, Sheng Zhang, Hoifung Poon, Stephanie Hyland, Shruthi Bannur, Javier Alvarez-Valle, Xue Li, John Garrett, Alan McMillan, Gaurav Rajguru, Madhu Maddi, Nilesh Vijayrania, Rehaan Bhimai, Nick Mecklenburg, Rupal Jain, Daniel Holstein, Naveen Gaur , et al. (6 additional authors not shown)

    Abstract: In this work, we present MedImageInsight, an open-source medical imaging embedding model. MedImageInsight is trained on medical images with associated text and labels across a diverse collection of domains, including X-Ray, CT, MRI, dermoscopy, OCT, fundus photography, ultrasound, histopathology, and mammography. Rigorous evaluations demonstrate MedImageInsight's ability to achieve state-of-the-ar… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  6. arXiv:2404.10031  [pdf

    q-bio.NC cs.AI cs.LG

    Emergent Language Symbolic Autoencoder (ELSA) with Weak Supervision to Model Hierarchical Brain Networks

    Authors: Ammar Ahmed Pallikonda Latheef, Alberto Santamaria-Pang, Craig K Jones, Haris I Sair

    Abstract: Brain networks display a hierarchical organization, a complexity that poses a challenge for existing deep learning models, often structured as flat classifiers, leading to difficulties in interpretability and the 'black box' issue. To bridge this gap, we propose a novel architecture: a symbolic autoencoder informed by weak supervision and an Emergent Language (EL) framework. This model moves beyon… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures

  7. arXiv:2401.07654  [pdf, other

    cs.CV

    Foundation Models for Biomedical Image Segmentation: A Survey

    Authors: Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

    Abstract: Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing. Within the last year, marked by over 100 publications, SAM has demonstrated its prowess in zero-shot learning adaptations for medical im… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages, 4 figures, 7 tables

  8. arXiv:2311.13752  [pdf, other

    cs.CV cs.AI

    3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology

    Authors: Asma Ben Abacha, Alberto Santamaria-Pang, Ho Hin Lee, Jameson Merkow, Qin Cai, Surya Teja Devarakonda, Abdullah Islam, Julia Gong, Matthew P. Lungren, Thomas Lin, Noel C Codella, Ivan Tarapov

    Abstract: The increasing use of medical imaging in healthcare settings presents a significant challenge due to the increasing workload for radiologists, yet it also offers opportunity for enhancing healthcare outcomes if effectively leveraged. 3D image retrieval holds potential to reduce radiologist workloads by enabling clinicians to efficiently search through diagnostically similar or otherwise relevant c… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  9. arXiv:2305.05598  [pdf, other

    cs.CV

    Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query

    Authors: Ho Hin Lee, Alberto Santamaria-Pang, Jameson Merkow, Ozan Oktay, Fernando Pérez-García, Javier Alvarez-Valle, Ivan Tarapov

    Abstract: We introduce a novel Region-based contrastive pretraining for Medical Image Retrieval (RegionMIR) that demonstrates the feasibility of medical image retrieval with similar anatomical regions. RegionMIR addresses two major challenges for medical image retrieval i) standardization of clinically relevant searching criteria (e.g., anatomical, pathology-based), and ii) localization of anatomical area o… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  10. arXiv:2305.03814  [pdf

    cs.LG q-bio.NC

    Deep Labeling of fMRI Brain Networks

    Authors: Ammar Ahmed Pallikonda Latheef, Sejal Ghate, Zhipeng Hui, Alberto Santamaria-Pang, Ivan Tarapov, Haris I Sair, Craig K Jones

    Abstract: Resting State Networks (RSNs) of the brain extracted from Resting State functional Magnetic Resonance Imaging (RS-fMRI) are used in the pre-surgical planning to guide the neurosurgeon. This is difficult, though, as expert knowledge is required to label each of the RSNs. There is a lack of efficient and standardized methods to be used in clinical workflows. Additionally, these methods need to be ge… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 24 pages, 10 figures, 1 table

  11. arXiv:2209.08200  [pdf

    cs.LG

    Deep Labeling of fMRI Brain Networks Using Cloud Based Processing

    Authors: Sejal Ghate, Alberto Santamaria-Pang, Ivan Tarapov, Haris I Sair, Craig K Jones

    Abstract: Resting state fMRI is an imaging modality which reveals brain activity localization through signal changes, in what is known as Resting State Networks (RSNs). This technique is gaining popularity in neurosurgical pre-planning to visualize the functional regions and assess regional activity. Labeling of rs-fMRI networks require subject-matter expertise and is time consuming, creating a need for an… ▽ More

    Submitted 20 September, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: International Symposium on Visual Computing

  12. arXiv:2107.12473  [pdf, other

    cs.CV cs.AI cs.CR

    Adversarial Attacks with Time-Scale Representations

    Authors: Alberto Santamaria-Pang, Jianwei Qiu, Aritra Chowdhury, James Kubricht, Peter Tu, Iyer Naresh, Nurali Virani

    Abstract: We propose a novel framework for real-time black-box universal attacks which disrupts activations of early convolutional layers in deep learning models. Our hypothesis is that perturbations produced in the wavelet space disrupt early convolutional layers more effectively than perturbations performed in the time domain. The main challenge in adversarial attacks is to preserve low frequency image co… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  13. arXiv:2008.09866  [pdf, other

    cs.CV eess.IV

    Symbolic Semantic Segmentation and Interpretation of COVID-19 Lung Infections in Chest CT volumes based on Emergent Languages

    Authors: Aritra Chowdhury, Alberto Santamaria-Pang, James R. Kubricht, Jianwei Qiu, Peter Tu

    Abstract: The coronavirus disease (COVID-19) has resulted in a pandemic crippling the a breadth of services critical to daily life. Segmentation of lung infections in computerized tomography (CT) slices could be be used to improve diagnosis and understanding of COVID-19 in patients. Deep learning systems lack interpretability because of their black box nature. Inspired by human communication of complex idea… ▽ More

    Submitted 22 August, 2020; originally announced August 2020.

  14. arXiv:2008.09860  [pdf

    cs.CV

    Emergent symbolic language based deep medical image classification

    Authors: Aritra Chowdhury, Alberto Santamaria-Pang, James R. Kubricht, Peter Tu

    Abstract: Modern deep learning systems for medical image classification have demonstrated exceptional capabilities for distinguishing between image based medical categories. However, they are severely hindered by their ina-bility to explain the reasoning behind their decision making. This is partly due to the uninterpretable continuous latent representations of neural net-works. Emergent languages (EL) have… ▽ More

    Submitted 22 August, 2020; originally announced August 2020.

  15. arXiv:2007.09471  [pdf

    eess.IV cs.CV q-bio.CB q-bio.QM

    Automated Phenotyping via Cell Auto Training (CAT) on the Cell DIVE Platform

    Authors: Alberto Santamaria-Pang, Anup Sood, Dan Meyer, Aritra Chowdhury, Fiona Ginty

    Abstract: We present a method for automatic cell classification in tissue samples using an automated training set from multiplexed immunofluorescence images. The method utilizes multiple markers stained in situ on a single tissue section on a robust hyperplex immunofluorescence platform (Cell DIVE, GE Healthcare) that provides multi-channel images allowing analysis at single cell/sub-cellular levels. The ce… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

    Journal ref: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA, 2019, pp. 2750-2756

  16. arXiv:2007.09469  [pdf

    cs.AI cs.CV cs.LG q-bio.CB

    ESCELL: Emergent Symbolic Cellular Language

    Authors: Aritra Chowdhury, James R. Kubricht, Anup Sood, Peter Tu, Alberto Santamaria-Pang

    Abstract: We present ESCELL, a method for developing an emergent symbolic language of communication between multiple agents reasoning about cells. We show how agents are able to cooperate and communicate successfully in the form of symbols similar to human language to accomplish a task in the form of a referential game (Lewis' signaling game). In one form of the game, a sender and a receiver observe a set o… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: IEEE International Symposium on Biomedical Imaging (IEEE ISBI 2020)

    Journal ref: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), Iowa City, IA, USA, 2020, pp. 1604-1607

  17. arXiv:2007.09448  [pdf

    cs.AI

    Towards Emergent Language Symbolic Semantic Segmentation and Model Interpretability

    Authors: Alberto Santamaria-Pang, James Kubricht, Aritra Chowdhury, Chitresh Bhushan, Peter Tu

    Abstract: Recent advances in methods focused on the grounding problem have resulted in techniques that can be used to construct a symbolic language associated with a specific domain. Inspired by how humans communicate complex ideas through language, we developed a generalized Symbolic Semantic ($\text{S}^2$) framework for interpretable segmentation. Unlike adversarial models (e.g., GANs), we explicitly mode… ▽ More

    Submitted 4 August, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: Accepted to Medical Image Computing and Computer Assisted Intervention (MICCAI) 2020, 9 pages, 3 figures