Skip to main content

Showing 1–12 of 12 results for author: Harmon, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.04522  [pdf, ps, other

    eess.IV cs.CV

    Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model

    Authors: Pengfei Guo, Can Zhao, Dong Yang, Yufan He, Vishwesh Nath, Ziyue Xu, Pedro R. A. S. Bassi, Zongwei Zhou, Benjamin D. Simon, Stephanie Anne Harmon, Baris Turkbey, Daguang Xu

    Abstract: Generating 3D CT volumes from descriptive free-text inputs presents a transformative opportunity in diagnostics and research. In this paper, we introduce Text2CT, a novel approach for synthesizing 3D CT volumes from textual descriptions using the diffusion model. Unlike previous methods that rely on fixed-format text input, Text2CT employs a novel prompt formulation that enables generation from di… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  2. arXiv:2411.12915  [pdf, other

    cs.CV

    VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

    Authors: Vishwesh Nath, Wenqi Li, Dong Yang, Andriy Myronenko, Mingxin Zheng, Yao Lu, Zhijian Liu, Hongxu Yin, Yucheng Tang, Pengfei Guo, Can Zhao, Ziyue Xu, Yufan He, Greg Heinrich, Yee Man Law, Benjamin Simon, Stephanie Harmon, Stephen Aylward, Marc Edgar, Michael Zephyr, Song Han, Pavlo Molchanov, Baris Turkbey, Holger Roth, Daguang Xu

    Abstract: Generalist vision language models (VLMs) have made significant strides in computer vision, but they fall short in specialized fields like healthcare, where expert knowledge is essential. In traditional computer vision tasks, creative or approximate answers may be acceptable, but in healthcare, precision is paramount.Current large multimodal models like Gemini and GPT-4o are insufficient for medica… ▽ More

    Submitted 4 March, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  3. arXiv:2409.11169  [pdf, other

    eess.IV cs.AI cs.CV

    MAISI: Medical AI for Synthetic Imaging

    Authors: Pengfei Guo, Can Zhao, Dong Yang, Ziyue Xu, Vishwesh Nath, Yucheng Tang, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu

    Abstract: Medical imaging analysis faces challenges such as data scarcity, high annotation costs, and privacy concerns. This paper introduces the Medical AI for Synthetic Imaging (MAISI), an innovative approach using the diffusion model to generate synthetic 3D computed tomography (CT) images to address those challenges. MAISI leverages the foundation volume compression network and the latent diffusion mode… ▽ More

    Submitted 29 October, 2024; v1 submitted 13 September, 2024; originally announced September 2024.

    Comments: WACV25 accepted. https://monai.io/research/maisi

  4. arXiv:2406.12177  [pdf, other

    cs.CV cs.LG

    Location-based Radiology Report-Guided Semi-supervised Learning for Prostate Cancer Detection

    Authors: Alex Chen, Nathan Lay, Stephanie Harmon, Kutsev Ozyoruk, Enis Yilmaz, Brad J. Wood, Peter A. Pinto, Peter L. Choyke, Baris Turkbey

    Abstract: Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 4 page paper accepted to IEEE International Symposium on Biomedical Imaging (ISBI 2024)

  5. arXiv:2406.05285  [pdf, other

    cs.CV

    VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging

    Authors: Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath, Ziyue Xu, Dong Yang, Can Zhao, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu, Wenqi Li

    Abstract: Foundation models for interactive segmentation in 2D natural images and videos have sparked significant interest in building 3D foundation models for medical imaging. However, the domain gaps and clinical use cases for 3D medical imaging require a dedicated model that diverges from existing 2D solutions. Specifically, such foundation models should support a full workflow that can actually reduce h… ▽ More

    Submitted 21 November, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  6. arXiv:2402.05817  [pdf

    eess.IV cs.CV cs.LG

    Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging

    Authors: Pouria Yazdian Anari, Fiona Obiezu, Nathan Lay, Fatemeh Dehghani Firouzabadi, Aditi Chaurasia, Mahshid Golagha, Shiva Singh, Fatemeh Homayounieh, Aryan Zahergivar, Stephanie Harmon, Evrim Turkbey, Rabindra Gautam, Kevin Ma, Maria Merino, Elizabeth C. Jones, Mark W. Ball, W. Marston Linehan, Baris Turkbey, Ashkan A. Malayeri

    Abstract: Introduction This study explores the use of the latest You Only Look Once (YOLO V7) object detection method to enhance kidney detection in medical imaging by training and testing a modified YOLO V7 on medical image formats. Methods Study includes 878 patients with various subtypes of renal cell carcinoma (RCC) and 206 patients with normal kidneys. A total of 5657 MRI scans for 1084 patients were r… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  7. arXiv:2203.06338  [pdf, other

    eess.IV cs.CV

    Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation

    Authors: Pengfei Guo, Dong Yang, Ali Hatamizadeh, An Xu, Ziyue Xu, Wenqi Li, Can Zhao, Daguang Xu, Stephanie Harmon, Evrim Turkbey, Baris Turkbey, Bradford Wood, Francesca Patella, Elvira Stellato, Gianpaolo Carrafiello, Vishal M. Patel, Holger R. Roth

    Abstract: Federated learning (FL) is a distributed machine learning technique that enables collaborative model training while avoiding explicit data sharing. The inherent privacy-preserving property of FL algorithms makes them especially attractive to the medical field. However, in case of heterogeneous client data distributions, standard FL methods are unstable and require intensive hyperparameter tuning t… ▽ More

    Submitted 31 August, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

  8. arXiv:2104.10195  [pdf, other

    eess.IV cs.CV

    Auto-FedAvg: Learnable Federated Averaging for Multi-Institutional Medical Image Segmentation

    Authors: Yingda Xia, Dong Yang, Wenqi Li, Andriy Myronenko, Daguang Xu, Hirofumi Obinata, Hitoshi Mori, Peng An, Stephanie Harmon, Evrim Turkbey, Baris Turkbey, Bradford Wood, Francesca Patella, Elvira Stellato, Gianpaolo Carrafiello, Anna Ierardi, Alan Yuille, Holger Roth

    Abstract: Federated learning (FL) enables collaborative model training while preserving each participant's privacy, which is particularly beneficial to the medical field. FedAvg is a standard algorithm that uses fixed weights, often originating from the dataset sizes at each client, to aggregate the distributed learned models on a server during the FL process. However, non-identical data distribution across… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  9. arXiv:2011.11750  [pdf, other

    eess.IV cs.CV

    Federated Semi-Supervised Learning for COVID Region Segmentation in Chest CT using Multi-National Data from China, Italy, Japan

    Authors: Dong Yang, Ziyue Xu, Wenqi Li, Andriy Myronenko, Holger R. Roth, Stephanie Harmon, Sheng Xu, Baris Turkbey, Evrim Turkbey, Xiaosong Wang, Wentao Zhu, Gianpaolo Carrafiello, Francesca Patella, Maurizio Cariati, Hirofumi Obinata, Hitoshi Mori, Kaku Tamura, Peng An, Bradford J. Wood, Daguang Xu

    Abstract: The recent outbreak of COVID-19 has led to urgent needs for reliable diagnosis and management of SARS-CoV-2 infection. As a complimentary tool, chest CT has been shown to be able to reveal visual patterns characteristic for COVID-19, which has definite value at several stages during the disease course. To facilitate CT analysis, recent efforts have focused on computer-aided characterization and di… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: Accepted with minor revision to Medical Image Analysis

  10. arXiv:2007.05534  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Domain Image Completion for Random Missing Input Data

    Authors: Liyue Shen, Wentao Zhu, Xiaosong Wang, Lei Xing, John M. Pauly, Baris Turkbey, Stephanie Anne Harmon, Thomas Hogue Sanford, Sherif Mehralivand, Peter Choyke, Bradford Wood, Daguang Xu

    Abstract: Multi-domain data are widely leveraged in vision applications taking advantage of complementary information from different modalities, e.g., brain tumor segmentation from multi-parametric magnetic resonance imaging (MRI). However, due to possible data corruption and different imaging protocols, the availability of images for each domain could vary amongst multiple data sources in practice, which m… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  11. arXiv:2005.05761  [pdf, other

    eess.IV cs.CV physics.med-ph

    Adipose Tissue Segmentation in Unlabeled Abdomen MRI using Cross Modality Domain Adaptation

    Authors: Samira Masoudi, Syed M. Anwar, Stephanie A. Harmon, Peter L. Choyke, Baris Turkbey, Ulas Bagci

    Abstract: Abdominal fat quantification is critical since multiple vital organs are located within this region. Although computed tomography (CT) is a highly sensitive modality to segment body fat, it involves ionizing radiations which makes magnetic resonance imaging (MRI) a preferable alternative for this purpose. Additionally, the superior soft tissue contrast in MRI could lead to more accurate results. Y… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: 5 pages,7 figures, EMBC 2020 conference

  12. arXiv:1906.03347  [pdf, other

    cs.CV eess.IV

    When Unseen Domain Generalization is Unnecessary? Rethinking Data Augmentation

    Authors: Ling Zhang, Xiaosong Wang, Dong Yang, Thomas Sanford, Stephanie Harmon, Baris Turkbey, Holger Roth, Andriy Myronenko, Daguang Xu, Ziyue Xu

    Abstract: Recent advances in deep learning for medical image segmentation demonstrate expert-level accuracy. However, in clinically realistic environments, such methods have marginal performance due to differences in image domains, including different imaging protocols, device vendors and patient populations. Here we consider the problem of domain generalization, when a model is trained once, and its perfor… ▽ More

    Submitted 12 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: 9 pages, 3 figure