Skip to main content

Showing 1–50 of 74 results for author: Zhou, S K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.19742  [pdf, ps, other

    eess.IV cs.AI cs.CV

    NeRF-based CBCT Reconstruction needs Normalization and Initialization

    Authors: Zhuowei Xu, Han Li, Dai Sun, Zhicheng Li, Yujia Li, Qingpeng Kong, Zhiwei Cheng, Nassir Navab, S. Kevin Zhou

    Abstract: Cone Beam Computed Tomography (CBCT) is widely used in medical imaging. However, the limited number and intensity of X-ray projections make reconstruction an ill-posed problem with severe artifacts. NeRF-based methods have achieved great success in this task. However, they suffer from a local-global training mismatch between their two key components: the hash encoder and the neural network. Specif… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  2. arXiv:2504.02382  [pdf, other

    eess.IV cs.AI cs.CV

    Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge

    Authors: Yudi Sang, Yanzhen Liu, Sutuke Yibulayimu, Yunning Wang, Benjamin D. Killeen, Mingxu Liu, Ping-Cheng Ku, Ole Johannsen, Karol Gotkowski, Maximilian Zenk, Klaus Maier-Hein, Fabian Isensee, Peiyan Yue, Yi Wang, Haidong Yu, Zhaohong Pan, Yutong He, Xiaokun Liang, Daiqi Liu, Fuxin Fan, Artur Jurgas, Andrzej Skalski, Yuxi Ma, Jing Yang, Szymon Płotka , et al. (11 additional authors not shown)

    Abstract: The segmentation of pelvic fracture fragments in CT and X-ray images is crucial for trauma diagnosis, surgical planning, and intraoperative guidance. However, accurately and efficiently delineating the bone fragments remains a significant challenge due to complex anatomy and imaging limitations. The PENGWIN challenge, organized as a MICCAI 2024 satellite event, aimed to advance automated fracture… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: PENGWIN 2024 Challenge Report

  3. arXiv:2501.13514  [pdf, other

    eess.IV cs.CV

    Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

    Authors: Chenxu Wu, Qingpeng Kong, Zihang Jiang, S. Kevin Zhou

    Abstract: Magnetic Resonance Imaging (MRI), including diffusion MRI (dMRI), serves as a ``microscope'' for anatomical structures and routinely mitigates the influence of low signal-to-noise ratio scans by compromising temporal or spatial resolution. However, these compromises fail to meet clinical demands for both efficiency and precision. Consequently, denoising is a vital preprocessing step, particularly… ▽ More

    Submitted 9 March, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: 40pages, 34figures

    Journal ref: ICLR 2025

  4. arXiv:2410.23577  [pdf, other

    eess.IV cs.AI cs.CV

    MS-Glance: Bio-Insipred Non-semantic Context Vectors and their Applications in Supervising Image Reconstruction

    Authors: Ziqi Gao, Wendi Yang, Yujia Li, Lei Xing, S. Kevin Zhou

    Abstract: Non-semantic context information is crucial for visual recognition, as the human visual perception system first uses global statistics to process scenes rapidly before identifying specific objects. However, while semantic information is increasingly incorporated into computer vision tasks such as image reconstruction, non-semantic information, such as global spatial structures, is often overlooked… ▽ More

    Submitted 23 November, 2024; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: Accepted by WACV 2025

  5. arXiv:2410.17691  [pdf, other

    eess.IV cs.CV q-bio.NC

    Longitudinal Causal Image Synthesis

    Authors: Yujia Li, Han Li, ans S. Kevin Zhou

    Abstract: Clinical decision-making relies heavily on causal reasoning and longitudinal analysis. For example, for a patient with Alzheimer's disease (AD), how will the brain grey matter atrophy in a year if intervened on the A-beta level in cerebrospinal fluid? The answer is fundamental to diagnosis and follow-up treatment. However, this kind of inquiry involves counterfactual medical images which can not b… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  6. arXiv:2410.14200  [pdf, other

    eess.IV cs.CL cs.CV

    E3D-GPT: Enhanced 3D Visual Foundation for Medical Vision-Language Model

    Authors: Haoran Lai, Zihang Jiang, Qingsong Yao, Rongsheng Wang, Zhiyang He, Xiaodong Tao, Wei Wei, Weifu Lv, S. Kevin Zhou

    Abstract: The development of 3D medical vision-language models holds significant potential for disease diagnosis and patient treatment. However, compared to 2D medical images, 3D medical images, such as CT scans, face challenges related to limited training data and high dimension, which severely restrict the progress of 3D medical vision-language models. To address these issues, we collect a large amount of… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  7. arXiv:2410.00404  [pdf, other

    eess.IV cs.CV

    3DGR-CAR: Coronary artery reconstruction from ultra-sparse 2D X-ray views with a 3D Gaussians representation

    Authors: Xueming Fu, Yingtai Li, Fenghe Tang, Jun Li, Mingyue Zhao, Gao-Jun Teng, S. Kevin Zhou

    Abstract: Reconstructing 3D coronary arteries is important for coronary artery disease diagnosis, treatment planning and operation navigation. Traditional reconstruction techniques often require many projections, while reconstruction from sparse-view X-ray projections is a potential way of reducing radiation dose. However, the extreme sparsity of coronary arteries in a 3D volume and ultra-limited number of… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 10 pages, 5 figures, Accepted at MICCAI 2024

  8. arXiv:2405.12223  [pdf, other

    eess.IV cs.CV

    Cascaded Multi-path Shortcut Diffusion Model for Medical Image Translation

    Authors: Yinchi Zhou, Tianqi Chen, Jun Hou, Huidong Xie, Nicha C. Dvornek, S. Kevin Zhou, David L. Wilson, James S. Duncan, Chi Liu, Bo Zhou

    Abstract: Image-to-image translation is a vital component in medical imaging processing, with many uses in a wide range of imaging modalities and clinical scenarios. Previous methods include Generative Adversarial Networks (GANs) and Diffusion Models (DMs), which offer realism but suffer from instability and lack uncertainty estimation. Even though both GAN and DM methods have individually exhibited their c… ▽ More

    Submitted 14 August, 2024; v1 submitted 5 April, 2024; originally announced May 2024.

    Comments: Accepted at Medical Image Analysis Journal

  9. arXiv:2404.17890  [pdf, other

    eess.IV cs.AI cs.CV

    DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction

    Authors: Chenhe Du, Xiyue Lin, Qing Wu, Xuanyu Tian, Ying Su, Zhe Luo, Rui Zheng, Yang Chen, Hongjiang Wei, S. Kevin Zhou, Jingyi Yu, Yuyao Zhang

    Abstract: Limited-angle and sparse-view computed tomography (LACT and SVCT) are crucial for expanding the scope of X-ray CT applications. However, they face challenges due to incomplete data acquisition, resulting in diverse artifacts in the reconstructed CT images. Emerging implicit neural representation (INR) techniques, such as NeRF, NeAT, and NeRP, have shown promise in under-determined CT imaging recon… ▽ More

    Submitted 19 July, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: 16 pages, 11 figures

    ACM Class: I.2.10; I.4.5

  10. arXiv:2403.05256  [pdf, other

    eess.IV cs.CV cs.LG

    DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction

    Authors: Ziqi Gao, Yue Zhang, Xinwen Liu, Kaiyan Li, S. Kevin Zhou

    Abstract: Multi-contrast (MC) Magnetic Resonance Imaging (MRI) reconstruction aims to incorporate a reference image of auxiliary modality to guide the reconstruction process of the target modality. Known MC reconstruction methods perform well with a fully sampled reference image, but usually exhibit inferior performance, compared to single-contrast (SC) methods, when the reference image is missing or of low… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 11 pages, 4 figures, 2 tables

  11. arXiv:2402.10609  [pdf, other

    eess.IV cs.CV cs.LG

    MRPD: Undersampled MRI reconstruction by prompting a large latent diffusion model

    Authors: Ziqi Gao, S. Kevin Zhou

    Abstract: Implicit visual knowledge in a large latent diffusion model (LLDM) pre-trained on natural images is rich and hypothetically universal to natural and medical images. To test this hypothesis from a practical perspective, we propose a novel framework for undersampled MRI Reconstruction by Prompting a large latent Diffusion model (MRPD). While the existing methods trained on MRI datasets are typically… ▽ More

    Submitted 5 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, 7 tables, 1 pseudocode

  12. arXiv:2401.03150  [pdf, other

    eess.IV

    O-PRESS: Boosting OCT axial resolution with Prior guidance, Recurrence, and Equivariant Self-Supervision

    Authors: Kaiyan Li, Jingyuan Yang, Wenxuan Liang, Xingde Li, Chenxi Zhang, Lulu Chen, Chan Wu, Xiao Zhang, Zhiyan Xu, Yuelin Wang, Lihui Meng, Yue Zhang, Youxin Chen, S. Kevin Zhou

    Abstract: Optical coherence tomography (OCT) is a noninvasive technology that enables real-time imaging of tissue microanatomies. The axial resolution of OCT is intrinsically constrained by the spectral bandwidth of the employed light source while maintaining a fixed center wavelength for a specific application. Physically extending this bandwidth faces strong limitations and requires a substantial cost. We… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  13. arXiv:2312.15676  [pdf, other

    eess.IV cs.CV

    3DGR-CT: Sparse-View CT Reconstruction with a 3D Gaussian Representation

    Authors: Yingtai Li, Xueming Fu, Han Li, Shang Zhao, Ruiyang Jin, S. Kevin Zhou

    Abstract: Sparse-view computed tomography (CT) reduces radiation exposure by acquiring fewer projections, making it a valuable tool in clinical scenarios where low-dose radiation is essential. However, this often results in increased noise and artifacts due to limited data. In this paper we propose a novel 3D Gaussian representation (3DGR) based method for sparse-view CT reconstruction. Inspired by recent s… ▽ More

    Submitted 22 April, 2025; v1 submitted 25 December, 2023; originally announced December 2023.

  14. arXiv:2312.01740  [pdf, other

    eess.IV cs.CV

    MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

    Authors: Fenghe Tang, Bingkun Nian, Jianrui Ding, Quan Quan, Jie Yang, Wei Liu, S. Kevin Zhou

    Abstract: Due to the scarcity and specific imaging characteristics in medical images, light-weighting Vision Transformers (ViTs) for efficient medical image segmentation is a significant challenge, and current studies have not yet paid attention to this issue. This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integra… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 13 pages

    ACM Class: I.4.6

  15. arXiv:2312.01679  [pdf, other

    eess.IV cs.CV cs.LG

    Adversarial Medical Image with Hierarchical Feature Hiding

    Authors: Qingsong Yao, Zecheng He, Yuexiang Li, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

    Abstract: Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon an… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Our code is available at \url{https://github.com/qsyao/Hierarchical_Feature_Constraint}. arXiv admin note: text overlap with arXiv:2012.09501

  16. arXiv:2308.01239  [pdf, other

    eess.IV cs.CV

    CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion

    Authors: Fenghe Tang, Jianrui Ding, Lingtao Wang, Chunping Ning, S. Kevin Zhou

    Abstract: The U-shaped architecture has emerged as a crucial paradigm in the design of medical image segmentation networks. However, due to the inherent local limitations of convolution, a fully convolutional segmentation network with U-shaped architecture struggles to effectively extract global context information, which is vital for the precise localization of lesions. While hybrid architectures combining… ▽ More

    Submitted 2 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures

    ACM Class: I.4.6

  17. arXiv:2306.15203  [pdf, other

    eess.IV cs.AI cs.CV

    Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction

    Authors: Qing Wu, Lixuan Chen, Ce Wang, Hongjiang Wei, S. Kevin Zhou, Jingyi Yu, Yuyao Zhang

    Abstract: Emerging neural reconstruction techniques based on tomography (e.g., NeRF, NeAT, and NeRP) have started showing unique capabilities in medical imaging. In this work, we present a novel Polychromatic neural representation (Polyner) to tackle the challenging problem of CT imaging when metallic implants exist within the human body. CT metal artifacts arise from the drastic variation of metal's attenu… ▽ More

    Submitted 1 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted by NeurIPS 2023

  18. Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Image Segmentation

    Authors: Ziyuan Zhao, Fangcheng Zhou, Zeng Zeng, Cuntai Guan, S. Kevin Zhou

    Abstract: Domain shift and label scarcity heavily limit deep learning applications to various medical image analysis tasks. Unsupervised domain adaptation (UDA) techniques have recently achieved promising cross-modality medical image segmentation by transferring knowledge from a label-rich source domain to an unlabeled target domain. However, it is also difficult to collect annotations from the source domai… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted by MICCAI 2022 (top 13% paper; early accept)

    Journal ref: Medical Image Computing and Computer Assisted Intervention, MICCAI 2022. Lecture Notes in Computer Science, vol 13435. Springer, Cham

  19. arXiv:2305.04294   

    eess.IV cs.CV

    PELE scores: Pelvic X-ray Landmark Detection by Pelvis Extraction and Enhancement

    Authors: Zhen Huang, Han Li, Shitong Shao, Heqin Zhu, Huijie Hu, Zhiwei Cheng, Jianji Wang, S. Kevin Zhou

    Abstract: The pelvis, the lower part of the trunk, supports and balances the trunk. Landmark detection from a pelvic X-ray (PXR) facilitates downstream analysis and computer-assisted diagnosis and treatment of pelvic diseases. Although PXRs have the advantages of low radiation and reduced cost compared to computed tomography (CT) images, their 2D pelvis-tissue superposition of 3D structures confuses clinica… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: will revise it and resubmit it again later

  20. Unified Multi-Modal Image Synthesis for Missing Modality Imputation

    Authors: Yue Zhang, Chengtao Peng, Qiuli Wang, Dan Song, Kaiyan Li, S. Kevin Zhou

    Abstract: Multi-modal medical images provide complementary soft-tissue characteristics that aid in the screening and diagnosis of diseases. However, limited scanning time, image corruption and various imaging protocols often result in incomplete multi-modal images, thus limiting the usage of multi-modal data for clinical purposes. To address this issue, in this paper, we propose a novel unified multi-modal… ▽ More

    Submitted 9 July, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: IEEE TMI accepted final version

  21. arXiv:2304.00570  [pdf, other

    eess.IV cs.CV cs.LG

    FedFTN: Personalized Federated Learning with Deep Feature Transformation Network for Multi-institutional Low-count PET Denoising

    Authors: Bo Zhou, Huidong Xie, Qiong Liu, Xiongchao Chen, Xueqi Guo, Zhicheng Feng, Jun Hou, S. Kevin Zhou, Biao Li, Axel Rominger, Kuangyu Shi, James S. Duncan, Chi Liu

    Abstract: Low-count PET is an efficient way to reduce radiation exposure and acquisition time, but the reconstructed images often suffer from low signal-to-noise ratio (SNR), thus affecting diagnosis and other downstream tasks. Recent advances in deep learning have shown great potential in improving low-count PET image quality, but acquiring a large, centralized, and diverse dataset from multiple institutio… ▽ More

    Submitted 6 October, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: 13 pages, 6 figures, Accepted at Medical Image Analysis Journal (MedIA)

  22. arXiv:2303.15728  [pdf, other

    eess.IV

    DiffULD: Diffusive Universal Lesion Detection

    Authors: Peiang Zhao, Han Li, Ruiyang Jin, S. Kevin Zhou

    Abstract: Universal Lesion Detection (ULD) in computed tomography (CT) plays an essential role in computer-aided diagnosis. Promising ULD results have been reported by anchor-based detection designs, but they have inherent drawbacks due to the use of anchors: i) Insufficient training targets and ii) Difficulties in anchor design. Diffusion probability models (DPM) have demonstrated outstanding capabilities… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  23. arXiv:2303.14349  [pdf, other

    eess.IV cs.LG

    Causal Image Synthesis of Brain MR in 3D

    Authors: Yujia Li, Jiong Shi, S. Kevin Zhou

    Abstract: Clinical decision making requires counterfactual reasoning based on a factual medical image and thus necessitates causal image synthesis. To this end, we present a novel method for modeling the causality between demographic variables, clinical indices and brain MR images for Alzheimer's Diseases. Specifically, we leverage a structural causal model to depict the causality and a styled generator to… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 11 pages

  24. arXiv:2303.13810  [pdf, other

    cs.CV eess.IV

    Evidence-aware multi-modal data fusion and its application to total knee replacement prediction

    Authors: Xinwen Liu, Jing Wang, S. Kevin Zhou, Craig Engstrom, Shekhar S. Chandra

    Abstract: Deep neural networks have been widely studied for predicting a medical condition, such as total knee replacement (TKR). It has shown that data of different modalities, such as imaging data, clinical variables and demographic information, provide complementary information and thus can improve the prediction accuracy together. However, the data sources of various modalities may not always be of high… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  25. arXiv:2303.10611  [pdf, other

    eess.IV cs.CV

    Rethinking Dual-Domain Undersampled MRI reconstruction: domain-specific design from the perspective of the receptive field

    Authors: Ziqi Gao, S. Kevin Zhou

    Abstract: Undersampled MRI reconstruction is crucial for accelerating clinical scanning. Dual-domain reconstruction network is performant among SoTA deep learning methods. In this paper, we rethink dual-domain model design from the perspective of the receptive field, which is needed for image recovery and K-space interpolation problems. Further, we introduce domain-specific modules for dual-domain reconstru… ▽ More

    Submitted 15 February, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 2024 IEEE International Symposium on Biomedical Imaging (ISBI)

  26. arXiv:2303.08416  [pdf, other

    eess.IV cs.CV

    Lung Nodule Segmentation and Uncertain Region Prediction with an Uncertainty-Aware Attention Mechanism

    Authors: Han Yang, Qiuli Wang, Yue Zhang, Zhulin An, Chen Liu, Xiaohong Zhang, S. Kevin Zhou

    Abstract: Radiologists possess diverse training and clinical experiences, leading to variations in the segmentation annotations of lung nodules and resulting in segmentation uncertainty.Conventional methods typically select a single annotation as the learning target or attempt to learn a latent space comprising multiple annotations. However, these approaches fail to leverage the valuable information inheren… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 10 pages, 10 figures. We have reported a preliminary version of this work in MICCAI 2022

  27. arXiv:2303.06358  [pdf, other

    eess.IV cs.CV

    O2CTA: Introducing Annotations from OCT to CCTA in Coronary Plaque Analysis

    Authors: Jun Li, Kexin Li, Yafeng Zhou, S. Kevin Zhou

    Abstract: Targeted diagnosis and treatment plans for patients with coronary artery disease vary according to atherosclerotic plaque component. Coronary CT angiography (CCTA) is widely used for artery imaging and determining the stenosis degree. However, the limited spatial resolution and susceptibility to artifacts fail CCTA in obtaining lumen morphological characteristics and plaque composition. It can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: Accepted for oral presentation in MICCAI-BTSD 2023 workshop

  28. arXiv:2302.01735  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective

    Authors: Chenyu You, Weicheng Dai, Yifei Min, Fenglin Liu, David A. Clifton, S Kevin Zhou, Lawrence Hamilton Staib, James S Duncan

    Abstract: For medical image segmentation, contrastive learning is the dominant practice to improve the quality of visual representations by contrasting semantically similar and dissimilar pairs of samples. This is enabled by the observation that without accessing ground truth labels, negative examples with truly dissimilar anatomical features, if sampled, can significantly improve the performance. In realit… ▽ More

    Submitted 23 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by Advances in Neural Information Processing Systems (NeurIPS 2023)

  29. arXiv:2212.02078  [pdf, other

    eess.IV cs.AI cs.CV

    LE-UDA: Label-efficient unsupervised domain adaptation for medical image segmentation

    Authors: Ziyuan Zhao, Fangcheng Zhou, Kaixin Xu, Zeng Zeng, Cuntai Guan, S. Kevin Zhou

    Abstract: While deep learning methods hitherto have achieved considerable success in medical image segmentation, they are still hampered by two limitations: (i) reliance on large-scale well-labeled datasets, which are difficult to curate due to the expert-driven and time-consuming nature of pixel-level annotations in clinical practices, and (ii) failure to generalize from one domain to another, especially w… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by IEEE Transactions on Medical Imaging, 2022

  30. arXiv:2211.13229  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    DeltaNet:Conditional Medical Report Generation for COVID-19 Diagnosis

    Authors: Xian Wu, Shuxin Yang, Zhaopeng Qiu, Shen Ge, Yangtian Yan, Xingwang Wu, Yefeng Zheng, S. Kevin Zhou, Li Xiao

    Abstract: Fast screening and diagnosis are critical in COVID-19 patient treatment. In addition to the gold standard RT-PCR, radiological imaging like X-ray and CT also works as an important means in patient screening and follow-up. However, due to the excessive number of patients, writing reports becomes a heavy burden for radiologists. To reduce the workload of radiologists, we propose DeltaNet to generate… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

  31. arXiv:2211.01670  [pdf, other

    eess.IV cs.CV

    Active CT Reconstruction with a Learned Sampling Policy

    Authors: Ce Wang, Kun Shang, Haimiao Zhang, Shang Zhao, Dong Liang, S. Kevin Zhou

    Abstract: Computed tomography (CT) is a widely-used imaging technology that assists clinical decision-making with high-quality human body representations. To reduce the radiation dose posed by CT, sparse-view and limited-angle CT are developed with preserved image quality. However, these methods are still stuck with a fixed or uniform sampling strategy, which inhibits the possibility of acquiring a better i… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  32. arXiv:2210.05117  [pdf, other

    eess.IV cs.CV

    DA-VSR: Domain Adaptable Volumetric Super-Resolution For Medical Images

    Authors: Cheng Peng, S. Kevin Zhou, Rama Chellappa

    Abstract: Medical image super-resolution (SR) is an active research area that has many potential applications, including reducing scan time, bettering visual understanding, increasing robustness in downstream tasks, etc. However, applying deep-learning-based SR approaches for clinical applications often encounters issues of domain inconsistency, as the test data may be acquired by different machines or on d… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: MICCAI2021

  33. arXiv:2208.14022  [pdf, other

    eess.IV cs.CV

    Stabilize, Decompose, and Denoise: Self-Supervised Fluoroscopy Denoising

    Authors: Ruizhou Liu, Qiang Ma, Zhiwei Cheng, Yuanyuan Lyu, Jianji Wang, S. Kevin Zhou

    Abstract: Fluoroscopy is an imaging technique that uses X-ray to obtain a real-time 2D video of the interior of a 3D object, helping surgeons to observe pathological structures and tissue functions especially during intervention. However, it suffers from heavy noise that mainly arises from the clinical use of a low dose X-ray, thereby necessitating the technology of fluoroscopy denoising. Such denoising is… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 11 pages, 18 figures

  34. arXiv:2208.08048  [pdf, other

    eess.IV cs.CV

    REGAS: REspiratory-GAted Synthesis of Views for Multi-Phase CBCT Reconstruction from a single 3D CBCT Acquisition

    Authors: Cheng Peng, Haofu Liao, S. Kevin Zhou, Rama Chellappa

    Abstract: It is a long-standing challenge to reconstruct Cone Beam Computed Tomography (CBCT) of the lung under respiratory motion. This work takes a step further to address a challenging setting in reconstructing a multi-phase}4D lung image from just a single}3D CBCT acquisition. To this end, we introduce REpiratory-GAted Synthesis of views, or REGAS. REGAS proposes a self-supervised method to synthesize t… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  35. arXiv:2203.07373  [pdf, other

    eess.IV cs.AI cs.CV

    SATr: Slice Attention with Transformer for Universal Lesion Detection

    Authors: Han Li, Long Chen, Hu Han, S. Kevin Zhou

    Abstract: Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis. Promising ULD results have been reported by multi-slice-input detection approaches which model 3D context from multiple adjacent CT slices, but such methods still experience difficulty in obtaining a global representation among different slices and within each individual slice since they on… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: 11 pages, 3 figures

  36. arXiv:2203.05567  [pdf, other

    eess.IV cs.CV

    Recovering medical images from CT film photos

    Authors: Quan Quan, Qiyuan Wang, Yuanqi Du, Liu Li, S. Kevin Zhou

    Abstract: While medical images such as computed tomography (CT) are stored in DICOM format in hospital PACS, it is still quite routine in many countries to print a film as a transferable medium for the purposes of self-storage and secondary consultation. Also, with the ubiquitousness of mobile phone cameras, it is quite common to take pictures of CT films, which unfortunately suffer from geometric deformati… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  37. arXiv:2203.04292  [pdf, other

    eess.IV cs.CV cs.LG

    Towards performant and reliable undersampled MR reconstruction via diffusion model sampling

    Authors: Cheng Peng, Pengfei Guo, S. Kevin Zhou, Vishal Patel, Rama Chellappa

    Abstract: Magnetic Resonance (MR) image reconstruction from under-sampled acquisition promises faster scanning time. To this end, current State-of-The-Art (SoTA) approaches leverage deep neural networks and supervised training to learn a recovery model. While these approaches achieve impressive performances, the learned model can be fragile on unseen degradation, e.g. when given a different acceleration fac… ▽ More

    Submitted 10 March, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

  38. arXiv:2203.03196  [pdf, other

    eess.IV cs.CV

    Undersampled MRI Reconstruction with Side Information-Guided Normalisation

    Authors: Xinwen Liu, Jing Wang, Cheng Peng, Shekhar S. Chandra, Feng Liu, S. Kevin Zhou

    Abstract: Magnetic resonance (MR) images exhibit various contrasts and appearances based on factors such as different acquisition protocols, views, manufacturers, scanning parameters, etc. This generally accessible appearance-related side information affects deep learning-based undersampled magnetic resonance imaging (MRI) reconstruction frameworks, but has been overlooked in the majority of current works.… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  39. arXiv:2203.02772  [pdf, other

    eess.IV cs.CV

    Rib Suppression in Digital Chest Tomosynthesis

    Authors: Yihua Sun, Qingsong Yao, Yuanyuan Lyu, Jianji Wang, Yi Xiao, Hongen Liao, S. Kevin Zhou

    Abstract: Digital chest tomosynthesis (DCT) is a technique to produce sectional 3D images of a human chest for pulmonary disease screening, with 2D X-ray projections taken within an extremely limited range of angles. However, under the limited angle scenario, DCT contains strong artifacts caused by the presence of ribs, jamming the imaging quality of the lung area. Recently, great progress has been achieved… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  40. arXiv:2203.02131  [pdf, other

    eess.IV cs.AI cs.CV cs.RO

    3D endoscopic depth estimation using 3D surface-aware constraints

    Authors: Shang Zhao, Ce Wang, Qiyuan Wang, Yanzhe Liu, S Kevin Zhou

    Abstract: Robotic-assisted surgery allows surgeons to conduct precise surgical operations with stereo vision and flexible motor control. However, the lack of 3D spatial perception limits situational awareness during procedures and hinders mastering surgical skills in the narrow abdominal space. Depth estimation, as a representative perception task, is typically defined as an image reconstruction problem. In… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  41. arXiv:2203.02114  [pdf, other

    eess.IV cs.CV

    MixCL: Pixel label matters to contrastive learning

    Authors: Jun Li, Quan Quan, S. Kevin Zhou

    Abstract: Contrastive learning and self-supervised techniques have gained prevalence in computer vision for the past few years. It is essential for medical image analysis, which is often notorious for its lack of annotations. Most existing self-supervised methods applied in natural imaging tasks focus on designing proxy tasks for unlabeled data. For example, contrastive learning is often based on the fact t… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  42. arXiv:2203.02100  [pdf, other

    eess.IV cs.CV cs.LG

    Learning Incrementally to Segment Multiple Organs in a CT Image

    Authors: Pengbo Liu, Xia Wang, Mengsi Fan, Hongli Pan, Minmin Yin, Xiaohong Zhu, Dandan Du, Xiaoying Zhao, Li Xiao, Lian Ding, Xingwang Wu, S. Kevin Zhou

    Abstract: There exists a large number of datasets for organ segmentation, which are partially annotated and sequentially constructed. A typical dataset is constructed at a certain time by curating medical images and annotating the organs of interest. In other words, new datasets with annotations of new organ categories are built over time. To unleash the potential behind these partially labeled, sequentiall… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2103.04526

  43. arXiv:2203.02098  [pdf, other

    eess.IV cs.CV

    Universal Segmentation of 33 Anatomies

    Authors: Pengbo Liu, Yang Deng, Ce Wang, Yuan Hui, Qian Li, Jun Li, Shiwei Luo, Mengke Sun, Quan Quan, Shuxin Yang, You Hao, Honghu Xiao, Chunpeng Zhao, Xinbao Wu, S. Kevin Zhou

    Abstract: In the paper, we present an approach for learning a single model that universally segments 33 anatomical structures, including vertebrae, pelvic bones, and abdominal organs. Our model building has to address the following challenges. Firstly, while it is ideal to learn such a model from a large-scale, fully-annotated dataset, it is practically hard to curate such a dataset. Thus, we resort to lear… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  44. arXiv:2112.15011  [pdf, other

    eess.IV cs.CL cs.CV

    Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

    Authors: Shuxin Yang, Xian Wu, Shen Ge, S. Kevin Zhou, Li Xiao

    Abstract: In clinics, a radiology report is crucial for guiding a patient's treatment. However, writing radiology reports is a heavy burden for radiologists. To this end, we present an automatic, multi-modal approach for report generation from a chest x-ray. Our approach, motivated by the observation that the descriptions in radiology reports are highly correlated with specific information of the x-ray imag… ▽ More

    Submitted 1 June, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

  45. arXiv:2112.15009  [pdf, ps, other

    eess.IV cs.CL cs.CV

    Knowledge Matters: Radiology Report Generation with General and Specific Knowledge

    Authors: Shuxin Yang, Xian Wu, Shen Ge, Shaohua Kevin Zhou, Li Xiao

    Abstract: Automatic radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches… ▽ More

    Submitted 6 November, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: Medical Image Analysis

  46. arXiv:2112.04386  [pdf, other

    eess.IV cs.CV cs.LG

    Which images to label for few-shot medical landmark detection?

    Authors: Quan Quan, Qingsong Yao, Jun Li, S. Kevin Zhou

    Abstract: The success of deep learning methods relies on the availability of well-labeled large-scale datasets. However, for medical images, annotating such abundant training data often requires experienced radiologists and consumes their limited time. Few-shot learning is developed to alleviate this burden, which achieves competitive performances with only several labeled data. However, a crucial yet previ… ▽ More

    Submitted 28 April, 2024; v1 submitted 7 December, 2021; originally announced December 2021.

    Journal ref: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2022

  47. arXiv:2111.10790  [pdf, other

    eess.IV cs.CV

    DuDoTrans: Dual-Domain Transformer Provides More Attention for Sinogram Restoration in Sparse-View CT Reconstruction

    Authors: Ce Wang, Kun Shang, Haimiao Zhang, Qian Li, Yuan Hui, S. Kevin Zhou

    Abstract: While Computed Tomography (CT) reconstruction from X-ray sinograms is necessary for clinical diagnosis, iodine radiation in the imaging process induces irreversible injury, thereby driving researchers to study sparse-view CT reconstruction, that is, recovering a high-quality CT image from a sparse set of sinogram views. Iterative models are proposed to alleviate the appeared artifacts in sparse-vi… ▽ More

    Submitted 25 November, 2021; v1 submitted 21 November, 2021; originally announced November 2021.

  48. arXiv:2110.09134  [pdf, other

    eess.IV cs.CV

    GAN-based disentanglement learning for chest X-ray rib suppression

    Authors: Luyi Han, Yuanyuan Lyu, Cheng Peng, S. Kevin Zhou

    Abstract: Clinical evidence has shown that rib-suppressed chest X-rays (CXRs) can improve the reliability of pulmonary disease diagnosis. However, previous approaches on generating rib-suppressed CXR face challenges in preserving details and eliminating rib residues. We hereby propose a GAN-based disentanglement learning framework called Rib Suppression GAN, or RSGAN, to perform rib suppression by utilizing… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  49. arXiv:2106.15345  [pdf, other

    cs.CV cs.LG eess.IV

    Where is the disease? Semi-supervised pseudo-normality synthesis from an abnormal image

    Authors: Yuanqi Du, Quan Quan, Hu Han, S. Kevin Zhou

    Abstract: Pseudo-normality synthesis, which computationally generates a pseudo-normal image from an abnormal one (e.g., with lesions), is critical in many perspectives, from lesion detection, data augmentation to clinical surgery suggestion. However, it is challenging to generate high-quality pseudo-normal images in the absence of the lesion information. Thus, expensive lesion segmentation data have been in… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  50. arXiv:2105.14711  [pdf, other

    eess.IV cs.CV

    CTSpine1K: A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

    Authors: Yang Deng, Ce Wang, Yuan Hui, Qian Li, Jun Li, Shiwei Luo, Mengke Sun, Quan Quan, Shuxin Yang, You Hao, Pengbo Liu, Honghu Xiao, Chunpeng Zhao, Xinbao Wu, S. Kevin Zhou

    Abstract: Spine-related diseases have high morbidity and cause a huge burden of social cost. Spine imaging is an essential tool for noninvasively visualizing and assessing spinal pathology. Segmenting vertebrae in computed tomography (CT) images is the basis of quantitative medical image analysis for clinical diagnosis and surgery planning of spine diseases. Current publicly available annotated datasets on… ▽ More

    Submitted 3 October, 2024; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted by MICCAI2024 Open Data for oral presentation and will be published as a part of the journal MELBA special issue