Skip to main content

Showing 1–45 of 45 results for author: Wei, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.02418  [pdf, ps, other

    eess.SP

    Passive Multi-Target Visible Light Positioning Based on Multi-Camera Joint Optimization

    Authors: Wenxuan Pan, Yang Yang, Dong Wei, Meng Zhang, Zhiyu Zhu

    Abstract: Camera-based visible light positioning (VLP) has emerged as a promising indoor positioning technique. However, the need for dedicated LED infrastructure and on-target cameras in existing algorithms limits their scalability and increases deployment costs. To address these limitations, this letter proposes a passive VLP algorithm based on Multi-Camera Joint Optimization (MCJO). In the considered sys… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2504.19173  [pdf, other

    eess.SP

    Meta-learning based Selective Fixed-filter Active Noise Control System with ResNet Classifier

    Authors: Y. Xiao, M. Liu, D. Wei, L. Jian

    Abstract: The selective fixed-filter strategy is popular in industrial applications involving active noise control (ANC) technology, which circumvents the time-consuming online learning process by selecting the best-matched pre-trained control filter. However, the existing selective fixed-filter ANC (SFANC) based algorithms classify noises in frequency band, which is not a reasonable approach. Moreover, the… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  3. arXiv:2504.09905  [pdf, other

    eess.SP

    Fusing Bluetooth with Pedestrian Dead Reckoning: A Floor Plan-Assisted Positioning Approach

    Authors: Wenxuan Pan, Yang Yang, Mingzhe Chen, Dong Wei, Caili Guo, Shiwen Mao

    Abstract: Floor plans can provide valuable prior information that helps enhance the accuracy of indoor positioning systems. However, existing research typically faces challenges in efficiently leveraging floor plan information and applying it to complex indoor layouts. To fully exploit information from floor plans for positioning, we propose a floor plan-assisted fusion positioning algorithm (FP-BP) using B… ▽ More

    Submitted 19 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

  4. arXiv:2502.05228  [pdf

    quant-ph cs.AI eess.SY

    Multi-Objective Mobile Damped Wave Algorithm (MOMDWA): A Novel Approach For Quantum System Control

    Authors: Juntao Yu, Jiaquan Yu, Dedai Wei, Xinye Sha, Shengwei Fu, Miuyu Qiu, Yurun Jin, Kaichen Ouyang

    Abstract: In this paper, we introduce a novel multi-objective optimization algorithm, the Multi-Objective Mobile Damped Wave Algorithm (MOMDWA), specifically designed to address complex quantum control problems. Our approach extends the capabilities of the original Mobile Damped Wave Algorithm (MDWA) by incorporating multiple objectives, enabling a more comprehensive optimization process. We applied MOMDWA… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  5. arXiv:2411.19845  [pdf, other

    cs.CV cs.LG eess.SP

    A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications

    Authors: Liqiang Zhang, Ye Tian, Dongyan Wei

    Abstract: With the development of smart cities, the demand for continuous pedestrian navigation in large-scale urban environments has significantly increased. While global navigation satellite systems (GNSS) provide low-cost and reliable positioning services, they are often hindered in complex urban canyon environments. Thus, exploring opportunistic signals for positioning in urban areas has become a key so… ▽ More

    Submitted 14 December, 2024; v1 submitted 29 November, 2024; originally announced November 2024.

  6. arXiv:2409.08597  [pdf, other

    cs.SD cs.CL eess.AS

    LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation

    Authors: Shaojun Li, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li, Xianghui He, Min Zhang, Hao Yang

    Abstract: Recent advancements in integrating speech information into large language models (LLMs) have significantly improved automatic speech recognition (ASR) accuracy. However, existing methods often constrained by the capabilities of the speech encoders under varied acoustic conditions, such as accents. To address this, we propose LA-RAG, a novel Retrieval-Augmented Generation (RAG) paradigm for LLM-bas… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: submitted to ICASSP 2025

  7. arXiv:2407.02005  [pdf, other

    cs.CL cs.SD eess.AS

    An End-to-End Speech Summarization Using Large Language Model

    Authors: Hengchao Shang, Zongyao Li, Jiaxin Guo, Shaojun Li, Zhiqiang Rao, Yuanchang Luo, Daimeng Wei, Hao Yang

    Abstract: Abstractive Speech Summarization (SSum) aims to generate human-like text summaries from spoken content. It encounters difficulties in handling long speech input and capturing the intricate cross-modal mapping between long speech inputs and short text summaries. Research on large language models (LLMs) and multimodal information fusion has provided new insights for addressing these challenges. In t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: InterSpeech 2024

  8. arXiv:2406.09696  [pdf, other

    eess.IV cs.CV

    MoME: Mixture of Multimodal Experts for Cancer Survival Prediction

    Authors: Conghao Xiong, Hao Chen, Hao Zheng, Dong Wei, Yefeng Zheng, Joseph J. Y. Sung, Irwin King

    Abstract: Survival analysis, as a challenging task, requires integrating Whole Slide Images (WSIs) and genomic data for comprehensive decision-making. There are two main challenges in this task: significant heterogeneity and complex inter- and intra-modal interactions between the two modalities. Previous approaches utilize co-attention methods, which fuse features from both modalities only once after separa… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 8 + 1/2 pages, early accepted to MICCAI2024

  9. arXiv:2406.04791  [pdf, other

    cs.SD eess.AS

    Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR

    Authors: Shaojun Li, Daimeng Wei, Hengchao Shang, Jiaxin Guo, ZongYao Li, Zhanglin Wu, Zhiqiang Rao, Yuanchang Luo, Xianghui He, Hao Yang

    Abstract: Despite recent improvements in End-to-End Automatic Speech Recognition (E2E ASR) systems, the performance can degrade due to vocal characteristic mismatches between training and testing data, particularly with limited target speaker adaptation data. We propose a novel speaker adaptation approach Speaker-Smoothed kNN that leverages k-Nearest Neighbors (kNN) retrieval techniques to improve model out… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  10. arXiv:2405.16197  [pdf, other

    cs.CV eess.IV

    A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior

    Authors: Fuheng Zhou, Dikai Wei, Ye Fan, Yulong Huang, Yonggang Zhang

    Abstract: Although deep learning based models for underwater image enhancement have achieved good performance, they face limitations in both lightweight and effectiveness, which prevents their deployment and application on resource-constrained platforms. Moreover, most existing deep learning based models use data compression to get high-level semantic information in latent space instead of using the origina… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 10 pages

  11. arXiv:2404.14435  [pdf, other

    cs.CV eess.IV

    Frenet-Serret Frame-based Decomposition for Part Segmentation of 3D Curvilinear Structures

    Authors: Leslie Gu, Jason Ken Adhinarta, Mikhail Bessmeltsev, Jiancheng Yang, Yongjie Jessica Zhang, Wenjie Yin, Daniel Berger, Jeff Lichtman, Hanspeter Pfister, Donglai Wei

    Abstract: Accurately segmenting 3D curvilinear structures in medical imaging remains challenging due to their complex geometry and the scarcity of diverse, large-scale datasets for algorithm development and evaluation. In this paper, we use dendritic spine segmentation as a case study and address these challenges by introducing a novel Frenet--Serret Frame-based Decomposition, which decomposes 3D curvilinea… ▽ More

    Submitted 24 October, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures

  12. arXiv:2404.06080  [pdf

    eess.IV cs.CV

    Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures

    Authors: Ching-Kai Lin, Di-Chun Wei, Yun-Chien Cheng

    Abstract: This study presents a computer-aided diagnosis (CAD) system to assist early detection of lung metastases during endobronchial ultrasound (EBUS) procedures, significantly reducing follow-up time and enabling timely treatment. Due to limited cytology images and morphological similarities among cells, classifying lung metastases is challenging, and existing research rarely targets this issue directly… ▽ More

    Submitted 14 May, 2025; v1 submitted 9 April, 2024; originally announced April 2024.

  13. arXiv:2404.04904  [pdf, other

    cs.SD cs.AI eess.AS

    Cross-Domain Audio Deepfake Detection: Dataset and Analysis

    Authors: Yuang Li, Min Zhang, Mengxin Ren, Miaomiao Ma, Daimeng Wei, Hao Yang

    Abstract: Audio deepfake detection (ADD) is essential for preventing the misuse of synthetic voices that may infringe on personal rights and privacy. Recent zero-shot text-to-speech (TTS) models pose higher risks as they can clone voices with a single utterance. However, the existing ADD datasets are outdated, leading to suboptimal generalization of detection models. In this paper, we construct a new cross-… ▽ More

    Submitted 20 September, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  14. arXiv:2402.09372  [pdf, other

    eess.IV cs.AI cs.CV

    Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge

    Authors: Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, Pengfei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni

    Abstract: Rib fractures are a common and potentially severe injury that can be challenging and labor-intensive to detect in CT scans. While there have been efforts to address this field, the lack of large-scale annotated datasets and evaluation benchmarks has hindered the development and validation of deep learning algorithms. To address this issue, the RibFrac Challenge was introduced, providing a benchmar… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Challenge paper for MICCAI RibFrac Challenge (https://ribfrac.grand-challenge.org/)

  15. UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

    Authors: Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang

    Abstract: Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER). Previous works usually adopt end-to-end models and has strong dependency on Pseudo Paired Data and Original Paired Data. But when only pre-training on Pseudo Paired Data, previous models have negative effect on correction. While fine-tu… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted in ICASSP 2023

  16. arXiv:2312.01726  [pdf, other

    eess.IV cs.CV

    Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retinal OCT Images with Full and Sparse Annotations

    Authors: Hong Liu, Dong Wei, Donghuan Lu, Xiaoying Tang, Liansheng Wang, Yefeng Zheng

    Abstract: Layer segmentation is important to quantitative analysis of retinal optical coherence tomography (OCT). Recently, deep learning based methods have been developed to automate this task and yield remarkable performance. However, due to the large spatial gap and potential mismatch between the B-scans of an OCT volume, all of them were based on 2D segmentation of individual B-scans, which may lose the… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by MIA. arXiv admin note: text overlap with arXiv:2203.02390

  17. arXiv:2309.17329  [pdf, other

    cs.CV cs.AI cs.GR cs.LG eess.IV

    Efficient Anatomical Labeling of Pulmonary Tree Structures via Deep Point-Graph Representation-based Implicit Fields

    Authors: Kangxian Xie, Jiancheng Yang, Donglai Wei, Ziqiao Weng, Pascal Fua

    Abstract: Pulmonary diseases rank prominently among the principal causes of death worldwide. Curing them will require, among other things, a better understanding of the complex 3D tree-shaped structures within the pulmonary system, such as airways, arteries, and veins. Traditional approaches using high-resolution image stacks and standard CNNs on dense voxel grids face challenges in computational efficiency… ▽ More

    Submitted 17 October, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by Medical Image Analysis

    MSC Class: 68T45; 62P10; 68U10; 68U05; 05C90

  18. arXiv:2309.12805  [pdf, other

    eess.IV cs.CV

    Automatic view plane prescription for cardiac magnetic resonance imaging via supervision by spatial relationship between views

    Authors: Dong Wei, Yawen Huang, Donghuan Lu, Yuexiang Li, Yefeng Zheng

    Abstract: Background: View planning for the acquisition of cardiac magnetic resonance (CMR) imaging remains a demanding task in clinical practice. Purpose: Existing approaches to its automation relied either on an additional volumetric image not typically acquired in clinic routine, or on laborious manual annotations of cardiac structural landmarks. This work presents a clinic-compatible, annotation-free sy… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: Medical Physics. arXiv admin note: text overlap with arXiv:2109.11715

  19. arXiv:2306.14274  [pdf, other

    eess.IV cs.CV

    MEPNet: A Model-Driven Equivariant Proximal Network for Joint Sparse-View Reconstruction and Metal Artifact Reduction in CT Images

    Authors: Hong Wang, Minghao Zhou, Dong Wei, Yuexiang Li, Yefeng Zheng

    Abstract: Sparse-view computed tomography (CT) has been adopted as an important technique for speeding up data acquisition and decreasing radiation dose. However, due to the lack of sufficient projection data, the reconstructed CT images often present severe artifacts, which will be further amplified when patients carry metallic implants. For this joint sparse-view reconstruction and metal artifact reductio… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: MICCAI 2023

  20. arXiv:2306.06767  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    The Impact of ChatGPT and LLMs on Medical Imaging Stakeholders: Perspectives and Use Cases

    Authors: Jiancheng Yang, Hongwei Bran Li, Donglai Wei

    Abstract: This study investigates the transformative potential of Large Language Models (LLMs), such as OpenAI ChatGPT, in medical imaging. With the aid of public data, these models, which possess remarkable language understanding and generation capabilities, are augmenting the interpretive skills of radiologists, enhancing patient-physician communication, and streamlining clinical workflows. The paper intr… ▽ More

    Submitted 6 July, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Paper invited for the first issue of Meta-Radiology

  21. arXiv:2305.10009  [pdf, other

    eess.SP

    A Modular and High-Resolution Time-Frequency Post-Processing Technique

    Authors: Jinshun Shen, Deyun Wei

    Abstract: In this letter, based on the variational model, we propose a novel time-frequency post-processing technique to approximate the ideal time-frequency representation. Our method has the advantage of modularity, enabling "plug and play", independent of the performance of specific time-frequency analysis tool. Therefore, it can be easily generalized to the fractional Fourier domain and the linear canon… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  22. arXiv:2303.05302  [pdf, other

    eess.IV cs.CV

    M3AE: Multimodal Representation Learning for Brain Tumor Segmentation with Missing Modalities

    Authors: Hong Liu, Dong Wei, Donghuan Lu, Jinghan Sun, Liansheng Wang, Yefeng Zheng

    Abstract: Multimodal magnetic resonance imaging (MRI) provides complementary information for sub-region analysis of brain tumors. Plenty of methods have been proposed for automatic brain tumor segmentation using four common MRI modalities and achieved remarkable performance. In practice, however, it is common to have one or more modalities missing due to image corruption, artifacts, acquisition protocols, a… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Journal ref: AAAI 2023

  23. arXiv:2212.10431  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity

    Authors: Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister

    Abstract: The mechanism of existing style transfer algorithms is by minimizing a hybrid loss function to push the generated image toward high similarities in both content and style. However, this type of approach cannot guarantee visual fidelity, i.e., the generated artworks should be indistinguishable from real ones. In this paper, we devise a new style transfer framework called QuantArt for high visual-fi… ▽ More

    Submitted 5 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to CVPR 2023. Code is available at https://github.com/siyuhuang/QuantArt

  24. arXiv:2210.09309  [pdf, other

    eess.IV cs.CV cs.LG

    RibSeg v2: A Large-scale Benchmark for Rib Labeling and Anatomical Centerline Extraction

    Authors: Liang Jin, Shixuan Gu, Donglai Wei, Jason Ken Adhinarta, Kaiming Kuang, Yongjie Jessica Zhang, Hanspeter Pfister, Bingbing Ni, Jiancheng Yang, Ming Li

    Abstract: Automatic rib labeling and anatomical centerline extraction are common prerequisites for various clinical applications. Prior studies either use in-house datasets that are inaccessible to communities, or focus on rib segmentation that neglects the clinical significance of rib labeling. To address these issues, we extend our prior dataset (RibSeg) on the binary rib segmentation task to a comprehens… ▽ More

    Submitted 1 August, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, journal

  25. arXiv:2207.03180  [pdf, other

    eess.IV cs.CV

    Deformer: Towards Displacement Field Learning for Unsupervised Medical Image Registration

    Authors: Jiashun Chen, Donghuan Lu, Yu Zhang, Dong Wei, Munan Ning, Xinyu Shi, Zhe Xu, Yefeng Zheng

    Abstract: Recently, deep-learning-based approaches have been widely studied for deformable image registration task. However, most efforts directly map the composite image representation to spatial transformation through the convolutional neural network, ignoring its limited ability to capture spatial correspondence. On the other hand, Transformer can better characterize the spatial relationship with attenti… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  26. arXiv:2206.02425  [pdf, other

    eess.IV cs.CV

    mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation

    Authors: Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng Zheng

    Abstract: Accurate brain tumor segmentation from Magnetic Resonance Imaging (MRI) is desirable to joint learning of multimodal images. However, in clinical practice, it is not always possible to acquire a complete set of MRIs, and the problem of missing modalities causes severe performance degradation in existing multimodal segmentation methods. In this work, we present the first attempt to exploit the Tran… ▽ More

    Submitted 4 August, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022

  27. Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images

    Authors: Dong Wei, Ying Sun, Ping Chai, Adrian Low, Sim Heng Ong

    Abstract: Automatic segmentation of myocardium in Late Gadolinium Enhanced (LGE) Cardiac MR (CMR) images is often difficult due to the intensity heterogeneity resulting from accumulation of contrast agent in infarcted areas. In this paper, we propose an automatic segmentation framework that fully utilizes shared information between corresponding cine and LGE images of a same patient. Given myocardial contou… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: MICCAI 2011

  28. arXiv:2205.10572  [pdf, other

    eess.IV cs.CV physics.med-ph

    A Comprehensive 3-D Framework for Automatic Quantification of Late Gadolinium Enhanced Cardiac Magnetic Resonance Images

    Authors: Dong Wei, Ying Sun, Sim-Heng Ong, Ping Chai, Lynette L Teo, Adrian F Low

    Abstract: Late gadolinium enhanced (LGE) cardiac magnetic resonance (CMR) can directly visualize nonviable myocardium with hyperenhanced intensities with respect to normal myocardium. For heart attack patients, it is crucial to facilitate the decision of appropriate therapy by analyzing and quantifying their LGE CMR images. To achieve accurate quantification, LGE CMR images need to be processed in two steps… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: IEEE Transactions on Biomedical Engineering ( Volume: 60, Issue: 6, June 2013)

  29. arXiv:2205.10548  [pdf, ps, other

    eess.IV cs.CV physics.med-ph

    Three-Dimensional Segmentation of the Left Ventricle in Late Gadolinium Enhanced MR Images of Chronic Infarction Combining Long- and Short-Axis Information

    Authors: Dong Wei, Ying Sun, Sim-Heng Ong, Ping Chai, Lynette L. Teo, Adrian F. Low

    Abstract: Automatic segmentation of the left ventricle (LV) in late gadolinium enhanced (LGE) cardiac MR (CMR) images is difficult due to the intensity heterogeneity arising from accumulation of contrast agent in infarcted myocardium. In this paper, we present a comprehensive framework for automatic 3D segmentation of the LV in LGE CMR images. Given myocardial contours in cine images as a priori knowledge,… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: Medical Image Analysis, Volume 17, Issue 6, August 2013, Pages 685-697

  30. arXiv:2204.02844  [pdf, other

    cs.CV eess.IV

    Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

    Authors: Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Yulun Zhang, Hanspeter Pfister, Donglai Wei

    Abstract: Existing deep learning real denoising methods require a large amount of noisy-clean image pairs for supervision. Nonetheless, capturing a real noisy-clean dataset is an unacceptable expensive and cumbersome procedure. To alleviate this problem, this work investigates how to generate realistic noisy images. Firstly, we formulate a simple yet reasonable noise model that treats each real noisy pixel… ▽ More

    Submitted 14 September, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2021

  31. arXiv:2203.05571  [pdf, other

    eess.IV cs.CV

    Deep Convolutional Neural Networks for Molecular Subtyping of Gliomas Using Magnetic Resonance Imaging

    Authors: Dong Wei, Yiming Li, Yinyan Wang, Tianyi Qian, Yefeng Zheng

    Abstract: Knowledge of molecular subtypes of gliomas can provide valuable information for tailored therapies. This study aimed to investigate the use of deep convolutional neural networks (DCNNs) for noninvasive glioma subtyping with radiological imaging data according to the new taxonomy announced by the World Health Organization in 2016. Methods: A DCNN model was developed for the prediction of the five g… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Proc. SPIE 11314, Medical Imaging 2020: Computer-Aided Diagnosis

  32. Conquering Data Variations in Resolution: A Slice-Aware Multi-Branch Decoder Network

    Authors: Shuxin Wang, Shilei Cao, Zhizhong Chai, Dong Wei, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Fully convolutional neural networks have made promising progress in joint liver and liver tumor segmentation. Instead of following the debates over 2D versus 3D networks (for example, pursuing the balance between large-scale 2D pretraining and 3D context), in this paper, we novelly identify the wide variation in the ratio between intra- and inter-slice resolutions as a crucial obstacle to the perf… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Published by IEEE TMI

  33. Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retina OCT Images

    Authors: Hong Liu, Dong Wei, Donghuan Lu, Yuexiang Li, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Automated surface segmentation of retinal layer is important and challenging in analyzing optical coherence tomography (OCT). Recently, many deep learning based methods have been developed for this task and yield remarkable performance. However, due to large spatial gap and potential mismatch between the B-scans of OCT data, all of them are based on 2D segmentation of individual B-scans, which may… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: Presented at MICCAI 2021

  34. arXiv:2112.05754  [pdf, other

    eess.IV cs.CV q-bio.QM

    PyTorch Connectomics: A Scalable and Flexible Segmentation Framework for EM Connectomics

    Authors: Zudi Lin, Donglai Wei, Jeff Lichtman, Hanspeter Pfister

    Abstract: We present PyTorch Connectomics (PyTC), an open-source deep-learning framework for the semantic and instance segmentation of volumetric microscopy images, built upon PyTorch. We demonstrate the effectiveness of PyTC in the field of connectomics, which aims to segment and reconstruct neurons, synapses, and other organelles like mitochondria at nanometer resolution for understanding neuronal communi… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Technical report

  35. arXiv:2110.14795  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MedMNIST v2 -- A large-scale lightweight benchmark for 2D and 3D biomedical image classification

    Authors: Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, Bingbing Ni

    Abstract: We introduce MedMNIST v2, a large-scale MNIST-like dataset collection of standardized biomedical images, including 12 datasets for 2D and 6 datasets for 3D. All images are pre-processed into a small size of 28x28 (2D) or 28x28x28 (3D) with the corresponding classification labels so that no background knowledge is required for users. Covering primary data modalities in biomedical images, MedMNIST v… ▽ More

    Submitted 25 September, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: The data and code are publicly available at https://medmnist.com/. arXiv admin note: text overlap with arXiv:2010.14925

    Journal ref: Scientific Data 2023

  36. arXiv:2109.14805  [pdf, other

    eess.IV cs.CV

    Unsupervised Landmark Detection Based Spatiotemporal Motion Estimation for 4D Dynamic Medical Images

    Authors: Yuyu Guo, Lei Bi, Dongming Wei, Liyun Chen, Zhengbin Zhu, Dagan Feng, Ruiyan Zhang, Qian Wang, Jinman Kim

    Abstract: Motion estimation is a fundamental step in dynamic medical image processing for the assessment of target organ anatomy and function. However, existing image-based motion estimation methods, which optimize the motion field by evaluating the local image similarity, are prone to produce implausible estimation, especially in the presence of large motion. In this study, we provide a novel motion estima… ▽ More

    Submitted 7 November, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: accepted by IEEE Transactions on Cybernetics

  37. arXiv:2109.11715  [pdf, other

    eess.IV cs.CV

    Training Automatic View Planner for Cardiac MR Imaging via Self-Supervision by Spatial Relationship between Views

    Authors: Dong Wei, Kai Ma, Yefeng Zheng

    Abstract: View planning for the acquisition of cardiac magnetic resonance imaging (CMR) requires acquaintance with the cardiac anatomy and remains a challenging task in clinical practice. Existing approaches to its automation relied either on an additional volumetric image not typically acquired in clinic routine, or on laborious manual annotations of cardiac structural landmarks. This work presents a clini… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Accepted by MICCAI 2021

  38. arXiv:2109.09521  [pdf, other

    eess.IV cs.AI cs.CV cs.GR cs.LG

    RibSeg Dataset and Strong Point Cloud Baselines for Rib Segmentation from CT Scans

    Authors: Jiancheng Yang, Shixuan Gu, Donglai Wei, Hanspeter Pfister, Bingbing Ni

    Abstract: Manual rib inspections in computed tomography (CT) scans are clinically critical but labor-intensive, as 24 ribs are typically elongated and oblique in 3D volumes. Automatic rib segmentation methods can speed up the process through rib measurement and visualization. However, prior arts mostly use in-house labeled datasets that are publicly unavailable and work on dense 3D volumes that are computat… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: MICCAI 2021. The dataset, code, and model are available at https://github.com/M3DV/RibSeg

  39. arXiv:2108.07979  [pdf, other

    cs.CV eess.IV

    A New Bidirectional Unsupervised Domain Adaptation Segmentation Framework

    Authors: Munan Ning, Cheng Bian, Dong Wei, Chenglang Yuan, Yaohua Wang, Yang Guo, Kai Ma, Yefeng Zheng

    Abstract: Domain shift happens in cross-domain scenarios commonly because of the wide gaps between different domains: when applying a deep learning model well-trained in one domain to another target domain, the model usually performs poorly. To tackle this problem, unsupervised domain adaptation (UDA) techniques are proposed to bridge the gap between different domains, for the purpose of improving model per… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: IPMI 2021

  40. arXiv:2005.07701  [pdf

    eess.IV physics.optics

    Optical image decomposition and noise filtering based on Laguerre-Gaussian modes

    Authors: Jiantao Ma, Dan Wei, Haocheng Yang, Yong Zhang, Min Xiao

    Abstract: We propose and experimentally demonstrate an efficient image decomposition in the Laguerre-Gaussian (LG) domain. By developing an advanced computing method, the sampling points are much fewer than those in the existing methods, which can significantly improve the calculation efficiency. The beam waist, azimuthal and radial truncation orders of the LG modes are optimized depending on the image info… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  41. arXiv:2005.07428  [pdf

    physics.optics eess.IV

    Laguerre-Gaussian transform for rotating image processing

    Authors: Dan Wei, Jiantao Ma, Tianxin Wang, Chuan Xu, Yin Cai, Lidan Zhang, Xinyuan Fang, Dunzhao Wei, Shining Zhu, Yong Zhang, Min Xiao

    Abstract: Rotation is a common motional form in nature, existing from atoms and molecules, industrial turbines to astronomical objects. However, it still lacks an efficient and reliable method for real-time image processing of a fast-rotating object. Since the Fourier spectrum of a rotating object changes rapidly, the traditional Fourier transform (FT) techniques become extremely complicated and time consum… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  42. arXiv:2001.03857  [pdf, other

    eess.IV cs.CV

    Robust Brain Magnetic Resonance Image Segmentation for Hydrocephalus Patients: Hard and Soft Attention

    Authors: Xuhua Ren, Jiayu Huo, Kai Xuan, Dongming Wei, Lichi Zhang, Qian Wang

    Abstract: Brain magnetic resonance (MR) segmentation for hydrocephalus patients is considered as a challenging work. Encoding the variation of the brain anatomical structures from different individuals cannot be easily achieved. The task becomes even more difficult especially when the image data from hydrocephalus patients are considered, which often have large deformations and differ significantly from the… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.

    Comments: ISBI 2020

  43. arXiv:1907.13020  [pdf, other

    eess.IV cs.CV

    Synthesis and Inpainting-Based MR-CT Registration for Image-Guided Thermal Ablation of Liver Tumors

    Authors: Dongming Wei, Sahar Ahmad, Jiayu Huo, Wen Peng, Yunhao Ge, Zhong Xue, Pew-Thian Yap, Wentao Li, Dinggang Shen, Qian Wang

    Abstract: Thermal ablation is a minimally invasive procedure for treat-ing small or unresectable tumors. Although CT is widely used for guiding ablation procedures, the contrast of tumors against surrounding normal tissues in CT images is often poor, aggravating the difficulty in accurate thermal ablation. In this paper, we propose a fast MR-CT image registration method to overlay a pre-procedural MR (pMR)… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: Accepted in MICCAI 2019

  44. arXiv:1906.02031  [pdf, ps, other

    eess.IV cs.CV

    OctopusNet: A Deep Learning Segmentation Network for Multi-modal Medical Images

    Authors: Yu Chen, Jiawei Chen, Dong Wei, Yuexiang Li, Yefeng Zheng

    Abstract: Deep learning models, such as the fully convolutional network (FCN), have been widely used in 3D biomedical segmentation and achieved state-of-the-art performance. Multiple modalities are often used for disease diagnosis and quantification. Two approaches are widely used in the literature to fuse multiple modalities in the segmentation networks: early-fusion (which stacks multiple modalities as di… ▽ More

    Submitted 22 August, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  45. arXiv:1401.2181  [pdf, ps, other

    eess.SY math.OC

    A biologically inspired model for transshipment problem

    Authors: Cai Gao, Chao Yan, Daijun Wei, Yong Hu, Sankaran Mahadevan, Yong Deng

    Abstract: Transshipment problem is one of the basic operational research problems. In this paper, our first work is to develop a biologically inspired mathematical model for a dynamical system, which is first used to solve minimum cost flow problem. It has lower computational complexity than Physarum Solver. Second, we apply the proposed model to solve the traditional transshipment problem. Compared with th… ▽ More

    Submitted 4 January, 2014; originally announced January 2014.

    Comments: 4 pages, 2 figures