Skip to main content

Showing 1–50 of 64 results for author: Han, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.07126  [pdf, ps, other

    eess.IV cs.AI

    DpDNet: An Dual-Prompt-Driven Network for Universal PET-CT Segmentation

    Authors: Xinglong Liang, Jiaju Huang, Luyi Han, Tianyu Zhang, Xin Wang, Yuan Gao, Chunyao Lu, Lishan Cai, Tao Tan, Ritse Mann

    Abstract: PET-CT lesion segmentation is challenging due to noise sensitivity, small and variable lesion morphology, and interference from physiological high-metabolic signals. Current mainstream approaches follow the practice of one network solving the segmentation of multiple cancer lesions by treating all cancers as a single task. However, this overlooks the unique characteristics of different cancer type… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2507.01055  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Prompt Mechanisms in Medical Imaging: A Comprehensive Survey

    Authors: Hao Yang, Xinlong Liang, Zhang Li, Yue Sun, Zheyu Hu, Xinghe Xie, Behdad Dashtbozorg, Jincheng Huang, Shiwei Zhu, Luyi Han, Jiong Zhang, Shanshan Wang, Ritse Mann, Qifeng Yu, Tao Tan

    Abstract: Deep learning offers transformative potential in medical imaging, yet its clinical adoption is frequently hampered by challenges such as data scarcity, distribution shifts, and the need for robust task generalization. Prompt-based methodologies have emerged as a pivotal strategy to guide deep learning models, providing flexible, domain-specific adaptations that significantly enhance model performa… ▽ More

    Submitted 27 June, 2025; originally announced July 2025.

  3. arXiv:2506.22001  [pdf, ps, other

    eess.AS cs.SD

    WTFormer: A Wavelet Conformer Network for MIMO Speech Enhancement with Spatial Cues Peservation

    Authors: Lu Han, Junqi Zhao, Renhua Peng

    Abstract: Current multi-channel speech enhancement systems mainly adopt single-output architecture, which face significant challenges in preserving spatio-temporal signal integrity during multiple-input multiple-output (MIMO) processing. To address this limitation, we propose a novel neural network, termed WTFormer, for MIMO speech enhancement that leverages the multi-resolution characteristics of wavelet t… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Accepted by Interspeech2025

  4. arXiv:2506.12006  [pdf, ps, other

    eess.IV cs.CV

    crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 2023

    Authors: Navodini Wijethilake, Reuben Dorent, Marina Ivory, Aaron Kujawa, Stefan Cornelissen, Patrick Langenhuizen, Mohamed Okasha, Anna Oviedova, Hexin Dong, Bogyeong Kang, Guillaume Sallé, Luyi Han, Ziyuan Zhao, Han Liu, Tao Yang, Shahad Hardan, Hussain Alasmawi, Santosh Sanjeev, Yuzhou Zhuang, Satoshi Kondo, Maria Baldeon Calisto, Shaikh Muhammad Uzair Noman, Cancan Chen, Ipek Oguz, Rongguo Zhang , et al. (14 additional authors not shown)

    Abstract: The cross-Modality Domain Adaptation (crossMoDA) challenge series, initiated in 2021 in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), focuses on unsupervised cross-modality segmentation, learning from contrast-enhanced T1 (ceT1) and transferring to T2 MRI. The task is an extreme example of domain shift chosen to serve as a mea… ▽ More

    Submitted 24 June, 2025; v1 submitted 13 June, 2025; originally announced June 2025.

  5. arXiv:2506.11250  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Can Time-Series Foundation Models Perform Building Energy Management Tasks?

    Authors: Ozan Baris Mulayim, Pengrui Quan, Liying Han, Xiaomin Ouyang, Dezhi Hong, Mario Bergés, Mani Srivastava

    Abstract: Building energy management (BEM) tasks require processing and learning from a variety of time-series data. Existing solutions rely on bespoke task- and data-specific models to perform these tasks, limiting their broader applicability. Inspired by the transformative success of Large Language Models (LLMs), Time-Series Foundation Models (TSFMs), trained on diverse datasets, have the potential to cha… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 30 pages, 5 tables, 8 figures. Under review for Data-Centric Engineering journal

  6. arXiv:2505.22106  [pdf, ps, other

    cs.SD cs.AI eess.AS

    AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

    Authors: Junqi Zhao, Jinzheng Zhao, Haohe Liu, Yun Chen, Lu Han, Xubo Liu, Mark Plumbley, Wenwu Wang

    Abstract: Diffusion models have significantly improved the quality and diversity of audio generation but are hindered by slow inference speed. Rectified flow enhances inference speed by learning straight-line ordinary differential equation (ODE) paths. However, this approach requires training a flow-matching model from scratch and tends to perform suboptimally, or even poorly, at low step counts. To address… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  7. arXiv:2505.14717  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    Aneumo: A Large-Scale Multimodal Aneurysm Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks

    Authors: Xigui Li, Yuanye Zhou, Feiyang Xiao, Xin Guo, Chen Jiang, Tan Pan, Xingmeng Zhang, Cenyu Liu, Zeyun Miao, Jianchao Ge, Xiansheng Wang, Qimeng Wang, Yichi Zhang, Wenbo Zhang, Fengping Zhu, Limei Han, Yuan Qi, Chensen Lin, Yuan Cheng

    Abstract: Intracranial aneurysms (IAs) are serious cerebrovascular lesions found in approximately 5\% of the general population. Their rupture may lead to high mortality. Current methods for assessing IA risk focus on morphological and patient-specific factors, but the hemodynamic influences on IA development and rupture remain unclear. While accurate for hemodynamic studies, conventional computational flui… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  8. arXiv:2505.11618  [pdf, ps, other

    cs.AI cs.LG eess.SP

    Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and Challenges

    Authors: Pengrui Quan, Brian Wang, Kang Yang, Liying Han, Mani Srivastava

    Abstract: Spatiotemporal reasoning plays a key role in Cyber-Physical Systems (CPS). Despite advances in Large Language Models (LLMs) and Large Reasoning Models (LRMs), their capacity to reason about complex spatiotemporal signals remains underexplored. This paper proposes a hierarchical SpatioTemporal reAsoning benchmaRK, STARK, to systematically evaluate LLMs across three levels of reasoning complexity: s… ▽ More

    Submitted 27 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  9. arXiv:2503.03199  [pdf, other

    eess.IV q-bio.QM

    PathRWKV: Enabling Whole Slide Prediction with Recurrent-Transformer

    Authors: Sicheng Chen, Tianyi Zhang, Dankai Liao, Dandan Li, Low Chang Han, Yanqin Jiang, Yueming Jin, Shangqing Lyu

    Abstract: Pathological diagnosis plays a critical role in clinical practice, where the whole slide images (WSIs) are widely applied. Through a two-stage paradigm, recent deep learning approaches enhance the WSI analysis with tile-level feature extracting and slide-level feature modeling. Current Transformer models achieved improvement in the efficiency and accuracy to previous multiple instance learning bas… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 11 pages, 2 figures

  10. arXiv:2501.16368  [pdf, other

    cs.LG cs.AI eess.SY

    Foundation Models for CPS-IoT: Opportunities and Challenges

    Authors: Ozan Baris, Yizhuo Chen, Gaofeng Dong, Liying Han, Tomoyoshi Kimura, Pengrui Quan, Ruijie Wang, Tianchen Wang, Tarek Abdelzaher, Mario Bergés, Paul Pu Liang, Mani Srivastava

    Abstract: Methods from machine learning (ML) have transformed the implementation of Perception-Cognition-Communication-Action loops in Cyber-Physical Systems (CPS) and the Internet of Things (IoT), replacing mechanistic and basic statistical models with those derived from data. However, the first generation of ML approaches, which depend on supervised learning with annotated data to create task-specific mod… ▽ More

    Submitted 4 February, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

  11. arXiv:2501.00399  [pdf, other

    cs.IT eess.SP

    Movable Superdirective Pairs: A Phase Shifter-Free Approach to mmWave Communications

    Authors: Liangcheng Han, Haifan Yin, Mengying Gao, Rui Zhang

    Abstract: In this letter, we propose a novel Movable Superdirective Pairs (MSP) approach that combines movable antennas with superdirective pair arrays to enhance the performance of millimeter-wave (mmWave) communications on the user side. By controlling the rotation angles and positions of superdirective antenna pairs, the proposed MSP approach maximizes the received signal-to-noise ratio (SNR) of multipat… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  12. arXiv:2412.18996  [pdf, other

    eess.IV cs.CV cs.LG

    WaveDiffUR: A diffusion SDE-based solver for ultra magnification super-resolution in remote sensing images

    Authors: Yue Shi, Liangxiu Han, Darren Dancy, Lianghao Han

    Abstract: Deep neural networks have recently achieved significant advancements in remote sensing superresolu-tion (SR). However, most existing methods are limited to low magnification rates (e.g., 2 or 4) due to the escalating ill-posedness at higher magnification scales. To tackle this challenge, we redefine high-magnification SR as the ultra-resolution (UR) problem, reframing it as solving a conditional d… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  13. arXiv:2409.07040  [pdf, other

    cs.CV eess.IV

    Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement

    Authors: Xianmin Chen, Peiliang Huang, Xiaoxu Feng, Dingwen Zhang, Longfei Han, Junwei Han

    Abstract: Low-light image enhancement, particularly in cross-domain tasks such as mapping from the raw domain to the sRGB domain, remains a significant challenge. Many deep learning-based methods have been developed to address this issue and have shown promising results in recent years. However, single-stage methods, which attempt to unify the complex mapping across both domains, leading to limited denoisin… ▽ More

    Submitted 31 December, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

  14. arXiv:2409.06887  [pdf, other

    eess.IV cs.CV

    Ordinal Learning: Longitudinal Attention Alignment Model for Predicting Time to Future Breast Cancer Events from Mammograms

    Authors: Xin Wang, Tao Tan, Yuan Gao, Eric Marcus, Luyi Han, Antonio Portaluri, Tianyu Zhang, Chunyao Lu, Xinglong Liang, Regina Beets-Tan, Jonas Teuwen, Ritse Mann

    Abstract: Precision breast cancer (BC) risk assessment is crucial for developing individualized screening and prevention. Despite the promising potential of recent mammogram (MG) based deep learning models in predicting BC risk, they mostly overlook the 'time-to-future-event' ordering among patients and exhibit limited explorations into how they track history changes in breast tissue, thereby limiting their… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  15. arXiv:2408.13549  [pdf, other

    eess.SP

    A Superdirective Beamforming Approach based on MultiTransUNet-GAN

    Authors: Yali Zhang, Haifan Yin, Liangcheng Han

    Abstract: In traditional multiple-input multiple-output (MIMO) communication systems, the antenna spacing is often no smaller than half a wavelength. However, by exploiting the coupling between more closely-spaced antennas, a superdirective array may achieve a much higher beamforming gain than traditional MIMO. In this paper, we present a novel utilization of neural networks in the context of superdirective… ▽ More

    Submitted 27 August, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

    Comments: 12 pages, 11 figures, 6 tables, to appear in IEEE Trans. Commun

  16. arXiv:2408.09715  [pdf, other

    cs.AI cs.CV cs.LG eess.IV

    HYDEN: Hyperbolic Density Representations for Medical Images and Reports

    Authors: Zhi Qiao, Linbin Han, Xiantong Zhen, Jia-Hong Gao, Zhen Qian

    Abstract: In light of the inherent entailment relations between images and text, hyperbolic point vector embeddings, leveraging the hierarchical modeling advantages of hyperbolic space, have been utilized for visual semantic representation learning. However, point vector embedding approaches fail to address the issue of semantic uncertainty, where an image may have multiple interpretations, and text may ref… ▽ More

    Submitted 19 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  17. arXiv:2407.10377  [pdf

    eess.IV cs.AI cs.CV

    Enhanced Masked Image Modeling to Avoid Model Collapse on Multi-modal MRI Datasets

    Authors: Linxuan Han, Sa Xiao, Zimeng Li, Haidong Li, Xiuchao Zhao, Yeqing Han, Fumin Guo, Xin Zhou

    Abstract: Multi-modal magnetic resonance imaging (MRI) provides information of lesions for computer-aided diagnosis from different views. Deep learning algorithms are suitable for identifying specific anatomical structures, segmenting lesions, and classifying diseases. Manual labels are limited due to the high expense, which hinders further improvement of accuracy. Self-supervised learning, particularly mas… ▽ More

    Submitted 15 January, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the lEEE for possible publication. copyright may be transferred without notice, after which this version may no longer be accessible

  18. arXiv:2407.02911  [pdf, other

    eess.IV cs.CV

    Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI

    Authors: Luyi Han, Tao Tan, Tianyu Zhang, Xin Wang, Yuan Gao, Chunyao Lu, Xinglong Liang, Haoran Dou, Yunzhi Huang, Ritse Mann

    Abstract: Adversarial learning helps generative models translate MRI from source to target sequence when lacking paired samples. However, implementing MRI synthesis with adversarial learning in clinical settings is challenging due to training instability and mode collapse. To address this issue, we leverage intermediate sequences to estimate the common latent space among multi-sequence MRI, enabling the rec… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  19. arXiv:2407.00718  [pdf, other

    eess.IV cs.CV

    ASPS: Augmented Segment Anything Model for Polyp Segmentation

    Authors: Huiqian Li, Dingwen Zhang, Jieru Yao, Longfei Han, Zhongyu Li, Junwei Han

    Abstract: Polyp segmentation plays a pivotal role in colorectal cancer diagnosis. Recently, the emergence of the Segment Anything Model (SAM) has introduced unprecedented potential for polyp segmentation, leveraging its powerful pre-training capability on large-scale datasets. However, due to the domain gap between natural and endoscopy images, SAM encounters two limitations in achieving effective performan… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Accepted by MICCAI2024

  20. arXiv:2404.08224   

    cs.LG cs.AI cs.CR cs.IT eess.SY

    HCL-MTSAD: Hierarchical Contrastive Consistency Learning for Accurate Detection of Industrial Multivariate Time Series Anomalies

    Authors: Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Chunjie Zhou

    Abstract: Multivariate Time Series (MTS) anomaly detection focuses on pinpointing samples that diverge from standard operational patterns, which is crucial for ensuring the safety and security of industrial applications. The primary challenge in this domain is to develop representations capable of discerning anomalies effectively. The prevalent methods for anomaly detection in the literature are predominant… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: This paper is a manuscript that is still in the process of revision, including Table 1, Figure 2, problem definition in section III.B and method description proposed in section IV. In addition, the submitter has not been authorized by the first author and other co-authors to post the paper to arXiv

  21. arXiv:2403.02616  [pdf

    cs.LG cs.AI cs.CR cs.NI eess.SY

    Unsupervised Spatio-Temporal State Estimation for Fine-grained Adaptive Anomaly Diagnosis of Industrial Cyber-physical Systems

    Authors: Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Chunjie Zhou

    Abstract: Accurate detection and diagnosis of abnormal behaviors such as network attacks from multivariate time series (MTS) are crucial for ensuring the stable and effective operation of industrial cyber-physical systems (CPS). However, existing researches pay little attention to the logical dependencies among system working states, and have difficulties in explaining the evolution mechanisms of abnormal s… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 23 pages, 7 figures

  22. arXiv:2401.09336  [pdf, other

    eess.IV cs.CV

    To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection

    Authors: Luyi Han, Tao Tan, Tianyu Zhang, Yuan Gao, Xin Wang, Valentina Longo, Sofía Ventura-Díaz, Anna D'Angelo, Jonas Teuwen, Ritse Mann

    Abstract: Clinicians compare breast DCE-MRI after neoadjuvant chemotherapy (NAC) with pre-treatment scans to evaluate the response to NAC. Clinical evidence supports that accurate longitudinal deformable registration without deforming treated tumor regions is key to quantifying tumor changes. We propose a conditional pyramid registration network based on unsupervised keypoint detection and selective volume-… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  23. arXiv:2311.15090  [pdf, other

    eess.IV cs.CV cs.LG

    Fine-Grained Unsupervised Cross-Modality Domain Adaptation for Vestibular Schwannoma Segmentation

    Authors: Luyi Han, Tao Tan, Ritse Mann

    Abstract: The domain adaptation approach has gained significant acceptance in transferring styles across various vendors and centers, along with filling the gaps in modalities. However, multi-center application faces the challenge of the difficulty of domain adaptation due to their intra-domain differences. We focus on introducing a fine-grained unsupervised framework for domain adaptation to facilitate cro… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  24. arXiv:2311.13196  [pdf, other

    cs.IT eess.SP stat.ME

    Optimal Time of Arrival Estimation for MIMO Backscatter Channels

    Authors: Chen He, Luyang Han, Z. Jane Wang

    Abstract: In this paper, we propose a novel time of arrival (TOA) estimator for multiple-input-multiple-output (MIMO) backscatter channels in closed form. The proposed estimator refines the estimation precision from the topological structure of the MIMO backscatter channels, and can considerably enhance the estimation accuracy. Particularly, we show that for the general $M \times N$ bistatic topology, the m… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  25. arXiv:2311.02378  [pdf

    cs.CR cs.AI eess.SY

    MTS-DVGAN: Anomaly Detection in Cyber-Physical Systems using a Dual Variational Generative Adversarial Network

    Authors: Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Hongle Liu, Xiang Long

    Abstract: Deep generative models are promising in detecting novel cyber-physical attacks, mitigating the vulnerability of Cyber-physical systems (CPSs) without relying on labeled information. Nonetheless, these generative models face challenges in identifying attack behaviors that closely resemble normal data, or deviate from the normal data distribution but are in close proximity to the manifold of the nor… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 14 figures, 8 tables. Accepted by Computers & Security

    Journal ref: Computers & Security, 2023, 103570

  26. arXiv:2308.09223  [pdf, other

    eess.IV cs.CV cs.LG

    DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction

    Authors: Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas

    Abstract: Accurate 3D cardiac reconstruction from cine magnetic resonance imaging (cMRI) is crucial for improved cardiovascular disease diagnosis and understanding of the heart's motion. However, current cardiac MRI-based reconstruction technology used in clinical settings is 2D with limited through-plane resolution, resulting in low-quality reconstructed cardiac volumes. To better reconstruct 3D cardiac vo… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted in MICCAI 2023

  27. arXiv:2308.05864  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions

    Authors: Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa , et al. (15 additional authors not shown)

    Abstract: Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diver… ▽ More

    Submitted 1 April, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: NeurIPS22 Cell Segmentation Challenge: https://neurips22-cellseg.grand-challenge.org/ . Nature Methods (2024)

  28. arXiv:2308.03448  [pdf, other

    cs.CV eess.IV

    Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

    Authors: Xin Jin, Jia-Wen Xiao, Ling-Hao Han, Chunle Guo, Xialei Liu, Chongyi Li, Ming-Ming Cheng

    Abstract: Explicit calibration-based methods have dominated RAW image denoising under extremely low-light environments. However, these methods are impeded by several critical limitations: a) the explicit calibration process is both labor- and time-intensive, b) challenge exists in transferring denoisers across different camera models, and c) the disparity between synthetic and real noise is exacerbated by d… ▽ More

    Submitted 25 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  29. arXiv:2307.06958  [pdf, other

    cs.IT eess.SP

    Superdirectivity-enhanced wireless communications: A multi-user perspective

    Authors: Liangcheng Han, Haifan Yin

    Abstract: Superdirective array may achieve an array gain proportional to the square of the number of antennas $M^2$. In the early studies of superdirectivity, little research has been done from wireless communication point of view. To leverage superdirectivity for enhancing the spectral efficiency, this paper investigates multi-user communication systems with superdirective arrays. We first propose a field-… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 11 pages, 8 figures

  30. arXiv:2307.02063  [pdf, other

    eess.SP

    A genetic algorithm based superdirective beamforming method under excitation power range constraints

    Authors: Jingcheng Xie, Haifan Yin, Liangcheng Han

    Abstract: The array gain of a superdirective antenna array can be proportional to the square of the number of antennas. However, the realization of the so-called superdirectivity entails accurate calculation and application of the excitations. Moreover, the excitations require a large dynamic power range, especially when the antenna spacing is smaller. In this paper, we derive the closed-form solution for t… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 5 pages, 6 figures

  31. arXiv:2307.00895  [pdf, other

    eess.IV cs.CV

    Synthesis of Contrast-Enhanced Breast MRI Using Multi-b-Value DWI-based Hierarchical Fusion Network with Attention Mechanism

    Authors: Tianyu Zhang, Luyi Han, Anna D'Angelo, Xin Wang, Yuan Gao, Chunyao Lu, Jonas Teuwen, Regina Beets-Tan, Tao Tan, Ritse Mann

    Abstract: Magnetic resonance imaging (MRI) is the most sensitive technique for breast cancer detection among current clinical imaging modalities. Contrast-enhanced MRI (CE-MRI) provides superior differentiation between tumors and invaded healthy tissue, and has become an indispensable technique in the detection and evaluation of cancer. However, the use of gadolinium-based contrast agents (GBCA) to obtain C… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted by MICCAI 2023

  32. arXiv:2307.00885  [pdf, other

    eess.IV cs.CV

    An Explainable Deep Framework: Towards Task-Specific Fusion for Multi-to-One MRI Synthesis

    Authors: Luyi Han, Tianyu Zhang, Yunzhi Huang, Haoran Dou, Xin Wang, Yuan Gao, Chunyao Lu, Tan Tao, Ritse Mann

    Abstract: Multi-sequence MRI is valuable in clinical settings for reliable diagnosis and treatment prognosis, but some sequences may be unusable or missing for various reasons. To address this issue, MRI synthesis is a potential solution. Recent deep learning-based methods have achieved good performance in combining multiple available sequences for missing sequence synthesis. Despite their success, these me… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  33. arXiv:2306.17207  [pdf, other

    cs.CV eess.IV

    A Fast Fourier Convolutional Deep Neural Network For Accurate and Explainable Discrimination Of Wheat Yellow Rust And Nitrogen Deficiency From Sentinel-2 Time-Series Data

    Authors: Yue Shi, Liangxiu Han, Pablo González-Moreno, Darren Dancey, Wenjiang Huang, Zhiqiang Zhang, Yuanyuan Liu, Mengning Huan, Hong Miao, Min Dai

    Abstract: Accurate and timely detection of plant stress is essential for yield protection, allowing better-targeted intervention strategies. Recent advances in remote sensing and deep learning have shown great potential for rapid non-invasive detection of plant stress in a fully automated and reproducible manner. However, the existing models always face several challenges: 1) computational inefficiency and… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 24 pages

  34. arXiv:2306.14687  [pdf, other

    eess.IV cs.CV

    GSMorph: Gradient Surgery for cine-MRI Cardiac Deformable Registration

    Authors: Haoran Dou, Ning Bi, Luyi Han, Yuhao Huang, Ritse Mann, Xin Yang, Dong Ni, Nishant Ravikumar, Alejandro F. Frangi, Yunzhi Huang

    Abstract: Deep learning-based deformable registration methods have been widely investigated in diverse medical applications. Learning-based deformable registration relies on weighted objective functions trading off registration accuracy and smoothness of the deformation field. Therefore, they inevitably require tuning the hyperparameter for optimal registration performance. Tuning the hyperparameters is hig… ▽ More

    Submitted 20 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at MICCAI 2023

  35. arXiv:2305.12994  [pdf, ps, other

    eess.SP cs.IT

    Multistatic Integrated Sensing and Communication System in Cellular Networks

    Authors: Zixiang Han, Lincong Han, Xiaozhou Zhang, Yajuan Wang, Liang Ma, Mengting Lou, Jing Jin, Guangyi Liu

    Abstract: A novel multistatic multiple-input multiple-output (MIMO) integrated sensing and communication (ISAC) system in cellular networks is proposed. It can make use of widespread base stations (BSs) to perform cooperative sensing in wide area. This system is important since the deployment of sensing function can be achieved based on the existing mobile communication networks at a low cost. In this syste… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  36. Segment Anything in Medical Images

    Authors: Jun Ma, Yuting He, Feifei Li, Lin Han, Chenyu You, Bo Wang

    Abstract: Medical image segmentation is a critical component in clinical practice, facilitating accurate diagnosis, treatment planning, and disease monitoring. However, existing methods, often tailored to specific modalities or disease types, lack generalizability across the diverse spectrum of medical image segmentation tasks. Here we present MedSAM, a foundation model designed for bridging this gap by ena… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Journal ref: Nature Communications 15, 654 (2024)

  37. arXiv:2302.10601  [pdf, other

    cs.CR cs.AI eess.SY

    Few-shot Detection of Anomalies in Industrial Cyber-Physical System via Prototypical Network and Contrastive Learning

    Authors: Haili Sun, Yan Huang, Lansheng Han, Chunjie Zhou

    Abstract: The rapid development of Industry 4.0 has amplified the scope and destructiveness of industrial Cyber-Physical System (CPS) by network attacks. Anomaly detection techniques are employed to identify these attacks and guarantee the normal operation of industrial CPS. However, it is still a challenging problem to cope with scenarios with few labeled samples. In this paper, we propose a few-shot anoma… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 10 pages, 7 figures, under review

  38. arXiv:2302.08967  [pdf, other

    eess.IV cs.CV cs.LG

    sMRI-PatchNet: A novel explainable patch-based deep learning network for Alzheimer's disease diagnosis and discriminative atrophy localisation with Structural MRI

    Authors: Xin Zhang, Liangxiu Han, Lianghao Han, Haoming Chen, Darren Dancey, Daoqiang Zhang

    Abstract: Structural magnetic resonance imaging (sMRI) can identify subtle brain changes due to its high contrast for soft tissues and high spatial resolution. It has been widely used in diagnosing neurological brain diseases, such as Alzheimer disease (AD). However, the size of 3D high-resolution data poses a significant challenge for data analysis and processing. Since only a few areas of the brain show s… ▽ More

    Submitted 19 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  39. arXiv:2302.01788  [pdf, other

    eess.IV cs.CV

    IMPORTANT-Net: Integrated MRI Multi-Parameter Reinforcement Fusion Generator with Attention Network for Synthesizing Absent Data

    Authors: Tianyu Zhang, Tao Tan, Luyi Han, Xin Wang, Yuan Gao, Jonas Teuwen, Regina Beets-Tan, Ritse Mann

    Abstract: Magnetic resonance imaging (MRI) is highly sensitive for lesion detection in the breasts. Sequences obtained with different settings can capture the specific characteristics of lesions. Such multi-parameter MRI information has been shown to improve radiologist performance in lesion classification, as well as improving the performance of artificial intelligence models in various tasks. However, obt… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  40. arXiv:2302.00517  [pdf, other

    cs.CV eess.IV

    Synthesis-based Imaging-Differentiation Representation Learning for Multi-Sequence 3D/4D MRI

    Authors: Luyi Han, Tao Tan, Tianyu Zhang, Yunzhi Huang, Xin Wang, Yuan Gao, Jonas Teuwen, Ritse Mann

    Abstract: Multi-sequence MRIs can be necessary for reliable diagnosis in clinical practice due to the complimentary information within sequences. However, redundant information exists across sequences, which interferes with mining efficient representations by modern machine learning or deep learning models. To handle various clinical scenarios, we propose a sequence-to-sequence generation framework (Seq2Seq… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  41. arXiv:2208.04825  [pdf, other

    eess.IV cs.CV

    Longitudinal Prediction of Postnatal Brain Magnetic Resonance Images via a Metamorphic Generative Adversarial Network

    Authors: Yunzhi Huang, Sahar Ahmad, Luyi Han, Shuai Wang, Zhengwang Wu, Weili Lin, Gang Li, Li Wang, Pew-Thian Yap

    Abstract: Missing scans are inevitable in longitudinal studies due to either subject dropouts or failed scans. In this paper, we propose a deep learning framework to predict missing scans from acquired scans, catering to longitudinal infant studies. Prediction of infant brain MRI is challenging owing to the rapid contrast and structural changes particularly during the first year of life. We introduce a trus… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  42. arXiv:2206.15254  [pdf, other

    eess.IV cs.CV

    Localizing the Recurrent Laryngeal Nerve via Ultrasound with a Bayesian Shape Framework

    Authors: Haoran Dou, Luyi Han, Yushuang He, Jun Xu, Nishant Ravikumar, Ritse Mann, Alejandro F. Frangi, Pew-Thian Yap, Yunzhi Huang

    Abstract: Tumor infiltration of the recurrent laryngeal nerve (RLN) is a contraindication for robotic thyroidectomy and can be difficult to detect via standard laryngoscopy. Ultrasound (US) is a viable alternative for RLN detection due to its safety and ability to provide real-time feedback. However, the tininess of the RLN, with a diameter typically less than 3mm, poses significant challenges to the accura… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Early Accepted by MICCAI 2022

  43. arXiv:2205.15001  [pdf, ps, other

    eess.SP

    Abnormal Signal Recognition with Time-Frequency Spectrogram: A Deep Learning Approach

    Authors: Tingyan Kuang, Huichao Chen, Lu Han, Rong He, Wei Wang, Guoru Ding

    Abstract: With the increasingly complex and changeable electromagnetic environment, wireless communication systems are facing jamming and abnormal signal injection, which significantly affects the normal operation of a communication system. In particular, the abnormal signals may emulate the normal signals, which makes it very challenging for abnormal signal recognition. In this paper, we propose a new abno… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by China Communications on August 30, 2021

  44. arXiv:2204.11547  [pdf, other

    cs.IT eess.SP

    Coupling Matrix-based Beamforming for Superdirective Antenna Arrays

    Authors: Liangcheng Han, Haifan Yin, Thomas L. Marzetta

    Abstract: In most multiple-input multiple-output (MIMO) communication systems, e.g., Massive MIMO, the antenna spacing is generally no less than half a wavelength. It helps to reduce the mutual coupling and therefore facilitate the system design. The maximum array gain is the number of antennas in this settings. However, when the antenna spacing is made very small, the array gain of a compact array can be p… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  45. Joint DoA-Range Estimation Using Space-Frequency Virtual Difference Coarray

    Authors: Zihuan Mao, Shengheng Liu, Yimin D. Zhang, Leixin Han, Yongming Huang

    Abstract: In this paper, we address the problem of joint direction-of-arrival (DoA) and range estimation using frequency diverse coprime array (FDCA). By incorporating the coprime array structure and coprime frequency offsets, a two-dimensional space-frequency virtual difference coarray corresponding to uniform array and uniform frequency offset is considered to increase the number of degrees-of-freedom (Do… ▽ More

    Submitted 4 May, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

  46. arXiv:2203.10726  [pdf, other

    eess.IV cs.CV

    TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

    Authors: Di Liu, Yunhe Gao, Qilong Zhangli, Ligong Han, Xiaoxiao He, Zhaoyang Xia, Song Wen, Qi Chang, Zhennan Yan, Mu Zhou, Dimitris Metaxas

    Abstract: Combining information from multi-view images is crucial to improve the performance and robustness of automated methods for disease diagnosis. However, due to the non-alignment characteristics of multi-view images, building correlation and data fusion across views largely remain an open problem. In this study, we present TransFusion, a Transformer-based architecture to merge divergent multi-view im… ▽ More

    Submitted 5 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  47. arXiv:2201.10792  [pdf, other

    cs.CL cs.SD eess.AS

    On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR

    Authors: Zhao Yang, Dianwen Ng, Xiao Fu, Liping Han, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao

    Abstract: End-to-end automatic speech recognition (ASR) has achieved promising results. However, most existing end-to-end ASR methods neglect the use of specific language characteristics. For Mandarin Chinese ASR tasks, there exist mutual promotion relationship between Pinyin and Character where Chinese characters can be romanized by Pinyin. Based on the above intuition, we first investigate types of end-to… ▽ More

    Submitted 30 March, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: submitted to INTERSPEECH 2022

  48. arXiv:2112.04489  [pdf, other

    eess.IV cs.CV

    Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning

    Authors: Alessa Hering, Lasse Hansen, Tony C. W. Mok, Albert C. S. Chung, Hanna Siebert, Stephanie Häger, Annkristin Lange, Sven Kuckertz, Stefan Heldmann, Wei Shao, Sulaiman Vesal, Mirabela Rusu, Geoffrey Sonn, Théo Estienne, Maria Vakalopoulou, Luyi Han, Yunzhi Huang, Pew-Thian Yap, Mikael Brudfors, Yaël Balbastre, Samuel Joutard, Marc Modat, Gal Lifshitz, Dan Raviv, Jinxin Lv , et al. (28 additional authors not shown)

    Abstract: Image registration is a fundamental medical image analysis task, and a wide variety of approaches have been proposed. However, only a few studies have comprehensively compared medical image registration approaches on a wide range of clinically relevant tasks. This limits the development of registration methods, the adoption of research advances into practice, and a fair benchmark across competing… ▽ More

    Submitted 7 October, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  49. A Latent Encoder Coupled Generative Adversarial Network (LE-GAN) for Efficient Hyperspectral Image Super-resolution

    Authors: Yue Shi, Liangxiu Han, Lianghao Han, Sheng Chang, Tongle Hu, Darren Dancey

    Abstract: Realistic hyperspectral image (HSI) super-resolution (SR) techniques aim to generate a high-resolution (HR) HSI with higher spectral and spatial fidelity from its low-resolution (LR) counterpart. The generative adversarial network (GAN) has proven to be an effective deep learning framework for image super-resolution. However, the optimisation process of existing GAN-based models frequently suffers… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 18 pages, 10 figures

  50. arXiv:2110.15327  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MEGAN: Memory Enhanced Graph Attention Network for Space-Time Video Super-Resolution

    Authors: Chenyu You, Lianyi Han, Aosong Feng, Ruihan Zhao, Hui Tang, Wei Fan

    Abstract: Space-time video super-resolution (STVSR) aims to construct a high space-time resolution video sequence from the corresponding low-frame-rate, low-resolution video sequence. Inspired by the recent success to consider spatial-temporal information for space-time super-resolution, our main goal in this work is to take full considerations of spatial and temporal correlations within the video sequences… ▽ More

    Submitted 29 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.