Skip to main content

Showing 1–50 of 131 results for author: Guo, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.23353  [pdf, ps, other

    cs.CV eess.IV

    Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement

    Authors: Siyuan Chai, Xiaodong Guo, Tong Liu

    Abstract: Infrared image helps improve the perception capabilities of autonomous driving in complex weather conditions such as fog, rain, and low light. However, infrared image often suffers from low contrast, especially in non-heat-emitting targets like bicycles, which significantly affects the performance of downstream high-level vision tasks. Furthermore, achieving contrast enhancement without amplifying… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  2. arXiv:2506.16304  [pdf, ps, other

    eess.SP

    A Tractable Approach to Massive Communication and Ubiquitous Connectivity in 6G Standardization

    Authors: Junyi Jiang, Wei Chen, Xin Guo, Shenghui Song, Ying Jun, Zhang, Zhu Han, Merouane Debbah, Khaled B. Letaief

    Abstract: The full-scale 6G standardization has attracted considerable recent attention, especially since the first 3GPP-wide 6G workshop held in March 2025. To understand the practical and fundamental values of 6G and facilitate its standardization, it is crucial to explore the theoretical limits of spectrum, energy, and coverage efficiency considering practical hardware and signaling constraints. In this… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  3. arXiv:2506.13317  [pdf, ps, other

    cs.IT eess.SP

    A Contemporary Survey on Fluid Antenna Systems: Fundamentals and Networking Perspectives

    Authors: Hanjiang Hong, Kai-Kit Wong, Hao Xu, Xinghao Guo, Farshad Rostami Ghadi, Yu Chen, Yin Xu, Chan-Byoung Chae, Baiyang Liu, Kin-Fai Tong, Yangyang Zhang

    Abstract: The explosive growth of teletraffic, fueled by the convergence of cyber-physical systems and data-intensive applications, such as the Internet of Things (IoT), autonomous systems, and immersive communications, demands a multidisciplinary suite of innovative solutions across the physical and network layers. Fluid antenna systems (FAS) represent a transformative advancement in antenna design, offeri… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.07362  [pdf, ps, other

    cs.IT eess.SP

    Fluid Antenna-Empowered Receive Spatial Modulation

    Authors: Xinghao Guo, Yin Xu, Dazhi He, Cixiao Zhang, Hanjiang Hong, Kai-Kit Wong, Chan-Byoung Chae, Wenjun Zhang, Yiyan Wu

    Abstract: Fluid antenna (FA), as an emerging antenna technology, fully exploits spatial diversity. This paper integrates FA with the receive spatial modulation (RSM) scheme and proposes a novel FA-empowered RSM (FA-RSM) system. In this system, the transmitter is equipped with an FA that simultaneously activates multiple ports to transmit precoded signals. We address three key challenges in the FA-RSM system… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 12 pages, submitted to IEEE Journal

  5. arXiv:2505.14717  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    Aneumo: A Large-Scale Multimodal Aneurysm Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks

    Authors: Xigui Li, Yuanye Zhou, Feiyang Xiao, Xin Guo, Chen Jiang, Tan Pan, Xingmeng Zhang, Cenyu Liu, Zeyun Miao, Jianchao Ge, Xiansheng Wang, Qimeng Wang, Yichi Zhang, Wenbo Zhang, Fengping Zhu, Limei Han, Yuan Qi, Chensen Lin, Yuan Cheng

    Abstract: Intracranial aneurysms (IAs) are serious cerebrovascular lesions found in approximately 5\% of the general population. Their rupture may lead to high mortality. Current methods for assessing IA risk focus on morphological and patient-specific factors, but the hemodynamic influences on IA development and rupture remain unclear. While accurate for hemodynamic studies, conventional computational flui… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  6. arXiv:2505.13818  [pdf, ps, other

    eess.SP

    RainfalLTE: A Zero-effect Rainfall Sensing System Utilizing Existing LTE Infrastructure

    Authors: Xianbin Jiang, Fei Shang, Haohua Du, Panlong Yang, Xing Guo, Lihong Liang, Yuanting Zhang, Xiang-Yang Li

    Abstract: Environmental sensing is an important research topic in the integrated sensing and communication (ISAC) system. Current works often focus on static environments, such as buildings and terrains. However, dynamic factors like rainfall can cause serious interference to wireless signals. In this paper, we propose a system called RainfalLTE that utilizes the downlink signal of LTE base stations for dev… ▽ More

    Submitted 25 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

  7. arXiv:2505.07687  [pdf, ps, other

    eess.IV cs.CV

    ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation

    Authors: Feng Yuan, Yifan Gao, Wenbin Wu, Keqing Wu, Xiaotong Guo, Jie Jiang, Xin Gao

    Abstract: Accurate multi-modal medical image translation requires ha-rmonizing global anatomical semantics and local structural fidelity, a challenge complicated by intermodality information loss and structural distortion. We propose ABS-Mamba, a novel architecture integrating the Segment Anything Model 2 (SAM2) for organ-aware semantic representation, specialized convolutional neural networks (CNNs) for pr… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: MICCAI 2025(under view)

  8. arXiv:2505.07449  [pdf, ps, other

    eess.IV cs.CV

    Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model

    Authors: Wei Li, Ming Hu, Guoan Wang, Lihao Liu, Kaijin Zhou, Junzhi Ning, Xin Guo, Zongyuan Ge, Lixu Gu, Junjun He

    Abstract: In ophthalmic surgery, developing an AI system capable of interpreting surgical videos and predicting subsequent operations requires numerous ophthalmic surgical videos with high-quality annotations, which are difficult to collect due to privacy concerns and labor consumption. Text-guided video generation (T2V) emerges as a promising solution to overcome this issue by generating ophthalmic surgica… ▽ More

    Submitted 26 June, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: Early accepted in MICCAI25

  9. arXiv:2504.02628  [pdf, ps, other

    eess.IV cs.CV

    Towards Computation- and Communication-efficient Computational Pathology

    Authors: Chu Han, Bingchao Zhao, Jiatai Lin, Shanshan Lyu, Longfei Wang, Tianpeng Deng, Cheng Lu, Changhong Liang, Hannah Y. Wen, Xiaojing Guo, Zhenwei Shi, Zaiyi Liu

    Abstract: Despite the impressive performance across a wide range of applications, current computational pathology models face significant diagnostic efficiency challenges due to their reliance on high-magnification whole-slide image analysis. This limitation severely compromises their clinical utility, especially in time-sensitive diagnostic scenarios and situations requiring efficient data transfer. To add… ▽ More

    Submitted 3 June, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

  10. arXiv:2503.22202  [pdf, other

    eess.SP

    mmHRR: Monitoring Heart Rate Recovery with Millimeter Wave Radar

    Authors: Ziheng Mao, Yuan He, Jia Zhang, Yimiao Sun, Yadong Xie, Xiuzhen Guo

    Abstract: Heart rate recovery (HRR) within the initial minute following exercise is a widely utilized metric for assessing cardiac autonomic function in individuals and predicting mortality risk in patients with cardiovascular disease. However, prevailing solutions for HRR monitoring typically involve the use of specialized medical equipment or contact wearable sensors, resulting in high costs and poor user… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  11. arXiv:2503.21165  [pdf, other

    eess.SY cs.AR

    Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits

    Authors: Shaik Jani Babu, Fan Hu, Linyu Zhu, Sonal Singhal, Xinfei Guo

    Abstract: Reliability has become an increasing concern in modern computing. Integrated circuits (ICs) are the backbone of modern computing devices across industries, including artificial intelligence (AI), consumer electronics, healthcare, automotive, industrial, and aerospace. Moore Law has driven the semiconductor IC industry toward smaller dimensions, improved performance, and greater energy efficiency.… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: This work is under review by ACM

  12. arXiv:2503.10287  [pdf, other

    cs.SD cs.CV cs.GR eess.AS

    MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment

    Authors: Hao Zhou, Xiaobao Guo, Yuzhe Zhu, Adams Wai-Kin Kong

    Abstract: Propelled by the breakthrough in deep generative models, audio-to-image generation has emerged as a pivotal cross-model task that converts complex auditory signals into rich visual representations. However, previous works only focus on single-source audio inputs for image generation, ignoring the multi-source characteristic in natural auditory scenes, thus limiting the performance in generating co… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  13. arXiv:2503.03971  [pdf, other

    eess.IV

    Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge

    Authors: Fanwen Wang, Zi Wang, Yan Li, Jun Lyu, Chen Qin, Shuo Wang, Kunyuan Guo, Mengting Sun, Mingkai Huang, Haoyu Zhang, Michael Tänzer, Qirong Li, Xinran Chen, Jiahao Huang, Yinzhe Wu, Kian Anvari Hamedani, Yuntong Lyu, Longyu Sun, Qing Li, Ziqiang Xu, Bingyu Xin, Dimitris N. Metaxas, Narges Razizadeh, Shahabedin Nabavi, George Yiasemis , et al. (34 additional authors not shown)

    Abstract: Cardiovascular magnetic resonance (CMR) imaging offers diverse contrasts for non-invasive assessment of cardiac function and myocardial characterization. However, CMR often requires the acquisition of many contrasts, and each contrast takes a considerable amount of time. The extended acquisition time will further increase the susceptibility to motion artifacts. Existing deep learning-based reconst… ▽ More

    Submitted 13 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  14. arXiv:2503.02410  [pdf, ps, other

    eess.IV cs.CV

    Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D

    Authors: Jiesi Hu, Chenfei Ye, Yanwu Yang, Xutao Guo, Yang Shang, Pengcheng Shi, Hanyang Peng, Ting Ma

    Abstract: In-context learning (ICL), a type of universal model, demonstrates exceptional generalization across a wide range of tasks without retraining by leveraging task-specific guidance from context, making it particularly effective for the intricate demands of neuroimaging. However, current ICL models, limited to 2D inputs and thus exhibiting suboptimal performance, struggle to extend to 3D inputs due t… ▽ More

    Submitted 4 July, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  15. arXiv:2501.17884  [pdf

    eess.SP cs.RO

    Ranging Performance Analysis in Automotive DToF Lidars

    Authors: Xiao Guo

    Abstract: In recent years, achieving full autonomy in driving has emerged as a paramount objective for both the industry and academia. Among various perception technologies, Lidar (Light detection and ranging) stands out for its high-precision and high-resolution capabilities based on the principle of light propagation and coupling ranging module and imaging module. Lidar is a sophisticated system that inte… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  16. arXiv:2501.14970  [pdf, other

    eess.SP cs.AI cs.LG

    AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges

    Authors: Guangjin Pan, Yuan Gao, Yilin Gao, Zhiyong Zhong, Xiaoyu Yang, Xinyu Guo, Shugong Xu

    Abstract: Wireless positioning technologies hold significant value for applications in autonomous driving, extended reality (XR), unmanned aerial vehicles (UAVs), and more. With the advancement of artificial intelligence (AI), leveraging AI to enhance positioning accuracy and robustness has emerged as a field full of potential. Driven by the requirements and functionalities defined in the 3rd Generation Par… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 32 pages. This work has been submitted to the IEEE for possible publication

  17. arXiv:2501.11842  [pdf, other

    cs.IT eess.SP

    Harnessing Rydberg Atomic Receivers: From Quantum Physics to Wireless Communications

    Authors: Yuanbin Chen, Xufeng Guo, Chau Yuen, Yufei Zhao, Yong Liang Guan, Chong Meng Samson See, Merouane Débbah, Lajos Hanzo

    Abstract: The intrinsic integration of Rydberg atomic receivers into wireless communication systems is proposed, by harnessing the principles of quantum physics in wireless communications. More particularly, we conceive a pair of Rydberg atomic receivers, one incorporates a local oscillator (LO), referred to as an LO-dressed receiver, while the other operates without an LO and is termed an LO-free receiver.… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: This manuscript has been submitted to IEEE journal, with 13 pages of body and 2 pages of supplementary material

  18. arXiv:2412.18459  [pdf, other

    cs.CV eess.IV

    Underwater Image Restoration via Polymorphic Large Kernel CNNs

    Authors: Xiaojiao Guo, Yihang Dong, Xuhang Chen, Weiwen Chen, Zimeng Li, FuChen Zheng, Chi-Man Pun

    Abstract: Underwater Image Restoration (UIR) remains a challenging task in computer vision due to the complex degradation of images in underwater environments. While recent approaches have leveraged various deep learning techniques, including Transformers and complex, parameter-heavy models to achieve significant improvements in restoration effects, we demonstrate that pure CNN architectures with lightweigh… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: Accepted by ICASSP2025

  19. arXiv:2412.16573  [pdf, other

    eess.IV physics.med-ph

    A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT

    Authors: Huidong Xie, Weijie Gan, Wei Ji, Xiongchao Chen, Alaa Alashi, Stephanie L. Thorn, Bo Zhou, Qiong Liu, Menghua Xia, Xueqi Guo, Yi-Hwa Liu, Hongyu An, Ulugbek S. Kamilov, Ge Wang, Albert J. Sinusas, Chi Liu

    Abstract: Myocardial perfusion imaging using SPECT is widely utilized to diagnose coronary artery diseases, but image quality can be negatively affected in low-dose and few-view acquisition settings. Although various deep learning methods have been introduced to improve image quality from low-dose or few-view SPECT data, previous approaches often fail to generalize across different acquisition settings, lim… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

    Comments: 13 pages, 6 figures, 2 tables. Paper under review. Oral presentation at IEEE MIC 2024

  20. arXiv:2412.08671  [pdf, other

    cs.CV cs.LG eess.IV

    A Deep Semantic Segmentation Network with Semantic and Contextual Refinements

    Authors: Zhiyan Wang, Deyin Liu, Lin Yuanbo Wu, Song Wang, Xin Guo, Lin Qi

    Abstract: Semantic segmentation is a fundamental task in multimedia processing, which can be used for analyzing, understanding, editing contents of images and videos, among others. To accelerate the analysis of multimedia data, existing segmentation researches tend to extract semantic information by progressively reducing the spatial resolutions of feature maps. However, this approach introduces a misalignm… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accept by tmm

  21. arXiv:2412.08670  [pdf, other

    cs.CV cs.LG eess.IV

    A feature refinement module for light-weight semantic segmentation network

    Authors: Zhiyan Wang, Xin Guo, Song Wang, Peixiao Zheng, Lin Qi

    Abstract: Low computational complexity and high segmentation accuracy are both essential to the real-world semantic segmentation tasks. However, to speed up the model inference, most existing approaches tend to design light-weight networks with a very limited number of parameters, leading to a considerable degradation in accuracy due to the decrease of the representation ability of the networks. To solve th… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accept by icip 2023

  22. arXiv:2412.04877  [pdf, other

    cs.IT eess.SP

    Fluid Antenna Index Modulation for MIMO Systems: Robust Transmission and Low-Complexity Detection

    Authors: Xinghao Guo, Yin Xu, Dazhi He, Cixiao Zhang, Hanjiang Hong, Kai-Kit Wong, Wenjun Zhang, Yiyan Wu

    Abstract: The fluid antenna (FA) index modulation (IM)-enabled multiple-input multiple-output (MIMO) system, referred to as FA-IM, significantly enhances spectral efficiency (SE) compared to the conventional FA-assisted MIMO system. To improve robustness against the high spatial correlation among multiple activated ports of the fluid antenna, this paper proposes an innovative FA grouping-based IM (FAG-IM) s… ▽ More

    Submitted 30 December, 2024; v1 submitted 6 December, 2024; originally announced December 2024.

    Comments: Submitted to an IEEE journal

  23. arXiv:2411.08509  [pdf, other

    cs.IT eess.SP

    Sum Rate Maximization for Movable Antenna-Aided Downlink RSMA Systems

    Authors: Cixiao Zhang, Size Peng, Yin Xu, Qingqing Wu, Xiaowu Ou, Xinghao Guo, Dazhi He, Wenjun Zhang

    Abstract: Rate splitting multiple access (RSMA) is regarded as a crucial and powerful physical layer (PHY) paradigm for next-generation communication systems. Particularly, users employ successive interference cancellation (SIC) to decode part of the interference while treating the remainder as noise. However, conventional RSMA systems rely on fixed-position antenna arrays, limiting their ability to fully e… ▽ More

    Submitted 14 November, 2024; v1 submitted 13 November, 2024; originally announced November 2024.

  24. arXiv:2410.19811  [pdf, other

    eess.SY cs.AI cs.CL cs.LG math.OC

    ControlAgent: Automating Control System Design via Novel Integration of LLM Agents and Domain Expertise

    Authors: Xingang Guo, Darioush Keivan, Usman Syed, Lianhui Qin, Huan Zhang, Geir Dullerud, Peter Seiler, Bin Hu

    Abstract: Control system design is a crucial aspect of modern engineering with far-reaching applications across diverse sectors including aerospace, automotive systems, power grids, and robotics. Despite advances made by Large Language Models (LLMs) in various domains, their application in control system design remains limited due to the complexity and specificity of control theory. To bridge this gap, we i… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  25. arXiv:2409.19217  [pdf

    eess.SP

    Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter

    Authors: Wei Wang, Chenyang Li, Zhaoxi Chen, Wenyu Zhang, Zetao Wang, Xi Guo, Jian Guan, Gang Li

    Abstract: Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS) is a sleep-related breathing disorder associated with significant morbidity and mortality worldwide. The gold standard for OSAHS diagnosis, polysomnography (PSG), faces challenges in popularization due to its high cost and complexity. Recently, radar has shown potential in detecting sleep apnea-hypopnea events (SAE) with the advantages of low cost… ▽ More

    Submitted 25 April, 2025; v1 submitted 27 September, 2024; originally announced September 2024.

  26. arXiv:2409.15816  [pdf, other

    eess.SY

    Diffusion Models for Intelligent Transportation Systems: A Survey

    Authors: Mingxing Peng, Kehua Chen, Xusen Guo, Qiming Zhang, Hui Zhong, Meixin Zhu, Hai Yang

    Abstract: Intelligent Transportation Systems (ITS) are vital in modern traffic management and optimization, significantly enhancing traffic efficiency and safety. Recently, diffusion models have emerged as transformative tools for addressing complex challenges within ITS. In this paper, we present a comprehensive survey of diffusion models for ITS, covering both theoretical and practical aspects. First, we… ▽ More

    Submitted 8 May, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 7 figures

  27. arXiv:2409.13440  [pdf, other

    eess.SP cs.AI cs.CR cs.LG

    Differentially Private Multimodal Laplacian Dropout (DP-MLD) for EEG Representative Learning

    Authors: Xiaowen Fu, Bingxin Wang, Xinzhou Guo, Guoqing Liu, Yang Xiang

    Abstract: Recently, multimodal electroencephalogram (EEG) learning has shown great promise in disease detection. At the same time, ensuring privacy in clinical studies has become increasingly crucial due to legal and ethical concerns. One widely adopted scheme for privacy protection is differential privacy (DP) because of its clear interpretation and ease of implementation. Although numerous methods have be… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  28. arXiv:2409.13283  [pdf, other

    eess.SP cs.IT

    MIMO Precoding Exploiting Extra Degrees of Freedom (DoF) in the Wavenumber Domain

    Authors: Yuanbin Chen, Xufeng Guo, Tianqi Mao, Qingqing Wu, Zhaocheng Wang, Chau Yuen

    Abstract: In this paper, we propose an emerging wavenumber-domain precoding scheme to break the limitations of rank-1 channels that merely supports single-stream transmission, enabling simultaneous transmission of multiple data streams. The proposed wavenumber-domain precoding scheme also breaks the Rayleigh distance demarcation, regardless of the far-field and near-field contexts. Specifically, by characte… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted in 2024 IEEE Globecom Workshop

  29. arXiv:2409.11543  [pdf, other

    eess.IV

    Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision

    Authors: Huidong Xie, Liang Guo, Alexandre Velo, Zhao Liu, Qiong Liu, Xueqi Guo, Bo Zhou, Xiongchao Chen, Yu-Jung Tsai, Tianshun Miao, Menghua Xia, Yi-Hwa Liu, Ian S. Armstrong, Ge Wang, Richard E. Carson, Albert J. Sinusas, Chi Liu

    Abstract: Rb-82 is a radioactive isotope widely used for cardiac PET imaging. Despite numerous benefits of 82-Rb, there are several factors that limits its image quality and quantitative accuracy. First, the short half-life of 82-Rb results in noisy dynamic frames. Low signal-to-noise ratio would result in inaccurate and biased image quantification. Noisy dynamic frames also lead to highly noisy parametric… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 15 Pages, 10 Figures, 5 tables. Paper Under review. Oral Presentation at IEEE MIC 2023

  30. arXiv:2409.10958  [pdf, other

    cs.MM cs.CR cs.CV eess.IV

    Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

    Authors: Yongyang Pan, Xiaohong Liu, Siqi Luo, Yi Xin, Xiao Guo, Xiaoming Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Rapid advancements in multimodal large language models have enabled the creation of hyper-realistic images from textual descriptions. However, these advancements also raise significant concerns about unauthorized use, which hinders their broader distribution. Traditional watermarking methods often require complex integration or degrade image quality. To address these challenges, we introduce a nov… ▽ More

    Submitted 15 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: 9 pages, 7 figures

  31. arXiv:2409.10123  [pdf, other

    eess.SP cs.IT

    Wavenumber-Domain Near-Field Channel Estimation: Beyond the Fresnel Bound

    Authors: Xufeng Guo, Yuanbin Chen, Ying Wang, Zhaocheng Wang, Chau Yuen

    Abstract: In the near-field context, the Fresnel approximation is typically employed to mathematically represent solvable functions of spherical waves. However, these efforts may fail to take into account the significant increase in the lower limit of the Fresnel approximation, known as the Fresnel distance. The lower bound of the Fresnel approximation imposes a constraint that becomes more pronounced as th… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted by IEEE Globecom 2024

  32. arXiv:2408.09931  [pdf, other

    eess.IV cs.CV

    Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation

    Authors: Qianhui Men, Xiaoqing Guo, Aris T. Papageorghiou, J. Alison Noble

    Abstract: 3D pose estimation from a 2D cross-sectional view enables healthcare professionals to navigate through the 3D space, and such techniques initiate automatic guidance in many image-guided radiology applications. In this work, we investigate how estimating 3D fetal pose from freehand 2D ultrasound scanning can guide a sonographer to locate a head standard plane. Fetal head pose is estimated by the pr… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Accepted by MICCAI2024

  33. arXiv:2408.05358  [pdf, other

    eess.SP cs.CV cs.HC cs.LG

    GesturePrint: Enabling User Identification for mmWave-based Gesture Recognition Systems

    Authors: Lilin Xu, Keyi Wang, Chaojie Gu, Xiuzhen Guo, Shibo He, Jiming Chen

    Abstract: The millimeter-wave (mmWave) radar has been exploited for gesture recognition. However, existing mmWave-based gesture recognition methods cannot identify different users, which is important for ubiquitous gesture interaction in many applications. In this paper, we propose GesturePrint, which is the first to achieve gesture recognition and gesture-based user identification using a commodity mmWave… ▽ More

    Submitted 25 July, 2024; originally announced August 2024.

    Comments: Accepted to the 44th IEEE International Conference on Distributed Computing Systems (ICDCS 2024)

  34. arXiv:2408.05117  [pdf, other

    eess.IV cs.AI cs.CV

    Beyond the Eye: A Relational Model for Early Dementia Detection Using Retinal OCTA Images

    Authors: Shouyue Liu, Ziyi Zhang, Yuanyuan Gu, Jinkui Hao, Yonghuai Liu, Huazhu Fu, Xinyu Guo, Hong Song, Shuting Zhang, Yitian Zhao

    Abstract: Early detection of dementia, such as Alzheimer's disease (AD) or mild cognitive impairment (MCI), is essential to enable timely intervention and potential treatment. Accurate detection of AD/MCI is challenging due to the high complexity, cost, and often invasive nature of current diagnostic techniques, which limit their suitability for large-scale population screening. Given the shared embryologic… ▽ More

    Submitted 12 March, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

  35. arXiv:2407.14815  [pdf, ps, other

    cs.IT eess.SP

    Unified Far-Field and Near-Field in Holographic MIMO: A Wavenumber-Domain Perspective

    Authors: Yuanbin Chen, Xufeng Guo, Gui Zhou, Shi Jin, Derrick Wing Kwan Ng, Zhaocheng Wang

    Abstract: This article conceives a unified representation for near-field and far-field holographic multiple-input multiple-output (HMIMO) channels, addressing a practical design dilemma: "Why does the angular-domain representation no longer function effectively?" To answer this question, we pivot from the angular domain to the wavenumber domain and present a succinct overview of its underlying philosophy. I… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: This article has been accepted for publication in IEEE Commag (7 pages, 5 figures)

  36. arXiv:2407.11651  [pdf, other

    cs.IT eess.SP

    Fluid Antenna Grouping Index Modulation Design for MIMO Systems

    Authors: Xinghao Guo, Yin Xu, Dazhi He, Cixiao Zhang, Wenjun Zhang, Yi-yan Wu

    Abstract: Index modulation (IM) significantly enhances the spectral efficiency of fluid antennas (FAs) enabled multiple-input multiple-output (MIMO) systems, which is named FA-IM. However, due to the dense distribution of ports on the FA, the wireless channel exhibits a high spatial correlation, leading to severe performance degradation in the existing FA-IM-assisted MIMO systems. To tackle this issue, this… ▽ More

    Submitted 16 August, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: A longer version with more details will be submitted to an IEEE journal

  37. arXiv:2406.18985  [pdf, other

    cs.IT eess.SP

    Exploiting Structured Sparsity in Near Field: From the Perspective of Decomposition

    Authors: Xufeng Guo, Yuanbin Chen, Ying Wang, Chau Yuen

    Abstract: The structured sparsity can be leveraged in traditional far-field channels, greatly facilitating efficient sparse channel recovery by compressing the complexity of overheads to the level of the scatterer number. However, when experiencing a fundamental shift from planar-wave-based far-field modeling to spherical-wave-based near-field modeling, whether these benefits persist in the near-field regim… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: This aricle has been accepted for publication in IEEE Commag

  38. arXiv:2406.14064  [pdf, ps, other

    cs.IT eess.SP

    PAPR Reduction with Pre-chirp Selection for Affine Frequency Division Multiplexing

    Authors: Haozhi Yuan, Yin Xu, Xinghao Guo, Yao Ge, Tianyao Ma, Haoyang Li, Dazhi He, Wenjun Zhang

    Abstract: Affine frequency division multiplexing (AFDM) is a promising new multicarrier technique for high-mobility communications based on discrete affine Fourier transform (DAFT). By properly tuning two parameters in the DAFT module, the effective channel in the DAFT domain can completely circumvent path overlap, thereby constituting a full representation of delay-Doppler profile. However, AFDM has a cruc… ▽ More

    Submitted 20 December, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  39. arXiv:2406.08374  [pdf, other

    cs.CV cs.AI eess.IV

    2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction

    Authors: Tianqi Chen, Jun Hou, Yinchi Zhou, Huidong Xie, Xiongchao Chen, Qiong Liu, Xueqi Guo, Menghua Xia, James S. Duncan, Chi Liu, Bo Zhou

    Abstract: Positron Emission Tomography (PET) is an important clinical imaging tool but inevitably introduces radiation hazards to patients and healthcare providers. Reducing the tracer injection dose and eliminating the CT acquisition for attenuation correction can reduce the overall radiation dose, but often results in PET with high noise and bias. Thus, it is desirable to develop 3D methods to translate t… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures

  40. arXiv:2406.02422  [pdf, other

    eess.IV cs.CV cs.LG

    IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI

    Authors: Ziyun Liang, Xiaoqing Guo, J. Alison Noble, Konstantinos Kamnitsas

    Abstract: Unsupervised anomaly segmentation approaches to pathology segmentation train a model on images of healthy subjects, that they define as the 'normal' data distribution. At inference, they aim to segment any pathologies in new images as 'anomalies', as they exhibit patterns that deviate from those in 'normal' training data. Prevailing methods follow the 'corrupt-and-reconstruct' paradigm. They inten… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  41. arXiv:2406.02164  [pdf, other

    cs.IT eess.SP

    Sparse Recovery for Holographic MIMO Channels: Leveraging the Clustered Sparsity

    Authors: Yuqing Guo, Xufeng Guo, Yuanbin Chen, Ying Wang

    Abstract: Envisioned as the next-generation transceiver technology, the holographic multiple-input-multiple-output (HMIMO) garners attention for its superior capabilities of fabricating electromagnetic (EM) waves. However, the densely packed antenna elements significantly increase the dimension of the HMIMO channel matrix, rendering traditional channel estimation methods inefficient. While the dimension cur… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: This manuscript has been submitted to IEEE journal, 5 pages, 3 figures

  42. arXiv:2405.19659  [pdf, other

    cs.CV eess.IV

    CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction

    Authors: Yilin Liu, Xuezhou Guo, Xinqi Wang, Fangzhou Du

    Abstract: Our project proposes an end-to-end 3D face alignment and reconstruction network. The backbone of our model is built by Bottle-Neck structure via Depth-wise Separable Convolution. We integrate Coordinate Attention mechanism and Spatial Group-wise Enhancement to extract more representative features. For more stable training process and better convergence, we jointly use Wing loss and the Weighted Pa… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures

  43. arXiv:2405.12996  [pdf, ps, other

    eess.IV

    Dose-aware Diffusion Model for 3D PET Image Denoising: Multi-institutional Validation with Reader Study and Real Low-dose Data

    Authors: Huidong Xie, Weijie Gan, Reimund Bayerlein, Bo Zhou, Ming-Kai Chen, Michal Kulon, Annemarie Boustani, Kuan-Yin Ko, Der-Shiun Wang, Benjamin A. Spencer, Wei Ji, Xiongchao Chen, Qiong Liu, Xueqi Guo, Menghua Xia, Yinchi Zhou, Hui Liu, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Hanzhong Wang, Biao Li, Axel Rominger, Kuangyu Shi, Ge Wang , et al. (2 additional authors not shown)

    Abstract: Reducing scan times, radiation dose, and enhancing image quality for lower-performance scanners, are critical in low-dose PET imaging. Deep learning techniques have been investigated for PET image denoising. However, existing models have often resulted in compromised image quality when achieving low-count/low-dose PET and have limited generalizability to different image noise-levels, acquisition p… ▽ More

    Submitted 16 June, 2025; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 16 Figures, 5 Tables. Paper under review. First-place Freek J. Beekman Young Investigator Award at SNMMI 2024. Code available after paper publication. arXiv admin note: substantial text overlap with arXiv:2311.04248

  44. arXiv:2405.06995  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Benchmarking Cross-Domain Audio-Visual Deception Detection

    Authors: Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, Alex C. Kot

    Abstract: Automated deception detection is crucial for assisting humans in accurately assessing truthfulness and identifying deceptive behavior. Conventional contact-based techniques, like polygraph devices, rely on physiological signals to determine the authenticity of an individual's statements. Nevertheless, recent developments in automated deception detection have demonstrated that multimodal features d… ▽ More

    Submitted 5 October, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: 12 pages

  45. arXiv:2404.09131  [pdf, other

    eess.SP

    Design of Artificial Interference Signals for Covert Communication Aided by Multiple Friendly Nodes

    Authors: Xuyang Zhao. Wei Guo, Yongchao Wang

    Abstract: In this paper, we consider a scenario of covert communication aided by multiple friendly interference nodes. The objective is to conceal the legitimate communication link under the surveillance of a warden. The main content is as follows: first, we propose a novel strategy for generating artificial noise signals in the considered covert scenario. Then, we leverage the statistical information of ch… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  46. arXiv:2404.03869  [pdf, other

    cs.LG cs.AI cs.MA cs.RO eess.SY

    Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration

    Authors: Xudong Guo, Daming Shi, Junjie Yu, Wenhui Fan

    Abstract: The emergence of multi-agent reinforcement learning (MARL) is significantly transforming various fields like autonomous vehicle networks. However, real-world multi-agent systems typically contain multiple roles, and the scale of these systems dynamically fluctuates. Consequently, in order to achieve zero-shot scalable collaboration, it is essential that strategies for different roles can be update… ▽ More

    Submitted 2 October, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  47. arXiv:2403.19002  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    Robust Active Speaker Detection in Noisy Environments

    Authors: Siva Sai Nagender Vasireddy, Chenxu Zhang, Xiaohu Guo, Yapeng Tian

    Abstract: This paper addresses the issue of active speaker detection (ASD) in noisy environments and formulates a robust active speaker detection (rASD) problem. Existing ASD approaches leverage both audio and visual modalities, but non-speech sounds in the surrounding environment can negatively impact performance. To overcome this, we propose a novel framework that utilizes audio-visual speech separation a… ▽ More

    Submitted 30 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures

  48. arXiv:2403.11071  [pdf, other

    eess.SP cs.IT

    Wavenumber Domain Sparse Channel Estimation in Holographic MIMO

    Authors: Xufeng Guo, Yuanbin Chen, Ying Wang, Zhaocheng Wang, Zhu Han

    Abstract: In this paper, we investigate the sparse channel estimation in holographic multiple-input multiple-output (HMIMO) systems. The conventional angular-domain representation fails to capture the continuous angular power spectrum characterized by the spatially-stationary electromagnetic random field, thus leading to the ambiguous detection of the significant angular power, which is referred to as the p… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted in 2024 ICC

  49. arXiv:2402.09567  [pdf, other

    eess.IV cs.CV

    TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction

    Authors: Xueqi Guo, Luyao Shi, Xiongchao Chen, Qiong Liu, Bo Zhou, Huidong Xie, Yi-Hwa Liu, Richard Palyo, Edward J. Miller, Albert J. Sinusas, Lawrence H. Staib, Bruce Spottiswoode, Chi Liu, Nicha C. Dvornek

    Abstract: Inter-frame motion in dynamic cardiac positron emission tomography (PET) using rubidium-82 (82-Rb) myocardial perfusion imaging impacts myocardial blood flow (MBF) quantification and the diagnosis accuracy of coronary artery diseases. However, the high cross-frame distribution variation due to rapid tracer kinetics poses a considerable challenge for inter-frame motion correction, especially for ea… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under revision at Medical Image Analysis

  50. arXiv:2401.14285  [pdf, other

    cs.CV cs.AI eess.IV

    POUR-Net: A Population-Prior-Aided Over-Under-Representation Network for Low-Count PET Attenuation Map Generation

    Authors: Bo Zhou, Jun Hou, Tianqi Chen, Yinchi Zhou, Xiongchao Chen, Huidong Xie, Qiong Liu, Xueqi Guo, Yu-Jung Tsai, Vladimir Y. Panin, Takuya Toyonaga, James S. Duncan, Chi Liu

    Abstract: Low-dose PET offers a valuable means of minimizing radiation exposure in PET imaging. However, the prevalent practice of employing additional CT scans for generating attenuation maps (u-map) for PET attenuation correction significantly elevates radiation doses. To address this concern and further mitigate radiation exposure in low-dose PET exams, we propose POUR-Net - an innovative population-prio… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 10 pages, 5 figures