Skip to main content

Showing 1–50 of 223 results for author: Xu, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.05569  [pdf, ps, other

    cs.IT eess.SP

    Fluid Antenna System-Assisted Self-Interference Cancellation for In-Band Full Duplex Communications

    Authors: Hanjiang Hong, Kai-Kit Wong, Hao Xu, Yiyan Wu, Sai Xu, Chan-Byoung Chae, Baiyang Liu, Kin-Fai Tong

    Abstract: In-band full-duplex (IBFD) systems are expected to double the spectral efficiency compared to half-duplex systems, provided that loopback self-interference (SI) can be effectively suppressed. The inherent interference mitigation capabilities of the emerging fluid antenna system (FAS) technology make it a promising candidate for addressing the SI challenge in IBFD systems. This paper thus proposes… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2506.02197  [pdf, ps, other

    eess.IV cs.CV

    NTIRE 2025 Challenge on RAW Image Restoration and Super-Resolution

    Authors: Marcos V. Conde, Radu Timofte, Zihao Lu, Xiangyu Kong, Xiaoxia Xing, Fan Wang, Suejin Han, MinKyu Park, Tianyu Zhang, Xin Luo, Yeda Chen, Dong Liu, Li Pang, Yuhang Yang, Hongzhong Wang, Xiangyong Cao, Ruixuan Jiang, Senyan Xu, Siyuan Jiang, Xueyang Fu, Zheng-Jun Zha, Tianyu Hao, Yuhong He, Ruoqi Li, Yueqi Yang , et al. (14 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 RAW Image Restoration and Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Restoration and Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. The goal of this challenge is two fold, (i) restore RAW images with blur and… ▽ More

    Submitted 4 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: CVPR 2025 - New Trends in Image Restoration and Enhancement (NTIRE)

  3. arXiv:2506.00045  [pdf, other

    cs.SD eess.AS

    ACE-Step: A Step Towards Music Generation Foundation Model

    Authors: Junmin Gong, Sean Zhao, Sen Wang, Shengyuan Xu, Joe Guo

    Abstract: We introduce ACE-Step, a novel open-source foundation model for music generation that overcomes key limitations of existing approaches and achieves state-of-the-art performance through a holistic architectural design. Current methods face inherent trade-offs between generation speed, musical coherence, and controllability. For example, LLM-based models (e.g. Yue, SongGen) excel at lyric alignment… ▽ More

    Submitted 28 May, 2025; originally announced June 2025.

    Comments: 14 pages, 5 figures, ace-step's tech report

  4. Self-supervised feature learning for cardiac Cine MR image reconstruction

    Authors: Siying Xu, Marcel Früh, Kerstin Hammernik, Andreas Lingg, Jens Kübler, Patrick Krumm, Daniel Rueckert, Sergios Gatidis, Thomas Küstner

    Abstract: We propose a self-supervised feature learning assisted reconstruction (SSFL-Recon) framework for MRI reconstruction to address the limitation of existing supervised learning methods. Although recent deep learning-based methods have shown promising performance in MRI reconstruction, most require fully-sampled images for supervised learning, which is challenging in practice considering long acquisit… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted to IEEE Transactions on Medical Imaging (TMI), 2025

  5. arXiv:2505.21928  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal Pathology

    Authors: Lianghui Zhu, Xitong Ling, Minxi Ouyang, Xiaoping Liu, Tian Guan, Mingxi Fu, Zhiqiang Cheng, Fanglei Fu, Maomao Zeng, Liming Liu, Song Duan, Qiang Huang, Ying Xiao, Jianming Li, Shanming Lu, Zhenghua Piao, Mingxi Zhu, Yibo Jin, Shan Xu, Qiming He, Yizhi Wang, Junru Cheng, Xuanyu Wang, Luxi Xie, Houqiang Li , et al. (2 additional authors not shown)

    Abstract: Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterati… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  6. arXiv:2505.21767  [pdf, ps, other

    eess.IV cs.LG eess.SP

    Beyond 1D: Vision Transformers and Multichannel Signal Images for PPG-to-ECG Reconstruction

    Authors: Xiaoyan Li, Shixin Xu, Faisal Habib, Arvind Gupta, Huaxiong Huang

    Abstract: Reconstructing ECG from PPG is a promising yet challenging task. While recent advancements in generative models have significantly improved ECG reconstruction, accurately capturing fine-grained waveform features remains a key challenge. To address this, we propose a novel PPG-to-ECG reconstruction method that leverages a Vision Transformer (ViT) as the core network. Unlike conventional approaches… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  7. arXiv:2505.20961  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS

    Efficient and Microphone-Fault-Tolerant 3D Sound Source Localization

    Authors: Yiyuan Yang, Shitong Xu, Niki Trigoni, Andrew Markham

    Abstract: Sound source localization (SSL) is a critical technology for determining the position of sound sources in complex environments. However, existing methods face challenges such as high computational costs and precise calibration requirements, limiting their deployment in dynamic or resource-constrained environments. This paper introduces a novel 3D SSL framework, which uses sparse cross-attention, p… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025 Conference

  8. arXiv:2505.19048  [pdf, other

    eess.SP

    Movable-Element STARS-Assisted Near-Field Wideband Communications

    Authors: Guangyu Zhu, Xidong Mu, Li Guo, Ao Huang, Shibiao Xu

    Abstract: A novel movable-element simultaneously transmitting and reflecting surface (ME-STARS)-assisted near-field wideband communication framework is proposed. In particular, the position of each STARS element can be adjusted to combat the significant wideband beam squint issue in the near field instead of using costly true-time delay components. Four practical ME-STARS element movement modes are proposed… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  9. arXiv:2505.17568  [pdf, ps, other

    cs.CR cs.AI cs.SD eess.AS

    JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models

    Authors: Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Zeren Luo, Jingyi Zheng, Wenhan Dong, Xinlei He, Xuechao Wang, Yingjie Xue, Shengmin Xu, Xinyi Huang

    Abstract: Audio Language Models (ALMs) have made significant progress recently. These models integrate the audio modality directly into the model, rather than converting speech into text and inputting text to Large Language Models (LLMs). While jailbreak attacks on LLMs have been extensively studied, the security of ALMs with audio modalities remains largely unexplored. Currently, there is a lack of an adve… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  10. arXiv:2505.12089  [pdf, ps, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

    Authors: Sangmin Lee, Eunpil Park, Angel Canelo, Hyunhee Park, Youngjo Kim, Hyung-Ju Chun, Xin Jin, Chongyi Li, Chun-Le Guo, Radu Timofte, Qi Wu, Tianheng Qiu, Yuchun Dong, Shenglin Ding, Guanghua Pan, Weiyu Zhou, Tao Hu, Yixu Feng, Duwei Dai, Yu Cao, Peng Wu, Wei Dong, Yanning Zhang, Qingsen Yan, Simon J. Larsen , et al. (11 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Efficient Burst HDR and Restoration Challenge, which aims to advance efficient multi-frame high dynamic range (HDR) and restoration techniques. The challenge is based on a novel RAW multi-frame fusion dataset, comprising nine noisy and misaligned RAW frames with various exposure levels per scene. Participants were tasked with developing solutions capable of effect… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  11. arXiv:2504.13670  [pdf, other

    eess.SP

    Pinching-Antenna Systems (PASS)-enabled Secure Wireless Communications

    Authors: Guangyu Zhu, Xidong Mu, Li Guo, Shibiao Xu, Yuanwei Liu, Naofal Al-Dhahir

    Abstract: A novel pinching-antenna systems (PASS)-enabled secure wireless communication framework is proposed. By dynamically adjusting the positions of dielectric particles, namely pinching antennas (PAs), along the waveguides, PASS introduces a novel concept of pinching beamforming to enhance the performance of physical layer security. A fundamental PASS-enabled secure communication system is considered w… ▽ More

    Submitted 14 May, 2025; v1 submitted 18 April, 2025; originally announced April 2025.

  12. arXiv:2504.09248  [pdf, ps, other

    eess.SY cs.CR

    Asymptotic stabilization under homomorphic encryption: A re-encryption free method

    Authors: Shuai Feng, Qian Ma, Junsoo Kim, Shengyuan Xu

    Abstract: In this paper, we propose methods to encrypted a pre-given dynamic controller with homomorphic encryption, without re-encrypting the control inputs. We first present a preliminary result showing that the coefficients in a pre-given dynamic controller can be scaled up into integers by the zooming-in factor in dynamic quantization, without utilizing re-encryption. However, a sufficiently small zoomi… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  13. Optimal Sensor Placement Using Combinations of Hybrid Measurements for Source Localization

    Authors: Kang Tang, Sheng Xu, Yuqi Yang, He Kong, Yongsheng Ma

    Abstract: This paper focuses on static source localization employing different combinations of measurements, including time-difference-of-arrival (TDOA), received-signal-strength (RSS), angle-of-arrival (AOA), and time-of-arrival (TOA) measurements. Since sensor-source geometry significantly impacts localization accuracy, the strategies of optimal sensor placement are proposed systematically using combinati… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

    Journal ref: IEEE Radar Conference 2024, Denver, CO, USA, pp. 1-6, 2024

  14. arXiv:2503.13252  [pdf, other

    cs.RO eess.SP

    Digital Beamforming Enhanced Radar Odometry

    Authors: Jingqi Jiang, Shida Xu, Kaicheng Zhang, Jiyuan Wei, Jingyang Wang, Sen Wang

    Abstract: Radar has become an essential sensor for autonomous navigation, especially in challenging environments where camera and LiDAR sensors fail. 4D single-chip millimeter-wave radar systems, in particular, have drawn increasing attention thanks to their ability to provide spatial and Doppler information with low hardware cost and power consumption. However, most single-chip radar systems using traditio… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  15. arXiv:2503.00941  [pdf, other

    eess.SP

    C2S-AE: CSI to Sensing enabled by an Auto-Encoder-based Framework

    Authors: Jun Jiang, Shugong Xu, Wenjun Yu, Yuan Gao

    Abstract: Next-generation mobile networks are set to utilize integrated sensing and communication (ISAC) as a critical technology, providing significant support for sectors like the industrial Internet of Things (IIoT), extended reality (XR), and smart home applications. A key challenge in ISAC implementation is the extraction of sensing parameters from radio signals, a task that conventional methods strugg… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  16. arXiv:2502.18766  [pdf, other

    eess.SP

    MTCA: Multi-Task Channel Analysis for Wireless Communication

    Authors: Jun Jiang, Wenjun Yu, Yuan Gao, Shugong Xu

    Abstract: In modern wireless communication systems, the effective processing of Channel State Information (CSI) is crucial for enhancing communication quality and reliability. However, current methods often handle different tasks in isolation, thereby neglecting the synergies among various tasks and leading to extract CSI features inadequately for subsequent analysis. To address these limitations, this pape… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  17. arXiv:2502.17536  [pdf, other

    eess.SP cs.LG

    CLEP-GAN: An Innovative Approach to Subject-Independent ECG Reconstruction from PPG Signals

    Authors: Xiaoyan Li, Shixin Xu, Faisal Habib, Neda Aminnejad, Arvind Gupta, Huaxiong Huang

    Abstract: This study addresses the challenge of reconstructing unseen ECG signals from PPG signals, a critical task for non-invasive cardiac monitoring. While numerous public ECG-PPG datasets are available, they lack the diversity seen in image datasets, and data collection processes often introduce noise, complicating ECG reconstruction from PPG even with advanced machine learning models. To tackle these c… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  18. arXiv:2502.16611  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments

    Authors: Shitong Xu, Yiyuan Yang, Niki Trigoni, Andrew Markham

    Abstract: Target speaker extraction focuses on isolating a specific speaker's voice from an audio mixture containing multiple speakers. To provide information about the target speaker's identity, prior works have utilized clean audio samples as conditioning inputs. However, such clean audio examples are not always readily available. For instance, obtaining a clean recording of a stranger's voice at a cockta… ▽ More

    Submitted 17 June, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: 11 pages, 6 figures

  19. arXiv:2502.11965  [pdf, other

    eess.SP cs.AI

    A MIMO Wireless Channel Foundation Model via CIR-CSI Consistency

    Authors: Jun Jiang, Wenjun Yu, Yunfan Li, Yuan Gao, Shugong Xu

    Abstract: In the field of artificial intelligence, self-supervised learning has demonstrated superior generalization capabilities by leveraging large-scale unlabeled datasets for pretraining, which is especially critical for wireless communication models to adapt to a variety of scenarios. This paper innovatively treats Channel State Information (CSI) and Channel Impulse Response (CIR) as naturally aligned… ▽ More

    Submitted 1 March, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: 6 pages, 2025 ICMLCN accepted

  20. arXiv:2502.07230  [pdf, ps, other

    eess.SY

    Physics-Informed Recurrent Network for State-Space Modeling of Gas Pipeline Networks

    Authors: Siyuan Wang, Wenchuan Wu, Chenhui Lin, Qi Wang, Shuwei Xu, Binbin Chen

    Abstract: As a part of the integrated energy system (IES), gas pipeline networks can provide additional flexibility to power systems through coordinated optimal dispatch. An accurate pipeline network model is critical for the optimal operation and control of IESs. However, inaccuracies or unavailability of accurate pipeline parameters often introduce errors in the state-space models of such networks. This p… ▽ More

    Submitted 19 June, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 9 Pages

  21. arXiv:2502.01003  [pdf, other

    eess.SP

    Near-Field Integrated Sensing and Communications for Secure UAV Networks

    Authors: Jingjing Zhao, Songtao Xue, Kaiquan Cai, Xidong Mu, Yuanwei Liu, Yanbo Zhu

    Abstract: A novel near-field integrated sensing and communications framework for secure unmanned aerial vehicle (UAV) networks with high time efficiency is proposed. A ground base station (GBS) with large aperture size communicates with one communication UAV (C-UAV) under the existence of one eavesdropping UAV (E-UAV), where the artificial noise (AN) is employed for both jamming and sensing purpose. Given t… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  22. arXiv:2501.16023  [pdf, other

    eess.SP

    TransPathNet: A Novel Two-Stage Framework for Indoor Radio Map Prediction

    Authors: Xin Li, Ran Liu, Saihua Xu, Sirajudeen Gulam Razul, Chau Yuen

    Abstract: Accurate indoor pathloss prediction is crucial for optimizing wireless communication in indoor settings, where diverse materials and complex electromagnetic interactions pose significant modeling challenges. This paper introduces TransPathNet, a novel two-stage deep learning framework that leverages transformer-based feature extraction and multiscale convolutional attention decoding to generate hi… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: Accepted to ICASSP 2025

  23. arXiv:2501.14970  [pdf, other

    eess.SP cs.AI cs.LG

    AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges

    Authors: Guangjin Pan, Yuan Gao, Yilin Gao, Zhiyong Zhong, Xiaoyu Yang, Xinyu Guo, Shugong Xu

    Abstract: Wireless positioning technologies hold significant value for applications in autonomous driving, extended reality (XR), unmanned aerial vehicles (UAVs), and more. With the advancement of artificial intelligence (AI), leveraging AI to enhance positioning accuracy and robustness has emerged as a field full of potential. Driven by the requirements and functionalities defined in the 3rd Generation Par… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 32 pages. This work has been submitted to the IEEE for possible publication

  24. arXiv:2501.02572  [pdf, other

    cs.NI cs.AI eess.SY

    Energy Optimization of Multi-task DNN Inference in MEC-assisted XR Devices: A Lyapunov-Guided Reinforcement Learning Approach

    Authors: Yanzan Sun, Jiacheng Qiu, Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaoyun Wang, Shuangfeng Han

    Abstract: Extended reality (XR), blending virtual and real worlds, is a key application of future networks. While AI advancements enhance XR capabilities, they also impose significant computational and energy challenges on lightweight XR devices. In this paper, we developed a distributed queue model for multi-task DNN inference, addressing issues of resource competition and queue coupling. In response to th… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 13 pages, 7 figures. This work has been submitted to the IEEE for possible publication

  25. arXiv:2412.17268  [pdf, ps, other

    cs.IT eess.SP

    STAR-RIS Assisted SWIPT Systems: Active or Passive?

    Authors: Guangyu Zhu, Xidong Mu, Li Guo, Ao Huang, Shibiao Xu

    Abstract: A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted simultaneous wireless information and power transfer (SWIPT) system is investigated. Both active and passive STAR-RISs are considered. Passive STAR-RISs can be cost-efficiently fabricated to large aperture sizes with significant near-field regions, but the design flexibility is limited by the couple… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  26. arXiv:2412.12742  [pdf, other

    eess.IV cs.AI cs.LG

    Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging

    Authors: Wenqi Huang, Veronika Spieker, Siying Xu, Gastao Cruz, Claudia Prieto, Julia Schnabel, Kerstin Hammernik, Thomas Kuestner, Daniel Rueckert

    Abstract: Conventional cardiac cine MRI methods rely on retrospective gating, which limits temporal resolution and the ability to capture continuous cardiac dynamics, particularly in patients with arrhythmias and beat-to-beat variations. To address these challenges, we propose a reconstruction framework based on subspace implicit neural representations for real-time cardiac cine MRI of continuously sampled… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  27. arXiv:2412.07555  [pdf, other

    eess.SP

    GSM: A GNN-based Space-MIMO Framework for Direct-to-Cell Communications

    Authors: Sai Xu, Yanan Du, Gaojie Chen, Rahim Tafazolli

    Abstract: This paper proposes a graph neural network (GNN)-based space multiple-input multiple-output (MIMO) framework, named GSM, for direct-to-cell communications, aiming to achieve distributed coordinated beamforming for low Earth orbit (LEO) satellites. Firstly, a system model for LEO multi-satellite communications is established, where multiple LEO satellites collaborate to perform distributed beamform… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  28. arXiv:2412.04201  [pdf, other

    cs.CV eess.IV

    Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image

    Authors: Shuang Xu, Zixiang Zhao, Haowen Bai, Chang Yu, Jiangjun Peng, Xiangyong Cao, Deyu Meng

    Abstract: Hyperspectral images (HSIs) are frequently noisy and of low resolution due to the constraints of imaging devices. Recently launched satellites can concurrently acquire HSIs and panchromatic (PAN) images, enabling the restoration of HSIs to generate clean and high-resolution imagery through fusing PAN images for denoising and super-resolution. However, previous studies treated these two tasks as in… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  29. arXiv:2412.01035  [pdf, other

    cs.LG eess.SY

    Adaptive Traffic Element-Based Streetlight Control Using Neighbor Discovery Algorithm Based on IoT Events

    Authors: Yupeng Tan, Sheng Xu, Chengyue Su

    Abstract: Intelligent streetlight systems divide the streetlight network into multiple sectors, activating only the streetlights in the corresponding sectors when traffic elements pass by, rather than all streetlights, effectively reducing energy waste. This strategy requires streetlights to understand their neighbor relationships to illuminate only the streetlights in their respective sectors. However, man… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  30. arXiv:2411.13305  [pdf, ps, other

    eess.SP

    Mutual Information-oriented ISAC Beamforming Design for Large Dimensional Antenna Array

    Authors: Shanfeng Xu, Yanshuo Cheng, Siqiang Wang, Xinyi Wang, Zhong Zheng, Zesong Fei

    Abstract: Existing integrated sensing and communication (ISAC) beamforming design were mostly designed under perfect instantaneous channel state information (CSI), limiting their use in practical dynamic environments. In this paper, we study the beamforming design for multiple-input multiple-output (MIMO) ISAC systems based on statistical CSI, with the weighted mutual information (MI) comprising sensing and… ▽ More

    Submitted 17 June, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: 17 pages, 5 figures

  31. arXiv:2411.07603  [pdf, other

    quant-ph eess.SY

    $\mathscr{H}_2$ Model Reduction for Linear Quantum Systems

    Authors: G. P. Wu, S. Xue, G. F. Zhang, I. R. Petersen

    Abstract: In this paper, an $\mathscr{H}_2$ norm-based model reduction method for linear quantum systems is presented, which can obtain a physically realizable model with a reduced order for closely approximating the original system. The model reduction problem is described as an optimization problem, whose objective is taken as an $\mathscr{H}_2$ norm of the difference between the transfer function of the… ▽ More

    Submitted 19 November, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: 13 pages,3 figures

  32. arXiv:2410.23154  [pdf, other

    eess.IV cs.CV

    Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe

    Authors: Songyu Xu, Yicheng Hu, Jionglong Su, Daniel Elson, Baoru Huang

    Abstract: Purpose: Drop-in gamma probes are widely used in robotic-assisted minimally invasive surgery (RAMIS) for lymph node detection. However, these devices only provide audio feedback on signal intensity, lacking the visual feedback necessary for precise localisation. Previous work attempted to predict the sensing area location using laparoscopic images, but the prediction accuracy was unsatisfactory. I… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  33. arXiv:2410.21351  [pdf, other

    cs.LG cs.AI cs.NI eess.SP

    LinFormer: A Linear-based Lightweight Transformer Architecture For Time-Aware MIMO Channel Prediction

    Authors: Yanliang Jin, Yifan Wu, Yuan Gao, Shunqing Zhang, Shugong Xu, Cheng-Xiang Wang

    Abstract: The emergence of 6th generation (6G) mobile networks brings new challenges in supporting high-mobility communications, particularly in addressing the issue of channel aging. While existing channel prediction methods offer improved accuracy at the expense of increased computational complexity, limiting their practical application in mobile networks. To address these challenges, we present LinFormer… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  34. arXiv:2410.19779  [pdf, other

    eess.SP cs.LG

    EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training

    Authors: Tongtian Yue, Shuning Xue, Xuange Gao, Yepeng Tang, Longteng Guo, Jie Jiang, Jing Liu

    Abstract: Electroencephalogram (EEG) signals are pivotal in providing insights into spontaneous brain activity, highlighting their significant importance in neuroscience research. However, the exploration of versatile EEG models is constrained by diverse data formats, outdated pre-training paradigms, and limited transfer learning methods, only leading to specialist models on single dataset. In this paper, w… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  35. arXiv:2410.12218  [pdf, other

    eess.SP

    Exploring Dual-Sniffer Passive Localization: Algorithm Design and Experimental Results

    Authors: Tuo Wu, Lingyu Hou, Hong Niu, Saihua Xu, Sirajudeen Gulam Razul, Chau Yuen

    Abstract: In this paper, we explore a dual-sniffer passive localization system that detects the timing difference of signals from both commercial base station (eNb) and user equipment (UE) to the sniffers. We design two localization schemes for UE localization: a time of arrival (ToA) based scheme and a time difference of arrival (TDoA) based scheme. In the ToA-based scheme, we derive two ellipse equations… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  36. arXiv:2410.01330  [pdf, ps, other

    cs.IT eess.SP

    Enhancing User Fairness in Wireless Powered Communication Networks with STAR-RIS

    Authors: Guangyu Zhu, Xidong Mu, Li Guo, Ao Huang, Shibiao Xu

    Abstract: A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted wireless powered communication network (WPCN) is proposed, where two energy-limited devices first harvest energy from a hybrid access point (HAP) and then use that energy to transmit information back. To fully eliminate the doubly-near-far effect in WPCNs, two STAR-RIS operating protocol-driven tran… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  37. arXiv:2409.19370  [pdf, ps, other

    eess.IV cs.CV

    MambaEviScrib: Mamba and Evidence-Guided Consistency Enhance CNN Robustness for Scribble-Based Weakly Supervised Ultrasound Image Segmentation

    Authors: Xiaoxiang Han, Xinyu Li, Jiang Shang, Yiman Liu, Keyan Chen, Shugong Xu, Qiaohong Liu, Qi Zhang

    Abstract: Segmenting anatomical structures and lesions from ultrasound images contributes to disease assessment. Weakly supervised learning (WSL) based on sparse annotation has achieved encouraging performance and demonstrated the potential to reduce annotation costs. This study attempts to introduce scribble-based WSL into ultrasound image segmentation tasks. However, ultrasound images often suffer from po… ▽ More

    Submitted 31 October, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

  38. arXiv:2409.15742  [pdf, other

    eess.AS cs.SD

    Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample

    Authors: Zhiyong Chen, Zhiqi Ai, Xinnuo Li, Shugong Xu

    Abstract: This paper introduces a novel framework for open-set speaker identification in household environments, playing a crucial role in facilitating seamless human-computer interactions. Addressing the limitations of current speaker models and classification approaches, our work integrates an pretrained WavLM frontend with a few-shot rapid tuning neural network (NN) backend for enrollment, employing task… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: IEEE Spoken Language Technology Workshop 2024

    Journal ref: IEEE Spoken Language Technology Workshop 2024

  39. arXiv:2409.15741  [pdf, other

    eess.AS cs.SD

    StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis

    Authors: Zhiyong Chen, Xinnuo Li, Zhiqi Ai, Shugong Xu

    Abstract: We introduce StyleFusion-TTS, a prompt and/or audio referenced, style and speaker-controllable, zero-shot text-to-speech (TTS) synthesis system designed to enhance the editability and naturalness of current research literature. We propose a general front-end encoder as a compact and effective module to utilize multimodal inputs including text prompts, audio references, and speaker timbre reference… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: The 7th Chinese Conference on Pattern Recognition and Computer Vision PRCV 2024

    Journal ref: The 7th Chinese Conference on Pattern Recognition and Computer Vision PRCV 2024

  40. arXiv:2409.14688  [pdf, other

    cs.RO eess.SY

    A Generalized Control Revision Method for Autonomous Driving Safety

    Authors: Zehang Zhu, Yuning Wang, Tianqi Ke, Zeyu Han, Shaobing Xu, Qing Xu, John M. Dolan, Jianqiang Wang

    Abstract: Safety is one of the most crucial challenges of autonomous driving vehicles, and one solution to guarantee safety is to employ an additional control revision module after the planning backbone. Control Barrier Function (CBF) has been widely used because of its strong mathematical foundation on safety. However, the incompatibility with heterogeneous perception data and incomplete consideration of t… ▽ More

    Submitted 17 March, 2025; v1 submitted 22 September, 2024; originally announced September 2024.

  41. arXiv:2409.12470  [pdf, other

    cs.CV eess.IV

    HSIGene: A Foundation Model For Hyperspectral Image Generation

    Authors: Li Pang, Xiangyong Cao, Datao Tang, Shuang Xu, Xueru Bai, Feng Zhou, Deyu Meng

    Abstract: Hyperspectral image (HSI) plays a vital role in various fields such as agriculture and environmental monitoring. However, due to the expensive acquisition cost, the number of hyperspectral images is limited, degenerating the performance of downstream tasks. Although some recent studies have attempted to employ diffusion models to synthesize HSIs, they still struggle with the scarcity of HSIs, affe… ▽ More

    Submitted 1 November, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  42. arXiv:2409.08652  [pdf, other

    eess.IV cs.CV

    SkinFormer: Learning Statistical Texture Representation with Transformer for Skin Lesion Segmentation

    Authors: Rongtao Xu, Changwei Wang, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

    Abstract: Accurate skin lesion segmentation from dermoscopic images is of great importance for skin cancer diagnosis. However, automatic segmentation of melanoma remains a challenging task because it is difficult to incorporate useful texture representations into the learning process. Texture representations are not only related to the local structural information learned by CNN, but also include the global… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 12 pages, 8 figures, published to JBHI

  43. arXiv:2409.02497  [pdf, other

    eess.IV cs.CV

    A Learnable Color Correction Matrix for RAW Reconstruction

    Authors: Anqi Liu, Shiyi Mu, Shugong Xu

    Abstract: Autonomous driving algorithms usually employ sRGB images as model input due to their compatibility with the human visual system. However, visually pleasing sRGB images are possibly sub-optimal for downstream tasks when compared to RAW images. The availability of RAW images is constrained by the difficulties in collecting real-world driving data and the associated challenges of annotation. To addre… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Accepted by BMVC2024

  44. arXiv:2408.06645  [pdf

    eess.SY

    Dynamic Pricing of Electric Vehicle Charging Station Alliances Under Information Asymmetry

    Authors: Zeyu Liu, Yun Zhou, Donghan Feng, Shaolun Xu, Yin Yi, Hengjie Li, Haojing Wang

    Abstract: Due to the centralization of charging stations (CSs), CSs are organized as charging station alliances (CSAs) in the commercial competition. Under this situation, this paper studies the profit-oriented dynamic pricing strategy of CSAs. As the practicability basis, a privacy-protected bidirectional real-time information interaction framework is designed, under which the status of EVs is utilized as… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  45. arXiv:2407.20852  [pdf, other

    cs.NI cs.MM eess.SY

    Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S

    Authors: Guangjin Pan, Shugong Xu, Pin Jiang

    Abstract: As 5G networks strive to support advanced time-critical applications, such as immersive Extended Reality (XR), cloud gaming, and autonomous driving, the demand for Real-time Broadband Communication (RTBC) grows. In this article, we present the main mechanisms of Low Latency, Low Loss, and Scalable Throughput (L4S). Subsequently, we investigate the support and challenges of L4S technology in the la… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures. This work has been submitted to the IEEE for possible publication

  46. arXiv:2407.20518  [pdf, other

    eess.IV cs.AI cs.CV

    High-Resolution Spatial Transcriptomics from Histology Images using HisToSGE

    Authors: Zhiceng Shi, Shuailin Xue, Fangfang Zhu, Wenwen Min

    Abstract: Spatial transcriptomics (ST) is a groundbreaking genomic technology that enables spatial localization analysis of gene expression within tissue sections. However, it is significantly limited by high costs and sparse spatial resolution. An alternative, more cost-effective strategy is to use deep learning methods to predict high-density gene expression profiles from histological images. However, exi… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  47. arXiv:2407.08509  [pdf, other

    eess.IV cs.CV

    Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration

    Authors: Shuang Xu, Chang Yu, Jiangjun Peng, Xiangyong Cao, Deyu Meng

    Abstract: Remote sensing image restoration aims to reconstruct missing or corrupted areas within images. To date, low-rank based models have garnered significant interest in this field. This paper proposes a novel low-rank regularization term, named the Haar nuclear norm (HNN), for efficient and effective remote sensing image restoration. It leverages the low-rank properties of wavelet coefficients derived… ▽ More

    Submitted 16 December, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  48. arXiv:2407.06064  [pdf, other

    eess.IV cs.CV

    Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation

    Authors: Shuang Xu, Qiao Ke, Jiangjun Peng, Xiangyong Cao, Zixiang Zhao

    Abstract: This paper introduces a novel paradigm for hyperspectral image (HSI) denoising, which is termed \textit{pan-denoising}. In a given scene, panchromatic (PAN) images capture similar structures and textures to HSIs but with less noise. This enables the utilization of PAN images to guide the HSI denoising process. Consequently, pan-denoising, which incorporates an additional prior, has the potential t… ▽ More

    Submitted 9 September, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 62, art. no. 5528714, 2024

  49. arXiv:2407.03308  [pdf, other

    physics.med-ph cs.AI eess.IV

    Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method

    Authors: Sijie Xu, Shenyan Zong, Chang-Sheng Mei, Guofeng Shen, Yueran Zhao, He Wang

    Abstract: Proton resonance frequency (PRF) based MR thermometry is essential for focused ultrasound (FUS) thermal ablation therapies. This work aims to enhance temporal resolution in dynamic MR temperature map reconstruction using an improved deep learning method. The training-optimized methods and five classical neural networks were applied on the 2-fold and 4-fold under-sampling k-space data to reconstruc… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  50. arXiv:2407.03034  [pdf, ps, other

    eess.IV cs.AI

    Attention Incorporated Network for Sharing Low-rank, Image and K-space Information during MR Image Reconstruction to Achieve Single Breath-hold Cardiac Cine Imaging

    Authors: Siying Xu, Kerstin Hammernik, Andreas Lingg, Jens Kuebler, Patrick Krumm, Daniel Rueckert, Sergios Gatidis, Thomas Kuestner

    Abstract: Cardiac Cine Magnetic Resonance Imaging (MRI) provides an accurate assessment of heart morphology and function in clinical practice. However, MRI requires long acquisition times, with recent deep learning-based methods showing great promise to accelerate imaging and enhance reconstruction quality. Existing networks exhibit some common limitations that constrain further acceleration possibilities,… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.