Skip to main content

Showing 1–50 of 90 results for author: Xia, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04662  [pdf, ps, other

    eess.SP

    Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR

    Authors: Tao Du, Jie Yang, Fan Liu, Jiaxiang Guo, Shuqiang Xia, Chao-Kai Wen, Shi Jin

    Abstract: Millimeter-wave (mmWave) 5G New Radio (NR) communication systems, with their high-resolution antenna arrays and extensive bandwidth, offer a transformative opportunity for high-throughput data transmission and advanced environmental sensing. Although passive sensing-based SLAM techniques can estimate user locations and environmental reflections simultaneously, their effectiveness is often constrai… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 7 pages, 7 figures. Accepted for publication at the 2025 IEEE International Conference on Communications (ICC). \c{opyright} 2025 IEEE. Personal use is permitted, but permission from IEEE must be obtained for all other uses

  2. arXiv:2506.22902  [pdf, ps, other

    cs.CV eess.IV

    Point Cloud Compression and Objective Quality Assessment: A Survey

    Authors: Yiling Xu, Yujie Zhang, Shuting Xia, Kaifa Yang, He Huang, Ziyu Shan, Wenjie Huang, Qi Yang, Le Yang

    Abstract: The rapid growth of 3D point cloud data, driven by applications in autonomous driving, robotics, and immersive environments, has led to criticals demand for efficient compression and quality assessment techniques. Unlike traditional 2D media, point clouds present unique challenges due to their irregular structure, high data volume, and complex attributes. This paper provides a comprehensive survey… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  3. arXiv:2505.20038  [pdf, other

    cs.SD cs.CV eess.AS

    Towards Video to Piano Music Generation with Chain-of-Perform Support Benchmarks

    Authors: Chang Liu, Haomin Zhang, Shiyu Xia, Zihao Chen, Chaofan Ding, Xin Yue, Huizhe Chen, Xinhan Di

    Abstract: Generating high-quality piano audio from video requires precise synchronization between visual cues and musical output, ensuring accurate semantic and temporal alignment.However, existing evaluation datasets do not fully capture the intricate synchronization required for piano music generation. A comprehensive benchmark is essential for two primary reasons: (1) existing metrics fail to reflect the… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 4 pages, 1 figure, accepted by CVPR 2025 MMFM Workshop

  4. arXiv:2505.05477  [pdf

    eess.SP cs.CV

    ECGDeDRDNet: A deep learning-based method for Electrocardiogram noise removal using a double recurrent dense network

    Authors: Sainan xiao, Wangdong Yang, Buwen Cao, Jintao Wu

    Abstract: Electrocardiogram (ECG) signals are frequently corrupted by noise, such as baseline wander (BW), muscle artifacts (MA), and electrode motion (EM), which significantly degrade their diagnostic utility. To address this issue, we propose ECGDeDRDNet, a deep learning-based ECG Denoising framework leveraging a Double Recurrent Dense Network architecture. In contrast to traditional approaches, we introd… ▽ More

    Submitted 22 April, 2025; originally announced May 2025.

  5. arXiv:2505.02446  [pdf, other

    cs.IT eess.SP

    Learned Intelligent Recognizer with Adaptively Customized RIS Phases in Communication Systems

    Authors: Yixuan Huang, Jie Yang, Chao-Kai Wen, Shuqiang Xia, Xiao Li, Shi Jin

    Abstract: This study presents an advanced wireless system that embeds target recognition within reconfigurable intelligent surface (RIS)-aided communication systems, powered by cuttingedge deep learning innovations. Such a system faces the challenge of fine-tuning both the RIS phase shifts and neural network (NN) parameters, since they intricately interdepend on each other to accomplish the recognition task… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: accepted by FCN 2024. arXiv admin note: substantial text overlap with arXiv:2503.02244

  6. arXiv:2505.02440  [pdf, other

    cs.IT eess.SP

    Cooperative ISAC Network for Off-Grid Imaging-based Low-Altitude Surveillance

    Authors: Yixuan Huang, Jie Yang, Chao-Kai Wen, Shuqiang Xia, Xiao Li, Shi Jin

    Abstract: The low-altitude economy has emerged as a critical focus for future economic development, emphasizing the urgent need for flight activity surveillance utilizing the existing sensing capabilities of mobile cellular networks. Traditional monostatic or localization-based sensing methods, however, encounter challenges in fusing sensing results and matching channel parameters. To address these challeng… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: accepted by VTC2025-Spring

  7. arXiv:2503.15054  [pdf, other

    eess.SP

    Joint Design of Radar Receive Filter and Unimodular ISAC Waveform with Sidelobe Level Control

    Authors: Kecheng Zhang, Ya-Feng Liu, Zhongbin Wang, Weijie Yuan, Musa Furkan Keskin, Henk Wymeersch, Shuqiang Xia

    Abstract: Integrated sensing and communication (ISAC) has been considered a key feature of next-generation wireless networks. This paper investigates the joint design of the radar receive filter and dual-functional transmit waveform for the multiple-input multiple-output (MIMO) ISAC system. While optimizing the mean square error (MSE) of the radar receive spatial response and maximizing the achievable rate… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: Submitted to IEEE for possible publication

  8. Enhanced Diagnostic Fidelity in Pathology Whole Slide Image Compression via Deep Learning

    Authors: Maximilian Fischer, Peter Neher, Peter Schüffler, Shuhan Xiao, Silvia Dias Almeida, Constantin Ulrich, Alexander Muckenhuber, Rickmer Braren, Michael Götz, Jens Kleesiek, Marco Nolden, Klaus Maier-Hein

    Abstract: Accurate diagnosis of disease often depends on the exhaustive examination of Whole Slide Images (WSI) at microscopic resolution. Efficient handling of these data-intensive images requires lossy compression techniques. This paper investigates the limitations of the widely-used JPEG algorithm, the current clinical standard, and reveals severe image artifacts impacting diagnostic fidelity. To overcom… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  9. arXiv:2503.05794  [pdf, other

    cs.CR cs.AI cs.LG cs.SD eess.AS

    CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking

    Authors: Yiming Li, Kaiying Yan, Shuo Shao, Tongqing Zhai, Shu-Tao Xia, Zhan Qin, Dacheng Tao

    Abstract: With the increasing adoption of deep learning in speaker verification, large-scale speech datasets have become valuable intellectual property. To audit and prevent the unauthorized usage of these valuable released datasets, especially in commercial or open-source scenarios, we propose a novel dataset ownership verification method. Our approach introduces a clustering-based backdoor watermark (CBW)… ▽ More

    Submitted 5 April, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

    Comments: 14 pages. The journal extension of our ICASSP'21 paper (arXiv:2010.11607)

  10. arXiv:2502.01332  [pdf, other

    quant-ph eess.SY math.OC

    A two-disk approach to the synthesis of coherent passive equalizers for linear quantum systems

    Authors: Valery Ugrinovskii, Shuixin Xiao

    Abstract: The coherent equalization problem consists in designing a quantum system acting as a mean-square near optimal filter for a given quantum communication channel. The paper develops an improved method for the synthesis of transfer functions for such equalizing filters, based on a linear quantum system model of the channel. The method draws on a connection with the two-disk problem of ${H}_{\infty}$ c… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 18 pages, 8 figures

  11. arXiv:2501.18878  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication System Based on Radio Frequency Resonance Beam

    Authors: Yixuan Guo, Shuaifan Xia, Mingliang Xiong, Qingwen Liu, Wen Fang, Qingwei Jiang, Gang Yan, Jiangchuan Mu

    Abstract: To address the complex beam control in traditional multiple-input multiple-output (MIMO) systems, researchers have proposed adaptive beam alignment using retro-directive antenna (RDA) arrays. This approach creates echo resonance between the base station (BS) and user equipment (UE), significantly reducing computational load. However, conventional resonant beam systems (RBS) suffer from echo interf… ▽ More

    Submitted 5 June, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

  12. arXiv:2412.17088  [pdf, other

    eess.SP

    6DMA-Aided Hybrid Beamforming with Joint Antenna Position and Orientation Optimization

    Authors: Yichi Zhang, Yuchen Zhang, Lipeng Zhu, Sa Xiao, Wanbin Tang, Yonina C. Eldar, Rui Zhang

    Abstract: This paper studies a sub-connected six-dimensional movable antenna (6DMA)-aided multi-user communication system. In this system, each sub-array is connected to a dedicated radio frequency chain and collectively moves and rotates as a unit within specific local regions. The movement and rotation capabilities of 6DMAs enhance design flexibility, facilitating the capture of spatial variations for imp… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

    Comments: The conference version of this paper has been accepted for Globecom 2024 Workshop

  13. arXiv:2412.13137  [pdf

    eess.IV cs.CV

    Unlocking the Potential of Digital Pathology: Novel Baselines for Compression

    Authors: Maximilian Fischer, Peter Neher, Peter Schüffler, Sebastian Ziegler, Shuhan Xiao, Robin Peretzke, David Clunie, Constantin Ulrich, Michael Baumgartner, Alexander Muckenhuber, Silvia Dias Almeida, Michael Götz, Jens Kleesiek, Marco Nolden, Rickmer Braren, Klaus Maier-Hein

    Abstract: Digital pathology offers a groundbreaking opportunity to transform clinical practice in histopathological image analysis, yet faces a significant hurdle: the substantial file sizes of pathological Whole Slide Images (WSI). While current digital pathology solutions rely on lossy JPEG compression to address this issue, lossy compression can introduce color and texture disparities, potentially impact… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  14. arXiv:2412.05167  [pdf, other

    cs.AI cs.CL cs.SD eess.AS

    Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models

    Authors: Kuofeng Gao, Shu-Tao Xia, Ke Xu, Philip Torr, Jindong Gu

    Abstract: Large Audio-Language Models (LALMs) have unclocked audio dialogue capabilities, where audio dialogues are a direct exchange of spoken language between LALMs and humans. Recent advances, such as GPT-4o, have enabled LALMs in back-and-forth audio dialogues with humans. This progression not only underscores the potential of LALMs but also broadens their applicability across a wide range of practical… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  15. arXiv:2411.15269  [pdf, other

    eess.IV cs.CV cs.LG

    MambaIRv2: Attentive State Space Restoration

    Authors: Hang Guo, Yong Guo, Yaohua Zha, Yulun Zhang, Wenbo Li, Tao Dai, Shu-Tao Xia, Yawei Li

    Abstract: The Mamba-based image restoration backbones have recently demonstrated significant potential in balancing global reception and computational efficiency. However, the inherent causal modeling limitation of Mamba, where each token depends solely on its predecessors in the scanned sequence, restricts the full utilization of pixels across the image and thus presents new challenges in image restoration… ▽ More

    Submitted 10 March, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: Accepted by CVPR2025

  16. arXiv:2409.10120  [pdf, other

    eess.IV cs.CV

    Data-Centric Strategies for Overcoming PET/CT Heterogeneity: Insights from the AutoPET III Lesion Segmentation Challenge

    Authors: Balint Kovacs, Shuhan Xiao, Maximilian Rokuss, Constantin Ulrich, Fabian Isensee, Klaus H. Maier-Hein

    Abstract: The third autoPET challenge introduced a new data-centric task this year, shifting the focus from model development to improving metastatic lesion segmentation on PET/CT images through data quality and handling strategies. In response, we developed targeted methods to enhance segmentation performance tailored to the characteristics of PET/CT imaging. Our approach encompasses two key elements. Firs… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: Contribution to the data-centric task of the autoPET III Challenge 2024

  17. arXiv:2409.09478  [pdf, other

    eess.IV cs.AI cs.CV

    From FDG to PSMA: A Hitchhiker's Guide to Multitracer, Multicenter Lesion Segmentation in PET/CT Imaging

    Authors: Maximilian Rokuss, Balint Kovacs, Yannick Kirchhoff, Shuhan Xiao, Constantin Ulrich, Klaus H. Maier-Hein, Fabian Isensee

    Abstract: Automated lesion segmentation in PET/CT scans is crucial for improving clinical workflows and advancing cancer diagnostics. However, the task is challenging due to physiological variability, different tracers used in PET imaging, and diverse imaging protocols across medical centers. To address this, the autoPET series was created to challenge researchers to develop algorithms that generalize acros… ▽ More

    Submitted 21 October, 2024; v1 submitted 14 September, 2024; originally announced September 2024.

    Comments: Winning method of the autoPET III challenge (model-centric) - Team LesionTracer

  18. arXiv:2408.08833  [pdf, other

    eess.SP

    Intra-symbol Differential Amplitude Shift Keying-aided Blind Detector for Ambient Backscatter Communication Systems

    Authors: Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang

    Abstract: Ambient backscatter communications (AmBC) are a promising technology for addressing the energy consumption challenge in wireless communications through the reflection or absorption of surrounding radio frequency (RF) signals. However, it grapples with the intricacies of ambient RF signal and the round-trip path loss. For traditional detectors, the incorporation of pilot sequences results in a redu… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  19. arXiv:2408.04274  [pdf, other

    physics.optics eess.SP

    Field of View Expansion for Resonant Beam Information and Power Transfer

    Authors: Shun Han, Wen Fang, Mingqing Liu, Mengyuan Xu, Shuaifan Xia, Qingwen Liu

    Abstract: Simultaneous wireless information and power transfer (SWIPT) leverages lightwave as the wireless transmission medium, emerging as a promising technology in the future Internet of Things (IoT) scenarios. The use of retro-reflectors in constructing spatially separated laser resonators (SSLR) enables a self-aligning wireless transmission system with the self-reproducing resonant beam, i.e. resonant b… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  20. arXiv:2407.10377  [pdf

    eess.IV cs.AI cs.CV

    Enhanced Masked Image Modeling to Avoid Model Collapse on Multi-modal MRI Datasets

    Authors: Linxuan Han, Sa Xiao, Zimeng Li, Haidong Li, Xiuchao Zhao, Yeqing Han, Fumin Guo, Xin Zhou

    Abstract: Multi-modal magnetic resonance imaging (MRI) provides information of lesions for computer-aided diagnosis from different views. Deep learning algorithms are suitable for identifying specific anatomical structures, segmenting lesions, and classifying diseases. Manual labels are limited due to the high expense, which hinders further improvement of accuracy. Self-supervised learning, particularly mas… ▽ More

    Submitted 15 January, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the lEEE for possible publication. copyright may be transferred without notice, after which this version may no longer be accessible

  21. arXiv:2407.05619  [pdf, other

    cs.RO eess.SY

    AIRA: A Low-cost IR-based Approach Towards Autonomous Precision Drone Landing and NLOS Indoor Navigation

    Authors: Yanchen Liu, Minghui Zhao, Kaiyuan Hou, Junxi Xia, Charlie Carver, Stephen Xia, Xia Zhou, Xiaofan Jiang

    Abstract: Automatic drone landing is an important step for achieving fully autonomous drones. Although there are many works that leverage GPS, video, wireless signals, and active acoustic sensing to perform precise landing, autonomous drone landing remains an unsolved challenge for palm-sized microdrones that may not be able to support the high computational requirements of vision, wireless, or active audio… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  22. arXiv:2406.12623  [pdf, other

    eess.IV cs.CV

    Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution

    Authors: Maximilian Fischer, Peter Neher, Tassilo Wald, Silvia Dias Almeida, Shuhan Xiao, Peter Schüffler, Rickmer Braren, Michael Götz, Alexander Muckenhuber, Jens Kleesiek, Marco Nolden, Klaus Maier-Hein

    Abstract: Processing histopathological Whole Slide Images (WSI) leads to massive storage requirements for clinics worldwide. Even after lossy image compression during image acquisition, additional lossy compression is frequently possible without substantially affecting the performance of deep learning-based (DL) downstream tasks. In this paper, we show that the commonly used JPEG algorithm is not best suite… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  23. arXiv:2406.02534  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Enhancing predictive imaging biomarker discovery through treatment effect analysis

    Authors: Shuhan Xiao, Lukas Klein, Jens Petersen, Philipp Vollmuth, Paul F. Jaeger, Klaus H. Maier-Hein

    Abstract: Identifying predictive covariates, which forecast individual treatment effectiveness, is crucial for decision-making across different disciplines such as personalized medicine. These covariates, referred to as biomarkers, are extracted from pre-treatment data, often within randomized controlled trials, and should be distinguished from prognostic biomarkers, which are independent of treatment assig… ▽ More

    Submitted 9 December, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to WACV 2025

  24. arXiv:2405.15413  [pdf, other

    eess.IV cs.CV cs.IT

    MambaVC: Learned Visual Compression with Selective State Spaces

    Authors: Shiyu Qin, Jinpeng Wang, Yimin Zhou, Bin Chen, Tianci Luo, Baoyi An, Tao Dai, Shutao Xia, Yaowei Wang

    Abstract: Learned visual compression is an important and active task in multimedia. Existing approaches have explored various CNN- and Transformer-based designs to model content distribution and eliminate redundancy, where balancing efficacy (i.e., rate-distortion trade-off) and efficiency remains a challenge. Recently, state-space models (SSMs) have shown promise due to their long-range modeling capacity a… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 17pages,15 figures

  25. arXiv:2405.01242  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms

    Authors: Yueyuan Sui, Minghui Zhao, Junxi Xia, Xiaofan Jiang, Stephen Xia

    Abstract: We propose TRAMBA, a hybrid transformer and Mamba architecture for acoustic and bone conduction speech enhancement, suitable for mobile and wearable platforms. Bone conduction speech enhancement has been impractical to adopt in mobile and wearable platforms for several reasons: (i) data collection is labor-intensive, resulting in scarcity; (ii) there exists a performance gap between state of-art m… ▽ More

    Submitted 29 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  26. arXiv:2404.00953  [pdf, ps, other

    cs.IT eess.SP

    Movable Antenna-Aided Hybrid Beamforming for Multi-User Communications

    Authors: Yichi Zhang, Yuchen Zhang, Lipeng Zhu, Sa Xiao, Wanbin Tang, Yonina C. Eldar, Rui Zhang

    Abstract: In this correspondence, we propose a movable antenna (MA)-aided multi-user hybrid beamforming scheme with a sub-connected structure, where multiple movable sub-arrays can independently change their positions within different local regions. To maximize the system sum rate, we jointly optimize the digital beamformer, analog beamformer, and positions of subarrays, under the constraints of unit modulu… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  27. arXiv:2403.14250  [pdf, other

    eess.IV cs.CR cs.CV

    Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations

    Authors: Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot

    Abstract: The widespread availability of publicly accessible medical images has significantly propelled advancements in various research and clinical fields. Nonetheless, concerns regarding unauthorized training of AI systems for commercial purposes and the duties of patient privacy protection have led numerous institutions to hesitate to share their images. This is particularly true for medical image segme… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  28. Networked Collaborative Sensing using Multi-domain Measurements: Architectures, Performance Limits and Algorithms

    Authors: Yihua Ma, Shuqiang Xia, Chen bai, Yuxin Wang, Zhongbin Wang, Songqian Li

    Abstract: As a promising 6G technology, integrated sensing and communication (ISAC) gains growing interest. ISAC provides integration gain via sharing spectrum, hardware, and software. However, concerns exist regarding its sensing performance when compared to the dedicated radar. To address this issue, the advantages of widely deployed networks should be utilized. This paper proposes networked collaborative… ▽ More

    Submitted 28 November, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Journal ref: IEEE Transactions on Vehicular Technology, early access, 2024

  29. arXiv:2402.08952  [pdf, other

    quant-ph eess.SY

    A two-stage solution to quantum process tomography: error analysis and optimal design

    Authors: Shuixin Xiao, Yuanlong Wang, Jun Zhang, Daoyi Dong, Gary J. Mooney, Ian R. Petersen, Hidehiro Yonezawa

    Abstract: Quantum process tomography is a critical task for characterizing the dynamics of quantum systems and achieving precise quantum control. In this paper, we propose a two-stage solution for both trace-preserving and non-trace-preserving quantum process tomography. Utilizing a tensor structure, our algorithm exhibits a computational complexity of $O(MLd^2)$ where $d$ is the dimension of the quantum sy… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 41 pages, 7 figures

  30. arXiv:2401.12587  [pdf, other

    eess.IV cs.CV

    An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding

    Authors: Xiang Liu, Jiahong Chen, Bin Chen, Zimo Liu, Baoyi An, Shu-Tao Xia, Zhi Wang

    Abstract: Displaying high-quality images on edge devices, such as augmented reality devices, is essential for enhancing the user experience. However, these devices often face power consumption and computing resource limitations, making it challenging to apply many deep learning-based image compression algorithms in this field. Implicit Neural Representation (INR) for image compression is an emerging technol… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  31. arXiv:2311.13847  [pdf, other

    cs.CV cs.IT eess.IV

    Perceptual Image Compression with Cooperative Cross-Modal Side Information

    Authors: Shiyu Qin, Bin Chen, Yujun Huang, Baoyi An, Tao Dai, Shu-Tao Xia

    Abstract: The explosion of data has resulted in more and more associated text being transmitted along with images. Inspired by from distributed source coding, many works utilize image side information to enhance image compression. However, existing methods generally do not consider using text as side information to enhance perceptual compression of images, even though the benefits of multimodal synergy have… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  32. Near-Field Wideband Secure Communications: An Analog Beamfocusing Approach

    Authors: Yuchen Zhang, Haiyang Zhang, Sa Xiao, Wanbin Tang, Yonina C. Eldar

    Abstract: In the rapidly advancing landscape of 6G, characterized by ultra-high-speed wideband transmission in millimeter-wave and terahertz bands, our paper addresses the pivotal task of enhancing physical layer security (PLS) within near-field wideband communications. We introduce true-time delayer (TTD)-incorporated analog beamfocusing techniques designed to address the interplay between near-field propa… ▽ More

    Submitted 28 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to IEEE journal for publication

    Journal ref: IEEE Transactions on Signal Processing, 2024

  33. arXiv:2310.20421  [pdf, other

    quant-ph eess.SY

    Two-stage solution for ancilla-assisted quantum process tomography: error analysis and optimal design

    Authors: Shuixin Xiao, Yuanlong Wang, Daoyi Dong, Jun Zhang

    Abstract: Quantum process tomography (QPT) is a fundamental task to characterize the dynamics of quantum systems. In contrast to standard QPT, ancilla-assisted process tomography (AAPT) framework introduces an extra ancilla system such that a single input state is needed. In this paper, we extend the two-stage solution, a method originally designed for standard QPT, to perform AAPT. Our algorithm has… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 6 pages, 3 figures

  34. arXiv:2310.10997  [pdf

    eess.SY

    Cooperative Dispatch of Microgrids Community Using Risk-Sensitive Reinforcement Learning with Monotonously Improved Performance

    Authors: Ziqing Zhu, Xiang Gao, Siqi Bu, Ka Wing Chan, Bin Zhou, Shiwei Xia

    Abstract: The integration of individual microgrids (MGs) into Microgrid Clusters (MGCs) significantly improves the reliability and flexibility of energy supply, through resource sharing and ensuring backup during outages. The dispatch of MGCs is the key challenge to be tackled to ensure their secure and economic operation. Currently, there is a lack of optimization method that can achieve a trade-off among… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  35. Energy-efficient Integrated Sensing and Communication System and DNLFM Waveform

    Authors: Yihua Ma, Zhifeng Yuan, Shuqiang Xia, Chen Bai, Zhongbin Wang, Yuxin Wang

    Abstract: Integrated sensing and communication (ISAC) is a key enabler of 6G. Unlike communication radio links, the sensing signal requires to experience round trips from many scatters. Therefore, sensing is more power-sensitive and faces a severer multi-target interference. In this paper, the ISAC system employs dedicated sensing signals, which can be reused as the communication reference signal. This pape… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Journal ref: 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore, Singapore, 2024, pp. 1-6

  36. Joint Beam Management and SLAM for mmWave Communication Systems

    Authors: Hang Que, Jie Yang, Chao-Kai Wen, Shuqiang Xia, Xiao Li, Shi Jin

    Abstract: The millimeter-wave (mmWave) communication technology, which employs large-scale antenna arrays, enables inherent sensing capabilities. Simultaneous localization and mapping (SLAM) can utilize channel multipath angle estimates to realize integrated sensing and communication design in 6G communication systems. However, existing works have ignored the significant overhead required by the mmWave beam… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Journal ref: IEEE Transactions on Communications, early access, July 2023

  37. arXiv:2306.11977  [pdf

    eess.IV cs.CV

    Encoding Enhanced Complex CNN for Accurate and Highly Accelerated MRI

    Authors: Zimeng Li, Sa Xiao, Cheng Wang, Haidong Li, Xiuchao Zhao, Caohui Duan, Qian Zhou, Qiuchen Rao, Yuan Fang, Junshuai Xie, Lei Shi, Fumin Guo, Chaohui Ye, Xin Zhou

    Abstract: Magnetic resonance imaging (MRI) using hyperpolarized noble gases provides a way to visualize the structure and function of human lung, but the long imaging time limits its broad research and clinical applications. Deep learning has demonstrated great potential for accelerating MRI by reconstructing images from undersampled data. However, most existing deep conventional neural networks (CNN) direc… ▽ More

    Submitted 13 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  38. Joint Localization and Environment Sensing by Harnessing NLOS Components in RIS-aided mmWave Communication Systems

    Authors: Yixuan Huang, Jie Yang, Wankai Tang, Chao-Kai Wen, Shuqiang Xia, Shi Jin

    Abstract: This study explores the use of non-line-of-sight (NLOS) components in millimeter-wave (mmWave) communication systems for joint localization and environment sensing. The radar cross section (RCS) of a reconfigurable intelligent surface (RIS) is calculated to develop a general path gain model for RISs and traditional scatterers. The results show that RISs have a greater potential to assist in locali… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: 32 pages, 12 figures, accepted by IEEE Transactions on Wireless Communications

    Journal ref: IEEE Transactions on Wireless Communications, early access, April 2023

  39. arXiv:2305.06279  [pdf, other

    cs.IT cs.LG eess.SP

    Vertical Federated Learning over Cloud-RAN: Convergence Analysis and System Optimization

    Authors: Yuanming Shi, Shuhao Xia, Yong Zhou, Yijie Mao, Chunxiao Jiang, Meixia Tao

    Abstract: Vertical federated learning (FL) is a collaborative machine learning framework that enables devices to learn a global model from the feature-partition datasets without sharing local raw data. However, as the number of the local intermediate outputs is proportional to the training samples, it is critical to develop communication-efficient techniques for wireless vertical FL to support high-dimensio… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 32 pages, 7 figures

  40. arXiv:2305.05356  [pdf, other

    cs.CV cs.MM eess.IV

    Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching

    Authors: Shuting Xia, Tingyu Fan, Yiling Xu, Jenq-Neng Hwang, Zhu Li

    Abstract: 3D dynamic point cloud (DPC) compression relies on mining its temporal context, which faces significant challenges due to DPC's sparsity and non-uniform structure. Existing methods are limited in capturing sufficient temporal dependencies. Therefore, this paper proposes a learning-based DPC compression framework via hierarchical block-matching-based inter-prediction module to compensate and compre… ▽ More

    Submitted 16 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 9 pages for the main body, 3 pages for the supplemental after References

  41. arXiv:2305.02485   

    cs.AI cs.LG eess.SY

    How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory

    Authors: Ziqing Zhu, Siqi Bu, Ka Wing Chan, Bin Zhou, Shiwei Xia

    Abstract: In face of the pressing need of decarbonization in the power sector, the re-design of electricity market is necessary as a Marco-level approach to accommodate the high penetration of renewable generations, and to achieve power system operation security, economic efficiency, and environmental friendliness. However, existing market design methodologies suffer from the lack of coordination among ener… ▽ More

    Submitted 11 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: It is old version with mistakes

  42. arXiv:2305.00561  [pdf, other

    cs.AI cs.FL cs.MA cs.RO eess.SY

    Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments

    Authors: Junchao Li, Mingyu Cai, Zhen Kan, Shaoping Xiao

    Abstract: Motion planning of autonomous agents in partially known environments with incomplete information is a challenging problem, particularly for complex tasks. This paper proposes a model-free reinforcement learning approach to address this problem. We formulate motion planning as a probabilistic-labeled partially observable Markov decision process (PL-POMDP) problem and use linear temporal logic (LTL)… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 32 pages, 22 figures, submitted to Autonomous Agents and Multi-Agent Systems

  43. arXiv:2304.10780  [pdf, other

    cs.CV eess.IV

    Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction

    Authors: Binbin Huang, Xingyue Peng, Siyuan Shen, Suan Xia, Ruiqian Li, Yanhua Yu, Yuehan Wang, Shenghua Gao, Wenzheng Chen, Shiying Li, Jingyi Yu

    Abstract: We introduce Omni-LOS, a neural computational imaging method for conducting holistic shape reconstruction (HSR) of complex objects utilizing a Single-Photon Avalanche Diode (SPAD)-based time-of-flight sensor. As illustrated in Fig. 1, our method enables new capabilities to reconstruct near-$360^\circ$ surrounding geometry of an object from a single scan spot. In such a scenario, traditional line-o… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  44. arXiv:2302.09256  [pdf, other

    eess.AS cs.SD

    Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection

    Authors: Shengchang Xiao, Xueshuai Zhang, Pengyuan Zhang

    Abstract: Recently, convolutional neural networks (CNNs) have been widely used in sound event detection (SED). However, traditional convolution is deficient in learning time-frequency domain representation of different sound events. To address this issue, we propose multi-dimensional frequency dynamic convolution (MFDConv), a new design that endows convolutional kernels with frequency-adaptive dynamic prope… ▽ More

    Submitted 21 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: accepted to ICASSP 2023

  45. arXiv:2212.07651  [pdf, other

    eess.IV cs.CV cs.LG

    Two-stage Contextual Transformer-based Convolutional Neural Network for Airway Extraction from CT Images

    Authors: Yanan Wu, Shuiqing Zhao, Shouliang Qi, Jie Feng, Haowen Pang, Runsheng Chang, Long Bai, Mengqi Li, Shuyue Xia, Wei Qian, Hongliang Ren

    Abstract: Accurate airway extraction from computed tomography (CT) images is a critical step for planning navigation bronchoscopy and quantitative assessment of airway-related chronic obstructive pulmonary disease (COPD). The existing methods are challenging to sufficiently segment the airway, especially the high-generation airway, with the constraint of the limited label and cannot meet the clinical use in… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  46. arXiv:2210.16197  [pdf

    eess.SP

    Dimensionality Reduced Antenna Array for Beamforming/steering

    Authors: Shiyi Xia, Mingyang Zhao, Qian Ma, Xunnan Zhang, Ling Yang, Yazhi Pi, Hyunchul Chung, Ad Reniers, A. M. J. Koonen, Zizheng Cao

    Abstract: Beamforming makes possible a focused communication method. It is extensively employed in many disciplines involving electromagnetic waves, including arrayed ultrasonic, optical, and high-speed wireless communication. Conventional beam steering often requires the addition of separate active amplitude phase control units after each radiating element. The high power consumption and complexity of larg… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  47. Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting

    Authors: Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li, Shipeng Xia, Jiayang Zhang, Feng Tong, Lin Li, Qingyang Hong

    Abstract: This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting. The diarization system obtains direction information of speaker by microphone array. Speaker spatial embedding is generated by xvector and s-vector derived from superdirective beamforming (SDB) which makes the embedding more robust. Specifically, we propose a novel multi-channel sequence-to-s… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: Accepted by Interspeech 2022. arXiv admin note: text overlap with arXiv:2202.05744

  48. arXiv:2209.11953  [pdf

    q-bio.NC eess.SP

    TD-BPQBC: A 1.8μW 5.5mm3 ADC-less Neural Implant SoC utilizing 13.2pJ/Sample Time-domain Bi-phasic Quasi-static Brain Communication

    Authors: Baibhab Chatterjee, K Gaurav Kumar, Shulan Xiao, Gourab Barik, Krishna Jayant, Shreyas Sen

    Abstract: Untethered miniaturized wireless neural sensor nodes with data transmission and energy harvesting capabilities call for circuit and system-level innovations to enable ultra-low energy deep implants for brain-machine interfaces. Realizing that the energy and size constraints of a neural implant motivate highly asymmetric system design (a small, low-power sensor and transmitter at the implant, with… ▽ More

    Submitted 19 October, 2022; v1 submitted 24 September, 2022; originally announced September 2022.

    Comments: 4 pages, 6 figures, presented in ESSCIRC 2022 conference

  49. arXiv:2208.04318  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Adaptive Local Implicit Image Function for Arbitrary-scale Super-resolution

    Authors: Hongwei Li, Tao Dai, Yiming Li, Xueyi Zou, Shu-Tao Xia

    Abstract: Image representation is critical for many visual tasks. Instead of representing images discretely with 2D arrays of pixels, a recent study, namely local implicit image function (LIIF), denotes images as a continuous function where pixel values are expansion by using the corresponding coordinates as inputs. Due to its continuous nature, LIIF can be adopted for arbitrary-scale image super-resolution… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: This paper is accepted by ICIP 2022. 5 pages

  50. Highly Efficient Waveform Design and Hybrid Duplex for Joint Communication and Sensing

    Authors: Yihua Ma, Zhifeng Yuan, Shuqiang Xia, Guanghui Yu, Liujun Hu

    Abstract: Joint communication and sensing (JCAS) is a very promising 6G technology, which attracts more and more research attention. Compared with communication, radar has many unique features in terms of waveform design criteria, self-interference cancellation (SIC), aperture-dependent resolution, and virtual aperture. This paper proposes a novel waveform design named max-aperture radar slicing (MaRS) to g… ▽ More

    Submitted 4 July, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: in IEEE Internet of Things Journal

    Journal ref: in IEEE Internet of Things Journal, vol. 10, no. 19, pp. 17369-17381, 1 Oct.1, 2023,