Skip to main content

Showing 1–50 of 228 results for author: Ma, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.01348  [pdf, ps, other

    eess.AS cs.SD

    SpeechAccentLLM: A Unified Framework for Foreign Accent Conversion and Text to Speech

    Authors: Zhuangfei Cheng, Guangyan Zhang, Zehai Tu, Yangyang Song, Shuiyang Mao, Xiaoqi Jiao, Jingyu Li, Yiwen Guo, Jiasong Wu

    Abstract: Foreign accent conversion (FAC) in speech processing remains a challenging task. Building on the remarkable success of large language models (LLMs) in Text-to-Speech (TTS) tasks, this study investigates the adaptation of LLM-based techniques for FAC, which we term SpeechAccentLLM. At the core of this framework, we introduce SpeechCodeVAE, the first model to integrate connectionist temporal classif… ▽ More

    Submitted 8 July, 2025; v1 submitted 2 July, 2025; originally announced July 2025.

    Comments: 10 pages, includes references, 4 figures, 4 tables

    ACM Class: I.2.7

  2. arXiv:2506.21851  [pdf, ps, other

    cs.CV cs.MM eess.IV

    End-to-End RGB-IR Joint Image Compression With Channel-wise Cross-modality Entropy Model

    Authors: Haofeng Wang, Fangtao Zhou, Qi Zhang, Zeyuan Chen, Enci Zhang, Zhao Wang, Xiaofeng Huang, Siwei Ma

    Abstract: RGB-IR(RGB-Infrared) image pairs are frequently applied simultaneously in various applications like intelligent surveillance. However, as the number of modalities increases, the required data storage and transmission costs also double. Therefore, efficient RGB-IR data compression is essential. This work proposes a joint compression framework for RGB-IR image pair. Specifically, to fully utilize cr… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: IEEE International Conference on Systems, Man, and Cybernetics 2025. (SMC), under review

  3. arXiv:2506.15136  [pdf, ps, other

    eess.SP

    Out-of-Band Modality Synergy Based Multi-User Beam Prediction and Proactive BS Selection with Zero Pilot Overhead

    Authors: Kehui Li, Binggui Zhou, Jiajia Guo, Feifei Gao, Guanghua Yang, Shaodan Ma

    Abstract: Multi-user millimeter-wave communication relies on narrow beams and dense cell deployments to ensure reliable connectivity. However, tracking optimal beams for multiple mobile users across multiple base stations (BSs) results in significant signaling overhead. Recent works have explored the capability of out-of-band (OOB) modalities in obtaining spatial characteristics of wireless channels and red… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  4. arXiv:2506.12308  [pdf, ps, other

    eess.SP eess.SY

    From Ground to Sky: Architectures, Applications, and Challenges Shaping Low-Altitude Wireless Networks

    Authors: Weijie Yuan, Yuanhao Cui, Jiacheng Wang, Fan Liu, Geng Sun, Tao Xiang, Jie Xu, Shi Jin, Dusit Niyato, Sinem Coleri, Sumei Sun, Shiwen Mao, Abbas Jamalipour, Dong In Kim, Mohamed-Slim Alouini, Xuemin Shen

    Abstract: In this article, we introduce a novel low-altitude wireless network (LAWN), which is a reconfigurable, three-dimensional (3D) layered architecture. In particular, the LAWN integrates connectivity, sensing, control, and computing across aerial and terrestrial nodes that enable seamless operation in complex, dynamic, and mission-critical environments. Different from the conventional aerial communica… ▽ More

    Submitted 16 June, 2025; v1 submitted 13 June, 2025; originally announced June 2025.

    Comments: 10 pages, 5 figures

  5. arXiv:2505.22286  [pdf, ps, other

    cs.IT eess.SP

    Wireless Communication for Low-Altitude Economy with UAV Swarm Enabled Two-Level Movable Antenna System

    Authors: Haiquan Lu, Yong Zeng, Shaodan Ma, Bin Li, Shi Jin, Rui Zhang

    Abstract: Unmanned aerial vehicle (UAV) is regarded as a key enabling platform for low-altitude economy, due to its advantages such as 3D maneuverability, flexible deployment, and LoS air-to-air/ground communication links. In particular, the intrinsic high mobility renders UAV especially suitable for operating as a movable antenna (MA) from the sky. In this paper, by exploiting the flexible mobility of UAV… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 13 pages, 10 figures

  6. arXiv:2505.21951  [pdf, ps, other

    cs.IT eess.SP

    When Feedback Empowers the Uplink: Integrating Adaptive Coding with Wireless Power Transfer

    Authors: Zijian Yang, Yulin Shao, Shaodan Ma

    Abstract: Energy consumption and device lifetime are critical concerns for battery-constrained IoT devices. This paper introduces the Feedback-Aided Coding and Energy Transfer (FACET) framework, which synergistically combines adaptive feedback channel coding with wireless power transfer. FACET leverages the saturation effect of feedback coding, where increasing downlink power yields diminishing returns, to… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  7. arXiv:2505.11978  [pdf, ps, other

    cs.NI eess.SP

    LLM-guided DRL for Multi-tier LEO Satellite Networks with Hybrid FSO/RF Links

    Authors: Jiahui Li, Geng Sun, Zemin Sun, Jiacheng Wang, Yinqiu Liu, Ruichen Zhang, Dusit Niyato, Shiwen Mao

    Abstract: Despite significant advancements in terrestrial networks, inherent limitations persist in providing reliable coverage to remote areas and maintaining resilience during natural disasters. Multi-tier networks with low Earth orbit (LEO) satellites and high-altitude platforms (HAPs) offer promising solutions, but face challenges from high mobility and dynamic channel conditions that cause unstable con… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: This paper has been submitted to IEEE JSAC

  8. arXiv:2505.03261  [pdf, other

    cs.CV eess.IV

    DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor

    Authors: Wei-Ting Chen, Yu-Jiet Vong, Yi-Tsung Lee, Sy-Yen Kuo, Qiang Gao, Sizhuo Ma, Jian Wang

    Abstract: Video Quality Assessment (VQA) aims to evaluate video quality based on perceptual distortions and human preferences. Despite the promising performance of existing methods using Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), they often struggle to align closely with human perceptions, particularly in diverse real-world scenarios. This challenge is exacerbated by the limited sc… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  9. arXiv:2505.00687  [pdf, ps, other

    eess.IV cs.CV

    GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution

    Authors: Aditya Arora, Zhengzhong Tu, Yufei Wang, Ruizheng Bai, Jian Wang, Sizhuo Ma

    Abstract: In this paper, we propose GuideSR, a novel single-step diffusion-based image super-resolution (SR) model specifically designed to enhance image fidelity. Existing diffusion-based SR approaches typically adapt pre-trained generative models to image restoration tasks by adding extra conditioning on a VAE-downsampled representation of the degraded input, which often compromises structural fidelity. G… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  10. arXiv:2504.21445  [pdf, other

    eess.IV

    Emerging Advances in Learned Video Compression: Models, Systems and Beyond

    Authors: Chuanmin Jia, Feng Ye, Siwei Ma, Wen Gao, Huifang Sun, Leonardo Chiariglione

    Abstract: Video compression is a fundamental topic in the visual intelligence, bridging visual signal sensing/capturing and high-level visual analytics. The broad success of artificial intelligence (AI) technology has enriched the horizon of video compression into novel paradigms by leveraging end-to-end optimized neural models. In this survey, we first provide a comprehensive and systematic overview of rec… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  11. arXiv:2504.20441  [pdf, ps, other

    eess.SP

    Task-Oriented Semantic Communication with Importance-Aware Rate Control

    Authors: Zhiye Sun, Shuai Ma, Shiyin Li

    Abstract: Semantic communication is recognized for its high compression efficiency and robust resistance to noise. However, utilizing a fixed transmission rate in environments with dynamic signal-to-noise ratios (SNR) often results in inefficient use of communication resources. To address this challenge, this letter proposes an importance-aware rate control semantic communication (IRCSC) scheme, which dynam… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: 5 pages, 4 figures

  12. arXiv:2504.19660  [pdf, other

    cs.NI eess.SP

    Decentralization of Generative AI via Mixture of Experts for Wireless Networks: A Comprehensive Survey

    Authors: Yunting Xu, Jiacheng Wang, Ruichen Zhang, Changyuan Zhao, Dusit Niyato, Jiawen Kang, Zehui Xiong, Bo Qian, Haibo Zhou, Shiwen Mao, Abbas Jamalipour, Xuemin Shen, Dong In Kim

    Abstract: Mixture of Experts (MoE) has emerged as a promising paradigm for scaling model capacity while preserving computational efficiency, particularly in large-scale machine learning architectures such as large language models (LLMs). Recent advances in MoE have facilitated its adoption in wireless networks to address the increasing complexity and heterogeneity of modern communication systems. This paper… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: Survey paper, 30 pages, 13 figures

  13. arXiv:2504.16146  [pdf, other

    eess.SP cs.IT cs.NI

    Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications

    Authors: Chuang Zhang, Geng Sun, Jiahui Li, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Shiwen Mao, Tony Q. S. Quek

    Abstract: An integration of satellites and terrestrial networks is crucial for enhancing performance of next generation communication systems. However, the networks are hindered by the long-distance path loss and security risks in dense urban environments. In this work, we propose a satellite-terrestrial covert communication system assisted by the aerial active simultaneous transmitting and reflecting recon… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  14. arXiv:2504.16119  [pdf, other

    eess.SP physics.optics

    Micro-Ring Perceptron Sensor for High-Speed, Low-Power Radio-Frequency Signal

    Authors: Bo-Han Wu, Shi-Yuan Ma, Sri Krishna Vadlamani, Hyeongrak Choi, Dirk Englund

    Abstract: Radio-frequency (RF) sensing enables long-range, high-resolution detection for applications such as radar and wireless communication. RF photonic sensing mitigates the bandwidth limitations and high transmission losses of electronic systems by transducing the detected RF signals into broadband optical carriers. However, these sensing systems remain limited by detector noise and Nyquist rate sampli… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  15. arXiv:2504.09905  [pdf, other

    eess.SP

    Fusing Bluetooth with Pedestrian Dead Reckoning: A Floor Plan-Assisted Positioning Approach

    Authors: Wenxuan Pan, Yang Yang, Mingzhe Chen, Dong Wei, Caili Guo, Shiwen Mao

    Abstract: Floor plans can provide valuable prior information that helps enhance the accuracy of indoor positioning systems. However, existing research typically faces challenges in efficiently leveraging floor plan information and applying it to complex indoor layouts. To fully exploit information from floor plans for positioning, we propose a floor plan-assisted fusion positioning algorithm (FP-BP) using B… ▽ More

    Submitted 19 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

  16. arXiv:2504.08520  [pdf, other

    eess.SP cs.IT

    Joint Transmit Waveform and Receive Filter Design for ISAC System with Jamming

    Authors: Yuan Shu, Chenhao Qi, Shiwen Mao

    Abstract: In this paper, to suppress jamming in the complex electromagnetic environment, we propose a joint transmit waveform and receive filter design framework for integrated sensing and communications (ISAC). By jointly optimizing the transmit waveform and receive filters, we aim at minimizing the multiuser interference (MUI), subject to the constraints of the target mainlobe, jamming mainlobe and peak s… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  17. arXiv:2504.02061  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Aligned Better, Listen Better for Audio-Visual Large Language Models

    Authors: Yuxin Guo, Shuailei Ma, Shijie Ma, Xiaoyi Bao, Chen-Wei Xie, Kecheng Zheng, Tingyu Weng, Siyang Sun, Yun Zheng, Wei Zou

    Abstract: Audio is essential for multimodal video understanding. On the one hand, video inherently contains audio, which supplies complementary information to vision. Besides, video large language models (Video-LLMs) can encounter many audio-centric settings. However, existing Video-LLMs and Audio-Visual Large Language Models (AV-LLMs) exhibit deficiencies in exploiting audio information, leading to weak un… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted to ICLR 2025

  18. arXiv:2504.01333  [pdf, other

    eess.SP

    Reconfigurable Codebook-Based Beamforming for RDARS-Aided mmWave MU-MIMO Systems

    Authors: Chengwang Ji, Qing Xue, Haiquan Lu, Jintao Wang, Qiaoyan Peng, Shaodan Ma, Wei Zhang

    Abstract: Reconfigurable distributed antenna and reflecting surface (RDARS) is a new architecture for the sixth-generation (6G) millimeter wave (mmWave) communications. In RDARS-aided mmWave systems, the active and passive beamforming design and working mode configuration for reconfigurable elements are crucial for system performance. In this paper, we aim to maximize the weighted sum rate (WSR) in the RDAR… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  19. arXiv:2503.07139  [pdf, other

    cs.IT eess.SP

    Power Allocation for Coordinated Multi-Point Aided ISAC Systems

    Authors: Jianpeng Zou, Zhanfeng Zhong, Jintao Wang, Zheng Shi, Guanghua Yang, Shaodan Ma

    Abstract: In this letter, we investigate a coordinated multiple point (CoMP)-aided integrated sensing and communication (ISAC) system that supports multiple users and targets. Multiple base stations (BSs) employ a coordinated power allocation strategy to serve their associated single-antenna communication users (CUs) while utilizing the echo signals for joint radar target (RT) detection. The probability of… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 4 pages, 4 figures

  20. arXiv:2503.06149  [pdf, other

    cs.IT eess.SP

    Wireless Hallucination in Generative AI-enabled Communications: Concepts, Issues, and Solutions

    Authors: Xudong Wang, Jiacheng Wang, Lei Feng, Dusit Niyato, Ruichen Zhang, Jiawen Kang, Zehui Xiong, Hongyang Du, Shiwen Mao

    Abstract: Generative AI (GenAI) is driving the intelligence of wireless communications. Due to data limitations, random generation, and dynamic environments, GenAI may generate channel information or optimization strategies that violate physical laws or deviate from actual real-world requirements. We refer to this phenomenon as wireless hallucination, which results in invalid channel information, spectrum w… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: 7 pages, 4 figures

  21. arXiv:2503.02725  [pdf, other

    cs.CV eess.IV

    A Joint Visual Compression and Perception Framework for Neuralmorphic Spiking Camera

    Authors: Kexiang Feng, Chuanmin Jia, Siwei Ma, Wen Gao

    Abstract: The advent of neuralmorphic spike cameras has garnered significant attention for their ability to capture continuous motion with unparalleled temporal resolution.However, this imaging attribute necessitates considerable resources for binary spike data storage and transmission.In light of compression and spike-driven intelligent applications, we present the notion of Spike Coding for Intelligence (… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  22. arXiv:2502.19315  [pdf, ps, other

    physics.app-ph cond-mat.mes-hall cond-mat.mtrl-sci eess.SY physics.chem-ph

    Epitaxial high-K AlBN barrier GaN HEMTs

    Authors: Chandrashekhar Savant, Thai-Son Nguyen, Kazuki Nomoto, Saurabh Vishwakarma, Siyuan Ma, Akshey Dhar, Yu-Hsin Chen, Joseph Casamento, David J. Smith, Huili Grace Xing, Debdeep Jena

    Abstract: We report a polarization-induced 2D electron gas (2DEG) at an epitaxial AlBN/GaN heterojunction grown on a SiC substrate. Using this 2DEG in a long conducting channel, we realize ultra-thin barrier AlBN/GaN high electron mobility transistors that exhibit current densities of more than 0.25 A/mm, clean current saturation, a low pinch-off voltage of -0.43 V, and a peak transconductance of 0.14 S/mm.… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: Manuscript: 7 pages, 5 figures and Supplementary data: 2 pages, 4 figures

  23. arXiv:2502.16864  [pdf, other

    eess.SP

    Joint Size and Placement Optimization for IRS-Aided Communications with Active and Passive Elements

    Authors: Qiaoyan Peng, Qingqing Wu, Wen Chen, Chaoying Huang, Beixiong Zheng, Shaodan Ma, Mengnan Jian, Yijian Chen, Jun Yang

    Abstract: Different types of intelligent reflecting surfaces (IRS) are exploited for assisting wireless communications. The joint use of passive IRS (PIRS) and active IRS (AIRS) emerges as a promising solution owing to their complementary advantages. They can be integrated into a single hybrid active-passive IRS (HIRS) or deployed in a distributed manner, which poses challenges in determining the IRS elemen… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  24. arXiv:2502.12622  [pdf, other

    eess.SP

    Generative AI Enabled Robust Data Augmentation for Wireless Sensing in ISAC Networks

    Authors: Jiacheng Wang, Changyuan Zhao, Hongyang Du, Geng Sun, Jiawen Kang, Shiwen Mao, Dusit Niyato, Dong In Kim

    Abstract: Integrated sensing and communication (ISAC) uses the same software and hardware resources to achieve both communication and sensing functionalities. Thus, it stands as one of the core technologies of 6G and has garnered significant attention in recent years. In ISAC systems, a variety of machine learning models are trained to analyze and identify signal patterns, thereby ensuring reliable sensing… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 13 pages, 10 figures

  25. Semantic Feature Division Multiple Access for Digital Semantic Broadcast Channels

    Authors: Shuai Ma, Zhiye Sun, Bin Shen, Youlong Wu, Hang Li, Guangming Shi, Shiyin Li, Naofal Al-Dhahir

    Abstract: In this paper, we propose a digital semantic feature division multiple access (SFDMA) paradigm in multi-user broadcast (BC) networks for the inference and the image reconstruction tasks. In this SFDMA scheme, the multi-user semantic information is encoded into discrete approximately orthogonal representations, and the encoded semantic features of multiple users can be simultaneously transmitted in… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 14 pages, 13 figures

  26. arXiv:2501.10705  [pdf, other

    cs.IT eess.SP

    Secure Communication in Dynamic RDARS-Driven Systems

    Authors: Ziqian Pei, Jintao Wang, Pingping Zhang, Zheng Shi, Guanghua Yang, Shaodan Ma

    Abstract: In this letter, we investigate a dynamic reconfigurable distributed antenna and reflection surface (RDARS)-driven secure communication system, where the working mode of the RDARS can be flexibly configured. We aim to maximize the secrecy rate by jointly designing the active beamforming vectors, reflection coefficients, and the channel-aware mode selection matrix. To address the non-convex binary a… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: 5 pages, 5 figures

  27. arXiv:2501.01773  [pdf, other

    eess.IV cs.CV

    Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content

    Authors: Qizhe Wang, Qian Yin, Zhimeng Huang, Weijia Jiang, Yi Su, Siwei Ma, Jiaqi Zhang

    Abstract: Cloud gaming is an advanced form of Internet service that necessitates local terminals to decode within limited resources and time latency. Super-Resolution (SR) techniques are often employed on these terminals as an efficient way to reduce the required bit-rate bandwidth for cloud gaming. However, insufficient attention has been paid to SR of compressed game video content. Most SR networks amplif… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

    Comments: 10 pages, 4 figures, Data Compression Conference2025

  28. arXiv:2412.19494  [pdf, other

    cs.NI cs.IT eess.SP

    Retrieval-augmented Generation for GenAI-enabled Semantic Communications

    Authors: Shunpu Tang, Ruichen Zhang, Yuxuan Yan, Qianqian Yang, Dusit Niyato, Xianbin Wang, Shiwen Mao

    Abstract: Semantic communication (SemCom) is an emerging paradigm aiming at transmitting only task-relevant semantic information to the receiver, which can significantly improve communication efficiency. Recent advancements in generative artificial intelligence (GenAI) have empowered GenAI-enabled SemCom (GenSemCom) to further expand its potential in various applications. However, current GenSemCom systems… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  29. arXiv:2412.18817  [pdf, ps, other

    cs.IT eess.SP

    Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement

    Authors: Haiquan Lu, Zhi Yu, Yong Zeng, Shaodan Ma, Shi Jin, Rui Zhang

    Abstract: Passive metal reflectors for communication enhancement have appealing advantages such as ultra low cost, zero energy expenditure, maintenance-free operation, long life span, and full compatibility with legacy wireless systems. To unleash the full potential of passive reflectors for wireless communications, this paper proposes a new passive reflector architecture, termed flexible reflector (FR), fo… ▽ More

    Submitted 4 March, 2025; v1 submitted 25 December, 2024; originally announced December 2024.

    Comments: 14 pages, 16 figures

  30. arXiv:2412.11771  [pdf, other

    eess.IV cs.CV

    Point Cloud-Assisted Neural Image Compression

    Authors: Ziqun Li, Qi Zhang, Xiaofeng Huang, Zhao Wang, Siwei Ma, Wei Yan

    Abstract: High-efficient image compression is a critical requirement. In several scenarios where multiple modalities of data are captured by different sensors, the auxiliary information from other modalities are not fully leveraged by existing image-only codecs, leading to suboptimal compression efficiency. In this paper, we increase image compression performance with the assistance of point cloud, which is… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  31. arXiv:2412.05403  [pdf, other

    eess.SP cs.CE cs.HC cs.LG physics.bio-ph

    Knowledge-Based Deep Learning for Time-Efficient Inverse Dynamics

    Authors: Shuhao Ma, Yu Cao, Ian D. Robertson, Chaoyang Shi, Jindong Liu, Zhi-Qiang Zhang

    Abstract: Accurate understanding of muscle activation and muscle forces plays an essential role in neuro-rehabilitation and musculoskeletal disorder treatments. Computational musculoskeletal modeling has been widely used as a powerful non-invasive tool to estimate them through inverse dynamics using static optimization, but the inherent computational complexity results in time-consuming analysis. In this pa… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: 10 pages, 8 figures, Journal paper

  32. arXiv:2412.04213  [pdf, other

    cs.LG cs.HC eess.SP physics.bio-ph

    Physics-informed Deep Learning for Muscle Force Prediction with Unlabeled sEMG Signals

    Authors: Shuhao Ma, Jie Zhang, Chaoyang Shi, Pei Di, Ian D. Robertson, Zhi-Qiang Zhang

    Abstract: Computational biomechanical analysis plays a pivotal role in understanding and improving human movements and physical functions. Although physics-based modeling methods can interpret the dynamic interaction between the neural drive to muscle dynamics and joint kinematics, they suffer from high computational latency. In recent years, data-driven methods have emerged as a promising alternative due t… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 11pages, 8 figures, journal

    Journal ref: IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 32, pp. 1246-1256, 2024

  33. arXiv:2411.17056  [pdf, ps, other

    cs.IT eess.SP

    Robust Max-Min Fair Beamforming Design for Rate Splitting Multiple Access-aided Visible Light Communications

    Authors: Zhengqing Qiu, Yijie Mao, Shuai Ma, Bruno Clerckx

    Abstract: This paper addresses the robust beamforming design for rate splitting multiple access (RSMA)-aided visible light communication (VLC) networks with imperfect channel state information at the transmitter (CSIT). In particular, we first derive the theoretical lower bound for the channel capacity of RSMA-aided VLC networks. Then we investigate the beamforming design to solve the max-min fairness (MMF)… ▽ More

    Submitted 26 November, 2024; v1 submitted 25 November, 2024; originally announced November 2024.

  34. arXiv:2411.14135  [pdf, other

    eess.IV cs.MM

    Compact Visual Data Representation for Green Multimedia -- A Human Visual System Perspective

    Authors: Peilin Chen, Xiaohan Fang, Meng Wang, Shiqi Wang, Siwei Ma

    Abstract: The Human Visual System (HVS), with its intricate sophistication, is capable of achieving ultra-compact information compression for visual signals. This remarkable ability is coupled with high generalization capability and energy efficiency. By contrast, the state-of-the-art Versatile Video Coding (VVC) standard achieves a compression ratio of around 1,000 times for raw visual data. This notable d… ▽ More

    Submitted 26 December, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

  35. arXiv:2411.04762  [pdf, other

    cs.NI eess.SP

    JC5A: Service Delay Minimization for Aerial MEC-assisted Industrial Cyber-Physical Systems

    Authors: Geng Sun, Jiaxu Wu, Zemin Sun, Long He, Jiacheng Wang, Dusit Niyato, Abbas Jamalipour, Shiwen Mao

    Abstract: In the era of the sixth generation (6G) and industrial Internet of Things (IIoT), an industrial cyber-physical system (ICPS) drives the proliferation of sensor devices and computing-intensive tasks. To address the limited resources of IIoT sensor devices, unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) has emerged as a promising solution, providing flexible and cost-effective se… ▽ More

    Submitted 2 December, 2024; v1 submitted 7 November, 2024; originally announced November 2024.

  36. arXiv:2410.14697  [pdf, other

    q-bio.NC cs.AI eess.SP

    Learning Cortico-Muscular Dependence through Orthonormal Decomposition of Density Ratios

    Authors: Shihan Ma, Bo Hu, Tianyu Jia, Alexander Kenneth Clarke, Blanka Zicher, Arnault H. Caillet, Dario Farina, Jose C. Principe

    Abstract: The cortico-spinal neural pathway is fundamental for motor control and movement execution, and in humans it is typically studied using concurrent electroencephalography (EEG) and electromyography (EMG) recordings. However, current approaches for capturing high-level and contextual connectivity between these recordings have important limitations. Here, we present a novel application of statistical… ▽ More

    Submitted 19 December, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

  37. arXiv:2409.13398  [pdf

    cs.IT eess.SP

    Unsourced Sparse Multiple Access foUnsourced Sparse Multiple Access for 6G Massive Communicationr 6G Massive Communication

    Authors: Yifei Yuan, Yuhong Huang, Chunlin Yan, Sen Wang, Shuai Ma, Xiaodong Shen

    Abstract: Massive communication is one of key scenarios of 6G where two magnitude higher connection density would be required to serve diverse services. As a promising direction, unsourced multiple access has been proved to outperform significantly over orthogonal multiple access (OMA) or slotted-ALOHA in massive connections. In this paper we describe a design framework of unsourced sparse multiple access (… ▽ More

    Submitted 15 November, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

    Comments: 7 pages, 5 figures and 1 table

  38. arXiv:2409.10127  [pdf, ps, other

    cs.IT eess.SP

    Joint Beamforming and Illumination Pattern Design for Beam-Hopping LEO Satellite Communications

    Authors: Jing Wang, Chenhao Qi, Shui Yu, Shiwen Mao

    Abstract: Since hybrid beamforming (HBF) can approach the performance of fully-digital beamforming (FDBF) with much lower hardware complexity, we investigate the HBF design for beam-hopping (BH) low earth orbit (LEO) satellite communications (SatComs). Aiming at maximizing the sum-rate of totally illuminated beam positions during the whole BH period, we consider joint beamforming and illumination pattern de… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  39. arXiv:2409.06946  [pdf, other

    cs.IT eess.SP

    Refracting Reconfigurable Intelligent Surface Assisted URLLC for Millimeter Wave High-Speed Train Communication Coverage Enhancement

    Authors: Changzhu Liu, Ruisi He, Yong Niu, Shiwen Mao, Bo Ai, Ruifeng Chen

    Abstract: High-speed train (HST) has garnered significant attention from both academia and industry due to the rapid development of railways worldwide. Millimeter wave (mmWave) communication, known for its large bandwidth is an effective way to address performance bottlenecks in cellular network based HST wireless communication systems. However, mmWave signals suffer from significant path loss when traversi… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 11 figures, accepted by IEEE Transactions on Vehicular Technology

  40. arXiv:2409.00956  [pdf

    eess.IV cs.CV

    Physics-Informed Neural Network Based Digital Image Correlation Method

    Authors: Boda Li, Shichao Zhou, Qinwei Ma, Shaopeng Ma

    Abstract: Digital Image Correlation (DIC) is a key technique in experimental mechanics for full-field deformation measurement, traditionally relying on subset matching to determine displacement fields. However, selecting optimal parameters like shape functions and subset size can be challenging in non-uniform deformation scenarios. Recent deep learning-based DIC approaches, both supervised and unsupervised,… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  41. arXiv:2408.11398  [pdf, other

    eess.SP

    Generative AI based Secure Wireless Sensing for ISAC Networks

    Authors: Jiacheng Wang, Hongyang Du, Yinqiu Liu, Geng Sun, Dusit Niyato, Shiwen Mao, Dong In Kim, Xuemin Shen

    Abstract: Integrated sensing and communications (ISAC) is expected to be a key technology for 6G, and channel state information (CSI) based sensing is a key component of ISAC. However, current research on ISAC focuses mainly on improving sensing performance, overlooking security issues, particularly the unauthorized sensing of users. In this paper, we propose a secure sensing system (DFSS) based on two dist… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  42. arXiv:2408.08833  [pdf, other

    eess.SP

    Intra-symbol Differential Amplitude Shift Keying-aided Blind Detector for Ambient Backscatter Communication Systems

    Authors: Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang

    Abstract: Ambient backscatter communications (AmBC) are a promising technology for addressing the energy consumption challenge in wireless communications through the reflection or absorption of surrounding radio frequency (RF) signals. However, it grapples with the intricacies of ambient RF signal and the round-trip path loss. For traditional detectors, the incorporation of pilot sequences results in a redu… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  43. arXiv:2407.15395  [pdf, other

    eess.SP

    FAST-GSC: Fast and Adaptive Semantic Transmission for Generative Semantic Communication

    Authors: Yiru Wang, Wanting Yang, Zehui Xiong, Yuping Zhao, Shiwen Mao, Tony Q. S. Quek, H. Vincent Poor

    Abstract: The rapidly evolving field of generative artificial intelligence technology has introduced innovative approaches for developing semantic communication (SemCom) frameworks, leading to the emergence of a new paradigm-generative SemCom (GSC). However, the complex processes involved in semantic extraction and generative inference may result in considerable latency in resource-constrained scenarios. To… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  44. arXiv:2407.08919  [pdf, other

    cs.NI cs.ET eess.SP

    Redefinition of Digital Twin and its Situation Awareness Framework Designing Towards Fourth Paradigm for Energy Internet of Things

    Authors: Xing He, Yuezhong Tang, Shuyan Ma, Qian Ai, Fei Tao, Robert Qiu

    Abstract: Traditional knowledge-based situation awareness (SA) modes struggle to adapt to the escalating complexity of today's Energy Internet of Things (EIoT), necessitating a pivotal paradigm shift. In response, this work introduces a pioneering data-driven SA framework, termed digital twin-based situation awareness (DT-SA), aiming to bridge existing gaps between data and demands, and further to enhance S… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 16 pages, 15 figures Accepted by IEEE Transactions on Systems, Man and Cybernetics: Systems

  45. arXiv:2407.08424  [pdf, other

    eess.SP

    Semantic Feature Division Multiple Access for Multi-user Digital Interference Networks

    Authors: Shuai Ma, Chuanhui Zhang, Bin Shen, Youlong Wu, Hang Li, Shiyin Li, Guangming Shi, Naofal Al-Dhahir

    Abstract: With the ever-increasing user density and quality of service (QoS) demand,5G networks with limited spectrum resources are facing massive access challenges. To address these challenges, in this paper, we propose a novel discrete semantic feature division multiple access (SFDMA) paradigm for multi-user digital interference networks. Specifically, by utilizing deep learning technology, SFDMA extracts… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  46. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  47. arXiv:2407.01006  [pdf, other

    eess.SP

    Multi-Functional Beamforming Design for Integrated Sensing, Communication, and Computation

    Authors: Yapeng Zhao, Qingqing Wu, Wen Chen, Yong Zeng, Ruiqi Liu, Weidong Mei, Fen Hou, Shaodan Ma

    Abstract: Integrated sensing and communication (ISAC) systems may face a heavy computation burden since the sensory data needs to be further processed. This paper studies a novel system that integrates sensing, communication, and computation, aiming to provide services for different objectives efficiently. This system consists of a multi-antenna multi-functional base station (BS), an edge server, a target,… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  48. arXiv:2406.09627  [pdf, other

    cs.CV cs.AI eess.IV

    RobustSAM: Segment Anything Robustly on Degraded Images

    Authors: Wei-Ting Chen, Yu-Jiet Vong, Sy-Yen Kuo, Sizhuo Ma, Jian Wang

    Abstract: Segment Anything Model (SAM) has emerged as a transformative approach in image segmentation, acclaimed for its robust zero-shot segmentation capabilities and flexible prompting system. Nonetheless, its performance is challenged by images with degraded quality. Addressing this limitation, we propose the Robust Segment Anything Model (RobustSAM), which enhances SAM's performance on low-quality image… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR2024 (Highlight); Project Page: https://robustsam.github.io/

  49. arXiv:2406.09622  [pdf, other

    cs.CV cs.AI eess.IV

    DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer

    Authors: Wei-Ting Chen, Gurunandan Krishnan, Qiang Gao, Sy-Yen Kuo, Sizhuo Ma, Jian Wang

    Abstract: Generic Face Image Quality Assessment (GFIQA) evaluates the perceptual quality of facial images, which is crucial in improving image restoration algorithms and selecting high-quality face images for downstream tasks. We present a novel transformer-based method for GFIQA, which is aided by two unique mechanisms. First, a Dual-Set Degradation Representation Learning (DSL) mechanism uses facial image… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024, Project Page: https://dsl-fiqa.github.io/

  50. arXiv:2406.09389  [pdf, other

    eess.IV cs.CV

    Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior

    Authors: Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen

    Abstract: Capturing High Dynamic Range (HDR) scenery using 8-bit cameras often suffers from over-/underexposure, loss of fine details due to low bit-depth compression, skewed color distributions, and strong noise in dark areas. Traditional LDR image enhancement methods primarily focus on color mapping, which enhances the visual representation by expanding the image's color range and adjusting the brightness… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: https://sagiri0208.github.io