Skip to main content

Showing 1–50 of 153 results for author: Gao, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04821  [pdf, ps, other

    eess.SY

    Force-IMU Fusion-Based Sensing Acupuncture Needle and Quantitative Analysis System for Acupuncture Manipulations

    Authors: Peng Tian, Kang Yu, Tianyun Jiang, Yuqi Wang, Haiying Zhang, Hao Yang, Yunfeng Wang, Jun Zhang, Shuo Gao, Junhong Gao

    Abstract: Acupuncture, one of the key therapeutic methods in Traditional Chinese Medicine (TCM), has been widely adopted in various clinical fields. Quantitative research on acupuncture manipulation parameters is critical to achieve standardized techniques. However, quantitative mechanical detection of acupuncture parameters remains limited. This study establishes a kinematic and dynamic model of acupunctur… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  2. arXiv:2506.12831  [pdf, ps, other

    eess.SP cs.AI

    Synesthesia of Machines (SoM)-Enhanced Sub-THz ISAC Transmission for Air-Ground Network

    Authors: Zonghui Yang, Shijian Gao, Xiang Cheng, Liuqing Yang

    Abstract: Integrated sensing and communication (ISAC) within sub-THz frequencies is crucial for future air-ground networks, but unique propagation characteristics and hardware limitations present challenges in optimizing ISAC performance while increasing operational latency. This paper introduces a multi-modal sensing fusion framework inspired by synesthesia of machine (SoM) to enhance sub-THz ISAC transmis… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  3. arXiv:2506.09344  [pdf, ps, other

    cs.AI cs.CL cs.CV cs.LG cs.SD eess.AS

    Ming-Omni: A Unified Multimodal Model for Perception and Generation

    Authors: Inclusion AI, Biao Gong, Cheng Zou, Chuanyang Zheng, Chunluan Zhou, Canxiang Yan, Chunxiang Jin, Chunjie Shen, Dandan Zheng, Fudong Wang, Furong Xu, GuangMing Yao, Jun Zhou, Jingdong Chen, Jianxin Sun, Jiajia Liu, Jianjiang Zhu, Jun Peng, Kaixiang Ji, Kaiyou Song, Kaimeng Ren, Libin Wang, Lixiang Ru, Lele Xie, Longhua Tan , et al. (33 additional authors not shown)

    Abstract: We propose Ming-Omni, a unified multimodal model capable of processing images, text, audio, and video, while demonstrating strong proficiency in both speech and image generation. Ming-Omni employs dedicated encoders to extract tokens from different modalities, which are then processed by Ling, an MoE architecture equipped with newly proposed modality-specific routers. This design enables a single… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 18 pages,8 figures

  4. arXiv:2506.07535  [pdf, other

    eess.SP

    Synesthesia of Machines (SoM)-Aided Online FDD Precoding via Heterogeneous Multi-Modal Sensing: A Vertical Federated Learning Approach

    Authors: Haotian Zhang, Shijian Gao, Weibo Wen, Xiang Cheng, Liuqing Yang

    Abstract: This paper investigates a heterogeneous multi-vehicle, multi-modal sensing (H-MVMM) aided online precoding problem. The proposed H-MVMM scheme utilizes a vertical federated learning (VFL) framework to minimize pilot sequence length and optimize the sum rate. This offers a promising solution for reducing latency in frequency division duplexing systems. To achieve this, three preprocessing modules a… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2501.10941

  5. arXiv:2505.21522  [pdf, ps, other

    cs.CV cs.AI cs.LG eess.IV

    CIM-NET: A Video Denoising Deep Neural Network Model Optimized for Computing-in-Memory Architectures

    Authors: Shan Gao, Zhiqiang Wu, Yawen Niu, Xiaotao Li, Qingqing Xu

    Abstract: While deep neural network (DNN)-based video denoising has demonstrated significant performance, deploying state-of-the-art models on edge devices remains challenging due to stringent real-time and energy efficiency requirements. Computing-in-Memory (CIM) chips offer a promising solution by integrating computation within memory cells, enabling rapid matrix-vector multiplication (MVM). However, exis… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  6. arXiv:2505.12657  [pdf, ps, other

    eess.SY cs.SI math.OC

    Transmission Neural Networks: Approximation and Optimal Control

    Authors: Shuang Gao, Peter E. Caines

    Abstract: Transmission Neural Networks (TransNNs) introduced by Gao and Caines (2022) connect virus spread models over networks and neural networks with tuneable activation functions. This paper presents the approximation technique and the underlying assumptions employed by TransNNs in relation to the corresponding Markovian Susceptible-Infected-Susceptible (SIS) model with 2^n states, where n is the number… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Journal ref: IFAC Conference on Networked Systems, 2025

  7. arXiv:2505.08556  [pdf

    eess.SY

    A High-Efficiency Reconfigurable Bidirectional Array Antenna Based on Transmit-Reflect Switchable Metasurface

    Authors: Fan Qin, Jinyang Bi, Jiao Ma, Chao Gu, Hailin Zhang, Wenchi Cheng, Steven Gao

    Abstract: This paper proposes a reconfigurable bidirectional array antenna with high-efficiency radiations and flexible beam-switching capability by employing a novel transmit-reflect switchable metasurface (TRSM). To realize the electromagnetic (EM) wave transmitted or reflected manipulation, a dedicated transmit-reflect switch layer (TRSL) with periodically soldered PIN diodes is introduced between two tr… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 11 pages, 18 figures, published to TAP

  8. arXiv:2505.04467  [pdf, other

    eess.SP

    Image Steganography For Securing Intellicise Wireless Networks: "Invisible Encryption" Against Eavesdroppers

    Authors: Bizhu Wang, Song Gao, Rui Meng, Haixiao Gao, Xiaodong Xu, Mengying Sun, Chen Dong, Ping Zhang, Dusit Niyato

    Abstract: As one of the most promising technologies for intellicise (intelligent and consice) wireless networks, Semantic Communication (SemCom) significantly improves communication efficiency by extracting, transmitting, and recovering semantic information, while reducing transmission delay. However, an integration of communication and artificial intelligence (AI) also exposes SemCom to security and privac… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 10 pages, 4 figures

  9. arXiv:2504.21209  [pdf

    eess.SP cs.LG

    Generalised Label-free Artefact Cleaning for Real-time Medical Pulsatile Time Series

    Authors: Xuhang Chen, Ihsane Olakorede, Stefan Yu Bögli, Wenhao Xu, Erta Beqiri, Xuemeng Li, Chenyu Tang, Zeyu Gao, Shuo Gao, Ari Ercole, Peter Smielewski

    Abstract: Artefacts compromise clinical decision-making in the use of medical time series. Pulsatile waveforms offer probabilities for accurate artefact detection, yet most approaches rely on supervised manners and overlook patient-level distribution shifts. To address these issues, we introduce a generalised label-free framework, GenClean, for real-time artefact cleaning and leverage an in-house dataset of… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  10. arXiv:2504.18175  [pdf, other

    eess.SP

    Generative AI for Physical-Layer Authentication

    Authors: Rui Meng, Xiqi Cheng, Song Gao, Xiaodong Xu, Chen Dong, Guoshun Nan, Xiaofeng Tao, Ping Zhang, Tony Q. S. Quek

    Abstract: In recent years, Artificial Intelligence (AI)-driven Physical-Layer Authentication (PLA), which focuses on achieving endogenous security and intelligent identity authentication, has attracted considerable interest. When compared with Discriminative AI (DAI), Generative AI (GAI) offers several advantages, such as fingerprint data augmentation, fingerprint denoising and reconstruction, and protectio… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 10 pages, 3 figures

  11. arXiv:2504.15649  [pdf, other

    eess.IV cs.CV

    RepNet-VSR: Reparameterizable Architecture for High-Fidelity Video Super-Resolution

    Authors: Biao Wu, Diankai Zhang, Shaoli Liu, Si Gao, Chengjian Zheng, Ning Wang

    Abstract: As a fundamental challenge in visual computing, video super-resolution (VSR) focuses on reconstructing highdefinition video sequences from their degraded lowresolution counterparts. While deep convolutional neural networks have demonstrated state-of-the-art performance in spatial-temporal super-resolution tasks, their computationally intensive nature poses significant deployment challenges for res… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: Champion Solution for CVPR 2025 MAI VSR Track

  12. arXiv:2504.14835  [pdf, other

    eess.SP

    Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach

    Authors: Jiahui Liang, Miaowen Wen, Shuoyao Wang, Yuxuan Liang, Shijian Gao

    Abstract: As vehicle intelligence advances, multi-modal sensing-aided communication emerges as a key enabler for reliable Vehicle-to-Everything (V2X) connectivity through precise environmental characterization. As centralized learning may suffer from data privacy, model heterogeneity and communication overhead issues, federated learning (FL) has been introduced to support V2X. However, the practical deploym… ▽ More

    Submitted 1 May, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

  13. arXiv:2503.20256  [pdf, other

    cs.NI eess.SP

    Sequential Task Assignment and Resource Allocation in V2X-Enabled Mobile Edge Computing

    Authors: Yufei Ye, Shijian Gao, Xinhu Zheng, Liuqing Yang

    Abstract: Nowadays, the convergence of Mobile Edge Computing (MEC) and vehicular networks has emerged as a vital facilitator for the ever-increasing intelligent onboard applications. This paper introduces a multi-tier task offloading mechanism for MEC-enabled vehicular networks leveraging vehicle-to-everything (V2X) communications. The study focuses on applications with sequential subtasks and explores two… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  14. arXiv:2503.10641  [pdf, other

    eess.SY cs.AI cs.RO

    Estimating Control Barriers from Offline Data

    Authors: Hongzhan Yu, Seth Farrell, Ryo Yoshimitsu, Zhizhen Qin, Henrik I. Christensen, Sicun Gao

    Abstract: Learning-based methods for constructing control barrier functions (CBFs) are gaining popularity for ensuring safe robot control. A major limitation of existing methods is their reliance on extensive sampling over the state space or online system interaction in simulation. In this work we propose a novel framework for learning neural CBFs through a fixed, sparsely-labeled dataset collected prior to… ▽ More

    Submitted 20 February, 2025; originally announced March 2025.

    Comments: This paper has been accepted to ICRA 2025

  15. arXiv:2502.11946  [pdf, other

    cs.CL cs.AI cs.HC cs.SD eess.AS

    Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

    Authors: Ailin Huang, Boyong Wu, Bruce Wang, Chao Yan, Chen Hu, Chengli Feng, Fei Tian, Feiyu Shen, Jingbei Li, Mingrui Chen, Peng Liu, Ruihang Miao, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Gong, Zixin Zhang, Hongyu Zhou, Jianjian Sun, Brian Li, Chengting Feng, Changyi Wan, Hanpeng Hu , et al. (120 additional authors not shown)

    Abstract: Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contribu… ▽ More

    Submitted 18 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  16. arXiv:2501.12983  [pdf, other

    eess.SP

    LLM4WM: Adapting LLM for Wireless Multi-Tasking

    Authors: Xuanyu Liu, Shijian Gao, Boxun Liu, Xiang Cheng, Liuqing Yang

    Abstract: The wireless channel is fundamental to communication, encompassing numerous tasks collectively referred to as channel-associated tasks. These tasks can leverage joint learning based on channel characteristics to share representations and enhance system design. To capitalize on this advantage, LLM4WM is proposed--a large language model (LLM) multi-task fine-tuning framework specifically tailored fo… ▽ More

    Submitted 7 February, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

  17. arXiv:2501.10941  [pdf, other

    eess.SP

    Synesthesia of Machines (SoM)-Aided FDD Precoding with Sensing Heterogeneity: A Vertical Federated Learning Approach

    Authors: Haotian Zhang, Shijian Gao, Weibo Wen, Xiang Cheng

    Abstract: High complexity in precoding design for frequency division duplex systems necessitates streamlined solutions. Guided by Synesthesia of Machines (SoM), this paper introduces a heterogeneous multi-vehicle, multi-modal sensing aided precoding scheme within a vertical federated learning (VFL) framework, which significantly minimizes pilot sequence length while optimizing the system's sum rate. We addr… ▽ More

    Submitted 13 March, 2025; v1 submitted 18 January, 2025; originally announced January 2025.

    Comments: 7 pages, 7 figures

  18. arXiv:2501.10676  [pdf, other

    eess.SP

    Predictive Target-to-User Association in Complex Scenarios via Hybrid-Field ISAC Signaling

    Authors: Yifeng Yuan, Miaowen Wen, Xinhu Zheng, Shuoyao Wang, Shijian Gao

    Abstract: This paper presents a novel and robust target-to-user (T2U) association framework to support reliable vehicle-to-infrastructure (V2I) networks that potentially operate within the hybrid field (near-field and far-field). To address the challenges posed by complex vehicle maneuvers and user association ambiguity, an interacting multiple-model filtering scheme is developed, which combines coordinated… ▽ More

    Submitted 15 April, 2025; v1 submitted 18 January, 2025; originally announced January 2025.

  19. arXiv:2501.00842  [pdf, other

    cs.CR eess.IV eess.SP

    A Survey of Secure Semantic Communications

    Authors: Rui Meng, Song Gao, Dayu Fan, Haixiao Gao, Yining Wang, Xiaodong Xu, Bizhu Wang, Suyu Lv, Zhidi Zhang, Mengying Sun, Shujun Han, Chen Dong, Xiaofeng Tao, Ping Zhang

    Abstract: Semantic communication (SemCom) is regarded as a promising and revolutionary technology in 6G, aiming to transcend the constraints of ``Shannon's trap" by filtering out redundant information and extracting the core of effective data. Compared to traditional communication paradigms, SemCom offers several notable advantages, such as reducing the burden on data transmission, enhancing network managem… ▽ More

    Submitted 26 March, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: 160 pages, 27 figures

  20. arXiv:2501.00018  [pdf, other

    cs.SD eess.AS

    SECodec: Structural Entropy-based Compressive Speech Representation Codec for Speech Language Models

    Authors: Linqin Wang, Yaping Liu, Zhengtao Yu, Shengxiang Gao, Cunli Mao, Yuxin Huang, Wenjun Wang, Ling Dong

    Abstract: With the rapid advancement of large language models (LLMs), discrete speech representations have become crucial for integrating speech into LLMs. Existing methods for speech representation discretization rely on a predefined codebook size and Euclidean distance-based quantization. However, 1) the size of codebook is a critical parameter that affects both codec performance and downstream task train… ▽ More

    Submitted 15 December, 2024; originally announced January 2025.

    Comments: Accepted to the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI-25)

  21. arXiv:2412.13532  [pdf, other

    eess.SP

    Synesthesia of Machine (SoM)-Driven Analog Precoder Optimization for Enhanced ISAC Performance in Sub-THz Systems

    Authors: Zonghui Yang, Shijian Gao, Xiang Cheng

    Abstract: Integrated sensing and communication (ISAC) is anticipated to be widely used in future sub-terahertz (sub-THz) systems. With the line-of-sight (LoS) propagation characteristics of sub-THz channels, ISAC transmitter design largely parallels analog precoder optimization. However, balancing both sensing and communication functionalities is challenging due to the beam squint effect in sub-THz systems,… ▽ More

    Submitted 2 March, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

  22. arXiv:2412.08908  [pdf, other

    eess.SP

    WiFo: Wireless Foundation Model for Channel Prediction

    Authors: Boxun Liu, Shijian Gao, Xuanyu Liu, Xiang Cheng, Liuqing Yang

    Abstract: Channel prediction permits to acquire channel state information (CSI) without signaling overhead. However, almost all existing channel prediction methods necessitate the deployment of a dedicated model to accommodate a specific configuration. Leveraging the powerful modeling and multi-task learning capabilities of foundation models, we propose the first space-time-frequency (STF) wireless foundati… ▽ More

    Submitted 19 March, 2025; v1 submitted 11 December, 2024; originally announced December 2024.

  23. arXiv:2412.00562  [pdf, other

    eess.SP

    Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling

    Authors: Peihao Dong, Jibin Jia, Shen Gao, Fuhui Zhou, Qihui Wu

    Abstract: Wideband spectrum sensing (WSS) is critical for orchestrating multitudinous wireless transmissions via spectrum sharing, but may incur excessive costs of hardware, power and computation due to the high sampling rate. In this article, a deep learning based WSS framework embedding the multicoset preprocessing is proposed to enable the low-cost sub-Nyquist sampling. A pruned convolutional attention W… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: Accepted by IEEE Transactions on Vehicular Technology

  24. arXiv:2411.19000  [pdf

    cs.HC cs.AI eess.SY

    An AI-driven multimodal smart home platform for continuous monitoring and intelligent assistance in post-stroke patients

    Authors: Chenyu Tang, Ruizhi Zhang, Shuo Gao, Zihe Zhao, Zibo Zhang, Jiaqi Wang, Cong Li, Junliang Chen, Yanning Dai, Shengbo Wang, Ruoyu Juan, Qiaoying Li, Ruimou Xie, Xuhang Chen, Xinkai Zhou, Yunjia Xia, Jianan Chen, Fanghao Lu, Xin Li, Ninglli Wang, Peter Smielewski, Yu Pan, Hubin Zhao, Luigi G. Occhipinti

    Abstract: At-home rehabilitation for post-stroke patients presents significant challenges, as continuous, personalized care is often limited outside clinical settings. Additionally, the absence of comprehensive solutions addressing diverse monitoring and assistance needs in home environments complicates recovery efforts. Here, we present a multimodal smart home platform designed for continuous, at-home reha… ▽ More

    Submitted 15 April, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 5 figures, 41 references

  25. arXiv:2411.18266  [pdf

    eess.AS cs.AI cs.SD eess.SY

    Wearable intelligent throat enables natural speech in stroke patients with dysarthria

    Authors: Chenyu Tang, Shuo Gao, Cong Li, Wentian Yi, Yuxuan Jin, Xiaoxue Zhai, Sixuan Lei, Hongbei Meng, Zibo Zhang, Muzi Xu, Shengbo Wang, Xuhang Chen, Chenxi Wang, Hongyun Yang, Ningli Wang, Wenyu Wang, Jin Cao, Xiaodong Feng, Peter Smielewski, Yu Pan, Wenhui Song, Martin Birchall, Luigi G. Occhipinti

    Abstract: Wearable silent speech systems hold significant potential for restoring communication in patients with speech impairments. However, seamless, coherent speech remains elusive, and clinical efficacy is still unproven. Here, we present an AI-driven intelligent throat (IT) system that integrates throat muscle vibrations and carotid pulse signal sensors with large language model (LLM) processing to ena… ▽ More

    Submitted 14 March, 2025; v1 submitted 27 November, 2024; originally announced November 2024.

    Comments: 5 figures, 45 references

  26. arXiv:2411.06983  [pdf, other

    eess.SP

    Sensing Capacity for Integrated Sensing and Communication Systems in Low-Altitude Economy

    Authors: Jiahua Wan, Hong Ren, Cunhua Pan, Zhenkun Zhang, Songtao Gao, Yiming Yu, Chengzhong Wang

    Abstract: The burgeoning significance of the low-altitude economy (LAE) has garnered considerable interest, largely fuelled by the widespread deployment of unmanned aerial vehicles (UAVs). To tackle the challenges associated with the detection of unauthorized UAVs and the efficient scheduling of authorized UAVs, this letter introduces a novel performance metric, termed sensing capacity, for integrated sensi… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  27. arXiv:2411.01668  [pdf, ps, other

    math.OC eess.SY

    Linear Quadratic Mean Field Games with Quantile-Dependent Cost Coefficients

    Authors: Shuang Gao, Roland P. Malhamé

    Abstract: This paper studies a class of linear quadratic mean field games where the coefficients of quadratic cost functions depend on both the mean and the variance of the population's state distribution through its quantile function. Such a formulation allows for modelling agents that are sensitive to not only the population average but also the population variance. The corresponding mean field game equil… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: 15 pages

  28. arXiv:2410.20326  [pdf, other

    eess.SY cs.RO

    SEEV: Synthesis with Efficient Exact Verification for ReLU Neural Barrier Functions

    Authors: Hongchao Zhang, Zhizhen Qin, Sicun Gao, Andrew Clark

    Abstract: Neural Control Barrier Functions (NCBFs) have shown significant promise in enforcing safety constraints on nonlinear autonomous systems. State-of-the-art exact approaches to verifying safety of NCBF-based controllers exploit the piecewise-linear structure of ReLU neural networks, however, such approaches still rely on enumerating all of the activation regions of the network near the safety boundar… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

  29. arXiv:2410.19279  [pdf, other

    eess.SP cs.AI

    UbiHR: Resource-efficient Long-range Heart Rate Sensing on Ubiquitous Devices

    Authors: Haoyu Bian, Bin Guo, Sicong Liu, Yasan Ding, Shanshan Gao, Zhiwen Yu

    Abstract: Ubiquitous on-device heart rate sensing is vital for high-stress individuals and chronic patients. Non-contact sensing, compared to contact-based tools, allows for natural user monitoring, potentially enabling more accurate and holistic data collection. However, in open and uncontrolled mobile environments, user movement and lighting introduce. Existing methods, such as curve-based or short-range… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  30. arXiv:2410.16734  [pdf

    cs.NE eess.SP physics.app-ph

    High-Order Associative Learning Based on Memristive Circuits for Efficient Learning

    Authors: Shengbo Wang, Xuemeng Li, Jialin Ding, Weihao Ma, Ying Wang, Luigi Occhipinti, Arokia Nathan, Shuo Gao

    Abstract: Memristive associative learning has gained significant attention for its ability to mimic fundamental biological learning mechanisms while maintaining system simplicity. In this work, we introduce a high-order memristive associative learning framework with a biologically realistic structure. By utilizing memristors as synaptic modules and their state information to bridge different orders of assoc… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 5 pages, 7 figures

  31. arXiv:2409.09754  [pdf, other

    cs.CV cs.RO eess.IV physics.optics

    Towards Single-Lens Controllable Depth-of-Field Imaging via Depth-Aware Point Spread Functions

    Authors: Xiaolong Qian, Qi Jiang, Yao Gao, Shaohua Gao, Zhonghua Yi, Lei Sun, Kai Wei, Haifeng Li, Kailun Yang, Kaiwei Wang, Jian Bai

    Abstract: Controllable Depth-of-Field (DoF) imaging commonly produces amazing visual effects based on heavy and expensive high-end lenses. However, confronted with the increasing demand for mobile scenarios, it is desirable to achieve a lightweight solution with Minimalist Optical Systems (MOS). This work centers around two major limitations of MOS, i.e., the severe optical aberrations and uncontrollable Do… ▽ More

    Submitted 11 February, 2025; v1 submitted 15 September, 2024; originally announced September 2024.

    Comments: Accepted to IEEE Transactions on Computational Imaging (TCI). The source code and the established dataset will be publicly available at https://github.com/XiaolongQian/DCDI

  32. arXiv:2409.06307  [pdf, other

    cs.SD cs.AI eess.AS

    An End-to-End Approach for Chord-Conditioned Song Generation

    Authors: Shuochen Gao, Shun Lei, Fan Zhuo, Hangyu Liu, Feng Liu, Boshi Tang, Qiaochu Huang, Shiyin Kang, Zhiyong Wu

    Abstract: The Song Generation task aims to synthesize music composed of vocals and accompaniment from given lyrics. While the existing method, Jukebox, has explored this task, its constrained control over the generations often leads to deficiency in music performance. To mitigate the issue, we introduce an important concept from music composition, namely chords, to song generation networks. Chords form the… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  33. arXiv:2409.05809  [pdf, other

    physics.optics cs.CV eess.IV

    A Flexible Framework for Universal Computational Aberration Correction via Automatic Lens Library Generation and Domain Adaptation

    Authors: Qi Jiang, Yao Gao, Shaohua Gao, Zhonghua Yi, Lei Sun, Hao Shi, Kailun Yang, Kaiwei Wang, Jian Bai

    Abstract: Emerging universal Computational Aberration Correction (CAC) paradigms provide an inspiring solution to light-weight and high-quality imaging without repeated data preparation and model training to accommodate new lens designs. However, the training databases in these approaches, i.e., the lens libraries (LensLibs), suffer from their limited coverage of real-world aberration behaviors. In this wor… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  34. arXiv:2408.15140  [pdf

    physics.app-ph eess.SP

    GEM: A GEneral Memristive Transistor Model

    Authors: Shengbo Wang, Jingfang Pei, Cong Li, Xuemeng Li, Li Tao, Arokia Nathan, Guohua Hu, Shuo Gao

    Abstract: Neuromorphic devices, with their distinct advantages in energy efficiency and parallel processing, are pivotal in advancing artificial intelligence applications. Among these devices, memristive transistors have attracted significant attention due to their superior stability and operation flexibility compared to two-terminal memristors. However, the lack of a robust model that accurately captures t… ▽ More

    Submitted 7 November, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: 5 pages, 5 figures

  35. arXiv:2408.13546  [pdf, other

    eess.SP cs.AI

    Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics

    Authors: Zonghui Yang, Shijian Gao, Xiang Cheng, Liuqing Yang

    Abstract: Integrated sensing and communication (ISAC) technology is vital for vehicular networks, yet the time-varying communication channels and rapid movement of targets present significant challenges for real-time precoding design. Traditional optimization-based methods are computationally complex and depend on perfect prior information, which is often unavailable in double-dynamic scenarios. In this pap… ▽ More

    Submitted 3 December, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

    Comments: Submitted to IEEE for possible publication

  36. arXiv:2408.12252  [pdf, other

    eess.SP

    Synesthesia of Machines (SoM)-Enhanced Wideband Multi-User CSI Learning With LiDAR Sensing

    Authors: Haotian Zhang, Shijian Gao, Xiang Cheng, Liuqing Yang

    Abstract: Light detection and ranging (LiDAR) has been utilized for optimizing wireless communications due to its ability to detect the environment. This paper explores the use of LiDAR in channel estimation for wideband multi-user multiple-input-multiple-output orthogonal frequency division multiplexing systems and introduces a LiDAR-enhanced Channel State Information (CSI) learning network (LE-CLN). By ut… ▽ More

    Submitted 18 March, 2025; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: 6 pages, 4 figures, 1 table

  37. arXiv:2408.03685  [pdf, ps, other

    cs.LG eess.SY

    RL-ADN: A High-Performance Deep Reinforcement Learning Environment for Optimal Energy Storage Systems Dispatch in Active Distribution Networks

    Authors: Shengren Hou, Shuyi Gao, Weijie Xia, Edgar Mauricio Salazar Duque, Peter Palensky, Pedro P. Vergara

    Abstract: Deep Reinforcement Learning (DRL) presents a promising avenue for optimizing Energy Storage Systems (ESSs) dispatch in distribution networks. This paper introduces RL-ADN, an innovative open-source library specifically designed for solving the optimal ESSs dispatch in active distribution networks. RL-ADN offers unparalleled flexibility in modeling distribution networks, and ESSs, accommodating a w… ▽ More

    Submitted 8 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  38. Multibeam Hybrid Transmitarray Based on Polarization Rotating Metasurface With Reconfigurable Bidirectional Radiation

    Authors: Fan Qin, Yifei Liu, Chao Gu, Linfeng Zeng, Wenchi Cheng, Hailin Zhang, Steven Gao

    Abstract: This paper proposes a bidirectional multibeam hybrid transmitarray (HTA) employing a transmission polarization-rotating metasurface (TPRM). A novel configuration is introduced to facilitate bidirectional beam scanning by combining the transmitarray (TA) and folded-transmitarray (FTA). To accomplish the reconfiguration of both unidirectional and bidirectional radiation states in the +z, -z, and +/-… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 12 pages, 26 figures, published to TAP

  39. arXiv:2408.00429  [pdf, other

    eess.SP cs.AI

    Augmenting Channel Simulator and Semi- Supervised Learning for Efficient Indoor Positioning

    Authors: Yupeng Li, Xinyu Ning, Shijian Gao, Yitong Liu, Zhi Sun, Qixing Wang, Jiangzhou Wang

    Abstract: This work aims to tackle the labor-intensive and resource-consuming task of indoor positioning by proposing an efficient approach. The proposed approach involves the introduction of a semi-supervised learning (SSL) with a biased teacher (SSLB) algorithm, which effectively utilizes both labeled and unlabeled channel data. To reduce measurement expenses, unlabeled data is generated using an updated… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: ACCEPTED for presentation at 2024 IEEE Global Communications Conference

  40. arXiv:2407.20772  [pdf, other

    eess.SP cs.NI

    Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks

    Authors: Peihao Dong, Chaowei He, Shen Gao, Fuhui Zhou, Qihui Wu

    Abstract: In hierarchical cognitive radio networks, edge or cloud servers utilize the data collected by edge devices for modulation classification, which, however, is faced with problems of the computation load, transmission overhead, and data privacy. In this article, an edge learning (EL) based framework jointly mobilizing the edge device and the edge server for intelligent co-inference is proposed to rea… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted by IEEE Internet of Things Journal

  41. arXiv:2407.15139  [pdf, ps, other

    eess.SY

    An Interface Method for Co-simulation of EMT Model and Shifted Frequency EMT Model Based on Rotational Invariance Techniques

    Authors: Shilin Gao, Ying Chen, Zhitong Yu, Wensheng Chen, Yankan Song

    Abstract: The shifted frequency-based electromagnetic transient (SFEMT) simulation has greatly improved the computational efficiency of traditional electromagnetic transient (EMT) simulation for the ac grid. This letter proposes a novel interface for the co-simulation of the SFEMT model and the traditional EMT model. The general form of SFEMT modeling and the principle of analytical signal construction are… ▽ More

    Submitted 27 August, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

  42. arXiv:2407.09645  [pdf, other

    eess.SY cs.LG cs.RO

    Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey

    Authors: Milan Ganai, Sicun Gao, Sylvia Herbert

    Abstract: Recent literature has proposed approaches that learn control policies with high performance while maintaining safety guarantees. Synthesizing Hamilton-Jacobi (HJ) reachable sets has become an effective tool for verifying safety and supervising the training of reinforcement learning-based control policies for complex, high-dimensional systems. Previously, HJ reachability was restricted to verifying… ▽ More

    Submitted 21 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted in IEEE Open Journal of Control Systems (OJ-CSYS)

  43. arXiv:2407.07372  [pdf, other

    eess.IV cs.CV

    Multi-modal MRI Translation via Evidential Regression and Distribution Calibration

    Authors: Jiyao Liu, Shangqi Gao, Yuxin Li, Lihao Liu, Xin Gao, Zhaohu Xing, Junzhi Ning, Yanzhou Su, Xiao-Yong Zhang, Junjun He, Ningsheng Xu, Xiahai Zhuang

    Abstract: Multi-modal Magnetic Resonance Imaging (MRI) translation leverages information from source MRI sequences to generate target modalities, enabling comprehensive diagnosis while overcoming the limitations of acquiring all sequences. While existing deep-learning-based multi-modal MRI translation methods have shown promising potential, they still face two key challenges: 1) lack of reliable uncertainty… ▽ More

    Submitted 18 May, 2025; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Early accepted by MICCAI 2025

  44. arXiv:2407.00896  [pdf, other

    eess.SP cs.AI

    Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions

    Authors: Yupeng Li, Gang Li, Zirui Wen, Shuangfeng Han, Shijian Gao, Guangyi Liu, Jiangzhou Wang

    Abstract: The AI-enabled autoencoder has demonstrated great potential in channel state information (CSI) feedback in frequency division duplex (FDD) multiple input multiple output (MIMO) systems. However, this method completely changes the existing feedback strategies, making it impractical to deploy in recent years. To address this issue, this paper proposes a channel modeling aided data augmentation metho… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  45. arXiv:2406.14440  [pdf, other

    eess.SP

    LLM4CP: Adapting Large Language Models for Channel Prediction

    Authors: Boxun Liu, Xuanyu Liu, Shijian Gao, Xiang Cheng, Liuqing Yang

    Abstract: Channel prediction is an effective approach for reducing the feedback or estimation overhead in massive multi-input multi-output (m-MIMO) systems. However, existing channel prediction methods lack precision due to model mismatch errors or network generalization issues. Large language models (LLMs) have demonstrated powerful modeling and generalization abilities, and have been successfully applied… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  46. arXiv:2406.09304  [pdf

    physics.app-ph eess.SP

    Self-reconfigurable Multifunctional Memristive Nociceptor for Intelligent Robotics

    Authors: Shengbo Wang, Mingchao Fang, Lekai Song, Cong Li, Jian Zhang, Arokia Nathan, Guohua Hu, Shuo Gao

    Abstract: Artificial nociceptors, mimicking human-like stimuli perception, are of significance for intelligent robotics to work in hazardous and dynamic scenarios. One of the most essential characteristics of the human nociceptor is its self-adjustable attribute, which indicates that the threshold of determination of a potentially hazardous stimulus relies on environmental knowledge. This critical attribute… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures

  47. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  48. arXiv:2405.14347  [pdf, other

    eess.SP cs.AI

    Doubly-Dynamic ISAC Precoding for Vehicular Networks: A Constrained Deep Reinforcement Learning (CDRL) Approach

    Authors: Zonghui Yang, Shijian Gao, Xiang Cheng

    Abstract: Integrated sensing and communication (ISAC) technology is essential for supporting vehicular networks. However, the communication channel in this scenario exhibits time variations, and the potential targets may move rapidly, resulting in double dynamics. This nature poses a challenge for real-time precoder design. While optimization-based solutions are widely researched, they are complex and heavi… ▽ More

    Submitted 23 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by 2024 IEEE Global Communications Conference

  49. arXiv:2405.09778  [pdf, other

    eess.SP

    Beam Pattern Modulation Embedded Hybrid Transceiver Optimization for Integrated Sensing and Communication

    Authors: Boxun Liu, Shijian Gao, Zonghui Yang, Xiang Cheng, Liuqing Yang

    Abstract: Integrated sensing and communication (ISAC) emerges as a promising technology for B5G/6G, particularly in the millimeter-wave (mmWave) band. However, the widely utilized hybrid architecture in mmWave systems compromises multiplexing gain due to the constraints of limited radio frequency chains. Moreover, additional sensing functionalities exacerbate the impairment of spectrum efficiency (SE). In t… ▽ More

    Submitted 18 February, 2025; v1 submitted 15 May, 2024; originally announced May 2024.

  50. arXiv:2405.09663  [pdf

    eess.SP

    Design and Implementation of mmWave Surface Wave Enabled Fluid Antennas and Experimental Results for Fluid Antenna Multiple Access

    Authors: Yuanjun Shen, Boyi Tang, Shuai Gao, Kin-Fai Tong, Hang Wong, Kai-Kit Wong, Yangyang Zhang

    Abstract: While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel st… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Transactions on Antennas and Propagation