Skip to main content

Showing 1–50 of 136 results for author: Wu, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.07512  [pdf

    physics.app-ph eess.SY

    Demonstration of TFTs 3D Monolithically Integrated on GaN HEMTs using Cascode Configuration with High Breakdown Voltage (>1900V)

    Authors: Tian-Li Wu, Hsin-Jou Ho, Chia-Wei Liu, Yi-Chen Chen

    Abstract: This study demonstrates 3D monolithic integration of amorphous indium-gallium-zinc oxide (a-IGZO) thin-film transistors (TFTs) on Gallium Nitride (GaN) high electron mobility transistors (HEMTs) in a cascode configuration, achieving high breakdown voltage capabilities exceeding 1900 V. Two device configurations, differing in a-IGZO channel thickness (30 nm / 10 nm), are fabricated and evaluated. S… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

    Comments: 3 pages, 5 figures

  2. arXiv:2507.00209  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.RO

    SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures

    Authors: Fengyi Jiang, Xiaorui Zhang, Lingbo Jin, Ruixing Liang, Yuxin Chen, Adi Chola Venkatesh, Jason Culman, Tiantian Wu, Lirong Shao, Wenqing Sun, Cong Gao, Hallie McNamara, Jingpei Lu, Omid Mohareri

    Abstract: High-resolution imaging is crucial for enhancing visual clarity and enabling precise computer-assisted guidance in minimally invasive surgery (MIS). Despite the increasing adoption of 4K endoscopic systems, there remains a significant gap in publicly available native 4K datasets tailored specifically for robotic-assisted MIS. We introduce SurgiSR4K, the first publicly accessible surgical imaging a… ▽ More

    Submitted 7 July, 2025; v1 submitted 30 June, 2025; originally announced July 2025.

  3. arXiv:2506.06679  [pdf, ps, other

    eess.SY

    Controlled Reach-avoid Set Computation for Discrete-time Polynomial Systems via Convex Optimization

    Authors: Taoran Wu, Yiling Xue, Dejin Ren, Arvind Easwaran, Martin Fränzle, Bai Xue

    Abstract: This paper addresses the computation of controlled reach-avoid sets (CRASs) for discrete-time polynomial systems subject to control inputs. A CRAS is a set encompassing initial states from which there exist control inputs driving the system into a target set while avoiding unsafe sets. However, efficiently computing CRASs remains an open problem, especially for discrete-time systems. In this paper… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  4. arXiv:2506.01496  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Continual Speech Learning with Fused Speech Features

    Authors: Guitao Wang, Jinming Zhao, Hao Yang, Guilin Qi, Tongtong Wu, Gholamreza Haffari

    Abstract: Rapid growth in speech data demands adaptive models, as traditional static methods fail to keep pace with dynamic and diverse speech information. We introduce continuous speech learning, a new set-up targeting at bridging the adaptation gap in current speech models. We use the encoder-decoder Whisper model to standardize speech tasks into a generative format. We integrate a learnable gated-fusion… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted to Interspeech 2025

  5. arXiv:2505.20638  [pdf, ps, other

    cs.SD cs.CV cs.MM eess.AS

    Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs

    Authors: Wenhao You, Xingjian Diao, Chunhui Zhang, Keyi Kong, Weiyi Wu, Zhongyu Ouyang, Chiyu Ma, Tingxuan Wu, Noah Wei, Zong Ke, Ming Cheng, Soroush Vosoughi, Jiang Gui

    Abstract: While recent Multimodal Large Language Models exhibit impressive capabilities for general multimodal tasks, specialized domains like music necessitate tailored approaches. Music Audio-Visual Question Answering (Music AVQA) particularly underscores this, presenting unique challenges with its continuous, densely layered audio-visual content, intricate temporal dynamics, and the critical need for dom… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  6. arXiv:2505.09511  [pdf, ps, other

    cs.RO cs.MA eess.SY

    Design of a Formation Control System to Assist Human Operators in Flying a Swarm of Robotic Blimps

    Authors: Tianfu Wu, Jiaqi Fu, Wugang Meng, Sungjin Cho, Huanzhe Zhan, Fumin Zhang

    Abstract: Formation control is essential for swarm robotics, enabling coordinated behavior in complex environments. In this paper, we introduce a novel formation control system for an indoor blimp swarm using a specialized leader-follower approach enhanced with a dynamic leader-switching mechanism. This strategy allows any blimp to take on the leader role, distributing maneuvering demands across the swarm a… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  7. arXiv:2505.04453  [pdf, ps, other

    eess.SP

    Meta-Learning Driven Lightweight Phase Shift Compression for IRS-Assisted Wireless Systems

    Authors: Xianhua Yu, Dong Li, Bowen Gu, Xiaoye Jing, Wen Wu, Tuo Wu, Kan Yu

    Abstract: The phase shift information (PSI) overhead poses a critical challenge to enabling real-time intelligent reflecting surface (IRS)-assisted wireless systems, particularly under dynamic and resource-constrained conditions. In this paper, we propose a lightweight PSI compression framework, termed meta-learning-driven compression and reconstruction network (MCRNet). By leveraging a few-shot adaptation… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  8. arXiv:2504.18271  [pdf, other

    cs.AI cs.ET cs.HC eess.SY

    LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method

    Authors: Tao Wu, Kexue Fu, Qiang Hua, Xinxin Liu, Muhammad Ali Imran, Bo Liu

    Abstract: Antenna modeling is a time-consuming and complex process, decreasing the speed of antenna analysis and design. In this paper, a large language model (LLM)- enabled antenna modeling method, called LEAM, is presented to address this challenge. LEAM enables automatic antenna model generation based on language descriptions via prompt input, images, descriptions from academic papers, patents, and techn… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: Code are available: https://github.com/TaoWu974/LEAM

  9. arXiv:2504.13455  [pdf, other

    eess.SP

    Modular XL-Array-Enabled 3-D Localization based on Hybrid Spherical-Planar Wave Model in Terahertz Systems

    Authors: Yang Zhang, Ruidong Li, Cunhua Pan, Hong Ren, Tuo Wu, Changhong Wang

    Abstract: This work considers the three-dimensional (3-D) positioning problem in a Terahertz (THz) system enabled by a modular extra-large (XL) array with sub-connected architecture. Our purpose is to estimate the Cartesian Coordinates of multiple user equipments (UEs) with the received signal of the RF chains while considering the spatial non-stationarity (SNS). We apply the hybrid spherical-planar wave mo… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: 13 pages, 11 figures

  10. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  11. arXiv:2504.01638  [pdf, other

    eess.SY

    Convex Computations for Controlled Safety Invariant Sets of Black-box Discrete-time Dynamical Systems

    Authors: Taoran Wu, Yiling Xue, Jingduo Pan, Dejin Ren, Arvind Easwaran, Bai Xue

    Abstract: Identifying controlled safety invariant sets (CSISs) is essential in safety-critical applications. This paper tackles the problem of identifying CSISs for black-box discrete-time systems, where the model is unknown and only limited simulation data is accessible. Traditionally, a CSIS is defined as a subset of a safe set, encompassing initial states for which a control input exists that keeps the s… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 15 pages

  12. arXiv:2503.12698  [pdf, other

    eess.IV cs.CV

    A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT

    Authors: Dazhou Guo, Zhanghexuan Ji, Yanzhou Su, Dandan Zheng, Heng Guo, Puyang Wang, Ke Yan, Yirui Wang, Qinji Yu, Zi Li, Minfeng Xu, Jianfeng Zhang, Haoshen Li, Jia Ge, Tsung-Ying Ho, Bing-Shen Huang, Tashan Ai, Kuaile Zhao, Na Shen, Qifeng Wang, Yun Bian, Tingyu Wu, Peng Du, Hua Zhang, Feng-Ming Kong , et al. (9 additional authors not shown)

    Abstract: Precision medicine in the quantitative management of chronic diseases and oncology would be greatly improved if the Computed Tomography (CT) scan of any patient could be segmented, parsed and analyzed in a precise and detailed way. However, there is no such fully annotated CT dataset with all anatomies delineated for training because of the exceptionally high manual cost, the need for specialized… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  13. arXiv:2502.16669  [pdf, other

    eess.SP

    Holographic MIMO Multi-Cell Communications

    Authors: Kangda Zhi, Tianyu Yang, Shuangyang Li, Yi Song, Tuo Wu, Giuseppe Caire

    Abstract: Metamaterial antennas are appealing for next-generation wireless networks due to their simplified hardware and much-reduced size, power, and cost. This paper investigates the holographic multiple-input multiple-output (HMIMO)-aided multi-cell systems with practical per-radio frequency (RF) chain power constraints. With multiple antennas at both base stations (BSs) and users, we design the baseband… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 13 pages

  14. arXiv:2502.06710  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Learning Musical Representations for Music Performance Question Answering

    Authors: Xingjian Diao, Chunhui Zhang, Tingxuan Wu, Ming Cheng, Zhongyu Ouyang, Weiyi Wu, Jiang Gui

    Abstract: Music performances are representative scenarios for audio-visual modeling. Unlike common scenarios with sparse audio, music performances continuously involve dense audio signals throughout. While existing multimodal learning methods on the audio-video QA demonstrate impressive capabilities in general scenarios, they are incapable of dealing with fundamental problems within the music performances:… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted at EMNLP 2024

  15. arXiv:2502.04307  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    DexterityGen: Foundation Controller for Unprecedented Dexterity

    Authors: Zhao-Heng Yin, Changhao Wang, Luis Pineda, Francois Hogan, Krishna Bodduluri, Akash Sharma, Patrick Lancaster, Ishita Prasad, Mrinal Kalakrishnan, Jitendra Malik, Mike Lambeta, Tingfan Wu, Pieter Abbeel, Mustafa Mukadam

    Abstract: Teaching robots dexterous manipulation skills, such as tool use, presents a significant challenge. Current approaches can be broadly categorized into two strategies: human teleoperation (for imitation learning) and sim-to-real reinforcement learning. The first approach is difficult as it is hard for humans to produce safe and dexterous motions on a different embodiment without touch feedback. The… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: Project: https://zhaohengyin.github.io/dexteritygen

  16. arXiv:2501.18378  [pdf, other

    eess.SP

    A Hybrid Dynamic Subarray Architecture for Efficient DOA Estimation in THz Ultra-Massive Hybrid MIMO Systems

    Authors: Ye Tian, Jiaji Ren, Tuo Wu, Wei Liu, Chau Yuen, Merouane Debbah, Naofal Al-Dhahir, Matthew C. Valenti, Hing Cheung So, Yonina C. Eldar

    Abstract: Terahertz (THz) communication combined with ultra-massive multiple-input multiple-output (UM-MIMO) technology is promising for 6G wireless systems, where fast and precise direction-of-arrival (DOA) estimation is crucial for effective beamforming. However, finding DOAs in THz UM-MIMO systems faces significant challenges: while reducing hardware complexity, the hybrid analog-digital (HAD) architectu… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  17. arXiv:2501.16854  [pdf, ps, other

    eess.SP

    From Partial Calibration to Full Potential: A Two-Stage Sparse DOA Estimation for Incoherently-Distributed Sources with Gain-Phase Uncertainty

    Authors: He Xu, Tuo Wu, Wei Liu, Maged Elkashlan, Naofal Al-Dhahir, Merouane Debbah, Chau Yuen, Hing Cheung So

    Abstract: Direction-of-arrival (DOA) estimation for incoherently distributed (ID) sources is essential in multipath wireless communication scenarios, yet it remains challenging due to the combined effects of angular spread and gain-phase uncertainties in antenna arrays. This paper presents a two-stage sparse DOA estimation framework, transitioning from partial calibration to full potential, under the genera… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  18. arXiv:2501.12473  [pdf, other

    eess.SY

    RIS-Aided Monitoring With Cooperative Jamming: Design and Performance Analysis

    Authors: Shuying Lin, Yulong Zou, Zhiyang Li, Tong Wu, Eduard E. Bahingayi, Le-Nam Tran

    Abstract: We investigate a reconfigurable intelligent surface (RIS) aided wireless surveillance system. In this system, a monitor not only receives signal from suspicious transmitter via a RIS-enhanced legitimate surveillance (LS) link but also simultaneously takes control of multiple jammers to degrade the quality of received suspicious signal. Under this setup, to enhance monitoring performance requires i… ▽ More

    Submitted 25 February, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: submitted to IEEE Transactions on Communications

  19. arXiv:2501.08680  [pdf, other

    eess.SY cs.NI

    Digital Twin Online Channel Modeling: Challenges,Principles, and Applications

    Authors: Junling Li, Cheng-Xiang Wang, Chen Huang, Tianrun Qi, Tong Wu

    Abstract: Different from traditional offline channel modeling, digital twin online channel modeling can sense and accurately characterize dynamic wireless channels in real time, and can therefore greatly assist 6G network optimization. This article proposes a novel promising framework and a step-by-step design procedure of digital twin online channel models (DTOCM). By enabling continuous visualization and… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

  20. arXiv:2501.01281  [pdf, other

    eess.SP

    Towards Intelligent Antenna Positioning: Leveraging DRL for FAS-Aided ISAC Systems

    Authors: Shunxing Yang, Junteng Yao, Jie Tang, Tuo Wu, Maged Elkashlan, Chau Yuen, Merouane Debbah, Hyundong Shin, Matthew Valenti

    Abstract: Fluid antenna systems (FAS) enable dynamic antenna positioning, offering new opportunities to enhance integrated sensing and communication (ISAC) performance. However, existing studies primarily focus on communication enhancement or single-target sensing, leaving multi-target scenarios underexplored. Additionally, the joint optimization of beamforming and antenna positions poses a highly non-conve… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

  21. arXiv:2412.15843  [pdf, other

    eess.SP

    Rethinking Hardware Impairments in Multi-User Systems: Can FAS Make a Difference?

    Authors: Junteng Yao, Tuo Wu, Liaoshi Zhou, Ming Jin, Cunhua Pan, Maged Elkashlan, Fumiyuki Adachi, George K. Karagiannidis, Naofal Al-Dhahir, Chau Yuen

    Abstract: In this paper, we analyze the role of fluid antenna systems (FAS) in multi-user systems with hardware impairments (HIs). Specifically, we investigate a scenario where a base station (BS) equipped with multiple fluid antennas communicates with multiple users (CUs), each equipped with a single fluid antenna. Our objective is to maximize the minimum communication rate among all users by jointly optim… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  22. arXiv:2412.03839  [pdf, other

    eess.SP

    Fluid Antenna Systems Enabling 6G:Principles, Applications, and Research Directions

    Authors: Tuo Wu, Kangda Zhi, Junteng Yao, Xiazhi Lai, Jianchao Zheng, Hong Niu, Maged Elkashlan, Kai-Kit Wong, Chan-Byoung Chae, Zhiguo Ding, George K. Karagiannidis, Merouane Debbah, Chau Yuen

    Abstract: Fluid antenna system (FAS) as a new version of reconfigurable antenna technologies promoting shape and position flexibility, has emerged as an exciting and possibly transformative technology for wireless communications systems. FAS represents any software-controlled fluidic, conductive or dielectric structure that can dynamically alter antenna's shape and position to change the gain, the radiation… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  23. arXiv:2412.02282  [pdf, other

    cs.NI cs.IT eess.SP

    Exploring Evolutionary Spectral Clustering for Temporal-Smoothed Clustered Cell-Free Networking

    Authors: Junyuan Wang, Tianyao Wu, Ouyang Zhou, Yaping Zhu

    Abstract: Clustered cell-free networking, which dynamically partitions the whole network into nonoverlapping subnetworks, has been recently proposed to mitigate the cell-edge problem in cellular networks. However, prior works only focused on optimizing clustered cell-free networking in static scenarios with fixed users. This could lead to a large number of handovers in the practical dynamic environment with… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 5 pages, 3 figures

  24. Channel Modeling for Ultraviolet Non-Line-of-Sight Communications Incorporating an Obstacle

    Authors: Tianfeng Wu, Fang Yang, Tian Cao, Ling Cheng, Yupeng Chen, Jian Song, Julian Cheng, Zhu Han

    Abstract: Existing studies on ultraviolet (UV) non-line-of-sight (NLoS) channel modeling primarily focus on scenarios without any obstacle, which makes them unsuitable for small transceiver elevation angles in most cases. To address this issue, a UV NLoS channel model incorporating an obstacle was investigated in this paper, where the impacts of atmospheric scattering and obstacle reflection on UV signals w… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: Accepted by IEEE Global Communications Conference (GLOBECOM) 2024. arXiv admin note: substantial text overlap with arXiv:2411.15154

  25. arXiv:2411.15154  [pdf, other

    eess.SP

    Modeling of UV NLoS Communication Channels: From Atmospheric Scattering and Obstacle Reflection Perspectives

    Authors: Tianfeng Wu, Fang Yang, Tian Cao, Ling Cheng, Yupeng Chen, Jian Song, Julian Cheng, Zhu Han

    Abstract: As transceiver elevation angles increase from small to large, existing ultraviolet (UV) non-line-of-sight (NLoS) models encounter two challenges: i) cannot estimate the channel characteristics of UV NLoS communication scenarios when there exists an obstacle in the overlap volume between the transmitter beam and the receiver field-of-view (FoV), and ii) cannot evaluate the channel path loss for the… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Accepted by IEEE Journal on Selected Areas in Communications

  26. arXiv:2411.11110  [pdf, other

    eess.IV cs.CV

    Retinal Vessel Segmentation via Neuron Programming

    Authors: Tingting Wu, Ruyi Min, Peixuan Song, Hengtao Guo, Tieyong Zeng, Feng-Lei Fan

    Abstract: The accurate segmentation of retinal blood vessels plays a crucial role in the early diagnosis and treatment of various ophthalmic diseases. Designing a network model for this task requires meticulous tuning and extensive experimentation to handle the tiny and intertwined morphology of retinal blood vessels. To tackle this challenge, Neural Architecture Search (NAS) methods are developed to fully… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  27. arXiv:2411.09235  [pdf, ps, other

    eess.SP

    FAS for Secure and Covert Communications

    Authors: Junteng Yao, Liangxiao Xin, Tuo Wu, Ming Jin, Kai-Kit Wong, Chau Yuen, Hyundong Shin

    Abstract: This letter considers a fluid antenna system (FAS)-aided secure and covert communication system, where the transmitter adjusts multiple fluid antennas' positions to achieve secure and covert transmission under the threat of an eavesdropper and the detection of a warden. This letter aims to maximize the secrecy rate while satisfying the covertness constraint. Unfortunately, the optimization problem… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  28. arXiv:2411.08618  [pdf, other

    eess.SY

    Robust Optimal Power Flow Against Adversarial Attacks: A Tri-Level Optimization Approach

    Authors: Saman Mazaheri Khamaneh, Tong Wu

    Abstract: In power systems, unpredictable events like extreme weather, equipment failures, and cyberattacks present significant challenges to ensuring safety and reliability. Ensuring resilience in the face of these uncertainties is crucial for reliable and efficient operations. This paper presents a tri-level optimization approach for robust power system operations that effectively address worst-case attac… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: This work has been submitted for possible publication

  29. arXiv:2411.08386  [pdf, ps, other

    eess.SP

    A Secure Beamforming Design: When Fluid Antenna Meets NOMA

    Authors: Lifeng Mai, Junteng Yao, Jie Tang, Tuo Wu, Kai-Kit Wong, Hyundong Shin, Fumiyuki Adachi

    Abstract: This letter proposes a secure beamforming design for downlink non-orthogonal multiple access (NOMA) systems utilizing fluid antenna systems (FAS). We consider a setup where a base station (BS) with $M$ fluid antennas (FAs) communicates to a cell-center user (CU) and a cell-edge user (CEU), each with a FA. The CU is the intended recipient while the CEU is regarded as a potential eavesdropper. Our a… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  30. arXiv:2411.08383  [pdf, other

    eess.SP

    FAS-Driven Spectrum Sensing for Cognitive Radio Networks

    Authors: Junteng Yao, Ming Jin, Tuo Wu, Maged Elkashlan, Chau Yuen, Kai-Kit Wong, George K. Karagiannidis, Hyundong Shin

    Abstract: Cognitive radio (CR) networks face significant challenges in spectrum sensing, especially under spectrum scarcity. Fluid antenna systems (FAS) can offer an unorthodox solution due to their ability to dynamically adjust antenna positions for improved channel gain. In this letter, we study a FAS-driven CR setup where a secondary user (SU) adjusts the positions of fluid antennas to detect signals fro… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  31. arXiv:2411.05363  [pdf, other

    eess.SP

    Path Loss Modeling for NLoS Ultraviolet Channels Incorporating Scattering and Reflection Effects

    Authors: Tianfeng Wu, Fang Yang, Fei Li, Renzhi Yuan, Tian Cao, Ling Cheng, Jian Song, Julian Cheng, Zhu Han

    Abstract: This paper tackles limitations in existing non-line-of-sight (NLoS) ultraviolet (UV) channel models, where conventional approaches assume obstacle-free propagation or uniform radiation intensity. In this paper, we develop a path loss model incorporating scattering and reflection, and then propose an obstacle-boundary approximation method to achieve computational tractability. Our framework systema… ▽ More

    Submitted 18 March, 2025; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: Submitted to IEEE Global Communications Conference (GLOBECOM) 2025

  32. arXiv:2411.01400  [pdf, ps, other

    eess.SP

    Unlocking FAS-RIS Security Analysis with Block-Correlation Model

    Authors: Jianchao Zheng, Xiazhi Lai, Tuo Wu, Maged Elkashlan, Daniel Benevides da Costa, Chau Yuen, Fumiyuki Adachi

    Abstract: In this letter, we investigate the security of fluid antenna system (FAS)-reconfigurable intelligent surfaces (RIS) communication systems. The base station (BS) employs a single fixed-position antenna, while both the legitimate receiver and the eavesdropper are equipped with fluid antennas. By utilizing the block-correlation model and the central limit theorem (CLT), we derive approximate expressi… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  33. arXiv:2411.01398  [pdf, ps, other

    eess.SP

    Paving the Way to 6G: Outage Probability Analysis for FAS-ARIS Systems

    Authors: Jianchao Zheng, Xiazhi Lai, Junteng Yao, Jie Tang, Yijin Pan, Tuo Wu, Chau Yuen

    Abstract: In this paper, we pave the way to six-generation (6G) by investigating the outage probability (OP) of fluid antenna system (FAS)-active reconfigurable intelligent surface (ARIS) communication systems. We consider a FAS-ARIS setup consisting of a base station (BS) with a single fixed-position antenna and a receiver equipped with a fluid antenna (FA). Utilizing the block-correlation model, we derive… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  34. arXiv:2410.17609  [pdf, other

    eess.SP

    Exploring the Impact of RIS on Cooperative NOMA URLLC Systems: A Theoretical Perspective

    Authors: Jianchao Zheng, Tuo Wu, Junteng Yao, Chau Yuen, Zhiguo Ding, Fumiyuki Adachi

    Abstract: In this paper, we conduct a theoretical analysis of how to integrate reconfigurable intelligent surfaces (RIS) with cooperative non-orthogonal multiple access (NOMA), considering URLLC. We consider a downlink two-user cooperative NOMA system employing short-packet communications, where the two users are denoted by the central user (CU) and the cell-edge user (CEU), respectively, and an RIS is depl… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  35. arXiv:2410.12218  [pdf, other

    eess.SP

    Exploring Dual-Sniffer Passive Localization: Algorithm Design and Experimental Results

    Authors: Tuo Wu, Lingyu Hou, Hong Niu, Saihua Xu, Sirajudeen Gulam Razul, Chau Yuen

    Abstract: In this paper, we explore a dual-sniffer passive localization system that detects the timing difference of signals from both commercial base station (eNb) and user equipment (UE) to the sniffers. We design two localization schemes for UE localization: a time of arrival (ToA) based scheme and a time difference of arrival (TDoA) based scheme. In the ToA-based scheme, we derive two ellipse equations… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  36. arXiv:2410.06115  [pdf, other

    cs.IT eess.SP

    A physics-based perspective for understanding and utilizing spatial resources of wireless channels

    Authors: Hui Xu, Jun Wei Wu, Zhen Jie Qi, Hao Tian Wu, Rui Wen Shao, Qiang Cheng, Jieao Zhu, Linglong Dai, Tie Jun Cui

    Abstract: To satisfy the increasing demands for transmission rates of wireless communications, it is necessary to use spatial resources of electromagnetic (EM) waves. In this context, EM information theory (EIT) has become a hot topic by integrating the theoretical framework of deterministic mathematics and stochastic statistics to explore the transmission mechanisms of continuous EM waves. However, the pre… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 31pages, 8 figures

  37. arXiv:2409.12962  [pdf, other

    cs.CL cs.SD eess.AS

    CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

    Authors: Tsung-Han Wu, Joseph E. Gonzalez, Trevor Darrell, David M. Chan

    Abstract: The Automated Audio Captioning (AAC) task asks models to generate natural language descriptions of an audio input. Evaluating these machine-generated audio captions is a complex task that requires considering diverse factors, among them, auditory scene understanding, sound-object inference, temporal coherence, and the environmental context of the scene. While current methods focus on specific aspe… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: Code is publicly available at https://github.com/DavidMChan/clair-a

  38. arXiv:2408.13447  [pdf, ps, other

    eess.SP

    FAS-RIS Communication: Model, Analysis, and Optimization

    Authors: Junteng Yao, Jianchao Zheng, Tuo Wu, Ming Jin, Chau Yuen, Kai-Kit Wong, Fumiyuki Adachi

    Abstract: This correspondence investigates the novel fluid antenna system (FAS) technology, combining with reconfigurable intelligent surface (RIS) for wireless communications, where a base station (BS) communicates with a FAS-enabled user with the assistance of a RIS. To analyze this technology, we derive the outage probability based on the block-diagonal matrix approximation (BDMA) model. With this, we ob… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  39. arXiv:2408.13444  [pdf, ps, other

    eess.SP

    FAS-RIS: A Block-Correlation Model Analysis

    Authors: Xiazhi Lai, Junteng Yao, Kangda Zhi, Tuo Wu, David Morales-Jimenez, Kai-Kit Wong

    Abstract: In this correspondence, we analyze the performance of a reconfigurable intelligent surface (RIS)-aided communication system that involves a fluid antenna system (FAS)-enabled receiver. By applying the central limit theorem (CLT), we derive approximate expressions for the system outage probability when the RIS has a large number of elements. Also, we adopt the block-correlation channel model to sim… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  40. arXiv:2408.09067  [pdf, ps, other

    eess.SP

    FAS vs. ARIS: Which Is More Important for FAS-ARIS Communication Systems?

    Authors: Junteng Yao, Liaoshi Zhou, Tuo Wu, Ming Jin, Chongwen Huang, Chau Yuen

    Abstract: In this paper, we investigate the question of which technology, fluid antenna systems (FAS) or active reconfigurable intelligent surfaces (ARIS), plays a more crucial role in FAS-ARIS wireless communication systems. To address this, we develop a comprehensive system model and explore the problem from an optimization perspective. We introduce an alternating optimization (AO) algorithm incorporating… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  41. arXiv:2408.03124  [pdf, other

    eess.SY cs.LG

    CL-DiffPhyCon: Closed-loop Diffusion Control of Complex Physical Systems

    Authors: Long Wei, Haodong Feng, Yuchen Yang, Ruiqi Feng, Peiyan Hu, Xiang Zheng, Tao Zhang, Dixia Fan, Tailin Wu

    Abstract: The control problems of complex physical systems have broad applications in science and engineering. Previous studies have shown that generative control methods based on diffusion models offer significant advantages for solving these problems. However, existing generative control approaches face challenges in both performance and efficiency when extended to the closed-loop setting, which is essent… ▽ More

    Submitted 22 February, 2025; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: Published as a conference paper at ICLR 2025

  42. arXiv:2407.19663  [pdf, other

    cs.LG eess.SP

    Short-Term Photovoltaic Forecasting Model for Qualifying Uncertainty during Hazy Weather

    Authors: Xuan Yang, Yunxuan Dong, Lina Yang, Thomas Wu

    Abstract: Solar energy is one of the most promising renewable energy resources. Forecasting photovoltaic power generation is an important way to increase photovoltaic penetration. However, the difficulty in qualifying the uncertainty of PV power generation, especially during hazy weather, makes forecasting challenging. This paper proposes a novel model to address the issue. We introduce a modified entropy t… ▽ More

    Submitted 7 October, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

    Comments: The manuscript was submitted to Applied Energy on August 29, 2024

  43. arXiv:2407.11307  [pdf, ps, other

    eess.SP

    Fluid Antenna-Assisted Simultaneous Wireless Information and Power Transfer Systems

    Authors: Liaoshi Zhou, Junteng Yao, Tuo Wu, Ming Jin, Chau Yuen, Fumiyuki Adachi

    Abstract: This paper examines a fluid antenna (FA)-assisted simultaneous wireless information and power transfer (SWIPT) system. Unlike traditional SWIPT systems with fixed-position antennas (FPAs), our FA-assisted system enables dynamic reconfiguration of the radio propagation environment by adjusting the positions of FAs. This capability enhances both energy harvesting and communication performance. The s… ▽ More

    Submitted 23 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  44. arXiv:2407.08141  [pdf, ps, other

    eess.SP

    A Framework of FAS-RIS Systems: Performance Analysis and Throughput Optimization

    Authors: Junteng Yao, Xiazhi Lai, Kangda Zhi, Tuo Wu, Ming Jin, Cunhua Pan, Maged Elkashlan, Chau Yuen, Kai-Kit Wong

    Abstract: In this paper, we investigate reconfigurable intelligent surface (RIS)-assisted communication systems which involve a fixed-antenna base station (BS) and a mobile user (MU) that is equipped with fluid antenna system (FAS). Specifically, the RIS is utilized to enable communication for the user whose direct link from the base station is blocked by obstacles. We propose a comprehensive framework that… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: submitted to IEEE journal for possible publication

  45. arXiv:2407.07720  [pdf, other

    eess.IV cs.CV

    Exploiting Scale-Variant Attention for Segmenting Small Medical Objects

    Authors: Wei Dai, Rui Liu, Zixuan Wu, Tianyi Wu, Min Wang, Junxian Zhou, Yixuan Yuan, Jun Liu

    Abstract: Early detection and accurate diagnosis can predict the risk of malignant disease transformation, thereby increasing the probability of effective treatment. Identifying mild syndrome with small pathological regions serves as an ominous warning and is fundamental in the early diagnosis of diseases. While deep learning algorithms, particularly convolutional neural networks (CNNs), have shown promise… ▽ More

    Submitted 5 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures, under review

  46. Revisiting XL-MIMO Channel Estimation: When Dual-Wideband Effects Meet Near Field

    Authors: Anzheng Tang, Jun-Bo Wang, Yijin Pan, Tuo Wu, Yijian Chen, Hongkang Yu, Maged Elkashlan

    Abstract: The deployment of extremely large antenna arrays (ELAAs) and operation at higher frequency bands in wideband extremely large-scale multiple-input-multiple-output (XL-MIMO) systems introduce significant near-field effects, such as spherical wavefront propagation and spatially non-stationary (SnS) properties. Combined with dual-wideband impacts, these effects fundamentally reshape the sparsity patte… ▽ More

    Submitted 16 June, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: A major revision version has been submitted to IEEE journal for possible publication

  47. arXiv:2407.05289  [pdf, other

    cs.IT eess.SP

    DM-MIMO: Diffusion Models for Robust Semantic Communications over MIMO Channels

    Authors: Yiheng Duan, Tong Wu, Zhiyong Chen, Meixia Tao

    Abstract: This paper investigates robust semantic communications over multiple-input multiple-output (MIMO) fading channels. Current semantic communications over MIMO channels mainly focus on channel adaptive encoding and decoding, which lacks exploration of signal distribution. To leverage the potential of signal distribution in signal space denoising, we develop a diffusion model over MIMO channels (DM-MI… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  48. A Review of Safe Reinforcement Learning Methods for Modern Power Systems

    Authors: Tong Su, Tong Wu, Junbo Zhao, Anna Scaglione, Le Xie

    Abstract: Given the availability of more comprehensive measurement data in modern power systems, reinforcement learning (RL) has gained significant interest in operation and control. Conventional RL relies on trial-and-error interactions with the environment and reward feedback, which often leads to exploring unsafe operating regions and executing unsafe actions, especially when deployed in real-world power… ▽ More

    Submitted 25 June, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

    Journal ref: Proceedings of the IEEE, 2025

  49. arXiv:2406.16990  [pdf, other

    cs.SD cs.AI eess.AS

    AND: Audio Network Dissection for Interpreting Deep Acoustic Models

    Authors: Tung-Yu Wu, Yu-Xiang Lin, Tsui-Wei Weng

    Abstract: Neuron-level interpretations aim to explain network behaviors and properties by investigating neurons responsive to specific perceptual or structural input patterns. Although there is emerging work in the vision and language domains, none is explored for acoustic models. To bridge the gap, we introduce $\textit{AND}$, the first $\textbf{A}$udio $\textbf{N}$etwork $\textbf{D}$issection framework th… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML'24

    Journal ref: Forty-first International Conference on Machine Learning (2024)

  50. arXiv:2406.16876  [pdf, other

    eess.SP

    Near-Field Mobile Tracking: A Framework of Using XL-RIS Information

    Authors: Tuo Wu, Cunhua Pan, Kangda Zhi, Junteng Yao, Hong Ren, Maged Elkashlan, Chau Yuen

    Abstract: This paper introduces a novel mobile tracking framework leveraging the high-dimensional signal received from extremely large-scale (XL) reconfigurable intelligent surfaces (RIS). This received signal, named XL-RIS information, has a much larger data dimension and therefore offers a richer feature set compared to the traditional base station (BS) received signal, i.e., BS information, enabling more… ▽ More

    Submitted 5 August, 2024; v1 submitted 3 April, 2024; originally announced June 2024.