Skip to main content

Showing 1–50 of 77 results for author: Luo, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.04028  [pdf

    eess.SY

    An Improved Finite Element Modeling Method for Triply Periodic Minimal Surface Structures Based on Element Size and Minimum Jacobian

    Authors: Siqi Wang, Chuangyu Jiang, Xiaodong Zhang, Yilong Zhang, Baoqiang Zhang, Huageng Luo

    Abstract: Triply periodic minimal surface (TPMS) structures, a type of lattice structure, have garnered significant attention due to their lightweight nature, controllability, and excellent mechanical properties. Voxel-based modeling is a widely used method for investigating the mechanical behavior of such lattice structures through finite element simulations. This study proposes a two-parameter voxel metho… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2505.20805  [pdf, ps, other

    eess.SP

    Dual-Polarization Stacked Intelligent Metasurfaces for Holographic MIMO

    Authors: Yida Zhang, Qiuyan Liu, Hongtao Luo, Yuqi Xia, Qiang Wang

    Abstract: To address the limited wave domain signal processing capabilities of traditional single-polarized stacked intelligent metasurfaces (SIMs) in holographic multiple-input multiple-output (HMIMO) systems, which stems from limited integration space, this paper proposes a dual-polarized SIM (DPSIM) architecture. By stacking dual-polarized reconfigurable intelligent surfaces (DPRIS), DPSIM can independen… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2505.16211  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

    Authors: Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Zhe Wang, Xingjian Du, Shun Zhang, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Xiaojun Jia, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Haoyang Li, Yiming Li, Xiaobin Zhuang, Yang Liu, Haibo Hu, Zhizheng Wu , et al. (6 additional authors not shown)

    Abstract: The rapid advancement and expanding applications of Audio Large Language Models (ALLMs) demand a rigorous understanding of their trustworthiness. However, systematic research on evaluating these models, particularly concerning risks unique to the audio modality, remains largely unexplored. Existing evaluation frameworks primarily focus on the text modality or address only a restricted set of safet… ▽ More

    Submitted 1 July, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: Technical Report

  4. arXiv:2505.02554  [pdf, ps, other

    eess.SP

    Sensing Framework Design and Performance Optimization with Action Detection for ISCC

    Authors: Weiwei Chen, Yinghui He, Guanding Yu, Jianfeng Wang, Haiyan Luo

    Abstract: Integrated sensing, communication, and computation (ISCC) has been regarded as a prospective technology for the next-generation wireless network, supporting humancentric intelligent applications. However, the delay sensitivity of these computation-intensive applications, especially in a multidevice ISCC system with limited resources, highlights the urgent need for efficient sensing task execution… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Accepted by IEEE Transactions on Wireless Communications

  5. arXiv:2503.06756  [pdf, other

    eess.SP cs.IT

    Sphere Precoding for Robust Near-Field Communications

    Authors: Hao Luo, Yu Zhang, Ahmed Alkhateeb

    Abstract: Near-field communication with large antenna arrays promises significant beamforming and multiplexing gains. These communication links, however, are very sensitive to user mobility as any small change in the user position may suddenly drop the signal power. This leads to critical challenges for the robustness of these near-field communication systems. In this paper, we propose \textit{sphere precod… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: The code for sphere precoding will be available on the Wireless Intelligence Lab website: https://www.wi-lab.net/

  6. arXiv:2502.02021  [pdf, other

    cs.CV eess.IV

    Multi-illuminant Color Constancy via Multi-scale Illuminant Estimation and Fusion

    Authors: Hang Luo, Rongwei Li, Jinxing Liang

    Abstract: Multi-illuminant color constancy methods aim to eliminate local color casts within an image through pixel-wise illuminant estimation. Existing methods mainly employ deep learning to establish a direct mapping between an image and its illumination map, which neglects the impact of image scales. To alleviate this problem, we represent an illuminant map as the linear combination of components estimat… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 10 pages, 4 figures, this manuscript is under the consideration of Optics Express

  7. arXiv:2501.06282  [pdf, other

    cs.CL cs.AI cs.HC cs.SD eess.AS

    MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

    Authors: Qian Chen, Yafeng Chen, Yanni Chen, Mengzhe Chen, Yingda Chen, Chong Deng, Zhihao Du, Ruize Gao, Changfeng Gao, Zhifu Gao, Yabin Li, Xiang Lv, Jiaqing Liu, Haoneng Luo, Bin Ma, Chongjia Ni, Xian Shi, Jialong Tang, Hui Wang, Hao Wang, Wen Wang, Yuxuan Wang, Yunlan Xu, Fan Yu, Zhijie Yan , et al. (11 additional authors not shown)

    Abstract: Recent advancements in large language models (LLMs) and multimodal speech-text models have laid the groundwork for seamless voice interactions, enabling real-time, natural, and human-like conversations. Previous models for voice interactions are categorized as native and aligned. Native models integrate speech and text processing in one framework but struggle with issues like differing sequence le… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: Work in progress. Authors are listed in alphabetical order by family name

  8. arXiv:2412.18588  [pdf, other

    cs.RO cs.AI eess.SY

    A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs

    Authors: OpenMind, Shaohong Zhong, Adam Zhou, Boyuan Chen, Homin Luo, Jan Liphardt

    Abstract: Large Language Models (LLMs) are compact representations of all public knowledge of our physical environment and animal and human behaviors. The application of LLMs to robotics may offer a path to highly capable robots that perform well across most human tasks with limited or even zero tuning. Aside from increasingly sophisticated reasoning and task planning, networks of (suitably designed) LLMs o… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: 10 pages, 1 figure

  9. arXiv:2412.09854  [pdf, ps, other

    cs.HC cs.CR eess.SP

    User Identity Protection in EEG-based Brain-Computer Interfaces

    Authors: L. Meng, X. Jiang, J. Huang, W. Li, H. Luo, D. Wu

    Abstract: A brain-computer interface (BCI) establishes a direct communication pathway between the brain and an external device. Electroencephalogram (EEG) is the most popular input signal in BCIs, due to its convenience and low cost. Most research on EEG-based BCIs focuses on the accurate decoding of EEG signals; however, EEG signals also contain rich private information, e.g., user identity, emotion, and s… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Journal ref: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 31:3576-3586, 2023

  10. arXiv:2410.24039  [pdf, other

    cs.NI eess.SY

    Efficient Satellite-Ground Interconnection Design for Low-orbit Mega-Constellation Topology

    Authors: Wenhao Liu, Jiazhi Wu, Quanwei Lin, Handong Luo, Qi Zhang, Kun Qiu, Zhe Chen, Yue Gao

    Abstract: The low-orbit mega-constellation network (LMCN) is an important part of the space-air-ground integrated network system. An effective satellite-ground interconnection design can result in a stable constellation topology for LMCNs. A naive solution is accessing the satellite with the longest remaining service time (LRST), which is widely used in previous designs. The Coordinated Satellite-Ground Int… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: 13 pages, 14 figures

  11. arXiv:2408.12760  [pdf, other

    eess.IV cs.CV

    Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification

    Authors: Han Luo, Feng Gao, Junyu Dong, Lin Qi

    Abstract: Hyperspectral image (HSI) and synthetic aperture radar (SAR) data joint classification is a crucial and yet challenging task in the field of remote sensing image interpretation. However, feature modeling in existing methods is deficient to exploit the abundant global, spectral, and local features simultaneously, leading to sub-optimal classification performance. To solve the problem, we propose a… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE GRSL

  12. arXiv:2408.05776  [pdf

    cs.NI eess.SP

    Convergence of Symbiotic Communications and Blockchain for Sustainable and Trustworthy 6G Wireless Networks

    Authors: Haoxiang Luo, Gang Sun, Cheng Chi, Hongfang Yu, Mohsen Guizani

    Abstract: Symbiotic communication (SC) is known as a new wireless communication paradigm, similar to the natural ecosystem population, and can enable multiple communication systems to cooperate and mutualize through service exchange and resource sharing. As a result, SC is seen as an important potential technology for future sixth-generation (6G) communications, solving the problem of lack of spectrum resou… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  13. Power-LLaVA: Large Language and Vision Assistant for Power Transmission Line Inspection

    Authors: Jiahao Wang, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang

    Abstract: The inspection of power transmission line has achieved notable achievements in the past few years, primarily due to the integration of deep learning technology. However, current inspection approaches continue to encounter difficulties in generalization and intelligence, which restricts their further applicability. In this paper, we introduce Power-LLaVA, the first large language and vision assista… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  14. arXiv:2407.08591  [pdf, other

    eess.SP

    6D Motion Parameters Estimation in Monostatic Integrated Sensing and Communications System

    Authors: Hongliang Luo, Feifei Gao, Fan Liu, Shi Jin

    Abstract: In this paper, we propose a novel scheme to estimate the six dimensional (6D) motion parameters of dynamic target for monostatic integrated sensing and communications (ISAC) system. We first provide a generic ISAC framework for dynamic target sensing based on massive multiple input and multiple output (MIMO) array. Next, we derive the relationship between the sensing channel of ISAC base station (… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.16441

  15. arXiv:2407.04051  [pdf, other

    cs.SD cs.AI eess.AS

    FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

    Authors: Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang , et al. (8 additional authors not shown)

    Abstract: This report introduces FunAudioLLM, a model family designed to enhance natural voice interactions between humans and large language models (LLMs). At its core are two innovative models: SenseVoice, which handles multilingual speech recognition, emotion recognition, and audio event detection; and CosyVoice, which facilitates natural speech generation with control over multiple languages, timbre, sp… ▽ More

    Submitted 10 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Work in progress. Authors are listed in alphabetical order by family name

  16. arXiv:2406.14977  [pdf, other

    cs.AI eess.IV

    Trustworthy Enhanced Multi-view Multi-modal Alzheimer's Disease Prediction with Brain-wide Imaging Transcriptomics Data

    Authors: Shan Cong, Zhoujie Fan, Hongwei Liu, Yinghan Zhang, Xin Wang, Haoran Luo, Xiaohui Yao

    Abstract: Brain transcriptomics provides insights into the molecular mechanisms by which the brain coordinates its functions and processes. However, existing multimodal methods for predicting Alzheimer's disease (AD) primarily rely on imaging and sometimes genetic data, often neglecting the transcriptomic basis of brain. Furthermore, while striving to integrate complementary information between modalities,… ▽ More

    Submitted 2 April, 2025; v1 submitted 21 June, 2024; originally announced June 2024.

  17. arXiv:2405.19925  [pdf, other

    eess.SP

    Integrated Sensing and Communications Framework for 6G Networks

    Authors: Hongliang Luo, Tengyu Zhang, Chuanbin Zhao, Yucong Wang, Bo Lin, Yuhua Jiang, Dongqi Luo, Feifei Gao

    Abstract: In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) sc… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  18. arXiv:2405.17250  [pdf, ps, other

    cs.RO eess.SY

    "Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT

    Authors: Haohua Que, Wenbin Pan, Jie Xu, Hao Luo, Pei Wang, Li Zhang

    Abstract: In recent years, various intelligent autonomous robots have begun to appear in daily life and production. Desktop-level robots are characterized by their flexible deployment, rapid response, and suitability for light workload environments. In order to meet the current societal demand for service robot technology, this study proposes using a miniaturized desktop-level robot (by ROS) as a carrier, l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  19. arXiv:2405.07115  [pdf, other

    eess.SP cs.IT

    Digital Twin Aided Compressive Sensing: Enabling Site-Specific MIMO Hybrid Precoding

    Authors: Hao Luo, Ahmed Alkhateeb

    Abstract: Compressive sensing is a promising solution for the channel estimation in multiple-input multiple-output (MIMO) systems with large antenna arrays and constrained hardware. Utilizing site-specific channel data from real-world systems, deep learning can be employed to learn the compressive sensing measurement vectors with minimum redundancy, thereby focusing sensing power on promising spatial direct… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  20. arXiv:2403.10832  [pdf, other

    cs.IT eess.SP

    Joint Power Allocation and Beamforming for In-band Full-duplex Multi-cell Multi-user Networks

    Authors: Haifeng Luo, Navneet Garg, Mark Holm, Tharmalingam Ratnarajah

    Abstract: This paper investigates a robust joint power allocation and beamforming scheme for in-band full-duplex multi-cell multi-user (IBFD-MCMU) networks. A mean-squared error (MSE) minimization problem is formulated with constraints on the power budgets and residual self-interference (RSI) power. The problem is not convex, so we decompose it into two sub-problems: interference management beamforming and… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  21. arXiv:2403.06720  [pdf, other

    cs.IT eess.SP

    On the Secrecy Rate of In-Band Full-duplex Two-way Wiretap Channel

    Authors: Navneet Garg, Haifeng Luo, Tharmalingam Ratnarajah

    Abstract: In this paper, we consider a two-way wiretap Multi-Input Multi-Output Multi-antenna Eve (MIMOME) channel, where both nodes (Alice and Bob) transmit and receive in an in-band full-duplex (IBFD) manner. For this system with keyless security, we provide a novel artificial noise (AN) based signal design, where the AN is injected in both signal and null spaces. We present an ergodic secrecy rate approx… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  22. arXiv:2402.17268  [pdf, other

    eess.SY

    Reinforcement Learning Based Robust Volt/Var Control in Active Distribution Networks With Imprecisely Known Delay

    Authors: Hong Cheng, Huan Luo, Zhi Liu, Wei Sun, Weitao Li, Qiyue Li

    Abstract: Active distribution networks (ADNs) incorporating massive photovoltaic (PV) devices encounter challenges of rapid voltage fluctuations and potential violations. Due to the fluctuation and intermittency of PV generation, the state gap, arising from time-inconsistent states and exacerbated by imprecisely known system delays, significantly impacts the accuracy of voltage control. This paper addresses… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  23. arXiv:2401.15919  [pdf, other

    eess.SP cs.IT

    Integrated Imaging and Communication with Reconfigurable Intelligent Surfaces

    Authors: Hao Luo, Ahmed Alkhateeb

    Abstract: Reconfigurable intelligent surfaces, with their large number of antennas, offer an interesting opportunity for high spatial-resolution imaging. In this paper, we propose a novel RIS-aided integrated imaging and communication system that can reduce the RIS beam training overhead for communication by leveraging the imaging of the surrounding environment. In particular, using the RIS as a wireless im… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures. To appear in Asilomar 2023

  24. arXiv:2401.09761  [pdf, other

    eess.SP cs.IT

    ISAC with Backscattering RFID Tags: Joint Beamforming Design

    Authors: Hao Luo, Umut Demirhan, Ahmed Alkhateeb

    Abstract: In this paper, we explore an integrated sensing and communication (ISAC) system with backscattering RFID tags. In this setup, an access point employs a communication beam to serve a user while leveraging a sensing beam to detect an RFID tag. Under the total transmit power constraint of the system, our objective is to design sensing and communication beams by considering the tag detection and commu… ▽ More

    Submitted 31 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 5 pages, 5 figures. To appear in IEEE ICC 2024

  25. arXiv:2312.16441  [pdf, other

    eess.SP

    6D Radar Sensing and Tracking in Monostatic Integrated Sensing and Communications System

    Authors: Hongliang Luo, Feifei Gao, Fan Liu, Shi Jin

    Abstract: In this paper, we propose a novel scheme for sixdimensional (6D) radar sensing and tracking of dynamic target based on multiple input and multiple output (MIMO) array for monostatic integrated sensing and communications (ISAC) system. Unlike most existing ISAC studies believing that only the radial velocity of far-field dynamic target can be measured based on one single base station (BS), we find… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  26. arXiv:2311.01700  [pdf, other

    eess.SP

    Moving Target Sensing for ISAC Systems in Clutter Environment

    Authors: Dongqi Luo, Huihui Wu, Hongliang Luo, Bo Lin, Feifei Gao

    Abstract: In this paper, we consider the moving target sensing problem for integrated sensing and communication (ISAC) systems in clutter environment. Scatterers produce strong clutter, deteriorating the performance of ISAC systems in practice. Given that scatterers are typically stationary and the targets of interest are usually moving, we here focus on sensing the moving targets. Specifically, we adopt a… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  27. arXiv:2311.01674  [pdf, other

    eess.SP

    Integrated Sensing and Communications in Clutter Environment

    Authors: Hongliang Luo, Yucong Wang, Dongqi Luo, Jianwei Zhao, Huihui Wu, Shaodan Ma, Feifei Gao

    Abstract: In this paper, we propose a practical integrated sensing and communications (ISAC) framework to sense dynamic targets from clutter environment while ensuring users communications quality. To implement communications function and sensing function simultaneously, we design multiple communications beams that can communicate with the users as well as one sensing beam that can rotate and scan the entir… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  28. arXiv:2310.17997  [pdf

    physics.optics cs.AI eess.IV

    Deep Learning Enables Large Depth-of-Field Images for Sub-Diffraction-Limit Scanning Superlens Microscopy

    Authors: Hui Sun, Hao Luo, Feifei Wang, Qingjiu Chen, Meng Chen, Xiaoduo Wang, Haibo Yu, Guanglie Zhang, Lianqing Liu, Jianping Wang, Dapeng Wu, Wen Jung Li

    Abstract: Scanning electron microscopy (SEM) is indispensable in diverse applications ranging from microelectronics to food processing because it provides large depth-of-field images with a resolution beyond the optical diffraction limit. However, the technology requires coating conductive films on insulator samples and a vacuum environment. We use deep learning to obtain the mapping relationship between op… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages,7 figures

  29. arXiv:2310.03748  [pdf

    eess.SP cs.HC cs.LG

    Phase Synchrony Component Self-Organization in Brain Computer Interface

    Authors: Xu Niu, Na Lu, Huan Luo, Ruofan Yan

    Abstract: Phase synchrony information plays a crucial role in analyzing functional brain connectivity and identifying brain activities. A widely adopted feature extraction pipeline, composed of preprocessing, selection of EEG acquisition channels, and phase locking value (PLV) calculation, has achieved success in motor imagery classification (MI). However, this pipeline is manual and reliant on expert knowl… ▽ More

    Submitted 11 October, 2023; v1 submitted 21 September, 2023; originally announced October 2023.

  30. arXiv:2309.14405  [pdf, other

    cs.SD cs.AI eess.AS

    Joint Audio and Speech Understanding

    Authors: Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

    Abstract: Humans are surrounded by audio signals that include both speech and non-speech sounds. The recognition and understanding of speech and non-speech audio events, along with a profound comprehension of the relationship between them, constitute fundamental cognitive capabilities. For the first time, we build a machine learning model, called LTU-AS, that has a conceptually similar universal audio perce… ▽ More

    Submitted 10 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ASRU 2023. Code, dataset, and pretrained models are at https://github.com/yuangongnd/ltu. Interactive demo at https://huggingface.co/spaces/yuangongfdu/ltu-2

  31. arXiv:2309.14012  [pdf, other

    eess.SP

    Beam Squint Assisted User Localization in Near-Field Integrated Sensing and Communications Systems

    Authors: Hongliang Luo, Feifei Gao, Wanmai Yuan, Shun Zhang

    Abstract: Integrated sensing and communication (ISAC) has been regarded as a key technology for 6G wireless communications, in which large-scale multiple input and multiple output (MIMO) array with higher and wider frequency bands will be adopted. However, recent studies show that the beam squint phenomenon can not be ignored in wideband MIMO system, which generally deteriorates the communications performan… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by IEEE Transactions on Wireless Communications (TWC) on 18 September 2023

  32. arXiv:2308.01558  [pdf, other

    eess.SP

    Millimeter Wave V2V Beam Tracking using Radar: Algorithms and Real-World Demonstration

    Authors: Hao Luo, Umut Demirhan, Ahmed Alkhateeb

    Abstract: Utilizing radar sensing for assisting communication has attracted increasing interest thanks to its potential in dynamic environments. A particularly interesting problem for this approach appears in the vehicle-to-vehicle (V2V) millimeter wave and terahertz communication scenarios, where the narrow beams change with the movement of both vehicles. To address this problem, in this work, we develop a… ▽ More

    Submitted 27 October, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 5 pages, 5 figures. To appear in EUSIPCO 2023. The dataset is available on the DeepSense 6G website http://deepsense6g.net/

  33. arXiv:2305.12064  [pdf, other

    eess.SP

    YOLO: An Efficient Terahertz Band Integrated Sensing and Communications Scheme with Beam Squint

    Authors: Hongliang Luo, Feifei Gao, Hai Lin, Shaodan Ma, H. Vincent Poor

    Abstract: Using communications signals for dynamic target sensing is an important component of integrated sensing and communications (ISAC). In this paper, we propose to utilize the beam squint effect to realize fast non-cooperative dynamic target sensing in massive multiple input and multiple output (MIMO) Terahertz band communications systems. Specifically, we construct a wideband channel model of the ech… ▽ More

    Submitted 5 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted by IEEE Transactions on Wireless Communications (TWC)

  34. arXiv:2305.11013  [pdf, other

    cs.SD cs.CL eess.AS

    FunASR: A Fundamental End-to-End Speech Recognition Toolkit

    Authors: Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Zhangyu Xiao, Shiliang Zhang

    Abstract: This paper introduces FunASR, an open-source speech recognition toolkit designed to bridge the gap between academic research and industrial applications. FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications. The toolkit's flagship model, Paraformer, is a non-autoregressive end-to-end speech recognition model that has been trained on a manual… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures, accepted by INTERSPEECH 2023

  35. arXiv:2305.10790  [pdf, other

    eess.AS cs.SD

    Listen, Think, and Understand

    Authors: Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James Glass

    Abstract: The ability of artificial intelligence (AI) systems to perceive and comprehend audio signals is crucial for many applications. Although significant progress has been made in this area since the development of AudioSet, most existing models are designed to map audio inputs to pre-defined, discrete sound label sets. In contrast, humans possess the ability to not only classify sounds into general cat… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. Code, dataset, and models are available at https://github.com/YuanGongND/ltu. The interactive demo is at https://huggingface.co/spaces/yuangongfdu/ltu

  36. arXiv:2305.10680  [pdf, other

    cs.SD cs.CL eess.AS

    Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

    Authors: Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan

    Abstract: Estimating confidence scores for recognition results is a classic task in ASR field and of vital importance for kinds of downstream tasks and training strategies. Previous end-to-end~(E2E) based confidence estimation models (CEM) predict score sequences of equal length with input transcriptions, leading to unreliable estimation when deletion and insertion errors occur. In this paper we proposed CI… ▽ More

    Submitted 24 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, Interspeech2023

  37. Inflation Reduction Act impacts on the economics of clean hydrogen and liquid fuels

    Authors: Fangwei Cheng, Hongxi Luo, Jesse D. Jenkins, Eric D. Larson

    Abstract: The Inflation Reduction Act (IRA) in the United States provides unprecedented incentives for deploying low-carbon hydrogen and liquid fuels, among other low greenhouse gas (GHG) emissions technologies. To better understand the prospective competitiveness of low-carbon or negative-carbon hydrogen and liquid fuels under the IRA in the early 2030s, we examine the impacts of IRA provisions on costs of… ▽ More

    Submitted 14 August, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  38. arXiv:2304.13244  [pdf

    cs.NI eess.SP

    ESCM: An Efficient and Secure Communication Mechanism for UAV Networks

    Authors: Haoxiang Luo, Yifan Wu, Gang Sun, Hongfang Yu, Mohsen Guizani

    Abstract: UAV (unmanned aerial vehicle) is rapidly gaining traction in various human activities and has become an integral component of the satellite-air-ground-sea (SAGS) integrated network. As high-speed moving objects, UAVs not only have extremely strict requirements for communication delay, but also cannot be maliciously controlled as a weapon by the attacker. Therefore, an efficient and secure communic… ▽ More

    Submitted 16 June, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  39. arXiv:2304.08697  [pdf

    cs.NI cs.PF eess.SP

    Performance Analysis and Comparison of Non-ideal Wireless PBFT and RAFT Consensus Networks in 6G Communications

    Authors: Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Bo Lei, Mohsen Guizani

    Abstract: Due to advantages in security and privacy, blockchain is considered a key enabling technology to support 6G communications. Practical Byzantine Fault Tolerance (PBFT) and RAFT are seen as the most applicable consensus mechanisms (CMs) in blockchain-enabled wireless networks. However, previous studies on PBFT and RAFT rarely consider the channel performance of the physical layer, such as path loss… ▽ More

    Submitted 2 August, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.15759

  40. arXiv:2303.15759  [pdf

    cs.NI eess.SP

    Performance Analysis of Non-ideal Wireless PBFT Networks with mmWave and Terahertz Signals

    Authors: Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Shizhong Xu, Long Luo

    Abstract: Due to advantages in security and privacy, blockchain is considered a key enabling technology to support 6G communications. Practical Byzantine Fault Tolerance (PBFT) is seen as the most applicable consensus mechanism in blockchain-enabled wireless networks. However, previous studies on PBFT do not consider the channel performance of the physical layer, such as path loss and channel fading, result… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: IEEE International Conference on Metaverse Computing, Networking and Applications (MetaCom) 2023

  41. arXiv:2302.11249  [pdf, ps, other

    eess.SP

    RIS-Aided Integrated Sensing and Communication: Joint Beamforming and Reflection Design

    Authors: Honghao Luo, Rang Liu, Ming Li, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a promising technique to alleviate the spectrum congestion problem. Inspired by the applications of reconfigurable intelligent surface (RIS) in dynamically manipulating wireless propagation environment, in this paper, we investigate to deploy a RIS in an ISAC system to pursue performance improvement. Particularly, we consider a RIS… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE TVT

  42. arXiv:2302.10686  [pdf, other

    cs.SD cs.AI eess.AS

    Interpretable Spectrum Transformation Attacks to Speaker Recognition

    Authors: Jiadi Yao, Hong Luo, Xiao-Lei Zhang

    Abstract: The success of adversarial attacks to speaker recognition is mainly in white-box scenarios. When applying the adversarial voices that are generated by attacking white-box surrogate models to black-box victim models, i.e. \textit{transfer-based} black-box attacks, the transferability of the adversarial voices is not only far from satisfactory, but also lacks interpretable basis. To address these is… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  43. arXiv:2302.09332  [pdf, other

    eess.SP

    Incipient Fault Detection in Power Distribution System: A Time-Frequency Embedded Deep Learning Based Approach

    Authors: Qiyue Li, Huan Luo, Hong Cheng, Yuxing Deng, Wei Sun, Weitao Li, Zhi Liu

    Abstract: Incipient fault detection in power distribution systems is crucial to improve the reliability of the grid. However, the non-stationary nature and the inadequacy of the training dataset due to the self-recovery of the incipient fault signal, make the incipient fault detection in power distribution systems a great challenge. In this paper, we focus on incipient fault detection in power distribution… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: 15 pages

  44. arXiv:2211.12956   

    eess.SY cs.AI cs.LG

    Reinforcement learning for traffic signal control in hybrid action space

    Authors: Haoqing Luo, sheng jin

    Abstract: The prevailing reinforcement-learning-based traffic signal control methods are typically staging-optimizable or duration-optimizable, depending on the action spaces. In this paper, we propose a novel control architecture, TBO, which is based on hybrid proximal policy optimization. To the best of our knowledge, TBO is the first RL-based algorithm to implement synchronous optimization of the staging… ▽ More

    Submitted 25 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: There are serious problems with the innovation of the paper

  45. arXiv:2211.08210  [pdf, other

    eess.SP cs.IT

    Reconfigurable Intelligent Surface Aided Wireless Sensing for Scene Depth Estimation

    Authors: Abdelrahman Taha, Hao Luo, Ahmed Alkhateeb

    Abstract: Current scene depth estimation approaches mainly rely on optical sensing, which carries privacy concerns and suffers from estimation ambiguity for distant, shiny, and transparent surfaces/objects. Reconfigurable intelligent surfaces (RISs) provide a path for employing a massive number of antennas using low-cost and energy-efficient architectures. This has the potential for realizing RIS-aided wire… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Submitted to IEEE

  46. 3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

    Authors: Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, Zongyuan Ge

    Abstract: Usually, lesions are not isolated but are associated with the surrounding tissues. For example, the growth of a tumour can depend on or infiltrate into the surrounding tissues. Due to the pathological nature of the lesions, it is challenging to distinguish their boundaries in medical imaging. However, these uncertain regions may contain diagnostic information. Therefore, the simple binarization of… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted by Computers in Biology and Medicine. arXiv admin note: substantial text overlap with arXiv:2209.07843

  47. arXiv:2208.01854  [pdf, other

    eess.SP

    Joint Beamforming Design for RIS-Assisted Integrated Sensing and Communication Systems

    Authors: Honghao Luo, Rang Liu, Ming Li, Yang Liu, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a promising technology to tackle the spectrum congestion problem for future networks. In this correspondence, we investigate to deploy a reconfigurable intelligent surface (RIS) in an ISAC system for achieving better performance. In particular, a multi-antenna base station (BS) simultaneously serves multiple single-antenna users wi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE TVT

  48. arXiv:2207.00474  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

    Authors: Jiamin Liang, Xin Yang, Yuhao Huang, Kai Liu, Xinrui Zhou, Xindi Hu, Zehui Lin, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni

    Abstract: Ultrasound (US) is widely used for its advantages of real-time imaging, radiation-free and portability. In clinical practice, analysis and diagnosis often rely on US sequences rather than a single image to obtain dynamic anatomical information. This is challenging for novices to learn because practicing with adequate videos from patients is clinically unpractical. In this paper, we propose a novel… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  49. arXiv:2206.08518  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication with Reconfigurable Intelligent Surfaces: Opportunities, Applications, and Future Directions

    Authors: Rang Liu, Ming Li, Honghao Luo, Qian Liu, A. Lee Swindlehurst

    Abstract: Integrated sensing and communication (ISAC) is emerging as a key enabler to address the growing spectrum congestion problem and satisfy increasing demands for ubiquitous sensing and communication. By sharing various resources and information, ISAC achieves much higher spectral, energy, hardware, and economic efficiencies. Concurrently, reconfigurable intelligent surface (RIS) technology has been d… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: submitted to IEEE journal

  50. arXiv:2205.11392  [pdf, other

    eess.SP

    Beam Squint Assisted User Localization in Near-Field Communications Systems

    Authors: Hongliang Luo, Feifei Gao

    Abstract: The beam squint phenomenon in massive multi-input and multi-output wideband communications has been widely concerned recently, which generally deteriorates the beamforming performance. In this paper, we find that with the aid of the time-delay lines (TDs), the range and trajectory of the beam squint of a near-field communications system can be freely controlled, and hence it is possible to reverse… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.