Skip to main content

Showing 1–50 of 141 results for author: Zhang, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.10174  [pdf, ps, other

    eess.SP

    Subspace-Based Super-Resolution Sensing for Bi-Static ISAC with Clock Asynchronism

    Authors: Jingbo Zhao, Zhaoming Lu, J. Andrew Zhang, Jiaxi Zhou, Weicai Li, Tao Gu

    Abstract: Bi-static sensing is an attractive configuration for integrated sensing and communications (ISAC) systems; however, clock asynchronism between widely separated transmitters and receivers introduces time-varying time offsets (TO) and phase offsets (PO), posing significant challenges. This paper introduces a signal-subspace-based framework that estimates decoupled angles, delays, and complex gain se… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 13 pages, 9 figures. This work has been submitted to the IEEE for possible publication

  2. arXiv:2505.04105  [pdf

    eess.IV cs.CV

    MAISY: Motion-Aware Image SYnthesis for Medical Image Motion Correction

    Authors: Andrew Zhang, Hao Wang, Shuchang Ye, Michael Fulham, Jinman Kim

    Abstract: Patient motion during medical image acquisition causes blurring, ghosting, and distorts organs, which makes image interpretation challenging. Current state-of-the-art algorithms using Generative Adversarial Network (GAN)-based methods with their ability to learn the mappings between corrupted images and their ground truth via Structural Similarity Index Measure (SSIM) loss effectively generate mot… ▽ More

    Submitted 8 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

  3. arXiv:2505.00742  [pdf, other

    cs.CV cs.AI eess.IV

    Zoomer: Adaptive Image Focus Optimization for Black-box MLLM

    Authors: Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang, Xufang Luo, Yu Kang, Qingwei Lin, Anlan Zhang, Shiqi Jiang, Ting Cao, Tianjun Mao, Suman Banerjee, Guyue Liu, Saravan Rajmohan, Dongmei Zhang, Yuqing Yang, Qi Zhang, Lili Qiu

    Abstract: Recent advancements in multimodal large language models (MLLMs) have broadened the scope of vision-language tasks, excelling in applications like image captioning and interactive question-answering. However, these models struggle with accurately processing visual data, particularly in tasks requiring precise object recognition and fine visual details. Stringent token limits often result in the omi… ▽ More

    Submitted 29 April, 2025; originally announced May 2025.

  4. arXiv:2504.15600  [pdf, other

    cs.RO eess.SY

    Research on Navigation Methods Based on LLMs

    Authors: Anlong Zhang, Jianmin Ji

    Abstract: In recent years, the field of indoor navigation has witnessed groundbreaking advancements through the integration of Large Language Models (LLMs). Traditional navigation approaches relying on pre-built maps or reinforcement learning exhibit limitations such as poor generalization and limited adaptability to dynamic environments. In contrast, LLMs offer a novel paradigm for complex indoor navigatio… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  5. arXiv:2504.15042  [pdf, ps, other

    eess.SP

    Bayesian Sensing for Time-Varying Channels in ISAC Systems

    Authors: Xueyang Wang, Kai Wu, J. Andrew Zhang, Shiqi Gong, Chengwen Xing

    Abstract: Future mobile networks are projected to support integrated sensing and communications in high-speed communication scenarios. Nevertheless, large Doppler shifts induced by time-varying channels may cause severe inter-carrier interference (ICI). Frequency domain shows the potential of reducing ISAC complexity as compared with other domains. However, parameter mismatching issue still exists for such… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 14 pages, 8 figures, manuscript submitted to IEEE Transactions on Communications (TCOM)

  6. arXiv:2504.14076  [pdf, other

    cs.SD cs.LG eess.AS

    Transformation of audio embeddings into interpretable, concept-based representations

    Authors: Alice Zhang, Edison Thomaz, Lie Lu

    Abstract: Advancements in audio neural networks have established state-of-the-art results on downstream audio tasks. However, the black-box structure of these models makes it difficult to interpret the information encoded in their internal audio representations. In this work, we explore the semantic interpretability of audio embeddings extracted from these neural networks by leveraging CLAP, a contrastive l… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: Accepted to International Joint Conference on Neural Networks (IJCNN) 2025

  7. arXiv:2504.13131  [pdf, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

    Authors: Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong , et al. (88 additional authors not shown)

    Abstract: This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement. The challenge comprises two tracks: (i) Efficient Video Quality Assessment (KVQ), and (ii) Diffusion-based Image Super-Resolution (KwaiSR). Track 1 aims to advance the development of lightweight and efficient video quality assessment (VQA) models, with an emphasis on eliminating re… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of NTIRE 2025; Methods from 18 Teams; Accepted by CVPR Workshop; 21 pages

  8. arXiv:2504.06830  [pdf, other

    eess.SP

    Integrated Sensing and Communications Over the Years: An Evolution Perspective

    Authors: Di Zhang, Yuanhao Cui, Xiaowen Cao, Nanchi Su, Fan Liu, Xiaojun Jing, J. Andrew Zhang, Jie Xu, Christos Masouros, Dusit Niyato, Marco Di Renzo

    Abstract: Integrated Sensing and Communications (ISAC) enables efficient spectrum utilization and reduces hardware costs for beyond 5G (B5G) and 6G networks, facilitating intelligent applications that require both high-performance communication and precise sensing capabilities. This survey provides a comprehensive review of the evolution of ISAC over the years. We examine the expansion of the spectrum acros… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  9. arXiv:2503.22140  [pdf, other

    eess.IV cs.CV eess.SP

    Score-Based Turbo Message Passing for Plug-and-Play Compressive Image Recovery

    Authors: Chang Cai, Xiaojun Yuan, Ying-Jun Angela Zhang

    Abstract: Message passing algorithms have been tailored for compressive imaging applications by plugging in different types of off-the-shelf image denoisers. These off-the-shelf denoisers mostly rely on some generic or hand-crafted priors for denoising. Due to their insufficient accuracy in capturing the true image prior, these methods often fail to produce satisfactory results, especially in largely underd… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  10. arXiv:2503.11102  [pdf, other

    eess.SP

    Deep Learning-based OTFS Channel Estimation and Symbol Detection with Plug and Play Framework

    Authors: Xiaoqi Zhang, Zhitong Ni, Weijie Yuan, J. Andrew Zhang

    Abstract: Orthogonal Time Frequency Space (OTFS) modulation has recently attracted significant interest due to its potential for enabling reliable communication in high-mobility environments. One of the challenges for OTFS receivers is the fractional Doppler that occurs in practical systems, resulting in decreased channel sparsity, and then inaccurate channel estimation and high-complexity equalization. In… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  11. arXiv:2503.10107  [pdf, other

    eess.SP

    High-Resolution Uplink Sensing in Millimeter-Wave ISAC Systems

    Authors: Liangbin Zhao, Zhitong Ni, Yimeng Feng, Jianguo Li, Xiangyuan Bu, J. Andrew Zhang

    Abstract: Perceptive mobile networks (PMNs), integrating ubiquitous sensing capabilities into mobile networks, represent an important application of integrated sensing and communication (ISAC) in 6G. In this paper, we propose a practical framework for uplink sensing of angle-of-arrival (AoA), Doppler, and delay in millimeter-wave (mmWave) communication systems, which addresses challenges posed by clock asyn… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 15 pages, 8 figures

  12. arXiv:2502.12151  [pdf, other

    cs.CV eess.SY

    VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution

    Authors: Chendong Wang, Anlan Zhang, Yifan Yang, Lili Qiu, Yuqing Yang, Xinyang Jiang, Feng Qian, Suman Banerjee

    Abstract: 3D volumetric video provides immersive experience and is gaining traction in digital media. Despite its rising popularity, the streaming of volumetric video content poses significant challenges due to the high data bandwidth requirement. A natural approach to mitigate the bandwidth issue is to reduce the volumetric video's data rate by downsampling the content prior to transmission. The video can… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  13. arXiv:2502.01885  [pdf

    cs.LG cs.AI eess.IV

    A Privacy-Preserving Domain Adversarial Federated learning for multi-site brain functional connectivity analysis

    Authors: Yipu Zhang, Likai Wang, Kuan-Jui Su, Aiying Zhang, Hao Zhu, Xiaowen Liu, Hui Shen, Vince D. Calhoun, Yuping Wang, Hongwen Deng

    Abstract: Resting-state functional magnetic resonance imaging (rs-fMRI) and its derived functional connectivity networks (FCNs) have become critical for understanding neurological disorders. However, collaborative analyses and the generalizability of models still face significant challenges due to privacy regulations and the non-IID (non-independent and identically distributed) property of multiple data sou… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 34pages, 13 figures

  14. arXiv:2501.06750  [pdf, ps, other

    eess.SP

    Multi-Carrier Faster-Than-Nyquist Signaling for OTFS Systems

    Authors: Xueyang Wang, Shiqi Gong, Wenqian Shen, Chengwen Xing, J. Andrew Zhang

    Abstract: Orthogonal time frequency space (OTFS) modulation technique is promising for high-mobility applications to achieve reliable communications. However, the capacity of OTFS systems is generally limited by the Nyquist criterion, requiring orthogonal pulses in both time and frequency domains. In this paper, we propose a novel multi-carrier faster-than-Nyquist (MC-FTN) signaling scheme for OTFS systems.… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

    Comments: 14 pages, 9 figures, manuscript submitted to IEEE Transactions on Vehicular Technology (TVT) on May 10, 2024

  15. arXiv:2412.00317  [pdf, other

    eess.SP

    Joint Coverage and Electromagnetic Field Exposure Analysis in Downlink and Uplink for RIS-assisted Networks

    Authors: Lin Chen, Ahmed Elzanaty, Mustafa A Kishk, Ying-Jun Angela Zhang

    Abstract: Reconfigurable intelligent surfaces (RISs) have shown the potential to improve signal-to-interference-plus-noise ratio (SINR) related coverage, especially at high-frequency communications. However, assessing electromagnetic filed exposure (EMFE) and establishing EMFE regulations in RIS-assisted large-scale networks are still open issues. This paper proposes a framework to characterize SINR and EMF… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

  16. arXiv:2411.19666  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.AP

    Multimodal Whole Slide Foundation Model for Pathology

    Authors: Tong Ding, Sophia J. Wagner, Andrew H. Song, Richard J. Chen, Ming Y. Lu, Andrew Zhang, Anurag J. Vaidya, Guillaume Jaume, Muhammad Shaban, Ahrong Kim, Drew F. K. Williamson, Bowen Chen, Cristina Almagro-Perez, Paul Doucet, Sharifa Sahai, Chengkuan Chen, Daisuke Komura, Akihiro Kawabe, Shumpei Ishikawa, Georg Gerber, Tingying Peng, Long Phi Le, Faisal Mahmood

    Abstract: The field of computational pathology has been transformed with recent advances in foundation models that encode histopathology region-of-interests (ROIs) into versatile and transferable feature representations via self-supervised learning (SSL). However, translating these advancements to address complex clinical challenges at the patient and slide level remains constrained by limited clinical data… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

    Comments: The code is accessible at https://github.com/mahmoodlab/TITAN

  17. Optimizing Fingerprint-Spectrum-Based Synchronization in Integrated Sensing and Communications

    Authors: Xiao-Yang Wang, Shaoshi Yang, Hou-Yu Zhai, Christos Masouros, J. Andrew Zhang

    Abstract: Asynchronous radio transceivers often lead to significant range and velocity ambiguity, posing challenges for precise positioning and velocity estimation in passive-sensing perceptive mobile networks (PMNs). To address this issue, carrier frequency offset (CFO) and time offset (TO) synchronization algorithms have been studied in the literature. However, their performance can be significantly affec… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: This work has been accepted by Globecom 2024. arXiv admin note: substantial text overlap with arXiv:2409.00950

    Journal ref: in Proc. IEEE Global Communications Conference (GLOBECOM 2024), Cape Town, South Africa, Dec. 8-12, 2024, pp. 4016-4021

  18. arXiv:2410.06866  [pdf, other

    cs.CV eess.IV

    Secure Video Quality Assessment Resisting Adversarial Attacks

    Authors: Ao-Xiang Zhang, Yu Ran, Weixuan Tang, Yuan-Gen Wang, Qingxiao Guan, Chunsheng Yang

    Abstract: The exponential surge in video traffic has intensified the imperative for Video Quality Assessment (VQA). Leveraging cutting-edge architectures, current VQA models have achieved human-comparable accuracy. However, recent studies have revealed the vulnerability of existing VQA models against adversarial attacks. To establish a reliable and practical assessment system, a secure VQA model capable of… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  19. arXiv:2410.04225  [pdf, other

    eess.IV cs.CV cs.MM

    AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results

    Authors: Ivan Molodetskikh, Artem Borisov, Dmitriy Vatolin, Radu Timofte, Jianzhao Liu, Tianwu Zhi, Yabin Zhang, Yang Li, Jingwen Xu, Yiting Liao, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Yuqin Cao, Wei Sun, Weixia Zhang, Yinan Sun, Ziheng Jia, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Weihua Luo , et al. (2 additional authors not shown)

    Abstract: This paper presents the Video Super-Resolution (SR) Quality Assessment (QA) Challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2024. The task of this challenge was to develop an objective QA method for videos upscaled 2x and 4x by modern image- and video-SR algorithms. QA methods were evaluated by comparing their output with aggregate subjec… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: 18 pages, 7 figures

  20. Windowing Optimization for Fingerprint-Spectrum-Based Passive Sensing in Perceptive Mobile Networks

    Authors: Xiao-Yang Wang, Shaoshi Yang, Hou-Yu Zhai, Christos Masouros, J. Andrew Zhang

    Abstract: Perceptive mobile networks (PMN) have been widely recognized as a pivotal pillar for the sixth generation (6G) mobile communication systems. However, the asynchronicity between transmitters and receivers results in velocity and range ambiguity, which seriously degrades the sensing performance. To mitigate the ambiguity, carrier frequency offset (CFO) and time offset (TO) synchronizations have been… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 16 pages, 12 figures, accepted to publish on IEEE Transactions on Communications, Aug. 2024

  21. arXiv:2408.17397  [pdf, other

    cs.IT eess.SP

    End-to-End Learning for Task-Oriented Semantic Communications Over MIMO Channels: An Information-Theoretic Framework

    Authors: Chang Cai, Xiaojun Yuan, Ying-Jun Angela Zhang

    Abstract: This paper addresses the problem of end-to-end (E2E) design of learning and communication in a task-oriented semantic communication system. In particular, we consider a multi-device cooperative edge inference system over a wireless multiple-input multiple-output (MIMO) multiple access channel, where multiple devices transmit extracted features to a server to perform a classification task. We formu… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: major revision in IEEE JSAC

  22. arXiv:2408.15481  [pdf, ps, other

    eess.SP

    Joint Offloading and Beamforming Design in Integrating Sensing, Communication, and Computing Systems: A Distributed Approach

    Authors: Peng Liu, Zesong Fei, Xinyi Wang, Jingxuan Huang, Jie Hu, J. Andrew Zhang

    Abstract: When applying integrated sensing and communications (ISAC) in future mobile networks, many sensing tasks have low latency requirements, preferably being implemented at terminals. However, terminals often have limited computing capabilities and energy supply. In this paper, we investigate the effectiveness of leveraging the advanced computing capabilities of mobile edge computing (MEC) servers and… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 15 pages, 12 figures, submitted to IEEE journals for possible publication

  23. arXiv:2408.13593  [pdf, ps, other

    eess.SP eess.IV

    Learning Multi-Rate Task-Oriented Communications Over Symmetric Discrete Memoryless Channels

    Authors: Anbang Zhang, Shuaishuai Guo

    Abstract: This letter introduces a multi-rate task-oriented communication (MR-ToC) framework. This framework dynamically adapts to variations in affordable data rate within the communication pipeline. It conceptualizes communication pipelines as symmetric, discrete, memoryless channels. We employ a progressive learning strategy to train the system, comprising a nested codebook for encoding and task inferenc… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  24. arXiv:2408.11982  [pdf, other

    eess.IV cs.CV cs.MM

    AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results

    Authors: Maksim Smirnov, Aleksandr Gushchin, Anastasia Antsiferova, Dmitry Vatolin, Radu Timofte, Ziheng Jia, Zicheng Zhang, Wei Sun, Jiaying Qian, Yuqin Cao, Yinan Sun, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Kanjar De, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Wenhui Meng, Zhenzhong Chen, Zhengxue Cheng, Jiahao Xiao , et al. (7 additional authors not shown)

    Abstract: Video quality assessment (VQA) is a crucial task in the development of video compression standards, as it directly impacts the viewer experience. This paper presents the results of the Compressed Video Quality Assessment challenge, held in conjunction with the Advances in Image Manipulation (AIM) workshop at ECCV 2024. The challenge aimed to evaluate the performance of VQA methods on a diverse dat… ▽ More

    Submitted 22 October, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

  25. Semantic-Enabled 6G Communication: A Task-oriented and Privacy-preserving Perspective

    Authors: Shuaishuai Guo, Anbang Zhang, Yanhu Wang, Chenyuan Feng, Tony Q. S. Quek

    Abstract: Task-oriented semantic communication (ToSC) emerges as an innovative approach in the 6G landscape, characterized by the transmission of only vital information that is directly pertinent to a specific task. While ToSC offers an efficient mode of communication, it concurrently raises concerns regarding privacy, as sophisticated adversaries might possess the capability to reconstruct the original dat… ▽ More

    Submitted 2 April, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

    Journal ref: IEEE Network 2025

  26. arXiv:2408.02859  [pdf, other

    eess.IV cs.AI cs.CV

    Multistain Pretraining for Slide Representation Learning in Pathology

    Authors: Guillaume Jaume, Anurag Vaidya, Andrew Zhang, Andrew H. Song, Richard J. Chen, Sharifa Sahai, Dandan Mo, Emilio Madrigal, Long Phi Le, Faisal Mahmood

    Abstract: Developing self-supervised learning (SSL) models that can learn universal and transferable representations of H&E gigapixel whole-slide images (WSIs) is becoming increasingly valuable in computational pathology. These models hold the potential to advance critical tasks such as few-shot classification, slide retrieval, and patient stratification. Existing approaches for slide representation learnin… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: ECCV'24

  27. arXiv:2407.21514  [pdf

    eess.SP

    Wireless Communications in Doubly Selective Channels with Domain Adaptivity

    Authors: J. Andrew Zhang, Hongyang Zhang, Kai Wu, Xiaojing Huang, Jinhong Yuan, Y. Jay Guo

    Abstract: Wireless communications are significantly impacted by the propagation environment, particularly in doubly selective channels with variations in both time and frequency domains. Orthogonal Time Frequency Space (OTFS) modulation has emerged as a promising solution; however, its high equalization complexity, if performed in the delay-Doppler domain, limits its universal application. This article expl… ▽ More

    Submitted 30 October, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: Magazine article, 7 pages, 4 figures, 2 tables

  28. arXiv:2407.17057  [pdf, other

    eess.SP

    Efffcient Sensing Parameter Estimation with Direct Clutter Mitigation in Perceptive Mobile Networks

    Authors: Hang Li, Hongming Yang, Qinghua Guo, J. Andrew Zhang, Yang Xiang, Yashan Pang

    Abstract: In this work, we investigate sensing parameter estimation in the presence of clutter in perceptive mobile networks (PMNs) that integrate radar sensing into mobile communications. Performing clutter suppression before sensing parameter estimation is generally desirable as the number of sensing parameters can be signiffcantly reduced. However, existing methods require high-complexity clutter mitigat… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  29. arXiv:2407.13491  [pdf, other

    eess.SP cs.IT

    Performance Analysis and Low-Complexity Beamforming Design for Near-Field Physical Layer Security

    Authors: Yunpu Zhang, Yuan Fang, Changsheng You, Ying-Jun Angela Zhang, Hing Cheung So

    Abstract: Extremely large-scale arrays (XL-arrays) have emerged as a key enabler in achieving the unprecedented performance requirements of future wireless networks, leading to a significant increase in the range of the near-field region. This transition necessitates the spherical wavefront model for characterizing the wireless propagation rather than the far-field planar counterpart, thereby introducing ex… ▽ More

    Submitted 8 April, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 13 pages, 13 figures

  30. arXiv:2406.12426  [pdf, other

    cs.IT eess.SP

    Multi-Active-IRS-Assisted Cooperative Sensing: Cramér-Rao Bound and Joint Beamforming Design

    Authors: Yuan Fang, Xianghao Yu, Jie Xu, Ying-Jun Angela Zhang

    Abstract: This paper studies the multi-intelligent reflecting surface (IRS)-assisted cooperative sensing, in which multiple active IRSs are deployed in a distributed manner to facilitate multi-view target sensing at the non-line-of-sight (NLoS) area of the base station (BS). Different from prior works employing passive IRSs, we leverage active IRSs with the capability of amplifying the reflected signals to… ▽ More

    Submitted 18 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2404.13536

  31. arXiv:2406.09190  [pdf, other

    eess.SP

    Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

    Authors: Zhiqiang Xiao, Xianda Liu, Yong Zeng, J. Andrew Zhang, Shi Jin, Rui Zhang

    Abstract: Waveform design has served as a cornerstone for each generation of mobile communication systems. The future sixth-generation (6G) mobile communication networks are expected to employ larger-scale antenna arrays and exploit higher-frequency bands for further boosting data transmission rate and providing ubiquitous wireless sensing. This brings new opportunities and challenges for 6G waveform design… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  32. arXiv:2406.05700  [pdf, other

    cs.CV eess.IV

    HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model

    Authors: Hang Fu, Genyun Sun, Yinhe Li, Jinchang Ren, Aizhu Zhang, Cheng Jing, Pedram Ghamisi

    Abstract: Haze contamination in hyperspectral remote sensing images (HSI) can lead to spatial visibility degradation and spectral distortion. Haze in HSI exhibits spatial irregularity and inhomogeneous spectral distribution, with few dehazing networks available. Current CNN and Transformer-based dehazing methods fail to balance global scene recovery, local detail retention, and computational efficiency. Ins… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  33. arXiv:2405.16011  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Jia Ye, Anbang Zhang, Kun Xu

    Abstract: Semantic communications, a promising approach for agent-human and agent-agent interactions, typically operate at a feature level, lacking true semantic understanding. This paper explores understanding-level semantic communications (ULSC), transforming visual data into human-intelligible semantic content. We employ an image caption neural network (ICNN) to derive semantic representations from visua… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  34. arXiv:2405.10553  [pdf, other

    eess.SP

    Revealing the Trade-off in ISAC Systems: The KL Divergence Perspective

    Authors: Zesong Fei, Shuntian Tang, Xinyi Wang, Fanghao Xia, Fan Liu, J. Andrew Zhang

    Abstract: Integrated sensing and communication (ISAC) is regarded as a promising technique for 6G communication network. In this letter, we investigate the Pareto bound of the ISAC system in terms of a unified Kullback-Leibler (KL) divergence performance metric. We firstly present the relationship between KL divergence and explicit ISAC performance metric, i.e., demodulation error and probability of detecti… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 5 pages, 5 figures; submitted to IEEE journals for possible publication

  35. arXiv:2404.09149  [pdf, other

    eess.SY cs.NE math.NA

    Heuristic Solution to Joint Deployment and Beamforming Design for STAR-RIS Aided Networks

    Authors: Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang

    Abstract: This paper tackles the deployment challenges of Simultaneous Transmitting and Reflecting Reconfigurable Intelligent Surface (STAR-RIS) in communication systems. Unlike existing works that use fixed deployment setups or solely optimize the location, this paper emphasizes the joint optimization of the location and orientation of STAR-RIS. This enables searching across all user grouping possibilities… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages

  36. Interference Management for Full-Duplex ISAC in B5G/6G Networks: Architectures, Challenges, and Solutions

    Authors: Aimin Tang, Xudong Wang, J. Andrew Zhang

    Abstract: Integrated sensing and communications (ISAC) has been visioned as a key technique for B5G/6G networks. To support monostatic sensing, a full-duplex radio is indispensable to extract echo signals from targets. Such a radio can also greatly improve network capacity via full-duplex communications. However, full-duplex radios in existing ISAC designs are mainly focused on wireless sensing, while the a… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Communications Magazine

    Journal ref: IEEE Communications Magazine ( Volume: 62, Issue: 9, September 2024)

  37. arXiv:2403.12630  [pdf, other

    eess.AS cs.SD

    Reproducing the Acoustic Velocity Vectors in a Circular Listening Area

    Authors: Jiarui Wang, Thushara Abhayapala, Jihui Aimee Zhang, Prasanga Samarasinghe

    Abstract: Acoustic velocity vectors are important for human's localization of sound at low frequencies. This paper proposes a sound field reproduction algorithm, which matches the acoustic velocity vectors in a circular listening area. In previous work, acoustic velocity vectors are matched either at sweet spots or on the boundary of the listening area. Methods based on sweet spots experience performance de… ▽ More

    Submitted 4 September, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Submitted to the 17th International Conference on Signal Processing and Communication System (ICSPCS 2024)

  38. arXiv:2403.11940  [pdf, other

    cs.LG eess.SY

    Multistep Inverse Is Not All You Need

    Authors: Alexander Levine, Peter Stone, Amy Zhang

    Abstract: In real-world control settings, the observation space is often unnecessarily high-dimensional and subject to time-correlated noise. However, the controllable dynamics of the system are often far simpler than the dynamics of the raw observations. It is therefore desirable to learn an encoder to map the observation space to a simpler space of control-relevant variables. In this work, we consider the… ▽ More

    Submitted 6 September, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: RLC 2024

  39. arXiv:2403.05793  [pdf, ps, other

    eess.SP

    Performance Bounds for Passive Sensing in Asynchronous ISAC Systems -- Appendices

    Authors: Jingbo Zhao, Zhaoming Lu, J. Andrew Zhang, Weicai Li, Yifeng Xiong, Zijun Han, Xiangming Wen, Tao Gu

    Abstract: This document contains the appendices for our paper titled ``Performance Bounds for Passive Sensing in Asynchronous ISAC Systems." The appendices include rigorous derivations of key formulas, detailed proofs of the theorems and propositions introduced in the paper, and details of the algorithm tested in the numerical simulation for validation. These appendices aim to support and elaborate on the f… ▽ More

    Submitted 29 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 5 pages

  40. arXiv:2402.17533  [pdf, other

    cs.CV eess.IV

    Black-box Adversarial Attacks Against Image Quality Assessment Models

    Authors: Yu Ran, Ao-Xiang Zhang, Mingjie Li, Weixuan Tang, Yuan-Gen Wang

    Abstract: The goal of No-Reference Image Quality Assessment (NR-IQA) is to predict the perceptual quality of an image in line with its subjective evaluation. To put the NR-IQA models into practice, it is essential to study their potential loopholes for model refinement. This paper makes the first attempt to explore the black-box adversarial attacks on NR-IQA models. Specifically, we first formulate the atta… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  41. arXiv:2402.09048  [pdf, other

    eess.SP

    Sensing in Bi-Static ISAC Systems with Clock Asynchronism: A Signal Processing Perspective

    Authors: Kai Wu, Jacopo Pegoraro, Francesca Meneghello, J. Andrew Zhang, Jesus O. Lacruz, Joerg Widmer, Francesco Restuccia, Michele Rossi, Xiaojing Huang, Daqing Zhang, Giuseppe Caire, Y. Jay Guo

    Abstract: Integrated Sensing and Communication (ISAC) has been identified as a pillar usage scenario for the impending 6G era. Bi-static sensing, a major type of sensing in ISAC, is promising to expedite ISAC in the near future, as it requires minimal changes to the existing network infrastructure. However, a critical challenge for bi-static sensing is clock asynchronism due to the use of different clocks a… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 20 pages, 6 figures, 1 table

  42. arXiv:2401.15183  [pdf, other

    q-bio.BM eess.IV

    Moment-based metrics for molecules computable from cryo-EM images

    Authors: Andy Zhang, Oscar Mickelin, Joe Kileel, Eric J. Verbeke, Nicholas F. Marshall, Marc Aurèle Gilles, Amit Singer

    Abstract: Single particle cryogenic electron microscopy (cryo-EM) is an imaging technique capable of recovering the high-resolution 3-D structure of biological macromolecules from many noisy and randomly oriented projection images. One notable approach to 3-D reconstruction, known as Kam's method, relies on the moments of the 2-D images. Inspired by Kam's method, we introduce a rotationally invariant metric… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 21 Pages, 9 Figures, 2 Algorithms, and 3 Tables

  43. arXiv:2401.09119  [pdf, other

    eess.SP

    Anchor-points Assisted Uplink Sensing in Perceptive Mobile Networks

    Authors: Yanmo Hu, J. Andrew Zhang, Weibo Deng, Y. Jay Guo

    Abstract: Uplink sensing in integrated sensing and communications (ISAC) systems, such as Perceptive Mobile Networks, is challenging due to the clock asynchronism between transmitter and receiver. Existing solutions typically require the presence of a dominating line-of-sight path and the knowledge of transmitter location at the receiver. In this paper, relaxing these requirements, we propose a novel and ef… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 14 pages, 12 figures, journal paper

  44. arXiv:2401.09064  [pdf, other

    cs.IT eess.SP

    Performance Bounds and Optimization for CSI-Ratio based Bi-static Doppler Sensing in ISAC Systems

    Authors: Yanmo Hu, Kai Wu, J. Andrew Zhang, Weibo Deng, Y. Jay Guo

    Abstract: Bi-static sensing is crucial for exploring the potential of networked sensing capabilities in integrated sensing and communications (ISAC). However, it suffers from the challenging clock asynchronism issue. CSI ratio-based sensing is an effective means to address the issue. Its performance bounds, particular for Doppler sensing, have not been fully understood yet. This work endeavors to fill the r… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 14 pages, 15 figures, journal paper

  45. arXiv:2401.03473  [pdf, ps, other

    cs.SD cs.AI eess.AS

    ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

    Authors: He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

    Abstract: To promote speech processing and recognition research in driving scenarios, we build on the success of the Intelligent Cockpit Speech Recognition Challenge (ICSRC) held at ISCSLP 2022 and launch the ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge. This challenge collects over 100 hours of multi-channel speech data recorded inside a new energy vehicle and 40 hours… ▽ More

    Submitted 20 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at ICASSP 2024

  46. arXiv:2312.09760  [pdf, other

    eess.AS cs.SD

    U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

    Authors: Ao Zhang, Pan Zhou, Kaixun Huang, Yong Zou, Ming Liu, Lei Xie

    Abstract: Open-vocabulary keyword spotting (KWS), which allows users to customize keywords, has attracted increasingly more interest. However, existing methods based on acoustic models and post-processing train the acoustic model with ASR training criteria to model all phonemes, making the acoustic model under-optimized for the KWS task. To solve this problem, we propose a novel unified two-pass open-vocabu… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ASRU2023

  47. Densifying MIMO: Channel Modeling, Physical Constraints, and Performance Evaluation for Holographic Communications

    Authors: Y. Liu, M. Zhang, T. Wang, A. Zhang, M. Debbah

    Abstract: As the backbone of the fifth-generation (5G) cellular network, massive multiple-input multiple-output (MIMO) encounters a significant challenge in practical applications: how to deploy a large number of antenna elements within limited spaces. Recently, holographic communication has emerged as a potential solution to this issue. It employs dense antenna arrays and provides a tractable model. Nevert… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 20 figures, accepted by JSAC-SI-ESIT

  48. arXiv:2310.07141  [pdf, ps, other

    cs.IT eess.SP

    Time and Frequency Offset Estimation and Intercarrier Interference Cancellation for AFDM Systems

    Authors: Yuankun Tang, Anjie Zhang, Miaowen Wen, Yu Huang, Fei Ji, Jinming Wen

    Abstract: Affine frequency division multiplexing (AFDM) is an emerging multicarrier waveform that offers a potential solution for achieving reliable communications over time-varying channels. This paper proposes two maximum-likelihood (ML) estimators of symbol time offset and carrier frequency offset for AFDM systems. One is called joint ML estimator, which evaluates the arrival time and carrier frequency o… ▽ More

    Submitted 28 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: accepted by IEEE Wireless Communications and Networking Conference (WCNC) 2024

  49. arXiv:2310.05444  [pdf, other

    cs.IT eess.SP

    Waveform Design for MIMO-OFDM Integrated Sensing and Communication System: An Information Theoretical Approach

    Authors: Zhiqing Wei, Jinghui Piao, Xin Yuan, Huici Wu, J. Andrew Zhang, Zhiyong Feng, Lin Wang, Ping Zhang

    Abstract: Integrated sensing and communication (ISAC) is regarded as the enabling technology in the future 5th-Generation-Advanced (5G-A) and 6th-Generation (6G) mobile communication system. ISAC waveform design is critical in ISAC system. However, the difference of the performance metrics between sensing and communication brings challenges for the ISAC waveform design. This paper applies the unified perfor… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  50. arXiv:2310.04657  [pdf, other

    eess.AS cs.SD

    Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

    Authors: Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

    Abstract: The attention-based deep contextual biasing method has been demonstrated to effectively improve the recognition performance of end-to-end automatic speech recognition (ASR) systems on given contextual phrases. However, unlike shallow fusion methods that directly bias the posterior of the ASR model, deep biasing methods implicitly integrate contextual information, making it challenging to control t… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted by ASRU2023