Skip to main content

Showing 1–50 of 174 results for author: Yuan, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.22796  [pdf, ps, other

    eess.SP

    Channel Knowledge Map-assisted Dual-domain Tracking and Predictive Beamforming for High-Mobility Wireless Networks

    Authors: Ruolin Du, Zhiqiang Wei, Zai Yang, Lei Yang, Yong Zeng, Derrick Wing Kwan Ng, Jinhong Yuan

    Abstract: This paper introduces a novel channel knowledge map (CKM)-assisted dual-domain tracking and predictive beamforming scheme for high-mobility wireless networks. The central premise is that the CKM integrates both the coordinate and beam domains, thereby enabling tracking in one domain via treating the other domain's input as priors or measurements. In the coordinate domain (C-Domain), an extended Ka… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  2. arXiv:2506.22710  [pdf, ps, other

    cs.CV eess.IV

    LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning

    Authors: Jiang Yuan, JI Ma, Bo Wang, Guanzhou Ke, Weiming Hu

    Abstract: Implicit degradation estimation-based blind super-resolution (IDE-BSR) hinges on extracting the implicit degradation representation (IDR) of the LR image and adapting it to LR image features to guide HR detail restoration. Although IDE-BSR has shown potential in dealing with noise interference and complex degradations, existing methods ignore the importance of IDR discriminability for BSR and inst… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Journal ref: International Conference on Computer Vision (ICCV) 2025

  3. arXiv:2505.19626  [pdf, ps, other

    cs.SD eess.AS

    Decoding Speaker-Normalized Pitch from EEG for Mandarin Perception

    Authors: Jiaxin Chen, Yiming Wang, Ziyu Zhang, Jiayang Han, Yin-Long Liu, Rui Feng, Xiuyuan Liang, Zhen-Hua Ling, Jiahong Yuan

    Abstract: The same speech content produced by different speakers exhibits significant differences in pitch contour, yet listeners' semantic perception remains unaffected. This phenomenon may stem from the brain's perception of pitch contours being independent of individual speakers' pitch ranges. In this work, we recorded electroencephalogram (EEG) while participants listened to Mandarin monosyllables with… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  4. arXiv:2505.19448  [pdf, other

    eess.AS

    Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection

    Authors: Yin-Long Liu, Rui Feng, Jia-Xin Chen, Yi-Ming Wang, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: Recent breakthroughs in Automatic Speech Recognition (ASR) have enabled fully automated Alzheimer's Disease (AD) detection using ASR transcripts. Nonetheless, the impact of ASR errors on AD detection remains poorly understood. This paper fills the gap. We conduct a comprehensive study on AD detection using transcripts from various ASR models and their synthesized speech on the ADReSS dataset. Expe… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  5. arXiv:2505.19446  [pdf, other

    eess.AS

    Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech

    Authors: Yin-Long Liu, Yuanchao Li, Rui Feng, Liu He, Jia-Xin Chen, Yi-Ming Wang, Yu-Ang Chen, Yan-Han Peng, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: This paper presents our submission to the PROCESS Challenge 2025, focusing on spontaneous speech analysis for early dementia detection. For the three-class classification task (Healthy Control, Mild Cognitive Impairment, and Dementia), we propose a cascaded binary classification framework that fine-tunes pre-trained language models and incorporates pause encoding to better capture disfluencies. Th… ▽ More

    Submitted 26 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  6. arXiv:2504.09655  [pdf

    eess.IV cs.CV

    OmniMamba4D: Spatio-temporal Mamba for longitudinal CT lesion segmentation

    Authors: Justin Namuk Kim, Yiqiao Liu, Rajath Soans, Keith Persson, Sarah Halek, Michal Tomaszewski, Jianda Yuan, Gregory Goldmacher, Antong Chen

    Abstract: Accurate segmentation of longitudinal CT scans is important for monitoring tumor progression and evaluating treatment responses. However, existing 3D segmentation models solely focus on spatial information. To address this gap, we propose OmniMamba4D, a novel segmentation model designed for 4D medical images (3D images over time). OmniMamba4D utilizes a spatio-temporal tetra-orientated Mamba block… ▽ More

    Submitted 24 April, 2025; v1 submitted 13 April, 2025; originally announced April 2025.

    Comments: Accepted at IEEE International Symposium on Biomedical Imaging (ISBI) 2025

  7. arXiv:2503.19703  [pdf, other

    cs.CV eess.IV

    High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting

    Authors: Qian Wang, Zhihao Zhan, Jialei He, Zhituo Tu, Xiang Zhu, Jie Yuan

    Abstract: Highly accurate geometric precision and dense image features characterize True Digital Orthophoto Maps (TDOMs), which are in great demand for applications such as urban planning, infrastructure management, and environmental monitoring.Traditional TDOM generation methods need sophisticated processes, such as Digital Surface Models (DSM) and occlusion detection, which are computationally expensive a… ▽ More

    Submitted 13 May, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

  8. A Unified Approach to Enforce Non-Negativity Constraint in Neural Network Approximation for Optimal Voltage Regulation (preprint)

    Authors: Jiaqi Wu, Jingyi Yuan, Yang Weng, Guangwen Wang

    Abstract: Power system voltage regulation is crucial to maintain power quality while integrating intermittent renewable resources in distribution grids. However, the system model on the grid edge is often unknown, making it difficult to model physical equations for optimal control. Therefore, previous work proposes structured data-driven methods like input convex neural networks (ICNN) for "optimal" control… ▽ More

    Submitted 6 May, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

    Comments: Submitted to the 58th Hawaii International Conference on System Sciences (HICSS-58)

    Journal ref: HICSS'58 (2025) 3018-3027

  9. arXiv:2503.10060  [pdf, other

    eess.SP

    Sum-Rate Maximization for Pinching Antenna-assisted NOMA Systems with Multiple Dielectric Waveguides

    Authors: Shaokang Hu, Ruotong Zhao, Yihuan Liao, Derrick Wing Kwan Ng, Jinhong Yuan

    Abstract: This paper investigates the resource allocation design for a pinching antenna (PA)-assisted multiuser multiple-input single-output (MISO) non-orthogonal multiple access (NOMA) system featuring multiple dielectric waveguides. To enhance model accuracy, we propose a novel frequency-dependent power attenuation model for the dielectric waveguides in PA-assisted systems. By jointly optimizing the preco… ▽ More

    Submitted 6 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: 7 pages, 3 figures, conference

  10. arXiv:2503.01202  [pdf, other

    cs.CV cs.RO eess.IV

    A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping

    Authors: Jialei He, Zhihao Zhan, Zhituo Tu, Xiang Zhu, Jie Yuan

    Abstract: Rapid generation of large-scale orthoimages from Unmanned Aerial Vehicles (UAVs) has been a long-standing focus of research in the field of aerial mapping. A multi-sensor UAV system, integrating the Global Positioning System (GPS), Inertial Measurement Unit (IMU), 4D millimeter-wave radar and camera, can provide an effective solution to this problem. In this paper, we utilize multi-sensor data to… ▽ More

    Submitted 4 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  11. arXiv:2502.03497  [pdf

    eess.IV

    SLCGC: A lightweight Self-supervised Low-pass Contrastive Graph Clustering Network for Hyperspectral Images

    Authors: Yao Ding, Zhili Zhang, Aitao Yang, Yaoming Cai, Xiongwu Xiao, Danfeng Hong, Junsong Yuan

    Abstract: Self-supervised hyperspectral image (HSI) clustering remains a fundamental yet challenging task due to the absence of labeled data and the inherent complexity of spatial-spectral interactions. While recent advancements have explored innovative approaches, existing methods face critical limitations in clustering accuracy, feature discriminability, computational efficiency, and robustness to noise,… ▽ More

    Submitted 6 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 12 pages, 9 figures

  12. arXiv:2502.01078  [pdf, ps, other

    cs.IT eess.SP

    Parallel Coding for Orthogonal Delay-Doppler Division Multiplexing

    Authors: Qi Li, Jinhong Yuan, Min Qiu

    Abstract: This paper proposes a novel parallel coding transmission strategy and an iterative detection and decoding receiver signal processing technique for orthogonal delay-Doppler division multiplexing (ODDM) modulation. Specifically, the proposed approach employs a parallel channel encoding (PCE) scheme that consists of multiple short-length codewords for each delay-Doppler multicarrier (DDMC) symbol. Bu… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 12 pages, 12 figures, accepted by IEEE Transactions on Communications

  13. arXiv:2501.08026  [pdf, other

    eess.SP cs.IT

    Orthogonal Delay-Doppler Division Multiplexing Modulation with Hierarchical Mode-Based Index Modulation

    Authors: Kehan Huang, Min Qiu, Jinhong Yuan

    Abstract: The orthogonal time frequency space with index modulation (OTFS-IM) offers flexible tradeoffs between spectral efficiency (SE) and bit error rate (BER) in doubly selective fading channels. While OTFS-IM schemes demonstrated such potential, a persistent challenge lies in the detection complexity. To address this problem, we propose the hierarchical mode-based index modulation (HMIM). HMIM introduce… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  14. arXiv:2501.03689  [pdf, other

    cs.SD cs.AI eess.AS

    MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation

    Authors: Haojie Wei, Jun Yuan, Rui Zhang, Quanyu Dai, Yueguo Chen

    Abstract: Music source separation and pitch estimation are two vital tasks in music information retrieval. Typically, the input of pitch estimation is obtained from the output of music source separation. Therefore, existing methods have tried to perform these two tasks simultaneously, so as to leverage the mutually beneficial relationship between both tasks. However, these methods still face two critical ch… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  15. arXiv:2412.13216  [pdf, other

    eess.SP

    On the Time-Frequency Localization Characteristics of the Delay-Doppler Plane Orthogonal Pulse

    Authors: Akram Shafie, Jinhong Yuan, Nan Yang, Hai Lin

    Abstract: In this work, we study the time-frequency (TF) localization characteristics of the prototype pulse of orthogonal delay-Doppler (DD) division multiplexing modulation, namely, the DD plane orthogonal pulse (DDOP). The TF localization characteristics examine how concentrated or spread out the energy of a pulse is in the joint TF domain, the time domain (TD), and the frequency domain (FD). We first de… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: This paper has been accepted for publication in an IEEE Journal

  16. arXiv:2412.11325  [pdf, other

    cs.CV cs.SD eess.AS

    Sonicmesh: Enhancing 3D Human Mesh Reconstruction in Vision-Impaired Environments With Acoustic Signals

    Authors: Xiaoxuan Liang, Wuyang Zhang, Hong Zhou, Zhaolong Wei, Sicheng Zhu, Yansong Li, Rui Yin, Jiantao Yuan, Jeremy Gummeson

    Abstract: 3D Human Mesh Reconstruction (HMR) from 2D RGB images faces challenges in environments with poor lighting, privacy concerns, or occlusions. These weaknesses of RGB imaging can be complemented by acoustic signals, which are widely available, easy to deploy, and capable of penetrating obstacles. However, no existing methods effectively combine acoustic signals with RGB data for robust 3D HMR. The pr… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  17. arXiv:2412.07074  [pdf, other

    eess.SP

    Channel Spreading Function-Inspired Channel Transfer Function Estimation for OFDM Systems with High-Mobility

    Authors: Yiyan Ma, Bo Ai, Guoyu Ma, Akram Shafie, Qingqing Cheng, Mi Yang, Jingli Li, Xuebo Pang, Jinhong Yuan, Zhangdui Zhong

    Abstract: In this letter, we propose a novel channel transfer function (CTF) estimation approach for orthogonal frequency division multiplexing (OFDM) systems in high-mobility scenarios, that leverages the stationary properties of the delay-Doppler domain channel spreading function (CSF). First, we develop a CSF estimation model for OFDM systems that relies solely on discrete pilot symbols in the time-frequ… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  18. arXiv:2412.06259  [pdf, other

    eess.AS cs.SD

    Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

    Authors: Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previous studies have demonstrated the efficacy of fine-tuning pre-trained language models (PLMs) for AD detection. However, the objective of this traditional fine-tuning method, which invol… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: Accepted by ISCSLP 2024

  19. arXiv:2412.00058  [pdf

    eess.IV cs.CV

    Real-time volumetric free-hand ultrasound imaging for large-sized organs: A study of imaging the whole spine

    Authors: Caozhe Li, Enxiang Shen, Haoyang Wang, Yuxin Wang, Jie Yuan, Li Gong, Di Zhao, Weijing Zhang, Zhibin Jin

    Abstract: Three-dimensional (3D) ultrasound imaging can overcome the limitations of conventional two dimensional (2D) ultrasound imaging in structural observation and measurement. However, conducting volumetric ultrasound imaging for large-sized organs still faces difficulties including long acquisition time, inevitable patient movement, and 3D feature recognition. In this study, we proposed a real-time vol… ▽ More

    Submitted 25 November, 2024; originally announced December 2024.

  20. arXiv:2411.15529  [pdf, other

    cs.IT eess.SP

    Uplink Multiple Access with Heterogeneous Blocklength and Reliability Constraints: Discrete Signaling with Treating Interference as Noise

    Authors: Min Qiu, Yu-Chih Huang, Jinhong Yuan

    Abstract: We consider the uplink multiple access of heterogeneous users, e.g., ultra-reliable low-latency communications (URLLC) and enhanced mobile broadband (eMBB) users. Each user has its own reliability requirement and blocklength constraint, and users transmitting longer blocks suffer from heterogeneous interference. On top of that, the decoding of URLLC messages cannot leverage successive interference… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

    Comments: 14 pages, 7 figures, accepted by IEEE Transactions on Communications. arXiv admin note: text overlap with arXiv:2308.08883

  21. arXiv:2411.12985  [pdf, other

    eess.SP

    Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

    Authors: Huan Huang, Hongliang Zhang, Jide Yuan, Luyao Sun, Yitian Wang, Weidong Mei, Boya Di, Yi Cai, Zhu Han

    Abstract: Intelligent omni-surfaces (IOSs) with 360-degree electromagnetic radiation significantly improves the performance of wireless systems, while an adversarial IOS also poses a significant potential risk for physical layer security. In this paper, we propose a "DISCO" IOS (DIOS) based fully-passive jammer (FPJ) that can launch omnidirectional fully-passive jamming attacks. In the proposed DIOS-based F… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: This paper has been submitted to IEEE TWC for possible publication

  22. Electromagnetic Modeling and Capacity Analysis of Rydberg Atom-Based MIMO System

    Authors: Shuai S. A. Yuan, Xinyi Y. I. Xu, Jinpeng Yuan, Guoda Xie, Chongwen Huang, Xiaoming Chen, Zhixiang Huang, Wei E. I. Sha

    Abstract: Rydberg atom-based antennas exploit the quantum properties of highly excited Rydberg atoms, providing unique advantages over classical antennas, such as high sensitivity, broad frequency range, and compact size. Despite the increasing interests in their applications in antenna and communication engineering, two key properties, involving the lack of polarization multiplexing and isotropic reception… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: in IEEE Antennas and Wireless Propagation Letters, 2025

  23. arXiv:2410.17556  [pdf, other

    eess.SP

    Performance of orthogonal delay-doppler division multiplexing modulation with imperfect channel estimation

    Authors: Kehan Huang, Min Qiu, Jun Tong, Jinhong Yuan, Hai Lin

    Abstract: The orthogonal delay-Doppler division multiplexing (ODDM) modulation is a recently proposed multi-carrier modulation that features a realizable pulse orthogonal with respect to the delay-Doppler (DD) plane's fine resolutions. In this paper, we investigate the performance of ODDM systems with imperfect channel estimation considering three detectors, namely the message passing algorithm (MPA) detect… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  24. arXiv:2410.15358  [pdf, ps, other

    eess.SP cs.IT math.OC

    A New Adaptive Balanced Augmented Lagrangian Method with Application to ISAC Beamforming Design

    Authors: Jiageng Wu, Bo Jiang, Xinxin Li, Ya-Feng Liu, Jianhua Yuan

    Abstract: In this paper, we consider a class of convex programming problems with linear equality constraints, which finds broad applications in machine learning and signal processing. We propose a new adaptive balanced augmented Lagrangian (ABAL) method for solving these problems. The proposed ABAL method adaptively selects the stepsize parameter and enjoys a low per-iteration complexity, involving only the… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 7 pages, 1 table

  25. arXiv:2410.03682  [pdf, other

    eess.SP

    Delay Alignment Modulation with Hybrid Analog/Digital Beamforming for Millimeter Wave and Terahertz Communications

    Authors: Jieni Zhang, Yong Zeng, Xiangbin Yu, Shi Jin, Jinhong Yuan, Ying-Chang Liang, Rui Zhang

    Abstract: For millimeter wave (mmWave) or Terahertz (THz) communications, by leveraging the high spatial resolution offered by large antenna arrays and the multi-path sparsity of mmWave/THz channels, a novel inter-symbol interference (ISI) mitigation technique called delay alignment modulation (DAM) has been recently proposed. The key ideas of DAM are delay pre-compensation and path-based beamforming. Howev… ▽ More

    Submitted 20 September, 2024; originally announced October 2024.

  26. arXiv:2409.16920  [pdf, other

    eess.AS cs.AI cs.CL cs.HC cs.SD

    Cross-Lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models

    Authors: Zhichen Han, Tianqi Geng, Hui Feng, Jiahong Yuan, Korin Richmond, Yuanchao Li

    Abstract: Utilizing Self-Supervised Learning (SSL) models for Speech Emotion Recognition (SER) has proven effective, yet limited research has explored cross-lingual scenarios. This study presents a comparative analysis between human performance and SSL models, beginning with a layer-wise analysis and an exploration of parameter-efficient fine-tuning strategies in monolingual, cross-lingual, and transfer lea… ▽ More

    Submitted 30 April, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: Accepted to ICASSP 2025

  27. arXiv:2409.01694  [pdf, other

    eess.SP math.NA

    A novel and efficient parameter estimation of the Lognormal-Rician turbulence model based on k-Nearest Neighbor and data generation method

    Authors: Maoke Miao, Xinyu Zhang, Bo Liu, Rui Yin, Jiantao Yuan, Feng Gao, Xiao-Yu Chen

    Abstract: In this paper, we propose a novel and efficient parameter estimator based on $k$-Nearest Neighbor ($k$NN) and data generation method for the Lognormal-Rician turbulence channel. The Kolmogorov-Smirnov (KS) goodness-of-fit statistical tools are employed to investigate the validity of $k$NN approximation under different channel conditions and it is shown that the choice of $k$ plays a significant ro… ▽ More

    Submitted 13 February, 2025; v1 submitted 3 September, 2024; originally announced September 2024.

  28. Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution

    Authors: Jiang Yuan, Ji Ma, Bo Wang, Weiming Hu

    Abstract: Implicit degradation modeling-based blind super-resolution (SR) has attracted more increasing attention in the community due to its excellent generalization to complex degradation scenarios and wide application range. How to extract more discriminative degradation representations and fully adapt them to specific image features is the key to this task. In this paper, we propose a new Content-decoup… ▽ More

    Submitted 1 April, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

    Report number: TIP-33069-2024

    Journal ref: IEEE Transactions on Image Processing (2025)

  29. arXiv:2408.02074  [pdf

    eess.IV cs.AI cs.CV

    Applying Conditional Generative Adversarial Networks for Imaging Diagnosis

    Authors: Haowei Yang, Yuxiang Hu, Shuyao He, Ting Xu, Jiajie Yuan, Xingxin Gu

    Abstract: This study introduces an innovative application of Conditional Generative Adversarial Networks (C-GAN) integrated with Stacked Hourglass Networks (SHGN) aimed at enhancing image segmentation, particularly in the challenging environment of medical imaging. We address the problem of overfitting, common in deep learning models applied to complex imaging datasets, by augmenting data through rotation a… ▽ More

    Submitted 17 July, 2024; originally announced August 2024.

  30. arXiv:2407.21514  [pdf

    eess.SP

    Wireless Communications in Doubly Selective Channels with Domain Adaptivity

    Authors: J. Andrew Zhang, Hongyang Zhang, Kai Wu, Xiaojing Huang, Jinhong Yuan, Y. Jay Guo

    Abstract: Wireless communications are significantly impacted by the propagation environment, particularly in doubly selective channels with variations in both time and frequency domains. Orthogonal Time Frequency Space (OTFS) modulation has emerged as a promising solution; however, its high equalization complexity, if performed in the delay-Doppler domain, limits its universal application. This article expl… ▽ More

    Submitted 30 October, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: Magazine article, 7 pages, 4 figures, 2 tables

  31. arXiv:2407.06580  [pdf, other

    eess.SP

    Off-grid Channel Estimation for Orthogonal Delay-Doppler Division Multiplexing Using Grid Refinement and Adjustment

    Authors: Yaru Shan, Akram Shafie, Jinhong Yuan, Fanggang Wang

    Abstract: Orthogonal delay-Doppler (DD) division multiplexing (ODDM) has been recently proposed as a promising multicarrier modulation scheme to tackle Doppler spread in high-mobility environments. Accurate channel estimation is of paramount importance to guarantee reliable communication for the ODDM, especially when the delays and Dopplers of the propagation paths are off-grid. In this paper, we propose a… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  32. arXiv:2407.05391  [pdf, other

    eess.SP

    Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach

    Authors: Yangyang Niu, Zhiqing Wei, Dingyou Ma, Xiaoyu Yang, Huici Wu, Zhiyong Feng, Jianhua Yuan

    Abstract: The integrated sensing and communication (ISAC) system under multi-input multi-output (MIMO) architecture achieves dual functionalities of sensing and communication on the same platform by utilizing spatial gain, which provides a feasible paradigm facing spectrum congestion. However, the dual functionalities of sensing and communication operating simultaneously in the same platform bring severe in… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  33. arXiv:2406.18592  [pdf, ps, other

    eess.SP

    On the Coexistence of OTFS Modulation with OFDM-based Communication Systems

    Authors: Akram Shafie, Jinhong Yuan, Paul Fitzpatrick, Taka Sakurai, Yuting Fang

    Abstract: We investigate the coexistence of orthogonal time-frequency space (OTFS) modulation with current fourth- and fifth-generation (4G/5G) communication systems that primarily use orthogonal frequency-division multiplexing (OFDM) waveforms. We first derive the input-output-relation of OTFS in the considered coexisting system. In this derivation, we consider (i) the inclusion of multiple cyclic prefixes… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: This paper has been submitted for publication in an IEEE Journal. arXiv admin note: text overlap with arXiv:2311.06850

  34. arXiv:2406.18548  [pdf

    eess.IV cs.CV

    Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis

    Authors: Yuxiang Hu, Haowei Yang, Ting Xu, Shuyao He, Jiajie Yuan, Haozhang Deng

    Abstract: The diagnosis of brain cancer relies heavily on medical imaging techniques, with MRI being the most commonly used. It is necessary to perform automatic segmentation of brain tumors on MRI images. This project intends to build an MRI algorithm based on U-Net. The residual network and the module used to enhance the context information are combined, and the void space convolution pooling pyramid is a… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  35. arXiv:2406.07410  [pdf, other

    eess.AS

    Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

    Authors: Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer's Disease (AD) detection research. Even by solely utilizing the silent segments of these audio recordings, we achieve nearly 100% accuracy in AD detection. However, employing the same methods to other datasets and prepro… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  36. arXiv:2406.04776  [pdf, ps, other

    eess.SP cs.AI

    OFDM-Standard Compatible SC-NOFS Waveforms for Low-Latency and Jitter-Tolerance Industrial IoT Communications

    Authors: Tongyang Xu, Shuangyang Li, Jinhong Yuan

    Abstract: Traditional communications focus on regular and orthogonal signal waveforms for simplified signal processing and improved spectral efficiency. In contrast, the next-generation communications would aim for irregular and non-orthogonal signal waveforms to introduce new capabilities. This work proposes a spectrally efficient irregular Sinc (irSinc) shaping technique, revisiting the traditional Sinc b… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  37. arXiv:2406.02126  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    CityLight: A Universal Model for Coordinated Traffic Signal Control in City-scale Heterogeneous Intersections

    Authors: Jinwei Zeng, Chao Yu, Xinyi Yang, Wenxuan Ao, Qianyue Hao, Jian Yuan, Yong Li, Yu Wang, Huazhong Yang

    Abstract: The increasingly severe congestion problem in modern cities strengthens the significance of developing city-scale traffic signal control (TSC) methods for traffic efficiency enhancement. While reinforcement learning has been widely explored in TSC, most of them still target small-scale optimization and cannot directly scale to the city level due to unbearable resource demand. Only a few of them ma… ▽ More

    Submitted 28 August, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  38. arXiv:2405.08295  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechVerse: A Large-scale Generalizable Audio Language Model

    Authors: Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sravan Bodapati, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Large language models (LLMs) have shown incredible proficiency in performing tasks that require semantic understanding of natural language instructions. Recently, many works have further expanded this capability to perceive multimodal audio and text inputs, but their capabilities are often limited to specific fine-tuned tasks such as automatic speech recognition and translation. We therefore devel… ▽ More

    Submitted 24 March, 2025; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Single Column, 13 page

  39. Orthogonal Delay-Doppler Division Multiplexing Modulation with Tomlinson-Harashima Precoding

    Authors: Yiyan Ma, Akram Shafie, Jinhong Yuan, Guoyu Ma, Zhangdui Zhong, Bo Ai

    Abstract: The orthogonal delay-Doppler (DD) division multiplexing(ODDM) modulation has been recently proposed as a promising modulation scheme for next-generation communication systems with high mobility. Despite its benefits, ODDM modulation and other DD domain modulation schemes face the challenge of excessive equalization complexity. To address this challenge, we propose time domain Tomlinson-Harashima p… ▽ More

    Submitted 13 December, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  40. arXiv:2405.07547  [pdf, other

    cs.IT eess.SP

    Channel Coding Toward 6G: Technical Overview and Outlook

    Authors: Mohammad Rowshan, Min Qiu, Yixuan Xie, Xinyi Gu, Jinhong Yuan

    Abstract: Channel coding plays a pivotal role in ensuring reliable communication over wireless channels. With the growing need for ultra-reliable communication in emerging wireless use cases, the significance of channel coding has amplified. Furthermore, minimizing decoding latency is crucial for critical-mission applications, while optimizing energy efficiency is paramount for mobile and the Internet of Th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 102 pages, 87 figures, IEEE Open Journal of the Communications Society (invited paper)

  41. arXiv:2404.16253  [pdf, other

    eess.SP

    Mitigating Automotive Radar Interference using Onboard Intelligent Reflective Surface

    Authors: Shree Prasad Maruthi, Karrthik G. K., Vijaya Krishna A., Mahbub Hassan, Jinhong Yuan

    Abstract: The use of automotive radars is gaining popularity as a means to enhance a vehicle's sensing capabilities. However, these radars can suffer from interference caused by transmissions from other radars mounted on nearby vehicles. To address this issue, we investigate the use of an onboard intelligent reflective surface (IRS) to artificially increase a vehicle's effective radar cross section (RCS), o… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 7 pages, 9 Figures

  42. arXiv:2403.14192  [pdf, ps, other

    cs.IT eess.SP

    Fundamentals of Delay-Doppler Communications: Practical Implementation and Extensions to OTFS

    Authors: Shuangyang Li, Peter Jung, Weijie Yuan, Zhiqiang Wei, Jinhong Yuan, Baoming Bai, Giuseppe Caire

    Abstract: The recently proposed orthogonal time frequency space (OTFS) modulation, which is a typical Delay-Doppler (DD) communication scheme, has attracted significant attention thanks to its appealing performance over doubly-selective channels. In this paper, we present the fundamentals of general DD communications from the viewpoint of the Zak transform. We start our study by constructing DD domain basis… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  43. arXiv:2403.10323  [pdf, ps, other

    eess.SP

    Joint Optimization for Achieving Covertness in MIMO Over-the-Air Computation Networks

    Authors: Junteng Yao, Tuo Wu, Ming Jin, Cunhua Pan, Quanzhong Li, Jinhong Yuan

    Abstract: This paper investigates covert data transmission within a multiple-input multiple-output (MIMO) over-the-air computation (AirComp) network, where sensors transmit data to the access point (AP) while guaranteeing covertness to the warden (Willie). Simultaneously, the AP introduces artificial noise (AN) to confuse Willie, meeting the covert requirement. We address the challenge of minimizing mean-sq… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  44. arXiv:2403.02012  [pdf, other

    cs.IT eess.SP

    OTFS vs OFDM: Which is Superior in Multiuser LEO Satellite Communications

    Authors: Yu Liu, Ming Chen, Cunhua Pan, Tantao Gong, Jinhong Yuan, Jiangzhou Wang

    Abstract: Orthogonal time frequency space (OTFS) modulation, a delay-Doppler (DD) domain communication scheme exhibiting strong robustness against the Doppler shifts, has the potentials to be employed in LEO satellite communications. However, the performance comparison with the orthogonal frequency division multiplexing (OFDM) modulation and the resource allocation scheme for multiuser OTFS-based LEO satell… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  45. arXiv:2402.12127  [pdf, other

    cs.IT eess.SP

    Rate-Splitting Multiple Access for Transmissive Reconfigurable Intelligent Surface Transceiver Empowered ISAC System

    Authors: Ziwei Liu, Wen Chen, Qingqing Wu, Jinhong Yuan, Shanshan Zhang, Zhendong Li, Jun Li

    Abstract: In this paper, a novel transmissive reconfigurable intelligent surface (TRIS) transceiver empowered integrated sensing and communications (ISAC) system is proposed for future multi-demand terminals. To address interference management, we implement rate-splitting multiple access (RSMA), where the common stream is independently designed for the sensing service. We introduce the sensing quality of se… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  46. arXiv:2401.15164  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations

    Authors: Naresh Kumar Devulapally, Sidharth Anand, Sreyasee Das Bhattacharjee, Junsong Yuan, Yu-Ping Chang

    Abstract: Analyzing individual emotions during group conversation is crucial in developing intelligent agents capable of natural human-machine interaction. While reliable emotion recognition techniques depend on different modalities (text, audio, video), the inherent heterogeneity between these modalities and the dynamic cross-modal interactions influenced by an individual's unique behavioral patterns make… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  47. arXiv:2401.11058  [pdf, ps, other

    cs.IT eess.SP

    Low Complexity Turbo SIC-MMSE Detection for Orthogonal Time Frequency Space Modulation

    Authors: Qi Li, Jinhong Yuan, Min Qiu, Shuangyang Li, Yixuan Xie

    Abstract: Recently, orthogonal time frequency space (OTFS) modulation has garnered considerable attention due to its robustness against doubly-selective wireless channels. In this paper, we propose a low-complexity iterative successive interference cancellation based minimum mean squared error (SIC-MMSE) detection algorithm for zero-padded OTFS (ZP-OTFS) modulation. In the proposed algorithm, signals are de… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages, 12 figures, accepted by IEEE Transactions on Communications

  48. arXiv:2401.01433  [pdf, other

    cs.IT eess.SP

    Multiple Access Techniques for Intelligent and Multi-Functional 6G: Tutorial, Survey, and Outlook

    Authors: Bruno Clerckx, Yijie Mao, Zhaohui Yang, Mingzhe Chen, Ahmed Alkhateeb, Liang Liu, Min Qiu, Jinhong Yuan, Vincent W. S. Wong, Juan Montojo

    Abstract: Multiple access (MA) is a crucial part of any wireless system and refers to techniques that make use of the resource dimensions to serve multiple users/devices/machines/services, ideally in the most efficient way. Given the needs of multi-functional wireless networks for integrated communications, sensing, localization, computing, coupled with the surge of machine learning / artificial intelligenc… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: submitted for publication in Proceedings of the IEEE

  49. arXiv:2311.15556  [pdf, other

    cs.CV eess.IV

    PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images

    Authors: Jiquan Yuan, Xinyan Cao, Changjin Li, Fanyi Yang, Jinlong Lin, Xixin Cao

    Abstract: As image generation technology advances, AI-based image generation has been applied in various fields and Artificial Intelligence Generated Content (AIGC) has garnered widespread attention. However, the development of AI-based image generative models also brings new problems and challenges. A significant challenge is that AI-generated images (AIGI) may exhibit unique distortions compared to natura… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 18 pages

  50. arXiv:2311.13787  [pdf, other

    eess.SP

    A Fast Power Spectrum Sensing Solution for Generalized Coprime Sampling

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Junyu Yuan, Bin Tang

    Abstract: The growing scarcity of spectrum resources, wideband spectrum sensing is required to process a prohibitive volume of data at a high sampling rate. For some applications, spectrum estimation only requires second-order statistics. In this case, a fast power spectrum sensing solution is proposed based on the generalized coprime sampling. By exploring the sensing vector inherent structure, the autocor… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.