Skip to main content

Showing 1–50 of 87 results for author: Yuan, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.00480  [pdf, ps, other

    eess.SP

    The Coupling Effect of Sensing Targets on the Environment for 3GPP ISAC Channels: Observation, Modeling, and Validation

    Authors: Yameng Liu, Jianhua Zhang, Yuxiang Zhang, Hongbo Xing, Yifeng Xiong, Zhiqiang Yuan, Guangyi Liu

    Abstract: Integrated Sensing And Communication (ISAC) has been identified as a key 6G application by ITU and 3GPP, with standardization efforts already underway. Sensing tasks, such as target localization, demand more precise characterization of the sensing target (ST) in ISAC channel modeling. The ST couples complexly with environmental scatterers, potentially blocking some multipaths and generating new on… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2505.20673  [pdf, other

    eess.SP

    A Unified RCS Modeling of Typical Targets for 3GPP ISAC Channel Standardization and Experimental Analysis

    Authors: Yuxiang Zhang, Jianhua Zhang, Xidong Hu, Jiwei Zhang, Hongbo Xing, Huiwen Gong, Shilin Luo, Yifeng Xiong, Li Yu, Zhiqing Yuan, Guangyi Liu, Tao Jiang

    Abstract: Accurate radar cross section (RCS) modeling is crucial for characterizing target scattering and improving the precision of Integrated Sensing and Communication (ISAC) channel modeling. Existing RCS models are typically designed for specific target types, leading to increased complexity and lack of generalization. This makes it difficult to standardize RCS models for 3GPP ISAC channels, which need… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 13 pages,12 figures,39 conferences,submitted to IEEE Journal on Selected Areas in Communications

  3. arXiv:2505.07191  [pdf, other

    eess.SP

    A Unified Deterministic Channel Model for Multi-Type RIS with Reflective, Transmissive, and Polarization Operations

    Authors: Yuxiang Zhang, Jianhua Zhang, Zhengfu Zhou, Huiwen Gong, Hongbo Xing, Zhiqiang Yuan, Lei Tian, Li Yu, Guangyi Liu, Tao Jiang

    Abstract: Reconfigurable Intelligent Surface (RIS) technologies have been considered as a promising enabler for 6G, enabling advantageous control of electromagnetic (EM) propagation. RIS can be categorized into multiple types based on their reflective/transmissive modes and polarization control capabilities, all of which are expected to be widely deployed in practical environments. A reliable RIS channel mo… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: Submitted to IEEE Transactions on Vehicular Technology

  4. arXiv:2503.12506  [pdf, other

    cs.SD cs.AI eess.AS

    A General Close-loop Predictive Coding Framework for Auditory Working Memory

    Authors: Zhongju Yuan, Geraint Wiggins, Dick Botteldooren

    Abstract: Auditory working memory is essential for various daily activities, such as language acquisition, conversation. It involves the temporary storage and manipulation of information that is no longer present in the environment. While extensively studied in neuroscience and cognitive science, research on its modeling within neural networks remains limited. To address this gap, we propose a general frame… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  5. arXiv:2502.17213  [pdf, other

    q-bio.NC cs.AI cs.LG eess.SP

    Deep Learning-Powered Electrical Brain Signals Analysis: Advancing Neurological Diagnostics

    Authors: Jiahe Li, Xin Chen, Fanqi Shen, Junru Chen, Yuxin Liu, Daoze Zhang, Zhizhang Yuan, Fang Zhao, Meng Li, Yang Yang

    Abstract: Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  6. arXiv:2501.11093  [pdf, other

    eess.SP

    Channel Sounding Using Multiplicative Arrays Based on Successive Interference Cancellation Principle

    Authors: Zhangzhang Jiang, Zhiqiang Yuan, Chunhui Li, Le Yu, Wei Fan

    Abstract: Ultra-massive multiple-input and multiple-output (MIMO) systems have been seen as the key radio technology for the advancement of wireless communication systems, due to its capability to better utilize the spatial dimension of the propagation channels. Channel sounding is essential for developing accurate and realistic channel models for the massive MIMO systems. However, channel sounding with lar… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

  7. arXiv:2412.04917  [pdf, other

    cs.SD eess.AS eess.SP

    Continuous Speech Tokens Makes LLMs Robust Multi-Modality Learners

    Authors: Ze Yuan, Yanqing Liu, Shujie Liu, Sheng Zhao

    Abstract: Recent advances in GPT-4o like multi-modality models have demonstrated remarkable progress for direct speech-to-speech conversation, with real-time speech interaction experience and strong speech understanding ability. However, current research focuses on discrete speech tokens to align with discrete text tokens for language modelling, which depends on an audio codec with residual connections or i… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  8. arXiv:2411.14837  [pdf, other

    eess.SP

    Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System

    Authors: Xu Chen, Guangsheng Yu, Zhian Yuan, Hao Wu, Yilin Jiang, Ying Wang, Bin Deng, Limin Guo

    Abstract: Millimeter-wave (MMW) multiple-input multiple-output synthetic aperture radar (MIMO-SAR) system is a technology that can achieve high resolution, high frame rate, and all-weather imaging and has received extensive attention in the non-destructive testing and internal imaging applications of layered dielectric targets. However, the non-ideal scattering effect caused by dielectric materials can sign… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    Comments: 8 pages

  9. arXiv:2411.11190  [pdf, ps, other

    eess.IV cs.CV

    DeepSPV: A Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images

    Authors: Zhen Yuan, David Stojanovski, Lei Li, Alberto Gomez, Haran Jogeesvaran, Esther Puyol-Antón, Baba Inusa, Andrew P. King

    Abstract: Splenomegaly, the enlargement of the spleen, is an important clinical indicator for various associated medical conditions, such as sickle cell disease (SCD). Spleen length measured from 2D ultrasound is the most widely used metric for characterising spleen size. However, it is still considered a surrogate measure, and spleen volume remains the gold standard for assessing spleen size. Accurate sple… ▽ More

    Submitted 3 June, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.08038

  10. arXiv:2410.23628  [pdf

    eess.IV cs.CV physics.med-ph

    Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data

    Authors: Yucun Hou, Fenglin Zhan, Xin Cheng, Chenxi Li, Ziquan Yuan, Runze Liao, Haihao Wang, Jianlang Hua, Jing Wu, Jianyong Jiang

    Abstract: Positron emission tomography (PET) is a critical tool for diagnosing tumors and neurological disorders but poses radiation risks to patients, particularly to sensitive populations. While reducing injected radiation dose mitigates this risk, it often compromises image quality. To reconstruct full-dose-quality images from low-dose scans, we propose a Cycle-constrained Adversarial Denoising Convoluti… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  11. arXiv:2410.09289  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Multimodal Audio-based Disease Prediction with Transformer-based Hierarchical Fusion Network

    Authors: Jinjin Cai, Ruiqi Wang, Dezhong Zhao, Ziqin Yuan, Victoria McKenna, Aaron Friedman, Rachel Foot, Susan Storey, Ryan Boente, Sudip Vhaduri, Byung-Cheol Min

    Abstract: Audio-based disease prediction is emerging as a promising supplement to traditional medical diagnosis methods, facilitating early, convenient, and non-invasive disease detection and prevention. Multimodal fusion, which integrates features from various domains within or across bio-acoustic modalities, has proven effective in enhancing diagnostic performance. However, most existing methods in the fi… ▽ More

    Submitted 14 December, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

  12. arXiv:2410.03459  [pdf, other

    cs.SD cs.IT cs.LG eess.AS

    Generative Semantic Communication for Text-to-Speech Synthesis

    Authors: Jiahao Zheng, Jinke Ren, Peng Xu, Zhihao Yuan, Jie Xu, Fangxin Wang, Gui Gui, Shuguang Cui

    Abstract: Semantic communication is a promising technology to improve communication efficiency by transmitting only the semantic information of the source data. However, traditional semantic communication methods primarily focus on data reconstruction tasks, which may not be efficient for emerging generative tasks such as text-to-speech (TTS) synthesis. To address this limitation, this paper develops a nove… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: The paper has been accepted by IEEE Globecom Workshop

  13. arXiv:2409.16968  [pdf, other

    cs.LG cs.NI eess.SP

    Bridge to Real Environment with Hardware-in-the-loop for Wireless Artificial Intelligence Paradigms

    Authors: Jeffrey Redondo, Nauman Aslam, Juan Zhang, Zhenhui Yuan

    Abstract: Nowadays, many machine learning (ML) solutions to improve the wireless standard IEEE802.11p for Vehicular Adhoc Network (VANET) are commonly evaluated in the simulated world. At the same time, this approach could be cost-effective compared to real-world testing due to the high cost of vehicles. There is a risk of unexpected outcomes when these solutions are implemented in the real world, potential… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  14. arXiv:2409.11299  [pdf, other

    eess.IV cs.AI cs.CV

    TTT-Unet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation

    Authors: Rong Zhou, Zhengqing Yuan, Zhiling Yan, Weixiang Sun, Kai Zhang, Yiwei Li, Yanfang Ye, Xiang Li, Lifang He, Lichao Sun

    Abstract: Biomedical image segmentation is crucial for accurately diagnosing and analyzing various diseases. However, Convolutional Neural Networks (CNNs) and Transformers, the most commonly used architectures for this task, struggle to effectively capture long-range dependencies due to the inherent locality of CNNs and the computational complexity of Transformers. To address this limitation, we introduce T… ▽ More

    Submitted 5 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  15. arXiv:2409.06136   

    cs.SD eess.AS

    DENSE: Dynamic Embedding Causal Target Speech Extraction

    Authors: Yiwen Wang, Zeyu Yuan, Xihong Wu

    Abstract: Target speech extraction (TSE) focuses on extracting the speech of a specific target speaker from a mixture of signals. Existing TSE models typically utilize static embeddings as conditions for extracting the target speaker's voice. However, the static embeddings often fail to capture the contextual information of the extracted speech signal, which may limit the model's performance. We propose a n… ▽ More

    Submitted 9 December, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: The experimental design and results contain errors, and I would like to withdraw the paper

  16. arXiv:2409.00122  [pdf, other

    eess.SP cs.AI cs.LG

    Brant-X: A Unified Physiological Signal Alignment Framework

    Authors: Daoze Zhang, Zhizhang Yuan, Junru Chen, Kerui Chen, Yang Yang

    Abstract: Physiological signals serve as indispensable clues for understanding various physiological states of human bodies. Most existing works have focused on a single type of physiological signals for a range of application scenarios. However, as the body is a holistic biological system, the inherent interconnection among various physiological data should not be neglected. In particular, given the brain'… ▽ More

    Submitted 28 August, 2024; originally announced September 2024.

    Comments: Accepted by SIGKDD 2024

    Journal ref: SIGKDD 2024

  17. arXiv:2408.16251  [pdf, other

    cs.IT eess.SP

    Neural Network-Assisted Hybrid Model Based Message Passing for Parametric Holographic MIMO Near Field Channel Estimation

    Authors: Zhengdao Yuan, Yabo Guo, Dawei Gao, Qinghua Guo, Zhongyong Wang, Chongwen Huang, Ming Jin, Kai-Kit Wong

    Abstract: Holographic multiple-input and multiple-output (HMIMO) is a promising technology with the potential to achieve high energy and spectral efficiencies, enhance system capacity and diversity, etc. In this work, we address the challenge of HMIMO near field (NF) channel estimation, which is complicated by the intricate model introduced by the dyadic Green's function. Despite its complexity, the channel… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  18. arXiv:2407.19287  [pdf, other

    stat.ML cs.LG eess.SY

    Bayesian meta learning for trustworthy uncertainty quantification

    Authors: Zhenyuan Yuan, Thinh T. Doan

    Abstract: We consider the problem of Bayesian regression with trustworthy uncertainty quantification. We define that the uncertainty quantification is trustworthy if the ground truth can be captured by intervals dependent on the predictive distributions with a pre-specified probability. Furthermore, we propose, Trust-Bayes, a novel optimization framework for Bayesian meta learning which is cognizant of trus… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  19. arXiv:2407.08944  [pdf, other

    cs.CV eess.IV

    Bora: Biomedical Generalist Video Generation Model

    Authors: Weixiang Sun, Xiaocao You, Ruizhe Zheng, Zhengqing Yuan, Xiang Li, Lifang He, Quanzheng Li, Lichao Sun

    Abstract: Generative models hold promise for revolutionizing medical education, robot-assisted surgery, and data augmentation for medical AI development. Diffusion models can now generate realistic images from text prompts, while recent advancements have demonstrated their ability to create diverse, high-quality videos. However, these models often struggle with generating accurate representations of medical… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  20. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    A Deep Learning System for Rapid and Accurate Warning of Acute Aortic Syndrome on Non-contrast CT in China

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Dehai Lang, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He , et al. (19 additional authors not shown)

    Abstract: The accurate and timely diagnosis of acute aortic syndromes (AAS) in patients presenting with acute chest pain remains a clinical challenge. Aortic CT angiography (CTA) is the imaging protocol of choice in patients with suspected AAS. However, due to economic and workflow constraints in China, the majority of suspected patients initially undergo non-contrast CT as the initial imaging testing, and… ▽ More

    Submitted 23 April, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  21. arXiv:2405.10102  [pdf, other

    cs.NE cs.AI cs.LG eess.AS

    A novel Reservoir Architecture for Periodic Time Series Prediction

    Authors: Zhongju Yuan, Geraint Wiggins, Dick Botteldooren

    Abstract: This paper introduces a novel approach to predicting periodic time series using reservoir computing. The model is tailored to deliver precise forecasts of rhythms, a crucial aspect for tasks such as generating musical rhythm. Leveraging reservoir computing, our proposed method is ultimately oriented towards predicting human perception of rhythm. Our network accurately predicts rhythmic signals wit… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  22. arXiv:2405.06159  [pdf, other

    eess.SP

    Near-Field Channel Characterization for Mid-band ELAA Systems: Sounding, Parameter Estimation, and Modeling

    Authors: Wei Fan, Zhiqiang Yuan, Yejian Lyu, Jianhua Zhang, Gert Pedersen, Jonathan Borrill, Fengchun Zhang

    Abstract: 6G communication will greatly benefit from using extremely large-scale antenna arrays (ELAAs) and new mid-band spectrums (7-24 GHz). These techniques require a thorough exploration of the challenges and potentials of the associated near-field (NF) phenomena. It is crucial to develop accurate NF channel models that include spherical wave propagation and spatial non-stationarity (SnS). However, chan… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Communication Magazine

  23. arXiv:2405.04285  [pdf, other

    cs.AI eess.SP

    On the Foundations of Earth and Climate Foundation Models

    Authors: Xiao Xiang Zhu, Zhitong Xiong, Yi Wang, Adam J. Stewart, Konrad Heidler, Yuanyuan Wang, Zhenghang Yuan, Thomas Dujardin, Qingsong Xu, Yilei Shi

    Abstract: Foundation models have enormous potential in advancing Earth and climate sciences, however, current approaches may not be optimal as they focus on a few basic features of a desirable Earth and climate foundation model. Crafting the ideal Earth foundation model, we define eleven features which would allow such a foundation model to be beneficial for any geoscientific downstream application in an en… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  24. arXiv:2404.10440  [pdf, other

    cs.CL eess.AS

    Language Proficiency and F0 Entrainment: A Study of L2 English Imitation in Italian, French, and Slovak Speakers

    Authors: Zheng Yuan, Štefan Beňuš, Alessandro D'Ausilio

    Abstract: This study explores F0 entrainment in second language (L2) English speech imitation during an Alternating Reading Task (ART). Participants with Italian, French, and Slovak native languages imitated English utterances, and their F0 entrainment was quantified using the Dynamic Time Warping (DTW) distance between the parameterized F0 contours of the imitated utterances and those of the model utteranc… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted at Speech Prosody 2024

  25. arXiv:2404.10324  [pdf

    cs.LG cs.CE eess.SY

    Graph neural network-based surrogate modelling for real-time hydraulic prediction of urban drainage networks

    Authors: Zhiyu Zhang, Chenkaixiang Lu, Wenchong Tian, Zhenliang Liao, Zhiguo Yuan

    Abstract: Physics-based models are computationally time-consuming and infeasible for real-time scenarios of urban drainage networks, and a surrogate model is needed to accelerate the online predictive modelling. Fully-connected neural networks (NNs) are potential surrogate models, but may suffer from low interpretability and efficiency in fitting complex targets. Owing to the state-of-the-art modelling powe… ▽ More

    Submitted 1 August, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Journal ref: Water Research, 2024, 263, 122142

  26. arXiv:2404.07827  [pdf, other

    eess.SY

    iPREFER: An Intelligent Parameter Extractor based on Features for BSIM-CMG Models

    Authors: Zhiliang Peng, Yicheng Wang, Zhengwu Yuan, Xingsheng Wang

    Abstract: This paper introduces an innovative parameter extraction method for BSIM-CMG compact models, seamlessly integrating curve feature extraction and machine learning techniques. This method offers a promising solution for bridging the division between TCAD and compact model, significantly contributing to the Design Technology Co-Optimization (DTCO) process. The key innovation lies in the development o… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 6 pages

  27. arXiv:2404.02710  [pdf, other

    cs.CL eess.AS

    ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation

    Authors: Zheng Yuan, Dorina de Jong, Štefan Beňuš, Noël Nguyen, Ruitao Feng, Róbert Sabo, Luciano Fadiga, Alessandro D`Ausilio

    Abstract: We introduce the Alternating Reading Task (ART) Corpus, a collection of dyadic sentence reading for studying the entrainment and imitation behaviour in speech communication. The ART corpus features three experimental conditions - solo reading, alternating reading, and deliberate imitation - as well as three sub-corpora encompassing French-, Italian-, and Slovak-accented English. This design allows… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 15 pages, 2 figures, 7 tables, accepted at LREC-COLING 2024 conference

  28. arXiv:2403.13245  [pdf, other

    eess.SY cs.AI cs.DC cs.LG cs.RO

    Federated reinforcement learning for robot motion planning with zero-shot generalization

    Authors: Zhenyuan Yuan, Siyuan Xu, Minghui Zhu

    Abstract: This paper considers the problem of learning a control policy for robot motion planning with zero-shot generalization, i.e., no data collection and policy adaptation is needed when the learned policy is deployed in new environments. We develop a federated reinforcement learning framework that enables collaborative learning of multiple learners and a central server, i.e., the Cloud, without sharing… ▽ More

    Submitted 7 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  29. arXiv:2403.12467  [pdf, other

    eess.SP

    Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications

    Authors: Heng Wang, Jianhua Zhang, Gaofeng Nie, Li Yu, Zhiqiang Yuan, Tongjie Li, Jialin Wang, Guangyi Liu

    Abstract: Digital twin channel (DTC) is the real-time mapping of a wireless channel from the physical world to the digital world, which is expected to provide significant performance enhancements for the sixth-generation (6G) air-interface design. In this work, we first define five evolution levels of channel twins with the progression of wireless communication. The fifth level, autonomous DTC, is elaborate… ▽ More

    Submitted 12 August, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 15 references. It is submitted to IEEE journal

  30. Object Segmentation-Assisted Inter Prediction for Versatile Video Coding

    Authors: Zhuoyuan Li, Zikun Yuan, Li Li, Dong Liu, Xiaohu Tang, Feng Wu

    Abstract: In modern video coding standards, block-based inter prediction is widely adopted, which brings high compression efficiency. However, in natural videos, there are usually multiple moving objects of arbitrary shapes, resulting in complex motion fields that are difficult to represent compactly. This problem has been tackled by more flexible block partitioning methods in the Versatile Video Coding (VV… ▽ More

    Submitted 12 September, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 20 pages, 13 figures, accepted by IEEE Transactions on Broadcasting (TBC)

  31. Unsupervised Learning for Equitable DER Control

    Authors: Zhenyi Yuan, Guido Cavraro, Ahmed S. Zamzam, Jorge Cortés

    Abstract: In the context of managing distributed energy resources (DERs) within distribution networks (DNs), this work focuses on the task of developing local controllers. We propose an unsupervised learning framework to train functions that can closely approximate optimal power flow (OPF) solutions. The primary aim is to establish specific conditions under which these learned functions can collectively gui… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: To appear at 23rd Power Systems Computation Conference

    Journal ref: Electric Power Systems Research, vol. 234, p. 110634, 2024

  32. arXiv:2402.10251  [pdf, other

    q-bio.NC cs.AI cs.LG eess.SP

    BrainWave: A Brain Signal Foundation Model for Clinical Applications

    Authors: Zhizhang Yuan, Fanqi Shen, Meng Li, Yuguo Yu, Chenhao Tan, Yang Yang

    Abstract: Neural electrical activity is fundamental to brain function, underlying a range of cognitive and behavioral processes, including movement, perception, decision-making, and consciousness. Abnormal patterns of neural signaling often indicate the presence of underlying brain diseases. The variability among individuals, the diverse array of clinical symptoms from various brain disorders, and the limit… ▽ More

    Submitted 19 September, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 39 pages, 14 figures

  33. Energy-efficient Integrated Sensing and Communication System and DNLFM Waveform

    Authors: Yihua Ma, Zhifeng Yuan, Shuqiang Xia, Chen Bai, Zhongbin Wang, Yuxin Wang

    Abstract: Integrated sensing and communication (ISAC) is a key enabler of 6G. Unlike communication radio links, the sensing signal requires to experience round trips from many scatters. Therefore, sensing is more power-sensitive and faces a severer multi-target interference. In this paper, the ISAC system employs dedicated sensing signals, which can be reused as the communication reference signal. This pape… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Journal ref: 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore, Singapore, 2024, pp. 1-6

  34. arXiv:2308.08038  [pdf, other

    eess.IV cs.CV

    Deep Learning Framework for Spleen Volume Estimation from 2D Cross-sectional Views

    Authors: Zhen Yuan, Esther Puyol-Anton, Haran Jogeesvaran, Baba Inusa, Andrew P. King

    Abstract: Abnormal spleen enlargement (splenomegaly) is regarded as a clinical indicator for a range of conditions, including liver disease, cancer and blood diseases. While spleen length measured from ultrasound images is a commonly used surrogate for spleen size, spleen volume remains the gold standard metric for assessing splenomegaly and the severity of related clinical conditions. Computed tomography i… ▽ More

    Submitted 17 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 22 pages, 7 figures

  35. arXiv:2307.08717  [pdf, other

    eess.IV cs.CV

    Untrained neural network embedded Fourier phase retrieval from few measurements

    Authors: Liyuan Ma, Hongxia Wang, Ningyi Leng, Ziyang Yuan

    Abstract: Fourier phase retrieval (FPR) is a challenging task widely used in various applications. It involves recovering an unknown signal from its Fourier phaseless measurements. FPR with few measurements is important for reducing time and hardware costs, but it suffers from serious ill-posedness. Recently, untrained neural networks have offered new approaches by introducing learned priors to alleviate th… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  36. arXiv:2306.05088  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN

    Authors: Zheng Yuan, Aldo Pastore, Dorina de Jong, Hao Xu, Luciano Fadiga, Alessandro D'Ausilio

    Abstract: Phonetic convergence describes the automatic and unconscious speech adaptation of two interlocutors in a conversation. This paper proposes a Siamese recurrent neural network (RNN) architecture to measure the convergence of the holistic spectral characteristics of speech sounds in an L2-L2 interaction. We extend an alternating reading task (the ART) dataset by adding 20 native Slovak L2 English spe… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  37. Constraints on OPF Surrogates for Learning Stable Local Volt/Var Controllers

    Authors: Zhenyi Yuan, Guido Cavraro, Jorge Cortés

    Abstract: We consider the problem of learning local Volt/Var controllers in distribution grids (DGs). Our approach starts from learning separable surrogates that take both local voltages and reactive powers as arguments and predict the reactive power setpoints that approximate optimal power flow (OPF) solutions. We propose an incremental control algorithm and identify two different sets of slope conditions… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Journal ref: IEEE Control Systems Letters, vol. 7, pp. 2533 - 2538, 2023

  38. arXiv:2305.16616  [pdf, other

    eess.SP

    Channel Measurement, Modeling, and Simulation for 6G: A Survey and Tutorial

    Authors: Jianhua Zhang, Jiaxin Lin, Pan Tang, Yuxiang Zhang, Huixin Xu, Tianyang Gao, Haiyang Miao, Zeyong Chai, Zhengfu Zhou, Yi Li, Huiwen Gong, Yameng Liu, Zhiqiang Yuan, Lei Tian, Shaoshi Yang, Liang Xia, Guangyi Liu, Ping Zhang

    Abstract: The sixth generation (6G) mobile communications have attracted substantial attention in the global research community of information and communication technologies (ICT). 6G systems are expected to support not only extended 5G usage scenarios, but also new usage scenarios, such as integrated sensing and communication (ISAC), integrated artificial intelligence (AI) and communication, and communicat… ▽ More

    Submitted 10 March, 2025; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 41 pages,52 figures

  39. arXiv:2304.06887  [pdf, other

    cs.IT eess.SP

    Hierarchically Structured Matrix Recovery-Based Channel Estimation for RIS-Aided Communications

    Authors: Yabo Guo, Peng Sun, Zhengdao Yuan, Qinghua Guo, Zhongyong Wang

    Abstract: Reconfigurable intelligent surface (RIS) has emerged as a promising technology for improving capacity and extending coverage of wireless networks. In this work, we consider RIS-aided millimeter wave (mmWave) multiple-input and multiple-output (MIMO) communications, where acquiring accurate channel state information is challenging due to the high dimensionality of channels. To fully exploit the str… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  40. arXiv:2304.04580  [pdf, other

    cs.IT eess.SP

    Matrix Factorization Based Blind Bayesian Receiver for Grant-Free Random Access in mmWave MIMO mMTC

    Authors: Zhengdao Yuan, Fei Liu, Qinghua Guo, Xiaojun Yuan, Zhongyong Wang, Yonghui Li

    Abstract: Grant-free random access is promising for massive connectivity with sporadic transmissions in massive machine type communications (mMTC), where the hand-shaking between the access point (AP) and users is skipped, leading to high access efficiency. In grant-free random access, the AP needs to identify the active users and perform channel estimation and signal detection. Conventionally, pilot signal… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  41. OTFDM: A Novel 2D Modulation Waveform Modeling Dot-product Doubly-selective Channel

    Authors: Yihua Ma, Zhifeng Yuan, Yu Xin, Jiang Hua, Guanghui Yu, Jin Xu, Liujun Hu

    Abstract: Recently, a two-dimension (2D) modulation waveform of orthogonal time-frequency-space (OTFS) has been a popular 6G candidate to replace existing orthogonal frequency division multiplexing (OFDM). The extensive OTFS researches help to make both the advantages and limitations of OTFS more and more clear. The limitations are not easy to overcome as they come from OTFS on-grid 2D convolution channel m… ▽ More

    Submitted 4 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted by IEEE PIMRC 2023

    Journal ref: 2023 IEEE PIMRC, Toronto, ON, Canada, 2023, pp. 1-

  42. arXiv:2302.14752  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Guided Crowd Evacuation: Two-Scale Modeling and Control

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: Emergency evacuation describes a complex situation involving time-critical decision-making by evacuees. Mobile robots are being actively explored as a potential solution to provide timely guidance. In this work, we study a robot-guided crowd evacuation problem where a small group of robots is used to guide a large human crowd to safe locations. The challenge lies in how to use micro-level human-ro… ▽ More

    Submitted 11 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

  43. arXiv:2302.05079  [pdf, ps, other

    eess.SY

    Output tracking based on extended observer for nonlinear uncertain systems

    Authors: Xinhua Wang, Zengqiang Chen, Zhuzhi Yuan

    Abstract: A high-gain extended observer is designed for a class of nonlinear uncertain systems. This observer has the ability of estimating system uncertainty, and it can be used to estimate the derivatives of signal up to order n. The controller based on this extended observer can make the tracking error and its derivatives converge to zero rapidly even when uncertainties and disturbances exist. The result… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 6 pages, 1 figure, published in Control and Decision, 19(10), 2004, pp. 1113-1116

  44. arXiv:2212.07048  [pdf, other

    cs.CV eess.IV

    PD-Quant: Post-Training Quantization based on Prediction Difference Metric

    Authors: Jiawei Liu, Lin Niu, Zhihang Yuan, Dawei Yang, Xinggang Wang, Wenyu Liu

    Abstract: Post-training quantization (PTQ) is a neural network compression technique that converts a full-precision model into a quantized model using lower-precision data types. Although it can help reduce the size and computational cost of deep neural networks, it can also introduce quantization noise and reduce prediction accuracy, especially in extremely low-bit settings. How to determine the appropriat… ▽ More

    Submitted 27 March, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  45. arXiv:2211.06615  [pdf, other

    eess.SP

    A Shared Cluster-based Stochastic Channel Model for Integrated Sensing and Communication Systems

    Authors: Yameng Liu, Jianhua Zhang, Yuxiang Zhang, Zhiqiang Yuan, Guangyi Liu

    Abstract: Integrated Sensing And Communication (ISAC) has been recognized as a promising technology in the 6G communication. A realistic channel model is a prerequisite for designing ISAC systems. Most existing channel models independently generate the communication and sensing channels under the same framework. However, due to the multiplexing of hardware resources and the same environment, signals enabled… ▽ More

    Submitted 6 September, 2024; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 13 pages, 8 figures

  46. arXiv:2210.12646  [pdf, other

    eess.SP

    ADMM based Fourier phase retrieval with untrained generative prior

    Authors: Liyuan Ma, Hongxia Wang, Ningyi Leng, Ziyang Yuan

    Abstract: Fourier phase retrieval (FPR) is an inverse problem that recovers the signal from its Fourier magnitude measurement, it's ill-posed especially when the sampling rates are low. In this paper, an untrained generative prior is introduced to attack the ill-posedness. Based on the alternating direction method of multipliers (ADMM), an algorithm utilizing the untrained generative network called Net-ADM… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  47. Learning Provably Stable Local Volt/Var Controllers for Efficient Network Operation

    Authors: Zhenyi Yuan, Guido Cavraro, Manish K. Singh, Jorge Cortés

    Abstract: This paper develops a data-driven framework to synthesize local Volt/Var control strategies for distributed energy resources (DERs) in power distribution networks (DNs). Aiming to improve DN operational efficiency, as quantified by a generic optimal reactive power flow (ORPF) problem, we propose a two-stage approach. The first stage involves learning the manifold of optimal operating points determ… ▽ More

    Submitted 23 July, 2024; v1 submitted 26 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Power Systems, vol. 39, no. 1, pp. 2066-2079, 2024

  48. arXiv:2209.09795  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Assisted Human Crowd Evacuation using Navigation Velocity Fields

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: This work studies a robot-assisted crowd evacuation problem where we control a small group of robots to guide a large human crowd to safe locations. The challenge lies in how to model human-robot interactions and design robot controls to indirectly control a human population that significantly outnumbers the robots. To address the challenge, we treat the crowd as a continuum and formulate the evac… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  49. arXiv:2209.02604  [pdf, other

    cs.MM cs.AI cs.CV cs.SD eess.AS

    Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

    Authors: Yihe Liu, Ziqi Yuan, Huisheng Mao, Zhiyun Liang, Wanqiuyue Yang, Yuanzhe Qiu, Tie Cheng, Xiaoteng Li, Hua Xu, Kai Gao

    Abstract: Multimodal sentiment analysis (MSA), which supposes to improve text-based sentiment analysis with associated acoustic and visual modalities, is an emerging research area due to its potential applications in Human-Computer Interaction (HCI). However, the existing researches observe that the acoustic and visual modalities contribute much less than the textual modality, termed as text-predominant. Un… ▽ More

    Submitted 21 August, 2022; originally announced September 2022.

    Comments: 16pages, 7 figures, accepted by ICMI 2022

  50. arXiv:2208.09117  [pdf, ps, other

    eess.SY

    Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees

    Authors: Guido Cavraro, Zhenyi Yuan, Manish K. Singh, Jorge Cortés

    Abstract: This paper considers the problem of voltage regulation in distribution networks. The primary motivation is to keep voltages within preassigned operating limits by commanding the reactive power output of distributed energy resources (DERs) deployed in the grid. We develop a framework for developing local Volt/Var control that comprises two main steps. In the first, by exploiting historical data and… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE CDC 2022