Skip to main content

Showing 1–50 of 116 results for author: Zhao, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04383  [pdf, ps, other

    eess.IV cs.CV

    ViTaL: A Multimodality Dataset and Benchmark for Multi-pathological Ovarian Tumor Recognition

    Authors: You Zhou, Lijiang Chen, Guangxia Cui, Wenpei Bai, Yu Guo, Shuchang Lyu, Guangliang Cheng, Qi Zhao

    Abstract: Ovarian tumor, as a common gynecological disease, can rapidly deteriorate into serious health crises when undetected early, thus posing significant threats to the health of women. Deep neural networks have the potential to identify ovarian tumors, thereby reducing mortality rates, but limited public datasets hinder its progress. To address this gap, we introduce a vital ovarian tumor pathological… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2506.17108  [pdf, ps, other

    eess.SP cs.IT stat.ML

    Searching for a Hidden Markov Anomaly over Multiple Processes

    Authors: Levli Citron, Kobi Cohen, Qing Zhao

    Abstract: We address the problem of detecting an anomalous process among a large number of processes. At each time t, normal processes are in state zero (normal state), while the abnormal process may be in either state zero (normal state) or state one (abnormal state), with the states being hidden. The transition between states for the abnormal process is governed by a Markov chain over time. At each time s… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 13 pages, 9 figures

  3. arXiv:2506.06710  [pdf, ps, other

    cs.CV eess.IV

    A Systematic Investigation on Deep Learning-Based Omnidirectional Image and Video Super-Resolution

    Authors: Qianqian Zhao, Chunle Guo, Tianyi Zhang, Junpei Zhang, Peiyang Jia, Tan Su, Wenjie Jiang, Chongyi Li

    Abstract: Omnidirectional image and video super-resolution is a crucial research topic in low-level vision, playing an essential role in virtual reality and augmented reality applications. Its goal is to reconstruct high-resolution images or video frames from low-resolution inputs, thereby enhancing detail preservation and enabling more accurate scene analysis and interpretation. In recent years, numerous i… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  4. arXiv:2505.05041  [pdf, other

    eess.IV cs.CV

    ADNP-15: An Open-Source Histopathological Dataset for Neuritic Plaque Segmentation in Human Brain Whole Slide Images with Frequency Domain Image Enhancement for Stain Normalization

    Authors: Chenxi Zhao, Jianqiang Li, Qing Zhao, Jing Bai, Susana Boluda, Benoit Delatour, Lev Stimmer, Daniel Racoceanu, Gabriel Jimenez, Guanghui Fu

    Abstract: Alzheimer's Disease (AD) is a neurodegenerative disorder characterized by amyloid-beta plaques and tau neurofibrillary tangles, which serve as key histopathological features. The identification and segmentation of these lesions are crucial for understanding AD progression but remain challenging due to the lack of large-scale annotated datasets and the impact of staining variations on automated ima… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  5. arXiv:2504.15667  [pdf, other

    eess.IV cs.CV

    Performance Estimation for Supervised Medical Image Segmentation Models on Unlabeled Data Using UniverSeg

    Authors: Jingchen Zou, Jianqiang Li, Gabriel Jimenez, Qing Zhao, Daniel Racoceanu, Matias Cosarinsky, Enzo Ferrante, Guanghui Fu

    Abstract: The performance of medical image segmentation models is usually evaluated using metrics like the Dice score and Hausdorff distance, which compare predicted masks to ground truth annotations. However, when applying the model to unseen data, such as in clinical settings, it is often impractical to annotate all the data, making the model's performance uncertain. To address this challenge, we propose… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  6. arXiv:2504.01038  [pdf, other

    eess.IV cs.CV cs.HC

    An Integrated AI-Enabled System Using One Class Twin Cross Learning (OCT-X) for Early Gastric Cancer Detection

    Authors: Xian-Xian Liu, Yuanyuan Wei, Mingkun Xu, Yongze Guo, Hongwei Zhang, Huicong Dong, Qun Song, Qi Zhao, Wei Luo, Feng Tien, Juntao Gao, Simon Fong

    Abstract: Early detection of gastric cancer, a leading cause of cancer-related mortality worldwide, remains hampered by the limitations of current diagnostic technologies, leading to high rates of misdiagnosis and missed diagnoses. To address these challenges, we propose an integrated system that synergizes advanced hardware and software technologies to balance speed-accuracy. Our study introduces the One C… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: 26 pages, 4 figures, 6 tables

  7. arXiv:2503.20822  [pdf, other

    eess.IV cs.AI cs.GR

    Synthetic Video Enhances Physical Fidelity in Video Synthesis

    Authors: Qi Zhao, Xingyu Ni, Ziyu Wang, Feng Cheng, Ziyan Yang, Lu Jiang, Bohan Wang

    Abstract: We investigate how to enhance the physical fidelity of video generation models by leveraging synthetic videos derived from computer graphics pipelines. These rendered videos respect real-world physics, such as maintaining 3D consistency, and serve as a valuable resource that can potentially improve video generation models. To harness this potential, we propose a solution that curates and integrate… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  8. arXiv:2503.19427  [pdf, other

    eess.IV cs.CV

    ASP-VMUNet: Atrous Shifted Parallel Vision Mamba U-Net for Skin Lesion Segmentation

    Authors: Muyi Bao, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Changyu Zeng, Wenpei Bai, Guangliang Cheng

    Abstract: Skin lesion segmentation is a critical challenge in computer vision, and it is essential to separate pathological features from healthy skin for diagnostics accurately. Traditional Convolutional Neural Networks (CNNs) are limited by narrow receptive fields, and Transformers face significant computational burdens. This paper presents a novel skin lesion segmentation framework, the Atrous Shifted Pa… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  9. arXiv:2503.13560  [pdf, other

    eess.IV cs.CV

    MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset

    Authors: Zhaodong Wu, Qiaochu Zhao, Ming Hu, Yulong Li, Haochen Xue, Kang Dang, Zhengyong Jiang, Angelos Stefanidis, Qiufeng Wang, Imran Razzak, Zongyuan Ge, Junjun He, Yu Qiao, Zhong Zheng, Feilong Tang, Jionglong Su

    Abstract: With the significantly increasing incidence and prevalence of abdominal diseases, there is a need to embrace greater use of new innovations and technology for the diagnosis and treatment of patients. Although deep-learning methods have notably been developed to assist radiologists in diagnosing abdominal diseases, existing models have the restricted ability to segment common lesions in the abdomen… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  10. arXiv:2502.17499  [pdf

    eess.SP cs.AI cs.LG math.NA

    Detecting Long QT Syndrome and First-Degree Atrioventricular Block using Single-Lead AI-ECG: A Multi-Center Real-World Study

    Authors: Sumei Fan, Deyun Zhang, Yue Wang, Shijia Geng, Kun Lu, Meng Sang, Weilun Xu, Haixue Wang, Qinghao Zhao, Chuandong Cheng, Peng Wang, Shenda Hong

    Abstract: Home-based single-lead AI-ECG devices have enabled continuous, real-world cardiac monitoring. However, the accuracy of parameter calculations from single-lead AI-ECG algorithm remains to be fully validated, which is critical for conditions such as Long QT Syndrome (LQTS) and First-Degree Atrioventricular Block (AVBI). In this multicenter study, we assessed FeatureDB, an ECG measurements computatio… ▽ More

    Submitted 26 April, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: 29pages, 11 figures, 8 tables

  11. arXiv:2502.17380  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation

    Authors: Qiuming Zhao, Guangzhi Sun, Chao Zhang

    Abstract: Language diversity presents a significant challenge in speech-to-text (S2T) tasks, such as automatic speech recognition and translation. Traditional multi-lingual multi-task training approaches aim to address this by jointly optimising multiple speech recognition and translation tasks across various languages. While models like Whisper, built on these strategies, demonstrate strong performance, th… ▽ More

    Submitted 7 July, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: 13 pages

  12. arXiv:2502.15285  [pdf, other

    cs.SD cs.AI cs.DC cs.NI eess.AS

    Offload Rethinking by Cloud Assistance for Efficient Environmental Sound Recognition on LPWANs

    Authors: Le Zhang, Quanling Zhao, Run Wang, Shirley Bian, Onat Gungor, Flavio Ponzina, Tajana Rosing

    Abstract: Learning-based environmental sound recognition has emerged as a crucial method for ultra-low-power environmental monitoring in biological research and city-scale sensing systems. These systems usually operate under limited resources and are often powered by harvested energy in remote areas. Recent efforts in on-device sound recognition suffer from low accuracy due to resource constraints, whereas… ▽ More

    Submitted 21 March, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Accepted by The 23rd ACM Conference on Embedded Networked Sensor Systems (SenSys '25)

  13. arXiv:2502.12990  [pdf, other

    eess.SP

    Artificial Intelligence-derived Vascular Age from Photoplethysmography: A Novel Digital Biomarker for Cardiovascular Health

    Authors: Guangkun Nie, Qinghao Zhao, Gongzheng Tang, Yaxin Li, Shenda Hong

    Abstract: With the increasing availability of wearable devices, photoplethysmography (PPG) has emerged as a promising non-invasive tool for monitoring human hemodynamics. We propose a deep learning framework to estimate vascular age (AI-vascular age) from PPG signals, incorporating a distribution-aware loss to address biases caused by imbalanced data. The model was developed using data from the UK Biobank (… ▽ More

    Submitted 20 March, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  14. arXiv:2502.11729  [pdf, other

    eess.IV

    On Quantizing Neural Representation for Variable-Rate Video Coding

    Authors: Junqi Shi, Zhujia Chen, Hanfei Li, Qi Zhao, Ming Lu, Tong Chen, Zhan Ma

    Abstract: This work introduces NeuroQuant, a novel post-training quantization (PTQ) approach tailored to non-generalized Implicit Neural Representations for variable-rate Video Coding (INR-VC). Unlike existing methods that require extensive weight retraining for each target bitrate, we hypothesize that variable-rate coding can be achieved by adjusting quantization parameters (QPs) of pre-trained weights. Ou… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: to be pulished in ICLR'25

  15. arXiv:2501.15588  [pdf, other

    eess.IV cs.CV

    Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound: The TDSC-ABUS Challenge

    Authors: Gongning Luo, Mingwang Xu, Hongyu Chen, Xinjie Liang, Xing Tao, Dong Ni, Hyunsu Jeong, Chulhong Kim, Raphael Stock, Michael Baumgartner, Yannick Kirchhoff, Maximilian Rokuss, Klaus Maier-Hein, Zhikai Yang, Tianyu Fan, Nicolas Boutry, Dmitry Tereshchenko, Arthur Moine, Maximilien Charmetant, Jan Sauer, Hao Du, Xiang-Hui Bai, Vipul Pai Raikar, Ricardo Montoya-del-Angel, Robert Marti , et al. (12 additional authors not shown)

    Abstract: Breast cancer is one of the most common causes of death among women worldwide. Early detection helps in reducing the number of deaths. Automated 3D Breast Ultrasound (ABUS) is a newer approach for breast screening, which has many advantages over handheld mammography such as safety, speed, and higher detection rate of breast cancer. Tumor detection, segmentation, and classification are key componen… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  16. arXiv:2501.12023  [pdf, other

    cs.LG cs.CV eess.IV

    Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis

    Authors: Hongjun Liu, Changwei Song, Jiaqi Qiang, Jianqiang Li, Hui Pan, Lin Lu, Xiao Long, Qing Zhao, Jiuzuo Huang, Shi Chen

    Abstract: Cushing's syndrome is a condition caused by excessive glucocorticoid secretion from the adrenal cortex, often manifesting with moon facies and plethora, making facial data crucial for diagnosis. Previous studies have used pre-trained convolutional neural networks (CNNs) for diagnosing Cushing's syndrome using frontal facial images. However, CNNs are better at capturing local features, while Cushin… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  17. arXiv:2412.11185  [pdf, other

    eess.AS cs.CL cs.SD

    Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition

    Authors: Han Zhu, Gaofeng Cheng, Qingwei Zhao, Pengyuan Zhang

    Abstract: The performance of automatic speech recognition models often degenerates on domains not covered by the training data. Domain adaptation can address this issue, assuming the availability of the target domain data in the target language. However, such assumption does not stand in many real-world applications. To make domain adaptation more applicable, we address the problem of zero-shot domain adapt… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  18. arXiv:2410.00946  [pdf, other

    eess.IV cs.LG

    Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging

    Authors: Magdalini Paschali, Yu Hang Jiang, Spencer Siegel, Camila Gonzalez, Kilian M. Pohl, Akshay Chaudhari, Qingyu Zhao

    Abstract: Recent advancements in medicine have confirmed that brain disorders often comprise multiple subtypes of mechanisms, developmental trajectories, or severity levels. Such heterogeneity is often associated with demographic aspects (e.g., sex) or disease-related contributors (e.g., genetics). Thus, the predictive power of machine learning models used for symptom prediction varies across subjects based… ▽ More

    Submitted 5 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  19. arXiv:2408.17339  [pdf, other

    cs.CV eess.IV

    Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method

    Authors: Yuji Lin, Xianqiang Lyu, Junhui Hou, Qian Zhao, Deyu Meng

    Abstract: In this paper, we delve into the realm of 4-D light fields (LFs) to enhance underwater imaging plagued by light absorption, scattering, and other challenges. Contrasting with conventional 2-D RGB imaging, 4-D LF imaging excels in capturing scenes from multiple perspectives, thereby indirectly embedding geometric information. This intrinsic property is anticipated to effectively address the challen… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 14 pages, 14 figures

  20. arXiv:2408.05877  [pdf, other

    eess.IV

    Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network

    Authors: Kailai Sun, Xinwei Wang, Shaobo Liu, Qianchuan Zhao, Gao Huang, Chang Liu

    Abstract: Pedestrian detection and tracking in crowded video sequences have a wide range of applications, including autonomous driving, robot navigation and pedestrian flow surveillance. However, detecting and tracking pedestrians in high-density crowds face many challenges, including intra-class occlusions, complex motions, and diverse poses. Although deep learning models have achieved remarkable progress… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  21. arXiv:2408.03979  [pdf, ps, other

    cs.SD eess.AS

    Speaker Adaptation for Quantised End-to-End ASR Models

    Authors: Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng

    Abstract: End-to-end models have shown superior performance for automatic speech recognition (ASR). However, such models are often very large in size and thus challenging to deploy on resource-constrained edge devices. While quantisation can reduce model sizes, it can lead to increased word error rates (WERs). Although improved quantisation methods were proposed to address the issue of performance degradati… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: submitted to ASRU 2023 Workshop

  22. arXiv:2407.09424  [pdf, other

    eess.SP cs.AI

    TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models

    Authors: Hang Zou, Qiyang Zhao, Yu Tian, Lina Bariah, Faouzi Bader, Thierry Lestable, Merouane Debbah

    Abstract: Large Language Models (LLMs) have the potential to revolutionize the Sixth Generation (6G) communication networks. However, current mainstream LLMs generally lack the specialized knowledge in telecom domain. In this paper, for the first time, we propose a pipeline to adapt any general purpose LLMs to a telecom-specific LLMs. We collect and build telecom-specific pre-train dataset, instruction data… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:1303.2654 by other authors

  23. arXiv:2407.07506  [pdf, other

    eess.SP cs.AI

    Generative AI for RF Sensing in IoT systems

    Authors: Li Wang, Chao Zhang, Qiyang Zhao, Hang Zou, Samson Lasaulce, Giuseppe Valenzise, Zhuo He, Merouane Debbah

    Abstract: The development of wireless sensing technologies, using signals such as Wi-Fi, infrared, and RF to gather environmental data, has significantly advanced within Internet of Things (IoT) systems. Among these, Radio Frequency (RF) sensing stands out for its cost-effective and non-intrusive monitoring of human activities and environmental changes. However, traditional RF sensing methods face significa… ▽ More

    Submitted 24 November, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  24. arXiv:2407.00476  [pdf, other

    cs.CL eess.SY

    Large Language Models for Power Scheduling: A User-Centric Approach

    Authors: Thomas Mongaillard, Samson Lasaulce, Othman Hicheur, Chao Zhang, Lina Bariah, Vineeth S. Varma, Hang Zou, Qiyang Zhao, Merouane Debbah

    Abstract: While traditional optimization and scheduling schemes are designed to meet fixed, predefined system requirements, future systems are moving toward user-driven approaches and personalized services, aiming to achieve high quality-of-experience (QoE) and flexibility. This challenge is particularly pronounced in wireless and digitalized energy networks, where users' requirements have largely not been… ▽ More

    Submitted 14 November, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

  25. arXiv:2406.19706  [pdf, other

    cs.SD eess.AS

    SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR

    Authors: Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng

    Abstract: Mixture-of-experts (MoE) models have achieved excellent results in many tasks. However, conventional MoE models are often very large, making them challenging to deploy on resource-constrained edge devices. In this paper, we propose a novel speaker adaptive mixture of LoRA experts (SAML) approach, which uses low-rank adaptation (LoRA) modules as experts to reduce the number of trainable parameters… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 5 pages, accepted by Interspeech 2024. arXiv admin note: substantial text overlap with arXiv:2309.09136

  26. arXiv:2406.14953  [pdf, other

    cs.CV cs.AI cs.LG eess.SP

    Deep Imbalanced Regression to Estimate Vascular Age from PPG Data: a Novel Digital Biomarker for Cardiovascular Health

    Authors: Guangkun Nie, Qinghao Zhao, Gongzheng Tang, Jun Li, Shenda Hong

    Abstract: Photoplethysmography (PPG) is emerging as a crucial tool for monitoring human hemodynamics, with recent studies highlighting its potential in assessing vascular aging through deep learning. However, real-world age distributions are often imbalanced, posing significant challenges for deep learning models. In this paper, we introduce a novel, simple, and effective loss function named the Dist Loss t… ▽ More

    Submitted 2 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  27. arXiv:2406.10454  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    HumanPlus: Humanoid Shadowing and Imitation from Humans

    Authors: Zipeng Fu, Qingqing Zhao, Qi Wu, Gordon Wetzstein, Chelsea Finn

    Abstract: One of the key arguments for building robots that have similar form factors to human beings is that we can leverage the massive human data for training. Yet, doing so has remained challenging in practice due to the complexities in humanoid perception and control, lingering physical gaps between humanoids and humans in morphologies and actuation, and lack of a data pipeline for humanoids to learn a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: project website: https://humanoid-ai.github.io/

  28. arXiv:2405.18333  [pdf, other

    eess.SY

    On the analysis of a higher-order Lotka-Volterra model: an application of S-tensors and the polynomial complementarity problem

    Authors: Shaoxuan Cui, Qi Zhao, Guofeng Zhang, Hildeberto Jardón-Kojakhmetov, Ming Cao

    Abstract: It is known that the effect of species' density on species' growth is non-additive in real ecological systems. This challenges the conventional Lotka-Volterra model, where the interactions are always pairwise and their effects are additive. To address this challenge, we introduce HOIs (Higher-Order Interactions) which are able to capture, for example, the indirect effect of one species on a second… ▽ More

    Submitted 8 July, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  29. arXiv:2405.14559  [pdf, other

    eess.IV

    HemSeg-200: A Voxel-Annotated Dataset for Intracerebral Hemorrhages Segmentation in Brain CT Scans

    Authors: Changwei Song, Qing Zhao, Jianqiang Li, Xin Yue, Ruoyun Gao, Zhaoxuan Wang, An Gao, Guanghui Fu

    Abstract: Acute intracerebral hemorrhage is a life-threatening condition that demands immediate medical intervention. Intraparenchymal hemorrhage (IPH) and intraventricular hemorrhage (IVH) are critical subtypes of this condition. Clinically, when such hemorrhages are suspected, immediate CT scanning is essential to assess the extent of the bleeding and to facilitate the formulation of a targeted treatment… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  30. arXiv:2405.11115  [pdf

    eess.IV physics.optics

    Ptychographic non-line-of-sight imaging for depth-resolved visualization of hidden objects

    Authors: Pengming Song, Qianhao Zhao, Ruihai Wang, Ninghe Liu, Yingqi Qiang, Tianbo Wang, Xincheng Zhang, Yi Zhang, Guoan Zheng

    Abstract: Non-line-of-sight (NLOS) imaging enables the visualization of objects hidden from direct view, with applications in surveillance, remote sensing, and light detection and ranging. Here, we introduce a NLOS imaging technique termed ptychographic NLOS (pNLOS), which leverages coded ptychography for depth-resolved imaging of obscured objects. Our approach involves scanning a laser spot on a wall to il… ▽ More

    Submitted 1 September, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  31. arXiv:2405.04128  [pdf, other

    cs.CL cs.SD eess.AS

    Fine-grained Speech Sentiment Analysis in Chinese Psychological Support Hotlines Based on Large-scale Pre-trained Model

    Authors: Zhonglong Chen, Changwei Song, Yining Chen, Jianqiang Li, Guanghui Fu, Yongsheng Tong, Qing Zhao

    Abstract: Suicide and suicidal behaviors remain significant challenges for public policy and healthcare. In response, psychological support hotlines have been established worldwide to provide immediate help to individuals in mental crises. The effectiveness of these hotlines largely depends on accurately identifying callers' emotional states, particularly underlying negative emotions indicative of increased… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  32. arXiv:2404.09149  [pdf, other

    eess.SY cs.NE math.NA

    Heuristic Solution to Joint Deployment and Beamforming Design for STAR-RIS Aided Networks

    Authors: Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang

    Abstract: This paper tackles the deployment challenges of Simultaneous Transmitting and Reflecting Reconfigurable Intelligent Surface (STAR-RIS) in communication systems. Unlike existing works that use fixed deployment setups or solely optimize the location, this paper emphasizes the joint optimization of the location and orientation of STAR-RIS. This enables searching across all user grouping possibilities… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages

  33. arXiv:2403.17235  [pdf, ps, other

    eess.SY

    A Discrete-Time Least-Squares Adaptive State Tracking Control Scheme with A Mobile-Robot System Study

    Authors: Qianhong Zhao, Gang Tao

    Abstract: This paper develops an adaptive state tracking control scheme for discrete-time systems, using the least-squares algorithm, as the new solution to the long-standing discrete-time adaptive state tracking control problem to which the Lyapunov method (well-developed for the continuous-time adaptive state tracking problem) is not applicable. The new adaptive state tracking scheme is based on a recentl… ▽ More

    Submitted 1 February, 2025; v1 submitted 25 March, 2024; originally announced March 2024.

  34. arXiv:2403.13648  [pdf, other

    eess.SY

    Priority-based Energy Allocation in Buildings through Distributed Model Predictive Control

    Authors: Hongyi Li, Jun Xu, Qianchuan Zhao

    Abstract: Many countries are facing energy shortage today and most of the global energy is consumed by HVAC systems in buildings. For the scenarios where the energy system is not sufficiently supplied to HVAC systems, a priority-based allocation scheme based on distributed model predictive control is proposed in this paper, which distributes the energy rationally based on priority order. According to the sc… ▽ More

    Submitted 22 July, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  35. arXiv:2403.11405  [pdf, other

    eess.SP

    A Deep Learning Method for Beat-Level Risk Analysis and Interpretation of Atrial Fibrillation Patients during Sinus Rhythm

    Authors: Jun Lei, Yuxi Zhou, Xue Tian, Qinghao Zhao, Qi Zhang, Shijia Geng, Qingbo Wu, Shenda Hong

    Abstract: Atrial Fibrillation (AF) is a common cardiac arrhythmia. Many AF patients experience complications such as stroke and other cardiovascular issues. Early detection of AF is crucial. Existing algorithms can only distinguish ``AF rhythm in AF patients'' from ``sinus rhythm in normal individuals'' . However, AF patients do not always exhibit AF rhythm, posing a challenge for diagnosis when the AF rhyt… ▽ More

    Submitted 2 October, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  36. arXiv:2403.10481  [pdf, other

    eess.IV eess.SP

    Tensor Star Tensor Decomposition and Its Applications to Higher-order Compression and Completion

    Authors: Wuyang Zhou, Yu-Bang Zheng, Qibin Zhao, Danilo Mandic

    Abstract: A novel tensor decomposition framework, termed Tensor Star (TS) decomposition, is proposed which represents a new type of tensor network decomposition based on tensor contractions. This is achieved by connecting the core tensors in a ring shape, whereby the core tensors act as skip connections between the factor tensors and allow for direct correlation characterisation between any two arbitrary di… ▽ More

    Submitted 6 September, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  37. arXiv:2403.06942  [pdf, other

    eess.SY cs.LG stat.ML

    Grid Monitoring with Synchro-Waveform and AI Foundation Model Technologies

    Authors: Lang Tong, Xinyi Wang, Qing Zhao

    Abstract: Purpose:This article advocates for the development of a next-generation grid monitoring and control system designed for future grids dominated by inverter-based resources. Leveraging recent progress in generative artificial intelligence (AI), machine learning, and networking technology, we develop a physics-based AI foundation model with high-resolution synchro-waveform measurement technology to e… ▽ More

    Submitted 25 January, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

  38. arXiv:2403.05808  [pdf, other

    cs.CV eess.IV

    Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

    Authors: Junxiong Lin, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haorang Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang

    Abstract: Pre-trained diffusion models utilized for image generation encapsulate a substantial reservoir of a priori knowledge pertaining to intricate textures. Harnessing the potential of leveraging this a priori knowledge in the context of image super-resolution presents a compelling avenue. Nonetheless, prevailing diffusion-based methodologies presently overlook the constraints imposed by degradation inf… ▽ More

    Submitted 9 July, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  39. arXiv:2403.05743  [pdf, ps, other

    eess.SP cs.LG econ.GN

    Probabilistic Forecasting of Real-Time Electricity Market Signals via Interpretable Generative AI

    Authors: Xinyi Wang, Qing Zhao, Lang Tong

    Abstract: This paper introduces a generative AI approach to probabilistic forecasting of real-time electricity market signals, including locational marginal prices, interregional price spreads, and demand-supply imbalances. We present WIAE-GPF, a Weak Innovation AutoEncoder-based Generative Probabilistic Forecasting architecture that generates future samples of multivariate time series. Unlike traditional b… ▽ More

    Submitted 24 September, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  40. arXiv:2402.16631  [pdf, other

    cs.AI cs.NI eess.SP

    GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning

    Authors: Hang Zou, Qiyang Zhao, Samson Lasaulce, Lina Bariah, Mehdi Bennis, Merouane Debbah

    Abstract: Generative Artificial Intelligence (GenAI) and communication networks are expected to have groundbreaking synergies for 6G. Connecting GenAI agents via a wireless network can potentially unleash the power of Collective Intelligence (CI) and pave the way for Artificial General Intelligence (AGI). However, current wireless networks are designed as a "data pipe" and are not suited to accommodate and… ▽ More

    Submitted 4 May, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

  41. arXiv:2402.13870  [pdf, ps, other

    cs.LG eess.SP stat.AP

    Generative Probabilistic Time Series Forecasting and Applications in Grid Operations

    Authors: Xinyi Wang, Lang Tong, Qing Zhao

    Abstract: Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations. Such techniques are essential in risk-based decision-making and planning under uncertainty with broad applications in grid operations, including electricity price forecasting, risk-based economic dispatch, and stochastic optimizations.… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted at CISS 2024. arXiv admin note: text overlap with arXiv:2306.03782

  42. arXiv:2402.09679  [pdf, other

    cs.RO eess.SY

    Design and Visual Servoing Control of a Hybrid Dual-Segment Flexible Neurosurgical Robot for Intraventricular Biopsy

    Authors: Jian Chen, Mingcong Chen, Qingxiang Zhao, Shuai Wang, Yihe Wang, Ying Xiao, Jian Hu, Danny Tat Ming Chan, Kam Tong Leo Yeung, David Yuen Chung Chan, Hongbin Liu

    Abstract: Traditional rigid endoscopes have challenges in flexibly treating tumors located deep in the brain, and low operability and fixed viewing angles limit its development. This study introduces a novel dual-segment flexible robotic endoscope MicroNeuro, designed to perform biopsies with dexterous surgical manipulation deep in the brain. Taking into account the uncertainty of the control model, an imag… ▽ More

    Submitted 23 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA) 2024, 7 pages, 9 figures

    Journal ref: IEEE International Conference on Robotics & Automation, 2024

  43. arXiv:2402.07595  [pdf, other

    eess.IV cs.LG

    Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification

    Authors: Yuning Huang, Jingchen Zou, Lanxi Meng, Xin Yue, Qing Zhao, Jianqiang Li, Changwei Song, Gabriel Jimenez, Shaowu Li, Guanghui Fu

    Abstract: Medical image analysis frequently encounters data scarcity challenges. Transfer learning has been effective in addressing this issue while conserving computational resources. The recent advent of foundational models like the DINOv2, which uses the vision transformer architecture, has opened new opportunities in the field and gathered significant interest. However, DINOv2's performance on clinical… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  44. arXiv:2401.12783  [pdf, other

    cs.AI cs.LG eess.SP

    A Review of Deep Learning Methods for Photoplethysmography Data

    Authors: Guangkun Nie, Jiabao Zhu, Gongzheng Tang, Deyun Zhang, Shijia Geng, Qinghao Zhao, Shenda Hong

    Abstract: Photoplethysmography (PPG) is a highly promising device due to its advantages in portability, user-friendly operation, and non-invasive capabilities to measure a wide range of physiological information. Recent advancements in deep learning have demonstrated remarkable outcomes by leveraging PPG signals for tasks related to personal health management and other multifaceted applications. In this rev… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  45. arXiv:2310.18732  [pdf, other

    physics.optics eess.IV

    Tracking and fast imaging of a moving object via Fourier modulation

    Authors: Shijian Li, Xu-Ri Yao, Wei Zhang, Yeliang Wang, Qing Zhao

    Abstract: Recently, several single-pixel imaging (SPI) schemes have emerged for imaging fast-moving objects and have shown dramatic results. However, fast image reconstruction of a moving object with high quality is still challenging for SPI, thereby limiting its practical application. In this paper, we present a simultaneous tracking and imaging method that incorporates position encoding and spatial inform… ▽ More

    Submitted 15 September, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 6 figures

  46. arXiv:2310.14965  [pdf, ps, other

    eess.IV physics.optics

    Parallel compressive super-resolution imaging with wide field-of-view based on physics enhanced network

    Authors: Xiao-Peng Jin, An-Dong Xiong, Wei Zhang, Xiao-Qing Wang, Fan Liu, Chang-Heng Li, Xu-Ri Yao, Xue-Feng Liu, Qing Zhao

    Abstract: Achieving both high-performance and wide field-of-view (FOV) super-resolution imaging has been attracting increasing attention in recent years. However, such goal suffers from long reconstruction time and huge storage space. Parallel compressive imaging (PCI) provides an efficient solution, but the super-resolution quality and imaging speed are strongly dependent on precise optical transfer functi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  47. arXiv:2310.04630  [pdf, other

    eess.IV cs.CV

    Metadata-Conditioned Generative Models to Synthesize Anatomically-Plausible 3D Brain MRIs

    Authors: Wei Peng, Tomas Bosschieter, Jiahong Ouyang, Robert Paul, Ehsan Adeli, Qingyu Zhao, Kilian M. Pohl

    Abstract: Generative AI models hold great potential in creating synthetic brain MRIs that advance neuroimaging studies by, for example, enriching data diversity. However, the mainstay of AI research only focuses on optimizing the visual quality (such as signal-to-noise ratio) of the synthetic MRIs while lacking insights into their relevance to neuroscience. To gain these insights with respect to T1-weighted… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  48. arXiv:2310.02399  [pdf, other

    cs.NI eess.SP

    Can 5G NR Sidelink communications support wireless augmented reality?

    Authors: Ashutosh Srivastava, Qing Zhao, Yi Lu, Ping Wang, Qi Qu, Zhu Ji, Yee Sin Chan, Shivendra S. Panwar

    Abstract: Smart glasses that support augmented reality (AR) have the potential to become the consumer's primary medium of connecting to the future internet. For the best quality of user experience, AR glasses must have a small form factor and long battery life, while satisfying the data rate and latency requirements of AR applications. To extend the AR glasses' battery life, the computation and processing i… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures, accepted for publication in 2023 IEEE Global Communications Conference: Mobile and Wireless Networks (Globecom 2023 MWN), Kuala Lumpur, Malaysia, Dec. 2023

  49. arXiv:2309.13611  [pdf

    eess.IV cs.IR physics.optics

    Sparsity-regularized coded ptychography for robust and efficient lensless microscopy on a chip

    Authors: Ninghe Liu, Qianhao Zhao, Guoan Zheng

    Abstract: Coded ptychography has emerged as a powerful technique for high-throughput, high-resolution lensless imaging. However, the trade-off between acquisition speed and image quality remains a significant challenge. To address this, we introduce a novel sparsity-regularized approach to coded ptychography that dramatically reduces the number of required measurements while maintaining high reconstruction… ▽ More

    Submitted 1 September, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 14 pages, 7 figures

    MSC Class: 78-10

  50. arXiv:2309.11811  [pdf, other

    eess.SP cs.AI

    Multimodal Transformers for Wireless Communications: A Case Study in Beam Prediction

    Authors: Yu Tian, Qiyang Zhao, Zine el abidine Kherroubi, Fouzi Boukhalfa, Kebin Wu, Faouzi Bader

    Abstract: Wireless communications at high-frequency bands with large antenna arrays face challenges in beam management, which can potentially be improved by multimodality sensing information from cameras, LiDAR, radar, and GPS. In this paper, we present a multimodal transformer deep learning framework for sensing-assisted beam prediction. We employ a convolutional neural network to extract the features from… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.