Skip to main content

Showing 1–50 of 226 results for author: Hu, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04100  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Hierarchical Testing with Rabbit Optimization for Industrial Cyber-Physical Systems

    Authors: Jinwei Hu, Zezhi Tang, Xin Jin, Benyuan Zhang, Yi Dong, Xiaowei Huang

    Abstract: This paper presents HERO (Hierarchical Testing with Rabbit Optimization), a novel black-box adversarial testing framework for evaluating the robustness of deep learning-based Prognostics and Health Management systems in Industrial Cyber-Physical Systems. Leveraging Artificial Rabbit Optimization, HERO generates physically constrained adversarial examples that align with real-world data distributio… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Preprint accepted by IEEE Transactions on Industrial Cyber Physical Systems

  2. arXiv:2507.00755  [pdf

    eess.AS cs.AI cs.SD

    LearnAFE: Circuit-Algorithm Co-design Framework for Learnable Audio Analog Front-End

    Authors: Jinhai Hu, Zhongyi Zhang, Cong Sheng Leow, Wang Ling Goh, Yuan Gao

    Abstract: This paper presents a circuit-algorithm co-design framework for learnable analog front-end (AFE) in audio signal classification. Designing AFE and backend classifiers separately is a common practice but non-ideal, as shown in this paper. Instead, this paper proposes a joint optimization of the backend classifier with the AFE's transfer function to achieve system-level optimum. More specifically, t… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 11 pages, 15 figures, accepted for publication on IEEE Transactions on Circuits and Systems I: Regular Papers

  3. arXiv:2506.19893  [pdf, ps, other

    cs.LG cs.AI cs.IT eess.IV

    Distillation-Enabled Knowledge Alignment for Generative Semantic Communications in AIGC Provisioning Tasks

    Authors: Jingzhi Hu, Geoffrey Ye Li

    Abstract: Due to the surging amount of AI-generated content (AIGC), its provisioning to edges and mobile users from the cloud incurs substantial traffic on networks. Generative semantic communication (GSC) offers a promising solution by transmitting highly compact information, i.e., prompt text and latent representations, instead of high-dimensional AIGC data. However, GSC relies on the alignment between th… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  4. arXiv:2506.12712  [pdf, ps, other

    cs.CV eess.IV

    Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups

    Authors: Zhenghao Xi, Zhengnan Lv, Yang Zheng, Xiang Liu, Zhuang Yu, Junran Chen, Jing Hu, Yaqi Liu

    Abstract: The segmentation of coal maceral groups can be described as a semantic segmentation process of coal maceral group images, which is of great significance for studying the chemical properties of coal. Generally, existing semantic segmentation models of coal maceral groups use the method of stacking parameters to achieve higher accuracy. It leads to increased computational requirements and impacts mo… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  5. arXiv:2506.11496  [pdf, ps, other

    eess.IV cs.CV

    Taming Stable Diffusion for Computed Tomography Blind Super-Resolution

    Authors: Chunlei Li, Yilei Shi, Haoxi Hu, Jingliang Hu, Xiao Xiang Zhu, Lichao Mou

    Abstract: High-resolution computed tomography (CT) imaging is essential for medical diagnosis but requires increased radiation exposure, creating a critical trade-off between image quality and patient safety. While deep learning methods have shown promise in CT super-resolution, they face challenges with complex degradations and limited medical training data. Meanwhile, large-scale pre-trained diffusion mod… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  6. arXiv:2506.11339  [pdf, ps, other

    cs.CY eess.SY

    WIP: Exploring the Value of a Debugging Cheat Sheet and Mini Lecture in Improving Undergraduate Debugging Skills and Mindset

    Authors: Andrew Ash, John Hu

    Abstract: This work-in-progress research paper explores the efficacy of a small-scale microelectronics debugging education intervention utilizing quasi-experimental design in an introductory microelectronics course for third-year electrical and computer engineering (ECE) students. In the first semester of research, the experimental group attended a debugging "mini lecture" covering two common sources of cir… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: This is the accepted version of a paper accepted for presentation at the 2025 IEEE Frontiers in Education Conference (FIE). The final version will be available via IEEE Xplore at: https://ieeexplore.ieee.org

  7. arXiv:2506.00397  [pdf

    eess.SP

    A Family of Robust Generalized Adaptive Filters and Application for Time-series Prediction

    Authors: Yi Peng, Haiquan Zhao, Jinhui Hu

    Abstract: The continuous development of new adaptive filters (AFs) based on novel cost functions (CFs) is driven by the demands of various application scenarios and noise environments. However, these algorithms typically demonstrate optimal performance only in specific conditions. In the event of the noise change, the performance of these AFs often declines, rendering simple parameter adjustments ineffectiv… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  8. arXiv:2505.23743  [pdf, ps, other

    cs.CV eess.IV

    DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP

    Authors: Amber Yijia Zheng, Yu Zhang, Jun Hu, Raymond A. Yeh, Chen Chen

    Abstract: High-quality photography in extreme low-light conditions is challenging but impactful for digital cameras. With advanced computing hardware, traditional camera image signal processor (ISP) algorithms are gradually being replaced by efficient deep networks that enhance noisy raw images more intelligently. However, existing regression-based models often minimize pixel errors and result in oversmooth… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  9. arXiv:2505.17030  [pdf, ps, other

    eess.IV cs.LG

    Distillation-Enabled Knowledge Alignment Protocol for Semantic Communication in AI Agent Networks

    Authors: Jingzhi Hu, Geoffrey Ye Li

    Abstract: Future networks are envisioned to connect massive artificial intelligence (AI) agents, enabling their extensive collaboration on diverse tasks. Compared to traditional entities, these agents naturally suit the semantic communication (SC), which can significantly enhance the bandwidth efficiency. Nevertheless, SC requires the knowledge among agents to be aligned, while agents have distinct expert k… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  10. arXiv:2505.15868  [pdf

    q-bio.QM cs.AI eess.IV

    An Inclusive Foundation Model for Generalizable Cytogenetics in Precision Oncology

    Authors: Changchun Yang, Weiqian Dai, Yilan Zhang, Siyuan Chen, Jingdong Hu, Junkai Su, Yuxuan Chen, Ao Xu, Na Li, Xin Gao, Yongguo Yu

    Abstract: Chromosome analysis is vital for diagnosing genetic disorders and guiding cancer therapy decisions through the identification of somatic clonal aberrations. However, developing an AI model are hindered by the overwhelming complexity and diversity of chromosomal abnormalities, requiring extensive annotation efforts, while automated methods remain task-specific and lack generalizability due to the s… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: These authors contributed equally to this work: Changchun Yang, Weiqian Dai, Yilan Zhang

  11. arXiv:2505.11793  [pdf, other

    cs.CV cs.AI eess.IV

    CL-CaGAN: Capsule differential adversarial continuous learning for cross-domain hyperspectral anomaly detection

    Authors: Jianing Wang, Siying Guo, Zheng Hua, Runhu Huang, Jinyu Hu, Maoguo Gong

    Abstract: Anomaly detection (AD) has attracted remarkable attention in hyperspectral image (HSI) processing fields, and most existing deep learning (DL)-based algorithms indicate dramatic potential for detecting anomaly samples through specific training process under current scenario. However, the limited prior information and the catastrophic forgetting problem indicate crucial challenges for existing DL s… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1-15,2024

  12. arXiv:2505.00237  [pdf, ps, other

    cs.RO cs.LG eess.SY

    Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion Prediction

    Authors: Ze Zhang, Georg Hess, Junjie Hu, Emmanuel Dean, Lennart Svensson, Knut Åkesson

    Abstract: This paper proposes an integrated approach for the safe and efficient control of mobile robots in dynamic and uncertain environments. The approach consists of two key steps: one-shot multimodal motion prediction to anticipate motions of dynamic obstacles and model predictive control to incorporate these predictions into the motion planning process. Motion prediction is driven by an energy-based ne… ▽ More

    Submitted 4 June, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Comments: Published in IEEE Robotics and Automation Letters (RA-L)

  13. arXiv:2504.15178  [pdf

    eess.SP

    Time-Series Analysis on Edge-AI Hardware for Healthcare Monitoring

    Authors: Jinhai Hu

    Abstract: This project addresses the need for efficient, real-time analysis of biomedical signals such as electrocardiograms (ECG) and electroencephalograms (EEG) for continuous health monitoring. Traditional methods rely on long-duration data recording followed by offline analysis, which is power-intensive and delays responses to critical symptoms such as arrhythmia. To overcome these limitations, a time-d… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 38 pages, 20 figures, Progress report for qualification cum PhD confirmation exercise

  14. arXiv:2504.14952  [pdf, other

    cs.CV eess.IV

    PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV

    Authors: Qianyu Zhu, Junjie Wang, Jeremiah Hu, Jia Ai, Yong Lee

    Abstract: Deep learning algorithms have significantly reduced the computational time and improved the spatial resolution of particle image velocimetry~(PIV). However, the models trained on synthetic datasets might have a degraded performance on practical particle images due to domain gaps. As a result, special residual patterns are often observed for the vector fields of deep learning-based estimators. To r… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  15. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  16. arXiv:2504.10686  [pdf, other

    cs.CV eess.IV

    The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Hang Guo, Lei Sun, Zongwei Wu, Radu Timofte, Yawei Li, Yao Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Li Song, Hongyuan Yu, Pufan Xu, Cheng Wan, Zhijuan Huang, Peng Guo, Shuyuan Cui, Chenjun Li, Xuehai Hu, Pan Pan, Xin Zhang, Heng Zhang, Qing Luo, Linyan Jiang , et al. (122 additional authors not shown)

    Abstract: This paper presents a comprehensive review of the NTIRE 2025 Challenge on Single-Image Efficient Super-Resolution (ESR). The challenge aimed to advance the development of deep models that optimize key computational metrics, i.e., runtime, parameters, and FLOPs, while achieving a PSNR of at least 26.90 dB on the $\operatorname{DIV2K\_LSDIR\_valid}$ dataset and 26.99 dB on the… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted by CVPR2025 NTIRE Workshop, Efficient Super-Resolution Challenge Report. 50 pages

  17. arXiv:2504.07731  [pdf

    eess.SP

    Adaptive Robust Unscented Kalman Filter for Dynamic State Estimation of Power System

    Authors: Duc Viet Nguyen, Haiquan Zhao, Jinhui Hu, Le Ngoc Giang

    Abstract: Non-Gaussian noise and the uncertainty of noise distribution are the common factors that reduce accuracy in dynamic state estimation of power systems (PS). In addition, the optimal value of the free coefficients in the unscented Kalman filter (UKF) based on information theoretic criteria is also an urgent problem. In this paper, a robust adaptive UKF (AUKF) under generalized minimum mixture error… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 11 pages, 10 figures,

    MSC Class: 94-10; 94-05 ACM Class: H.1.1; H.4.3

  18. arXiv:2504.07365  [pdf, ps, other

    eess.SP

    Diffusion Augmented Complex Maximum Total Correntropy Algorithm for Power System Frequency Estimation

    Authors: Haiquan Zhao, Yi Peng, Jinsong Chen, Jinhui Hu

    Abstract: Currently, adaptive filtering algorithms have been widely applied in frequency estimation for power systems. However, research on diffusion tasks remains insufficient. Existing diffusion adaptive frequency estimation algorithms exhibit certain limitations in handling input noise and lack robustness against impulsive noise. Moreover, traditional adaptive filtering algorithms designed based on the s… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  19. arXiv:2503.23883  [pdf, ps, other

    eess.SP

    Algorithm Design and Prototype Validation for Reconfigurable Intelligent Sensing Surface: Forward-Only Transmission

    Authors: Cheng Luo, Luping Xiang, Jie Hu, Kun Yang

    Abstract: Sensing-assisted communication schemes have recently garnered significant research attention. In this work, we design a dual-function reconfigurable intelligent surface (RIS), integrating both active and passive elements, referred to as the reconfigurable intelligent sensing surface (RISS), to enhance communication. By leveraging sensing results from the active elements, we propose communication e… ▽ More

    Submitted 19 June, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

  20. arXiv:2503.14966  [pdf, other

    cs.CV eess.IV

    Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models

    Authors: Tingxiu Chen, Yilei Shi, Zixuan Zheng, Bingcong Yan, Jingliang Hu, Xiao Xiang Zhu, Lichao Mou

    Abstract: Ultrasound video classification enables automated diagnosis and has emerged as an important research area. However, publicly available ultrasound video datasets remain scarce, hindering progress in developing effective video classification models. We propose addressing this shortage by synthesizing plausible ultrasound videos from readily available, abundant ultrasound images. To this end, we intr… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: MICCAI 2024

  21. arXiv:2503.13987  [pdf, other

    eess.IV cs.CV

    Striving for Simplicity: Simple Yet Effective Prior-Aware Pseudo-Labeling for Semi-Supervised Ultrasound Image Segmentation

    Authors: Yaxiong Chen, Yujie Wang, Zixuan Zheng, Jingliang Hu, Yilei Shi, Shengwu Xiong, Xiao Xiang Zhu, Lichao Mou

    Abstract: Medical ultrasound imaging is ubiquitous, but manual analysis struggles to keep pace. Automated segmentation can help but requires large labeled datasets, which are scarce. Semi-supervised learning leveraging both unlabeled and limited labeled data is a promising approach. State-of-the-art methods use consistency regularization or pseudo-labeling but grow increasingly complex. Without sufficient l… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: MICCAI 2024

  22. arXiv:2503.08220  [pdf, other

    eess.SP

    Bedrock Models in Communication and Sensing: Advancing Generalization, Transferability, and Performance

    Authors: Cheng Luo, Luping Xiang, Jie Hu, Kun Yang

    Abstract: Deep learning (DL) has emerged as a powerful tool for addressing the intricate challenges inherent in communication and sensing systems, significantly enhancing the intelligence of future sixth-generation (6G) networks. A substantial body of research has highlighted the promise of DL-based techniques in these domains. However, in addition to improving accuracy, new challenges must be addressed reg… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  23. arXiv:2503.08202  [pdf, other

    eess.SP

    Low-Complexity Beamforming Design for Null Space-based Simultaneous Wireless Information and Power Transfer Systems

    Authors: Cheng Luo, Jie Hu, Luping Xiang, Kun Yang

    Abstract: Simultaneous wireless information and power transfer (SWIPT) is a promising technology for the upcoming sixth-generation (6G) communication networks, enabling internet of things (IoT) devices and sensors to extend their operational lifetimes. In this paper, we propose a SWIPT scheme by projecting the interference signals from both intra-wireless information transfer (WIT) and inter-wireless energy… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  24. arXiv:2503.08198  [pdf, other

    eess.SP

    Reconfigurable Intelligent Sensing Surface enables Wireless Powered Communication Networks: Interference Suppression and Massive Wireless Energy Transfer

    Authors: Cheng Luo, Jie Hu, Luping Xiang, Kun Yang

    Abstract: Recently, a novel structures of reconfigurable intelligent surface (RIS) integrating both passive and active elements, termed reconfigurable intelligent sensing surface (RISS), efficiently addresses challenges in RIS channel estimation and mitigates issues related to multiplicative path loss by processing the signal at the RISS. In this paper, we propose a sensing-assisted wirelessly powered commu… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  25. arXiv:2503.04147  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Port Selection and Beamforming Design for Integrated Data and Energy Transfer Assisted by Fluid Antennas

    Authors: Long Zhang, Yizhe Zhao, Halvin Yang, Guangming Liang, Jie Hu

    Abstract: Integrated data and energy transfer (IDET) is considered as a key enabler of 6G, as it can provide both wireless energy transfer (WET) and wireless data transfer (WDT) services towards low power devices. Thanks to the extra degree of freedom provided by fluid antenna (FA), incorporating FA into IDET systems presents a promising approach to enhance energy efficiency performance. This paper investig… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Submitted to an IEEE journal

  26. arXiv:2503.02410  [pdf, ps, other

    eess.IV cs.CV

    Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D

    Authors: Jiesi Hu, Chenfei Ye, Yanwu Yang, Xutao Guo, Yang Shang, Pengcheng Shi, Hanyang Peng, Ting Ma

    Abstract: In-context learning (ICL), a type of universal model, demonstrates exceptional generalization across a wide range of tasks without retraining by leveraging task-specific guidance from context, making it particularly effective for the intricate demands of neuroimaging. However, current ICL models, limited to 2D inputs and thus exhibiting suboptimal performance, struggle to extend to 3D inputs due t… ▽ More

    Submitted 4 July, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  27. arXiv:2502.20986  [pdf, other

    eess.SP

    Target Tracking using Robust Sensor Motion Control

    Authors: Jingwei Hu, Dave Zachariah, Petre Stoica

    Abstract: We consider the problem of tracking moving targets using mobile wireless sensors (of possibly different types). This is a joint estimation and control problem in which a tracking system must take into account both target and sensor dynamics. We make minimal assumptions about the target dynamics, namely only that their accelerations are bounded. We develop a control law that determines the sensor m… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  28. arXiv:2502.20941  [pdf, other

    eess.SY

    Adaptive Input Design for Nonlinear System Identification with Operational Constraints

    Authors: Jingwei Hu, Dave Zachariah, Torbjörn Wigren, Petre Stoica

    Abstract: We consider the problem of joint input design and parameter estimation for identifying nonlinear system models through the sequential acquisition of measurements while adhering to system constraints. We utilize a receding horizon approach and propose a new scale-invariant input design criterion, which is tailored to continuously updated parameter estimates, along with a new sequential parameter es… ▽ More

    Submitted 23 May, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  29. arXiv:2502.18200  [pdf, ps, other

    eess.SP

    Zero-Shot Semantic Communication with Multimodal Foundation Models

    Authors: Jiangjing Hu, Haotian Wu, Wenjing Zhang, Fengyu Wang, Wenjun Xu, Hui Gao, Deniz Gündüz

    Abstract: Most existing semantic communication (SemCom) systems use deep joint source-channel coding (DeepJSCC) to encode task-specific semantics in a goal-oriented manner. However, their reliance on predefined tasks and datasets significantly limits their flexibility and generalizability in practical deployments. Multi-modal foundation models provide a promising solution by generating universal semantic to… ▽ More

    Submitted 29 May, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  30. arXiv:2502.18008  [pdf, other

    cs.SD cs.AI eess.AS

    NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

    Authors: Yashan Wang, Shangda Wu, Jianhuai Hu, Xingjian Du, Yueqi Peng, Yongxin Huang, Shuai Fan, Xiaobing Li, Feng Yu, Maosong Sun

    Abstract: We introduce NotaGen, a symbolic music generation model aiming to explore the potential of producing high-quality classical sheet music. Inspired by the success of Large Language Models (LLMs), NotaGen adopts pre-training, fine-tuning, and reinforcement learning paradigms (henceforth referred to as the LLM training paradigms). It is pre-trained on 1.6M pieces of music in ABC notation, and then fin… ▽ More

    Submitted 21 March, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  31. arXiv:2502.12736  [pdf, other

    eess.SP cs.LG

    Cross-Domain Continual Learning for Edge Intelligence in Wireless ISAC Networks

    Authors: Jingzhi Hu, Xin Li, Zhou Su, Jun Luo

    Abstract: In wireless networks with integrated sensing and communications (ISAC), edge intelligence (EI) is expected to be developed at edge devices (ED) for sensing user activities based on channel state information (CSI). However, due to the CSI being highly specific to users' characteristics, the CSI-activity relationship is notoriously domain dependent, essentially demanding EI to learn sufficient datas… ▽ More

    Submitted 14 April, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  32. arXiv:2502.10932  [pdf, other

    eess.SY

    PPAC Driven Multi-die and Multi-technology Floorplanning

    Authors: Cristhian Roman-Vicharra, Yiran Chen, Jiang Hu

    Abstract: In heterogeneous integration, where different dies may utilize distinct technologies, floorplanning across multiple dies inherently requires simultaneous technology selection. This work presents the first systematic study of multi-die and multi-technology floorplanning. Unlike many conventional approaches, which are primarily driven by area and wirelength, this study additionally considers perform… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  33. arXiv:2502.06100  [pdf, other

    cs.CV eess.SP

    Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition

    Authors: Chenyu Liu, Jinshui Hu, Baocai Yin, Jia Pan, Bing Yin, Jun Du, Qingfeng Liu

    Abstract: Online Handwritten Text Recognition (OLHTR) has gained considerable attention for its diverse range of applications. Current approaches usually treat OLHTR as a sequence recognition task, employing either a single trajectory or image encoder, or multi-stream encoders, combined with a CTC or attention-based recognition decoder. However, these approaches face several drawbacks: 1) single encoders ty… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: ICASSP 2025

  34. arXiv:2502.04711  [pdf, other

    cs.SD eess.AS

    Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement

    Authors: Xihao Yuan, Siqi Liu, Hanting Chen, Lu Zhou, Jian Li, Jie Hu

    Abstract: Deep learning-based speech enhancement (SE) models have recently outperformed traditional techniques, yet their deployment on resource-constrained devices remains challenging due to high computational and memory demands. This paper introduces a novel dynamic frequency-adaptive knowledge distillation (DFKD) approach to effectively compress SE models. Our method dynamically assesses the model's outp… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 5 pages, 2 figures, accepted by ICASSP2025

  35. arXiv:2501.18201  [pdf, other

    cs.AI eess.SY

    Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay

    Authors: Jiaqi Hu, Jie Qi, Jing Zhang

    Abstract: Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we address the problem of controlling an unstable first-order hyperbolic PDE with… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 6 Pages, 7 Figures

  36. arXiv:2501.16951  [pdf, other

    eess.SP

    Federated Learning Strategies for Coordinated Beamforming in Multicell ISAC

    Authors: Lai Jiang, Kaitao Meng, Murat Temiz, Jiaming Hu, Christos Masouros

    Abstract: We propose two cooperative beamforming frameworks based on federated learning (FL) for multi-cell integrated sensing and communications (ISAC) systems. Our objective is to address the following dilemma in multicell ISAC: 1) Beamforming strategies that rely solely on local channel information risk generating significant inter-cell interference (ICI), which degrades network performance for both comm… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  37. arXiv:2501.13405  [pdf, other

    cs.IT eess.SP

    Performance Analysis of Fluid Antenna Multiple Access Assisted Wireless Powered Communication Network

    Authors: Xiao Lin, Yizhe Zhao, Halvin Yang, Jie Hu

    Abstract: This paper investigates a novel fluid antenna multiple access (FAMA)-assisted wireless powered communication network (WPCN), in which a hybrid access point (HAP) equipped with multiple fixed position antennas (FPAs) provides integrated data and energy transfer (IDET) services towards low-power devices that are equipped with a single fluid antenna (FA), while the low-power devices use harvested ene… ▽ More

    Submitted 10 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: Submitted to an IEEE journal

  38. arXiv:2501.07329  [pdf, other

    cs.SD cs.CL eess.AS

    Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding

    Authors: Jiliang Hu, Zuchao Li, Mengjia Shen, Haojun Ai, Sheng Li, Jun Zhang

    Abstract: Spoken language understanding (SLU) is a structure prediction task in the field of speech. Recently, many works on SLU that treat it as a sequence-to-sequence task have achieved great success. However, This method is not suitable for simultaneous speech recognition and understanding. In this paper, we propose a joint speech recognition and structure learning framework (JSRSL), an end-to-end SLU mo… ▽ More

    Submitted 17 January, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 5 pages, 2 figures, accepted by ICASSP 2025

  39. arXiv:2501.04644  [pdf, other

    eess.AS cs.SD

    FleSpeech: Flexibly Controllable Speech Generation with Various Prompts

    Authors: Hanzhao Li, Yuke Li, Xinsheng Wang, Jingbin Hu, Qicong Xie, Shan Yang, Lei Xie

    Abstract: Controllable speech generation methods typically rely on single or fixed prompts, hindering creativity and flexibility. These limitations make it difficult to meet specific user needs in certain scenarios, such as adjusting the style while preserving a selected speaker's timbre, or choosing a style and generating a voice that matches a character's visual appearance. To overcome these challenges, w… ▽ More

    Submitted 30 April, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

    Comments: 14 pages, 3 figures

  40. arXiv:2412.12197  [pdf

    eess.SY cs.RO

    Anti-bullying Adaptive Cruise Control: A proactive right-of-way protection approach

    Authors: Jia Hu, Zhexi Lian, Haoran Wang, Zihan Zhang, Ruoxi Qian, Duo Li, Jaehyun, So, Junnian Zheng

    Abstract: The current Adaptive Cruise Control (ACC) systems are vulnerable to "road bully" such as cut-ins. This paper proposed an Anti-bullying Adaptive Cruise Control (AACC) approach with proactive right-of-way protection ability. It bears the following features: i) with the enhanced capability of preventing bullying from cut-ins; ii) optimal but not unsafe; iii) adaptive to various driving styles of cut-… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: 12 pages, 15 figures

  41. arXiv:2412.12126  [pdf

    cs.DC cs.CV cs.LG eess.IV eess.SP

    Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI

    Authors: Sizhe Xing, Aolong Sun, Chengxi Wang, Yizhi Wang, Boyu Dong, Junhui Hu, Xuyu Deng, An Yan, Yingjun Liu, Fangchen Hu, Zhongya Li, Ouhan Huang, Junhao Zhao, Yingjun Zhou, Ziwei Li, Jianyang Shi, Xi Xiao, Richard Penty, Qixiang Cheng, Nan Chi, Junwen Zhang

    Abstract: The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands for computational power. Cloud computing has become the driving force behind this transformation. However, it consumes significant power and faces computation security risks due to the reliance on exten… ▽ More

    Submitted 1 May, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

  42. arXiv:2412.10822  [pdf

    eess.SY

    Automated Driving with Evolution Capability: A Reinforcement Learning Method with Monotonic Performance Enhancement

    Authors: Jia Hu, Xuerun Yan, Tian Xu, Haoran Wang

    Abstract: Reinforcement Learning (RL) offers a promising solution to enable evolutionary automated driving. However, the conventional RL method is always concerned with risk performance. The updated policy may not obtain a performance enhancement, even leading to performance deterioration. To address this challenge, this research proposes a High Confidence Policy Improvement Reinforcement Learning-based (HC… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: 24 pages, 16figures

  43. arXiv:2412.08219  [pdf, other

    eess.SY

    Neural Operator Feedback for a First-Order PIDE with Spatially-Varying State Delay

    Authors: Jie Qi, Jiaqi Hu, Jing Zhang, Miroslav Krstic

    Abstract: A transport PDE with a spatial integral and recirculation with constant delay has been a benchmark for neural operator approximations of PDE backstepping controllers. Introducing a spatially-varying delay into the model gives rise to a gain operator defined through integral equations which the operator's input -- the varying delay function -- enters in previously unencountered manners, including i… ▽ More

    Submitted 14 December, 2024; v1 submitted 11 December, 2024; originally announced December 2024.

    Comments: This 14 page paper contains 1 table and 20 figures

  44. arXiv:2412.06507  [pdf, other

    eess.IV cs.CV cs.LG

    BATseg: Boundary-aware Multiclass Spinal Cord Tumor Segmentation on 3D MRI Scans

    Authors: Hongkang Song, Zihui Zhang, Yanpeng Zhou, Jie Hu, Zishuo Wang, Hou Him Chan, Chon Lok Lei, Chen Xu, Yu Xin, Bo Yang

    Abstract: Spinal cord tumors significantly contribute to neurological morbidity and mortality. Precise morphometric quantification, encompassing the size, location, and type of such tumors, holds promise for optimizing treatment planning strategies. Although recent methods have demonstrated excellent performance in medical image segmentation, they primarily focus on discerning shapes with relatively large m… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: ECCV 2024 Workshop on BioImage Computing. Code and data are available at: https://github.com/vLAR-group/BATseg

  45. arXiv:2411.19385  [pdf, other

    cs.LG cs.AI eess.SP

    Zero-Forget Preservation of Semantic Communication Alignment in Distributed AI Networks

    Authors: Jingzhi Hu, Geoffrey Ye Li

    Abstract: Future communication networks are expected to connect massive distributed artificial intelligence (AI). Exploiting aligned priori knowledge of AI pairs, it is promising to convert high-dimensional data transmission into highly-compressed semantic communications (SC). However, to accommodate the local data distribution and user preferences, AIs generally adapt to different domains, which fundamenta… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  46. arXiv:2411.17705  [pdf, other

    eess.SP cs.AI cs.LG

    EEG-DCNet: A Fast and Accurate MI-EEG Dilated CNN Classification Method

    Authors: Wei Peng, Kang Liu, Jiaxi Shi, Jianchen Hu

    Abstract: The electroencephalography (EEG)-based motor imagery (MI) classification is a critical and challenging task in brain-computer interface (BCI) technology, which plays a significant role in assisting patients with functional impairments to regain mobility. We present a novel multi-scale atrous convolutional neural network (CNN) model called EEG-dilated convolution network (DCNet) to enhance the accu… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  47. arXiv:2411.15211  [pdf, other

    cs.LG cs.AI cs.CV eess.SP

    LightLLM: A Versatile Large Language Model for Predictive Light Sensing

    Authors: Jiawei Hu, Hong Jia, Mahbub Hassan, Lina Yao, Brano Kusy, Wen Hu

    Abstract: We propose LightLLM, a model that fine tunes pre-trained large language models (LLMs) for light-based sensing tasks. It integrates a sensor data encoder to extract key features, a contextual prompt to provide environmental information, and a fusion layer to combine these inputs into a unified representation. This combined input is then processed by the pre-trained LLM, which remains frozen while b… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: 15 pages, 14 figures, 5 tables

  48. arXiv:2411.14353   

    eess.IV cs.CV cs.LG

    Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models

    Authors: Houze Liu, Tong Zhou, Yanlin Xiang, Aoran Shen, Jiacheng Hu, Junliang Du

    Abstract: Medical image segmentation is crucial for accurate clinical diagnoses, yet it faces challenges such as low contrast between lesions and normal tissues, unclear boundaries, and high variability across patients. Deep learning has improved segmentation accuracy and efficiency, but it still relies heavily on expert annotations and struggles with the complexities of medical images. The small size of me… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: After a peer review process for a journal submission, we have been told the main conclusions presented in this paper have been proven previously by others. I believe the paper should be withdrawn

  49. arXiv:2411.08178  [pdf, other

    eess.IV math.NA

    On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction

    Authors: Tao Hong, Zhaoyi Xu, Jason Hu, Jeffrey A. Fessler

    Abstract: Model-based iterative reconstruction plays a key role in solving inverse problems. However, the associated minimization problems are generally large-scale, ill-posed, nonsmooth, and sometimes even nonconvex, which present challenges in designing efficient iterative solvers and often prevent their practical use. Preconditioning methods can significantly accelerate the convergence of iterative metho… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: 13 pages, 11 figures, 4 tables

  50. arXiv:2411.08014  [pdf

    cs.CV eess.IV

    Artistic Neural Style Transfer Algorithms with Activation Smoothing

    Authors: Xiangtian Li, Han Cao, Zhaoyang Zhang, Jiacheng Hu, Yuhui Jin, Zihao Zhao

    Abstract: The works of Gatys et al. demonstrated the capability of Convolutional Neural Networks (CNNs) in creating artistic style images. This process of transferring content images in different styles is called Neural Style Transfer (NST). In this paper, we re-implement image-based NST, fast NST, and arbitrary NST. We also explore to utilize ResNet with activation smoothing in NST. Extensive experimental… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: 8 pages,7 figures