Skip to main content

Showing 1–50 of 72 results for author: Shen, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.06492  [pdf, ps, other

    eess.SY

    Dual State-space Fidelity Blade (D-STAB): A Novel Stealthy Cyber-physical Attack Paradigm

    Authors: Jiajun Shen, Hao Tu, Fengjun Li, Morteza Hashemi, Di Wu, Huazhen Fang

    Abstract: This paper presents a novel cyber-physical attack paradigm, termed the Dual State-Space Fidelity Blade (D-STAB), which targets the firmware of core cyber-physical components as a new class of attack surfaces. The D-STAB attack exploits the information asymmetry caused by the fidelity gap between high-fidelity and low-fidelity physical models in cyber-physical systems. By designing precise adversar… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: accepted by 2025 American Control Conference

  2. arXiv:2505.17528  [pdf, ps, other

    eess.IV cs.CV

    DECT-based Space-Squeeze Method for Multi-Class Classification of Metastatic Lymph Nodes in Breast Cancer

    Authors: Hai Jiang, Chushan Zheng, Jiawei Pan, Yuanpin Zhou, Qiongting Liu, Xiang Zhang, Jun Shen, Yao Lu

    Abstract: Background: Accurate assessment of metastatic burden in axillary lymph nodes is crucial for guiding breast cancer treatment decisions, yet conventional imaging modalities struggle to differentiate metastatic burden levels and capture comprehensive lymph node characteristics. This study leverages dual-energy computed tomography (DECT) to exploit spectral-spatial information for improved multi-class… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  3. arXiv:2505.03380  [pdf, other

    cs.CV cs.AI eess.IV

    Reinforced Correlation Between Vision and Language for Precise Medical AI Assistant

    Authors: Haonan Wang, Jiaji Mao, Lehan Wang, Qixiang Zhang, Marawan Elbatel, Yi Qin, Huijun Hu, Baoxun Li, Wenhui Deng, Weifeng Qin, Hongrui Li, Jialin Liang, Jun Shen, Xiaomeng Li

    Abstract: Medical AI assistants support doctors in disease diagnosis, medical image analysis, and report generation. However, they still face significant challenges in clinical use, including limited accuracy with multimodal content and insufficient validation in real-world settings. We propose RCMed, a full-stack AI assistant that improves multimodal alignment in both input and output, enabling precise ana… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  4. arXiv:2504.09516  [pdf, other

    cs.SD cs.CV eess.AS

    FSSUAVL: A Discriminative Framework using Vision Models for Federated Self-Supervised Audio and Image Understanding

    Authors: Yasar Abbas Ur Rehman, Kin Wai Lau, Yuyang Xie, Ma Lan, JiaJun Shen

    Abstract: Recent studies have demonstrated that vision models can effectively learn multimodal audio-image representations when paired. However, the challenge of enabling deep models to learn representations from unpaired modalities remains unresolved. This issue is especially pertinent in scenarios like Federated Learning (FL), where data is often decentralized, heterogeneous, and lacks a reliable guarante… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 8 pages

  5. Control Pneumatic Soft Bending Actuator with Online Learning Pneumatic Physical Reservoir Computing

    Authors: Junyi Shen, Tetsuro Miyazaki, Kenji Kawashima

    Abstract: The intrinsic nonlinearities of soft robots present significant control but simultaneously provide them with rich computational potential. Reservoir computing (RC) has shown effectiveness in online learning systems for controlling nonlinear systems such as soft actuators. Conventional RC can be extended into physical reservoir computing (PRC) by leveraging the nonlinear dynamics of soft actuators… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 8 pages, 13 figures, IEEE-RAS International Conference on Soft Robotics (RoboSoft 2025)

    Journal ref: 2025 IEEE 8th International Conference on Soft Robotics (RoboSoft)

  6. arXiv:2502.20224  [pdf

    eess.IV cs.AI cs.CV

    RURANET++: An Unsupervised Learning Method for Diabetic Macular Edema Based on SCSE Attention Mechanisms and Dynamic Multi-Projection Head Clustering

    Authors: Wei Yang, Yiran Zhu, Jiayu Shen, Yuhan Tang, Chengchang Pan, Hui He, Yan Su, Honggang Qi

    Abstract: Diabetic Macular Edema (DME), a prevalent complication among diabetic patients, constitutes a major cause of visual impairment and blindness. Although deep learning has achieved remarkable progress in medical image analysis, traditional DME diagnosis still relies on extensive annotated data and subjective ophthalmologist assessments, limiting practical applications. To address this, we present RUR… ▽ More

    Submitted 7 March, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    Comments: 10 pages, 2 figures, 5 tables, submitted to The 28th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2025)

  7. arXiv:2501.15820  [pdf, other

    eess.SY cs.AI

    FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities

    Authors: Mingyuan Li, Jiahao Wang, Bo Du, Jun Shen, Qiang Wu

    Abstract: Effective traffic signal control (TSC) is crucial in mitigating urban congestion and reducing emissions. Recently, reinforcement learning (RL) has been the research trend for TSC. However, existing RL algorithms face several real-world challenges that hinder their practical deployment in TSC: (1) Sensor accuracy deteriorates with increased sensor detection range, and data transmission is prone to… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  8. A Novel Generative Multi-Task Representation Learning Approach for Predicting Postoperative Complications in Cardiac Surgery Patients

    Authors: Junbo Shen, Bing Xue, Thomas Kannampallil, Chenyang Lu, Joanna Abraham

    Abstract: Early detection of surgical complications allows for timely therapy and proactive risk mitigation. Machine learning (ML) can be leveraged to identify and predict patient risks for postoperative complications. We developed and validated the effectiveness of predicting postoperative complications using a novel surgical Variational Autoencoder (surgVAE) that uncovers intrinsic patterns via cross-task… ▽ More

    Submitted 18 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

    Comments: This article has been accepted for publication in Journal of the American Medical Informatics Association Published by Oxford University Press. Codes are publicly available at: https://github.com/ai4biomedicine/surgVAE

    ACM Class: J.3; I.2.7

    Journal ref: J. Am. Med. Inform. Assoc. (2024) ocae316

  9. arXiv:2411.18853  [pdf

    eess.SY

    Self-Adaptive Active Damping Method for Stability Enhancement of Systems With Black-Box Inverters Considering Operating Points

    Authors: Yang Li, Xiangyang Wu, Zhikang Shuai, Junbin Fang, Lili He, Yi Lei, Z. John Shen

    Abstract: Due to the black-box nature of inverters and the wide variation range of operating points, it is challenging to on-line predict and adaptively enhance the stability of inverter-based systems. To solve this problem, this paper provides a feasible self-adaptive active damping method to eliminate potential small-signal instability of systems with black-box inverters under multiple operating points. F… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  10. arXiv:2410.03811  [pdf

    cs.HC eess.SY

    Enhanced Digital Twin for Human-Centric and Integrated Lighting Asset Management in Public Libraries: From Corrective to Predictive Maintenance

    Authors: Jing Lin, Jingchun Shen

    Abstract: Lighting asset management in public libraries has traditionally been reactive, focusing on corrective maintenance, addressing issues only when failures occur. Although standards now encourage preventive measures, such as incorporating a maintenance factor, the broader goal of human centric, sustainable lighting systems requires a shift toward predictive maintenance strategies. This study introduce… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  11. arXiv:2409.08228  [pdf, other

    eess.SY

    Improving Initial Transients of Online Learning Echo State Network Control System with Feedback Adjustments

    Authors: Junyi Shen

    Abstract: Echo state networks (ESNs) have become increasingly popular in online learning control systems due to their ease of training. However, online learning ESN controllers often suffer from slow convergence during the initial transient phase. Existing solutions, such as prior training, control mode switching, and incorporating plant dynamic approximations, have notable drawbacks, including undermining… ▽ More

    Submitted 16 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: 6 pages, 11 figures

  12. Control Pneumatic Soft Bending Actuator with Feedforward Hysteresis Compensation by Pneumatic Physical Reservoir Computing

    Authors: Junyi Shen, Tetsuro Miyazaki, Kenji Kawashima

    Abstract: The nonlinearities of soft robots bring control challenges like hysteresis but also provide them with computational capacities. This paper introduces a fuzzy pneumatic physical reservoir computing (FPRC) model for feedforward hysteresis compensation in motion tracking control of soft actuators. Our method utilizes a pneumatic bending actuator as a physical reservoir with nonlinear computing capaci… ▽ More

    Submitted 26 December, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 8 pages, 17 figures. IEEE Robotics and Automation Letters, doi: 10.1109/LRA.2024.3523229

    Journal ref: IEEE Robotics and Automation Letters, 2025

  13. arXiv:2408.01738  [pdf, other

    eess.SY

    Adaptive Safety with Control Barrier Functions and Triggered Batch Least-Squares Identifier

    Authors: Jiajun Shen, Wei Wang, Jing Zhou, Jinhu Lü

    Abstract: In this paper, a triggered Batch Least-Squares Identifier (BaLSI) based adaptive safety control scheme is proposed for uncertain systems with potentially conflicting control objectives and safety constraints. A relaxation term is added to the Quadratic Programs (QP) combining the transformed Control Lyapunov Functions (CLFs) and Control Barrier Functions (CBFs), to mediate the potential conflict.… ▽ More

    Submitted 24 October, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: 11 pages, 10 fidures

  14. arXiv:2408.01731  [pdf, other

    eess.SY

    Composite Learning Adaptive Control without Excitation Condition

    Authors: Jiajun Shen, Wei Wang, Changyun Wen, Jinhu Lu

    Abstract: This paper focuses on excitation collection and composite learning adaptive control design for uncertain nonlinear systems. By adopting the spectral decomposition technique, a linear regression equation is constructed to collect previously appeared excitation information, establishing a relationship between unknown parameters and the system's historical data. A composite learning term, developed u… ▽ More

    Submitted 11 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: 15 pages, 13 figures

  15. arXiv:2406.18993  [pdf, ps, other

    eess.SP

    Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

    Authors: Han Xiao, Wenqiang Tian, Shi Jin, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang

    Abstract: In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource. Specifically, to deal with the intra-layer and inter-layer interference of SIP under multi-layer transmission, the interference cancellation with superimposed symbol ai… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  16. arXiv:2405.14411  [pdf, other

    cs.AI eess.SY

    Large Language Models for Explainable Decisions in Dynamic Digital Twins

    Authors: Nan Zhang, Christian Vergara-Marcillo, Georgios Diamantopoulos, Jingran Shen, Nikos Tziritas, Rami Bahsoon, Georgios Theodoropoulos

    Abstract: Dynamic data-driven Digital Twins (DDTs) can enable informed decision-making and provide an optimisation platform for the underlying system. By leveraging principles of Dynamic Data-Driven Applications Systems (DDDAS), DDTs can formulate computational modalities for feedback loops, model updates and decision-making, including autonomous ones. However, understanding autonomous decision-making often… ▽ More

    Submitted 4 September, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 9 pages, 3 figures, accepted by DDDAS2024 -- the 5th International Conference on Dynamic Data Driven Applications Systems

  17. arXiv:2404.17554  [pdf

    cs.HC eess.SP eess.SY stat.AP

    A Novel Context driven Critical Integrative Levels (CIL) Approach: Advancing Human-Centric and Integrative Lighting Asset Management in Public Libraries with Practical Thresholds

    Authors: Jing Lin, Nina Mylly, Per Olof Hedekvist, Jingchun Shen

    Abstract: This paper proposes the context driven Critical Integrative Levels (CIL), a novel approach to lighting asset management in public libraries that aligns with the transformative vision of human-centric and integrative lighting. This approach encompasses not only the visual aspects of lighting performance but also prioritizes the physiological and psychological well-being of library users. Incorporat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  18. arXiv:2404.15312  [pdf, other

    eess.SP cs.CV

    Realtime Person Identification via Gait Analysis

    Authors: Shanmuga Venkatachalam, Harideep Nair, Prabhu Vellaisamy, Yongqi Zhou, Ziad Youssfi, John Paul Shen

    Abstract: Each person has a unique gait, i.e., walking style, that can be used as a biometric for personal identification. Recent works have demonstrated effective gait recognition using deep neural networks, however most of these works predominantly focus on classification accuracy rather than model efficiency. In order to perform gait recognition using wearable devices on the edge, it is imperative to dev… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  19. arXiv:2403.08948  [pdf, ps, other

    eess.SY cs.GT

    Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning

    Authors: Jiajun Shen, Fengjun Li, Morteza Hashemi, Huazhen Fang

    Abstract: In the swift evolution of Cyber-Physical Systems (CPSs) within intelligent environments, especially in the industrial domain shaped by Industry 4.0, the surge in development brings forth unprecedented security challenges. This paper explores the intricate security issues of Industrial CPSs (ICPSs), with a specific focus on the unique threats presented by intelligent attackers capable of directly c… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 8 pages

  20. arXiv:2402.02889  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

    Authors: Yasar Abbas Ur Rehman, Kin Wai Lau, Yuyang Xie, Lan Ma, Jiajun Shen

    Abstract: The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  21. arXiv:2402.02724  [pdf, other

    eess.IV cs.CV cs.LG

    FDNet: Frequency Domain Denoising Network For Cell Segmentation in Astrocytes Derived From Induced Pluripotent Stem Cells

    Authors: Haoran Li, Jiahua Shi, Huaming Chen, Bo Du, Simon Maksour, Gabrielle Phillips, Mirella Dottori, Jun Shen

    Abstract: Artificially generated induced pluripotent stem cells (iPSCs) from somatic cells play an important role for disease modeling and drug screening of neurodegenerative diseases. Astrocytes differentiated from iPSCs are important targets to investigate neuronal metabolism. The astrocyte differentiation progress can be monitored through the variations of morphology observed from microscopy images at di… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by The IEEE International Symposium on Biomedical Imaging (ISBI) 2024

  22. arXiv:2310.15548  [pdf, ps, other

    eess.SP

    Knowledge-driven Meta-learning for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi Jin, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output systems. Recently, deep learning (DL) has been introduced for CSI feedback enhancement through massive collected training data and lengthy training time, which is quite costly and impractical for realistic deployment. In this article, a knowledge-driven meta-learning a… ▽ More

    Submitted 25 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.13475

  23. arXiv:2309.09423  [pdf, other

    cs.RO eess.SY

    Two Degree of Freedom Adaptive Control for Hysteresis Compensation of Pneumatic Continuum Bending Actuator

    Authors: Junyi Shen, Tetsuro Miyazaki, Shingo Ohno, Maina Sogabe, Kenji Kawashima

    Abstract: Soft robotics, with their inherent flexibility and infinite degrees of freedom (DoF), offer promising advancements in human-machine interfaces. Particularly, pneumatic artificial muscles (PAMs) and pneumatic bending actuators have been fundamental in driving this evolution, capitalizing on their mimetic nature to natural muscle movements. However, with the versatility of these actuators comes the… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE Conference on Robotics and Automation (ICRA 2024), Under Review

  24. arXiv:2308.13849  [pdf, other

    cs.LG cs.AI eess.SY

    Effectively Heterogeneous Federated Learning: A Pairing and Split Learning Based Approach

    Authors: Jinglong Shen, Xiucheng Wang, Nan Cheng, Longfei Ma, Conghao Zhou, Yuan Zhang

    Abstract: As a promising paradigm federated Learning (FL) is widely used in privacy-preserving machine learning, which allows distributed devices to collaboratively train a model while avoiding data transmission among clients. Despite its immense potential, the FL suffers from bottlenecks in training speed due to client heterogeneity, leading to escalated training latency and straggling server aggregation.… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  25. Trajectory Tracking Control of Dual-PAM Soft Actuator with Hysteresis Compensator

    Authors: Junyi Shen, Tetsuro Miyazaki, Shingo Ohno, Maina Sogabe, Kenji Kawashima

    Abstract: Soft robotics is a swiftly evolving field. Pneumatic actuators are suitable for driving soft robots because of their superior performance. However, their control is challenging due to the hysteresis characteristics. In response to this challenge, we propose an adaptive control method to compensate for the hysteresis of soft actuators. Employing a novel dual pneumatic artificial muscle (PAM) bendin… ▽ More

    Submitted 18 November, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: This paper has been published in the IEEE Robotics and Automation Letters ,DOI 10.1109/LRA.2023.3334098, copyright has been transfferd to the IEEE. Final version is available at IEEE Xplore

  26. arXiv:2308.04605  [pdf, other

    eess.IV cs.CV cs.GR cs.LG

    PSRFlow: Probabilistic Super Resolution with Flow-Based Models for Scientific Data

    Authors: Jingyi Shen, Han-Wei Shen

    Abstract: Although many deep-learning-based super-resolution approaches have been proposed in recent years, because no ground truth is available in the inference stage, few can quantify the errors and uncertainties of the super-resolved results. For scientific visualization applications, however, conveying uncertainties of the results to scientists is crucial to avoid generating misleading or incorrect info… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: To be published in Proc. IEEE VIS 2023

  27. arXiv:2307.02002  [pdf, other

    eess.SY

    Interpretable and Secure Trajectory Optimization for UAV-Assisted Communication

    Authors: Yunhao Quan, Nan Cheng, Xiucheng Wang, Jinglong Shen, Longfei Ma, Zhisheng Yin

    Abstract: Unmanned aerial vehicles (UAVs) have gained popularity due to their flexible mobility, on-demand deployment, and the ability to establish high probability line-of-sight wireless communication. As a result, UAVs have been extensively used as aerial base stations (ABSs) to supplement ground-based cellular networks for various applications. However, existing UAV-assisted communication schemes mainly… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  28. arXiv:2306.04970  [pdf, other

    cs.RO eess.SY

    Motion Planning for Aerial Pick-and-Place based on Geometric Feasibility Constraints

    Authors: Huazi Cao, Jiahao Shen, Cunjia Liu, Bo Zhu, Shiyu Zhao

    Abstract: This paper studies the motion planning problem of the pick-and-place of an aerial manipulator that consists of a quadcopter flying base and a Delta arm. We propose a novel partially decoupled motion planning framework to solve this problem. Compared to the state-of-the-art approaches, the proposed one has two novel features. First, it does not suffer from increased computation in high-dimensional… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  29. arXiv:2305.10009  [pdf, other

    eess.SP

    A Modular and High-Resolution Time-Frequency Post-Processing Technique

    Authors: Jinshun Shen, Deyun Wei

    Abstract: In this letter, based on the variational model, we propose a novel time-frequency post-processing technique to approximate the ideal time-frequency representation. Our method has the advantage of modularity, enabling "plug and play", independent of the performance of specific time-frequency analysis tool. Therefore, it can be easily generalized to the fractional Fourier domain and the linear canon… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  30. arXiv:2303.15161  [pdf, other

    cs.SD eess.AS

    Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator

    Authors: Yunhao Chen, Yunjie Zhu, Zihui Yan, Jianlu Shen, Zhen Ren, Yifan Huang

    Abstract: Despite consistent advancement in powerful deep learning techniques in recent years, large amounts of training data are still necessary for the models to avoid overfitting. Synthetic datasets using generative adversarial networks (GAN) have recently been generated to overcome this problem. Nevertheless, despite advancements, GAN-based methods are usually hard to train or fail to generate high-qual… ▽ More

    Submitted 4 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  31. arXiv:2303.12693  [pdf, other

    eess.SY cs.AI

    Resilient Output Containment Control of Heterogeneous Multiagent Systems Against Composite Attacks: A Digital Twin Approach

    Authors: Yukang Cui, Lingbo Cao, Michael V. Basin, Jun Shen, Tingwen Huang, Xin Gong

    Abstract: This paper studies the distributed resilient output containment control of heterogeneous multiagent systems against composite attacks, including denial-of-services (DoS) attacks, false-data injection (FDI) attacks, camouflage attacks, and actuation attacks. Inspired by digital twins, a twin layer (TL) with higher security and privacy is used to decouple the above problem into two tasks: defense pr… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  32. arXiv:2303.08856  [pdf, other

    cs.LG eess.SY

    On the Benefits of Leveraging Structural Information in Planning Over the Learned Model

    Authors: Jiajun Shen, Kananart Kuwaranancharoen, Raid Ayoub, Pietro Mercati, Shreyas Sundaram

    Abstract: Model-based Reinforcement Learning (RL) integrates learning and planning and has received increasing attention in recent years. However, learning the model can incur a significant cost (in terms of sample complexity), due to the need to obtain a sufficient number of samples for each state-action pair. In this paper, we investigate the benefits of leveraging structural information about the system… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 9 pages, 5 figures

  33. arXiv:2301.13475  [pdf, ps, other

    eess.SP

    A Knowledge-Driven Meta-Learning Method for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output (MIMO) systems. Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment. In this paper, a knowledge-dri… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  34. arXiv:2301.02243  [pdf, other

    cs.LG eess.SP stat.AP

    Machine Fault Classification using Hamiltonian Neural Networks

    Authors: Jeremy Shen, Jawad Chowdhury, Sourav Banerjee, Gabriel Terejanu

    Abstract: A new approach is introduced to classify faults in rotating machinery based on the total energy signature estimated from sensor measurements. The overall goal is to go beyond using black-box models and incorporate additional physical constraints that govern the behavior of mechanical systems. Observational data is used to train Hamiltonian neural networks that describe the conserved energy of the… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: ICPRAM 2023

  35. arXiv:2211.02940  [pdf, other

    cs.SD cs.AI eess.AS

    Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block

    Authors: Yunhao Chen, Yunjie Zhu, Zihui Yan, Yifan Huang, Zhen Ren, Jianlu Shen, Lifang Chen

    Abstract: Recently, massive architectures based on Convolutional Neural Network (CNN) and self-attention mechanisms have become necessary for audio classification. While these techniques are state-of-the-art, these works' effectiveness can only be guaranteed with huge computational costs and parameters, large amounts of data augmentation, transfer from large datasets and some other tricks. By utilizing the… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

  36. arXiv:2210.03402  [pdf

    eess.SY cs.LG nlin.AO

    Research on Self-adaptive Online Vehicle Velocity Prediction Strategy Considering Traffic Information Fusion

    Authors: Ziyan Zhang, Junhao Shen, Dongwei Yao, Feng Wu

    Abstract: In order to increase the prediction accuracy of the online vehicle velocity prediction (VVP) strategy, a self-adaptive velocity prediction algorithm fused with traffic information was presented for the multiple scenarios. Initially, traffic scenarios were established inside the co-simulation environment. In addition, the algorithm of a general regressive neural network (GRNN) paired with datasets… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures

  37. arXiv:2209.05482  [pdf, ps, other

    eess.SY

    Improved Fuzzy $H_{\infty}$ Filter Design Method for Nonlinear Systems with Time-Varing Delay

    Authors: Qianqian Ma, Li Li, Junhui Shen, Haowei Guan, Guangcheng Ma, Hongwei Xia

    Abstract: This paper investigates the fuzzy $H_{\infty}$ filter design issue for nonlinear systems with time-varying delay. In order to obtain less conservative fuzzy $H_{\infty}$ filter design method, a novel integral inequality is employed to replace the conventional Lebniz-Newton formula to analyze the stability conditions of the filtering error system. Besides, the information of the membership function… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: This paper was published in 2017 IEEE SMC. arXiv admin note: text overlap with arXiv:2209.04989. text overlap with arXiv:2209.04989

  38. arXiv:2208.13183  [pdf, other

    cs.SD eess.AS

    Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks

    Authors: Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark

    Abstract: Transfer tasks in text-to-speech (TTS) synthesis - where one or more aspects of the speech of one set of speakers is transferred to another set of speakers that do not feature these aspects originally - remains a challenging task. One of the challenges is that models that have high-quality transfer capabilities can have issues in stability, making them impractical for user-facing critical tasks. T… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: To be published in Interspeech 2022

  39. arXiv:2206.07949  [pdf, other

    eess.SP

    AI Enlightens Wireless Communication: A Transformer Backbone for CSI Feedback

    Authors: Han Xiao, Zhiqin Wang, Dexin Li, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang

    Abstract: This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided. Then a basic Transformer backbone for CSI feedback referred to EVCsiNet-T is proposed. Moreover, a s… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  40. arXiv:2205.08391  [pdf, other

    cs.ET eess.SY

    A High-Voltage Characterisation Platform For Emerging Resistive Switching Technologies

    Authors: Jiawei Shen, Andrea Mifsud, Lijie Xie, Abdulaziz Alshaya, Christos Papavassiliou

    Abstract: Emerging memristor-based array architectures have been effectively employed in non-volatile memories and neuromorphic computing systems due to their density, scalability and capability of storing information. Nonetheless, to demonstrate a practical on-chip memristor-based system, it is essential to have the ability to apply large programming voltage ranges during the characterisation procedures fo… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages. To be published in ISCAS 2022 and made available on IEEEXplore

  41. arXiv:2205.08381  [pdf, other

    cs.ET eess.SY

    A Wide Dynamic Range Read-out System For Resistive Switching Technology

    Authors: Lijie Xie, Jiawei Shen, Andrea Mifsud, Chaohan Wang, Abdulaziz Alshaya, Christos Papavassiliou

    Abstract: The memristor, because of its controllability over a wide dynamic range of resistance, has emerged as a promising device for data storage and analog computation. A major challenge is the accurate measurement of memristance over a wide dynamic range. In this paper, a novel read-out circuit with feedback adjustment is proposed to measure and digitise input current in the range between 20nA and 2mA.… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages, To be published in ISCAS 2022 and made available on IEEE Xplore

  42. arXiv:2205.08379  [pdf, other

    cs.ET eess.SY

    A CMOS-based Characterisation Platform for Emerging RRAM Technologies

    Authors: Andrea Mifsud, Jiawei Shen, Peilong Feng, Lijie Xie, Chaohan Wang, Yihan Pan, Sachin Maheshwari, Shady Agwa, Spyros Stathopoulos, Shiwei Wang, Alexander Serb, Christos Papavassiliou, Themis Prodromakis, Timothy G. Constandinou

    Abstract: Mass characterisation of emerging memory devices is an essential step in modelling their behaviour for integration within a standard design flow for existing integrated circuit designers. This work develops a novel characterisation platform for emerging resistive devices with a capacity of up to 1 million devices on-chip. Split into four independent sub-arrays, it contains on-chip column-parallel… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages. To be published in ISCAS 2022 and made available on IEEE Xplore

  43. arXiv:2203.04042  [pdf, other

    eess.IV cs.CV

    Abandoning the Bayer-Filter to See in the Dark

    Authors: Xingbo Dong, Wanyan Xu, Zhihui Miao, Lan Ma, Chao Zhang, Jiewen Yang, Zhe Jin, Andrew Beng Jin Teoh, Jiajun Shen

    Abstract: Low-light image enhancement - a pervasive but challenging problem, plays a central role in enhancing the visibility of an image captured in a poor illumination environment. Due to the fact that not all photons can pass the Bayer-Filter on the sensor of the color camera, in this work, we first present a De-Bayer-Filter simulator based on deep neural networks to generate a monochrome raw image from… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

  44. arXiv:2201.01449  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

    Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

    Abstract: In recent years, deep learning has successfully been applied to automate a wide variety of tasks in diagnostic histopathology. However, fast and reliable localization of small-scale regions-of-interest (ROI) has remained a key challenge, as discriminative morphologic features often occupy only a small fraction of a gigapixel-scale whole-slide image (WSI). In this paper, we propose a sparse WSI ana… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  45. arXiv:2112.10107  [pdf, other

    cs.AI cs.LG eess.SP

    Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control

    Authors: Liang Zhang, Qiang Wu, Jun Shen, Linyuan Lü, Bo Du, Jianqing Wu

    Abstract: Many studies confirmed that a proper traffic state representation is more important than complex algorithms for the classical traffic signal control (TSC) problem. In this paper, we (1) present a novel, flexible and efficient method, namely advanced max pressure (Advanced-MP), taking both running and queuing vehicles into consideration to decide whether to change current signal phase; (2) inventiv… ▽ More

    Submitted 9 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: 10 pages, 5 figures

    ACM Class: J.4; J.6

  46. arXiv:2107.04174  [pdf, other

    cs.SD cs.CV cs.LG eess.AS eess.SP

    EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments

    Authors: Jacob Donley, Vladimir Tourbabin, Jung-Suk Lee, Mark Broyles, Hao Jiang, Jie Shen, Maja Pantic, Vamsi Krishna Ithapu, Ravish Mehra

    Abstract: Augmented Reality (AR) as a platform has the potential to facilitate the reduction of the cocktail party effect. Future AR headsets could potentially leverage information from an array of sensors spanning many different modalities. Training and testing signal processing and machine learning algorithms on tasks such as beam-forming and speech enhancement require high quality representative data. To… ▽ More

    Submitted 18 October, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: Dataset is available at: https://github.com/facebookresearch/EasyComDataset

  47. arXiv:2106.06759  [pdf, ps, other

    eess.SP

    AI Enlightens Wireless Communication: Analyses, Solutions and Opportunities on CSI Feedback

    Authors: Han Xiao, Zhiqin Wang, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang

    Abstract: In this paper, we give a systematic description of the 1st Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AI Work Group. Firstly, the framework of full channel state information (F-CSI) feedback problem and its corresponding channel dataset are provided. Then the enhancing schemes for DL-based F-CSI feedback including i) ch… ▽ More

    Submitted 14 June, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

  48. arXiv:2105.07146  [pdf, other

    eess.IV cs.CV

    GCN-MIF: Graph Convolutional Network with Multi-Information Fusion for Low-dose CT Denoising

    Authors: Kecheng Chen, Jiayu Sun, Jiang Shen, Jixiang Luo, Xinyu Zhang, Xuelin Pan, Dongsheng Wu, Yue Zhao, Miguel Bento, Yazhou Ren, Xiaorong Pu

    Abstract: Being low-level radiation exposure and less harmful to health, low-dose computed tomography (LDCT) has been widely adopted in the early screening of lung cancer and COVID-19. LDCT images inevitably suffer from the degradation problem caused by complex noises. It was reported that deep learning (DL)-based LDCT denoising methods using convolutional neural network (CNN) achieved impressive denoising… ▽ More

    Submitted 16 April, 2022; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: Submitted to TMI with under review

  49. arXiv:2103.15060  [pdf, other

    cs.CL cs.SD eess.AS

    PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS

    Authors: Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang, Yonghui Wu

    Abstract: This paper introduces PnG BERT, a new encoder model for neural TTS. This model is augmented from the original BERT model, by taking both phoneme and grapheme representations of text as input, as well as the word-level alignment between them. It can be pre-trained on a large text corpus in a self-supervised manner, and fine-tuned in a TTS task. Experimental results show that a neural TTS model usin… ▽ More

    Submitted 7 June, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: Accepted to Interspeech 2021

  50. arXiv:2103.14574  [pdf, other

    cs.SD eess.AS

    Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

    Authors: Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, RJ Skerry-Ryan, Yonghui Wu

    Abstract: This paper introduces Parallel Tacotron 2, a non-autoregressive neural text-to-speech model with a fully differentiable duration model which does not require supervised duration signals. The duration model is based on a novel attention mechanism and an iterative reconstruction loss based on Soft Dynamic Time Warping, this model can learn token-frame alignments as well as token durations automatica… ▽ More

    Submitted 29 August, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Submitted to INTERSPEECH 2021