Skip to main content

Showing 1–50 of 79 results for author: Zhu, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.21928  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal Pathology

    Authors: Lianghui Zhu, Xitong Ling, Minxi Ouyang, Xiaoping Liu, Tian Guan, Mingxi Fu, Zhiqiang Cheng, Fanglei Fu, Maomao Zeng, Liming Liu, Song Duan, Qiang Huang, Ying Xiao, Jianming Li, Shanming Lu, Zhenghua Piao, Mingxi Zhu, Yibo Jin, Shan Xu, Qiming He, Yizhi Wang, Junru Cheng, Xuanyu Wang, Luxi Xie, Houqiang Li , et al. (2 additional authors not shown)

    Abstract: Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterati… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.15519  [pdf, ps, other

    eess.SP

    Exploiting Age of Information in Network Digital Twins for AI-driven Real-Time Link Blockage Detection

    Authors: Michele Zhu, Francesco Linsalata, Silvia Mura, Lorenzo Cazzella, Damiano Badini, Umberto Spagnolini

    Abstract: The Line-of-Sight (LoS) identification is crucial to ensure reliable high-frequency communication links, especially those vulnerable to blockages. Network Digital Twins and Artificial Intelligence are key technologies enabling blockage detection (LoS identification) for high-frequency wireless systems, e.g., 6>GHz. In this work, we enhance Network Digital Twins by incorporating Age of Information… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2505.15478  [pdf, ps, other

    eess.SP

    AI-empowered Real-Time Line-of-Sight Identification via Network Digital Twins

    Authors: Michele Zhu, Silvia Mura, Francesco Linsalata, Lorenzo Cazzella, Damiano Badini, Umberto Spagnolini

    Abstract: The identification of Line-of-Sight (LoS) conditions is critical for ensuring reliable high-frequency communication links, which are particularly vulnerable to blockages and rapid channel variations. Network Digital Twins (NDTs) and Ray-Tracing (RT) techniques can significantly automate the large-scale collection and labeling of channel data, tailored to specific wireless environments. This paper… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  4. arXiv:2505.06241  [pdf, other

    eess.SP cs.AI cs.LG

    Low-Complexity CNN-Based Classification of Electroneurographic Signals

    Authors: Arek Berc Gokdag, Silvia Mura, Antonio Coviello, Michele Zhu, Maurizio Magarini, Umberto Spagnolini

    Abstract: Peripheral nerve interfaces (PNIs) facilitate neural recording and stimulation for treating nerve injuries, but real-time classification of electroneurographic (ENG) signals remains challenging due to constraints on complexity and latency, particularly in implantable devices. This study introduces MobilESCAPE-Net, a lightweight architecture that reduces computational cost while maintaining and sli… ▽ More

    Submitted 27 April, 2025; originally announced May 2025.

  5. arXiv:2504.10923  [pdf, other

    cs.LG eess.SP

    Fast-Powerformer: A Memory-Efficient Transformer for Accurate Mid-Term Wind Power Forecasting

    Authors: Mingyi Zhu, Zhaoxin Li, Qiao Lin, Li Ding

    Abstract: Wind power forecasting (WPF), as a significant research topic within renewable energy, plays a crucial role in enhancing the security, stability, and economic operation of power grids. However, due to the high stochasticity of meteorological factors (e.g., wind speed) and significant fluctuations in wind power output, mid-term wind power forecasting faces a dual challenge of maintaining high accur… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Mingyi Zhu is the first author. Li Ding is the corresponding author

  6. arXiv:2504.10526  [pdf, other

    eess.IV cs.CV

    PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2

    Authors: Mingyang Zhu, Yinting Liu, Mingyu Li, Jiacheng Wang

    Abstract: Current methods for pathology image segmentation typically treat 2D slices independently, ignoring valuable cross-slice information. We present PathSeqSAM, a novel approach that treats 2D pathology slices as sequential video frames using SAM2's memory mechanisms. Our method introduces a distance-aware attention mechanism that accounts for variable physical distances between slices and employs LoRA… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  7. arXiv:2504.05640  [pdf, other

    eess.IV cs.CV

    CTI-Unet: Cascaded Threshold Integration for Improved U-Net Segmentation of Pathology Images

    Authors: Mingyang Zhu, Yuqiu Liang, Jiacheng Wang

    Abstract: Chronic kidney disease (CKD) is a growing global health concern, necessitating precise and efficient image analysis to aid diagnosis and treatment planning. Automated segmentation of kidney pathology images plays a central role in facilitating clinical workflows, yet conventional segmentation models often require delicate threshold tuning. This paper proposes a novel \textit{Cascaded Threshold-Int… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  8. arXiv:2504.02184  [pdf, other

    cs.RO eess.SY

    Model Predictive Control with Visibility Graphs for Humanoid Path Planning and Tracking Against Adversarial Opponents

    Authors: Ruochen Hou, Gabriel I. Fernandez, Mingzhang Zhu, Dennis W. Hong

    Abstract: In this paper we detail the methods used for obstacle avoidance, path planning, and trajectory tracking that helped us win the adult-sized, autonomous humanoid soccer league in RoboCup 2024. Our team was undefeated for all seated matches and scored 45 goals over 6 games, winning the championship game 6 to 1. During the competition, a major challenge for collision avoidance was the measurement nois… ▽ More

    Submitted 29 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

    Comments: This is a preprint version. This paper has been accepted to IEEE International Conference on Robotics and Automation (ICRA) 2025. The final published version will be available on IEEE Xplore

  9. arXiv:2502.10296  [pdf, other

    eess.IV

    SegX: Improving Interpretability of Clinical Image Diagnosis with Segmentation-based Enhancement

    Authors: Yuhao Zhang, Mingcheng Zhu, Zhiyao Luo

    Abstract: Deep learning-based medical image analysis faces a significant barrier due to the lack of interpretability. Conventional explainable AI (XAI) techniques, such as Grad-CAM and SHAP, often highlight regions outside clinical interests. To address this issue, we propose Segmentation-based Explanation (SegX), a plug-and-play approach that enhances interpretability by aligning the model's explanation ma… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  10. arXiv:2501.04973  [pdf, other

    eess.SP

    Infinite Factorial Linear Dynamical Systems for Transient Signal Detection

    Authors: Jiadi Bao, Yatong Wang, Yunjie Li, Mengtao Zhu, Shafei Wang

    Abstract: Accurately detecting the transient signal of interest from the background signal is one of the fundamental tasks in signal processing. The most recent approaches assume the existence of a single background source and represent the background signal using a linear dynamical system (LDS). This assumption might fail to capture the complexities of modern electromagnetic environments with multiple sour… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 13 pages, 9 figures, submitting to IEEE transactions on Signal Processing

  11. arXiv:2412.00162  [pdf, other

    cs.RO cs.LG eess.SY

    Dynamic High-Order Control Barrier Functions with Diffuser for Safety-Critical Trajectory Planning at Signal-Free Intersections

    Authors: Di Chen, Ruiguo Zhong, Kehua Chen, Zhiwei Shang, Meixin Zhu, Edward Chung

    Abstract: Planning safe and efficient trajectories through signal-free intersections presents significant challenges for autonomous vehicles (AVs), particularly in dynamic, multi-task environments with unpredictable interactions and an increased possibility of conflicts. This study aims to address these challenges by developing a unified, robust, adaptive framework to ensure safety and efficiency across thr… ▽ More

    Submitted 31 March, 2025; v1 submitted 29 November, 2024; originally announced December 2024.

    Comments: 11 figures, 5 tables, 15 pages

  12. arXiv:2410.10832  [pdf

    cs.RO eess.IV

    Non-Interrupting Rail Track Geometry Measurement System Using UAV and LiDAR

    Authors: Lihao Qiu, Ming Zhu, JeeWoong Park, Yingtao Jiang, Hualiang, Teng

    Abstract: The safety of train operations is largely dependent on the health of rail tracks, necessitating regular and meticulous inspection and maintenance. A significant part of such inspections involves geometric measurements of the tracks to detect any potential problems. Traditional methods for track geometry measurements, while proven to be accurate, require track closures during inspections, and consu… ▽ More

    Submitted 25 October, 2024; v1 submitted 28 September, 2024; originally announced October 2024.

  13. arXiv:2410.01087  [pdf

    eess.SY eess.SP

    Development of a Platform to Enable Real Time, Non-disruptive Testing and Early Fault Detection of Critical High Voltage Transformers and Switchgears in High Speed-rail

    Authors: Jiawei Fan, Ming Zhu, Yingtao Jiang, Hualiang Teng

    Abstract: Partial discharge (PD) incidents can occur in critical components of high-speed rail electric systems, such as transformers and switchgears, due to localized insulation defects that cannot withstand electric stress, leading to potential flashovers. These incidents can escalate over time, resulting in breakdowns, downtime, and safety risks. Fortunately, PD activities emit radio frequency (RF) signa… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  14. arXiv:2409.15816  [pdf, other

    eess.SY

    Diffusion Models for Intelligent Transportation Systems: A Survey

    Authors: Mingxing Peng, Kehua Chen, Xusen Guo, Qiming Zhang, Hui Zhong, Meixin Zhu, Hai Yang

    Abstract: Intelligent Transportation Systems (ITS) are vital in modern traffic management and optimization, significantly enhancing traffic efficiency and safety. Recently, diffusion models have emerged as transformative tools for addressing complex challenges within ITS. In this paper, we present a comprehensive survey of diffusion models for ITS, covering both theoretical and practical aspects. First, we… ▽ More

    Submitted 8 May, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 7 figures

  15. arXiv:2409.07902  [pdf, other

    eess.SP cs.IT cs.LG

    Conformal Distributed Remote Inference in Sensor Networks Under Reliability and Communication Constraints

    Authors: Meiyi Zhu, Matteo Zecchin, Sangwoo Park, Caili Guo, Chunyan Feng, Petar Popovski, Osvaldo Simeone

    Abstract: This paper presents communication-constrained distributed conformal risk control (CD-CRC) framework, a novel decision-making framework for sensor networks under communication constraints. Targeting multi-label classification problems, such as segmentation, CD-CRC dynamically adjusts local and global thresholds used to identify significant labels with the goal of ensuring a target false negative ra… ▽ More

    Submitted 24 February, 2025; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: 15 pages, 24 figures

  16. arXiv:2408.01553  [pdf, other

    cs.CV eess.IV

    Multi-task SAR Image Processing via GAN-based Unsupervised Manipulation

    Authors: Xuran Hu, Mingzhe Zhu, Ziqiang Xu, Zhenpeng Feng, Ljubisa Stankovic

    Abstract: Generative Adversarial Networks (GANs) have shown tremendous potential in synthesizing a large number of realistic SAR images by learning patterns in the data distribution. Some GANs can achieve image editing by introducing latent codes, demonstrating significant promise in SAR image processing. Compared to traditional SAR image processing methods, editing based on GAN latent space control is enti… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 19 pages, 17 figures, 7 tables

  17. arXiv:2408.00982  [pdf, other

    physics.optics eess.SP

    Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission

    Authors: Mingwen Zhu, Shangsu Ding, Zhixue Li, Song Yu, Jianming Shang, Bin Luo

    Abstract: In long-distance fiber optic transmission, the optic fiber link and erbium-doped fiber amplifiers can introduce excessive noise, which reduces the optical signal-to-noise ratio (OSNR). The narrow-band optical filters can be used to eliminate noise and thereby improve OSNR. However, there is a relative frequency drift between the signal and the narrow-band filter, which leads to filtered signal ins… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  18. arXiv:2407.00579  [pdf, ps, other

    cs.IT eess.SP

    Active-RIS-Aided Covert Communications in NOMA-Inspired ISAC Wireless Systems

    Authors: Miaomiao Zhu, Pengxu Chen, Liang Yang, Alexandros-Apostolos A. Boulogeorgos, Theodoros A. Tsiftsis, Hongwu Liu

    Abstract: Non-orthogonal multiple access (NOMA)-inspired integrated sensing and communication (ISAC) facilitates spectrum sharing for radar sensing and NOMA communications, whereas facing privacy and security challenges due to open wireless propagation. In this paper, active reconfigurable intelligent surface (RIS) is employed to aid covert communications in NOMA-inspired ISAC wireless system with the aim o… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  19. Toward Real-Time Digital Twins of EM Environments: Computational Benchmark of Ray Launching Software

    Authors: Michele Zhu, Lorenzo Cazzella, Francesco Linsalata, Maurizio Magarini, Matteo Matteucci, Umberto Spagnolini

    Abstract: Digital Twin has emerged as a promising paradigm for accurately representing wireless communication electromagnetic environments. The resulting virtual representation of reality facilitates comprehensive insights into the propagation environment, empowering multi-layer decision-making processes at the physical communication level. This paper investigates the impact of ray-based model simulation wi… ▽ More

    Submitted 2 October, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: This work as been published in IEEE Open Journal of Communication Society, we strongly advice to refer to the official version from IEEE. Source code and reference scenarios are available in https://github.com/Michele-Zhu/ray-launching-benchmark

  20. arXiv:2406.00416  [pdf, other

    stat.ML cs.LG eess.SP

    Representation and De-interleaving of Mixtures of Hidden Markov Processes

    Authors: Jiadi Bao, Mengtao Zhu, Yunjie Li, Shafei Wang

    Abstract: De-interleaving of the mixtures of Hidden Markov Processes (HMPs) generally depends on its representation model. Existing representation models consider Markov chain mixtures rather than hidden Markov, resulting in the lack of robustness to non-ideal situations such as observation noise or missing observations. Besides, de-interleaving methods utilize a search-based strategy, which is time-consumi… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 13 pages, 9 figures, submitted to IEEE transactions on Signal Processing

  21. arXiv:2404.16152  [pdf, ps, other

    cs.IT eess.SP

    Rethinking Grant-Free Protocol in mMTC

    Authors: Minhao Zhu, Yifei Sun, Lizhao You, Zhaorui Wang, Ya-Feng Liu, Shuguang Cui

    Abstract: This paper revisits the identity detection problem under the current grant-free protocol in massive machine-type communications (mMTC) by asking the following question: for stable identity detection performance, is it enough to permit active devices to transmit preambles without any handshaking with the base station (BS)? Specifically, in the current grant-free protocol, the BS blindly allocates a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE for possible publication

  22. arXiv:2403.13245  [pdf, other

    eess.SY cs.AI cs.DC cs.LG cs.RO

    Federated reinforcement learning for robot motion planning with zero-shot generalization

    Authors: Zhenyuan Yuan, Siyuan Xu, Minghui Zhu

    Abstract: This paper considers the problem of learning a control policy for robot motion planning with zero-shot generalization, i.e., no data collection and policy adaptation is needed when the learned policy is deployed in new environments. We develop a federated reinforcement learning framework that enables collaborative learning of multiple learners and a central server, i.e., the Cloud, without sharing… ▽ More

    Submitted 7 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  23. arXiv:2402.10686  [pdf, ps, other

    cs.IT cs.CR cs.LG eess.SP

    On the Impact of Uncertainty and Calibration on Likelihood-Ratio Membership Inference Attacks

    Authors: Meiyi Zhu, Caili Guo, Chunyan Feng, Osvaldo Simeone

    Abstract: In a membership inference attack (MIA), an attacker exploits the overconfidence exhibited by typical machine learning models to determine whether a specific data point was used to train a target model. In this paper, we analyze the performance of the likelihood ratio attack (LiRA) within an information-theoretical framework that allows the investigation of the impact of the aleatoric uncertainty i… ▽ More

    Submitted 8 June, 2025; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 16 pages, 28 figures

  24. arXiv:2402.03397  [pdf

    q-bio.QM eess.IV

    A Comprehensive Approach to Diagnosing Temporomandibular Joint Diseases: AI-driven TMD Diagnostic System

    Authors: Y. Gua, C. T. Kong, D. D Zhangc, Y. J Baid, J. K. H. Tsoia, Hua Huangc, Y. Q. Dengc, Y. M Zhue

    Abstract: AI-driven TMD diagnostic system uses AI segmentation method to diagnose Temporomandibular Joint Disorders (TMD). By using segmentation, three important parts: temporal bone, temporomandibular joint (TMJ) disc and the condyle can be identified. The location and the size of each segment are used as the basic information to determine if the patient has a high chance of having Temporomandibular Joint… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  25. arXiv:2401.03122  [pdf, other

    cs.CV eess.IV

    SAR Despeckling via Regional Denoising Diffusion Probabilistic Model

    Authors: Xuran Hu, Ziqiang Xu, Zhihan Chen, Zhengpeng Feng, Mingzhe Zhu, LJubisa Stankovic

    Abstract: Speckle noise poses a significant challenge in maintaining the quality of synthetic aperture radar (SAR) images, so SAR despeckling techniques have drawn increasing attention. Despite the tremendous advancements of deep learning in fixed-scale SAR image despeckling, these methods still struggle to deal with large-scale SAR images. To address this problem, this paper introduces a novel despeckling… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 5 pages, 5 figures

    ACM Class: I.4.4

  26. arXiv:2401.02883  [pdf, other

    cs.RO eess.SY

    iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

    Authors: Guoxiang Zhao, Devesh K. Jha, Yebin Wang, Minghui Zhu

    Abstract: This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for mo… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  27. arXiv:2312.13023  [pdf, other

    eess.SP

    Class Information Guided Reconstruction for Automatic Modulation Open-Set Recognition

    Authors: Ziwei Zhang, Mengtao Zhu, Jiabin Liu, Yunjie Li, Shafei Wang

    Abstract: Automatic Modulation Recognition (AMR) is a crucial technology in the domains of radar and communications. Traditional AMR approaches assume a closed-set scenario, where unknown samples are forcibly misclassified into known classes, leading to serious consequences for situation awareness and threat assessment. To address this issue, Automatic Modulation Open-set Recognition (AMOSR) defines two tas… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 14 pages, 11 figures

  28. arXiv:2312.10343  [pdf, other

    eess.SP cs.AR cs.LG cs.NE

    In-Sensor Radio Frequency Computing for Energy-Efficient Intelligent Radar

    Authors: Yang Sui, Minning Zhu, Lingyi Huang, Chung-Tse Michael Wu, Bo Yuan

    Abstract: Radio Frequency Neural Networks (RFNNs) have demonstrated advantages in realizing intelligent applications across various domains. However, as the model size of deep neural networks rapidly increases, implementing large-scale RFNN in practice requires an extensive number of RF interferometers and consumes a substantial amount of energy. To address this challenge, we propose to utilize low-rank dec… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  29. arXiv:2311.11969  [pdf, other

    eess.IV cs.CV

    SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks

    Authors: Jin Ye, Junlong Cheng, Jianpin Chen, Zhongying Deng, Tianbin Li, Haoyu Wang, Yanzhou Su, Ziyan Huang, Jilong Chen, Lei Jiang, Hui Sun, Min Zhu, Shaoting Zhang, Junjun He, Yu Qiao

    Abstract: Segment Anything Model (SAM) has achieved impressive results for natural image segmentation with input prompts such as points and bounding boxes. Its success largely owes to massive labeled training data. However, directly applying SAM to medical image segmentation cannot perform well because SAM lacks medical knowledge -- it does not use medical images for training. To incorporate medical knowled… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  30. Contrastive Self-Supervised Learning for Spatio-Temporal Analysis of Lung Ultrasound Videos

    Authors: Li Chen, Jonathan Rubin, Jiahong Ouyang, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Sourabh Kulhare, Rachel Millin, Kenton W Gregory, Cynthia R Gregory, Meihua Zhu, David O Kessler, Laurie Malia, Almaz Dessie, Joni Rabiner, Di Coneybeare, Bo Shopsin, Andrew Hersh, Cristian Madar, Jeffrey Shupp, Laura S Johnson, Jacob Avila, Kristin Dwyer, Peter Weimersheimer, Balasundar Raju , et al. (2 additional authors not shown)

    Abstract: Self-supervised learning (SSL) methods have shown promise for medical imaging applications by learning meaningful visual representations, even when the amount of labeled data is limited. Here, we extend state-of-the-art contrastive learning SSL methods to 2D+time medical ultrasound video data by introducing a modified encoder and augmentation method capable of learning meaningful spatio-temporal r… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: ISBI 2023, 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI)

  31. arXiv:2310.08080  [pdf

    eess.IV cs.CV

    RT-SRTS: Angle-Agnostic Real-Time Simultaneous 3D Reconstruction and Tumor Segmentation from Single X-Ray Projection

    Authors: Miao Zhu, Qiming Fu, Bo Liu, Mengxi Zhang, Bojian Li, Xiaoyan Luo, Fugen Zhou

    Abstract: Radiotherapy is one of the primary treatment methods for tumors, but the organ movement caused by respiration limits its accuracy. Recently, 3D imaging from a single X-ray projection has received extensive attention as a promising approach to address this issue. However, current methods can only reconstruct 3D images without directly locating the tumor and are only validated for fixed-angle imagin… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  32. arXiv:2308.06874  [pdf, ps, other

    eess.SY

    Joint Data Collection and Sensor Positioning in Multi-UAV-Assisted Wireless Sensor Network

    Authors: Mingyue Zhu, Zhiqing Wei, Chen Qiu, Wangjun Jiang, Huici Wu, Zhiying Feng

    Abstract: Due to the high mobility and easy deployment, unmanned aerial vehicles (UAVs) have attracted much attention in the field of wireless communication and positioning. To meet the challenges of lack of infrastructure coverage, uncertain sensor position and large amount of sensing data collection in wireless sensor network (WSN), this paper presents an efficient joint data collection and sensor positio… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  33. arXiv:2308.04463  [pdf, other

    eess.IV

    Weakly Semi-Supervised Detection in Lung Ultrasound Videos

    Authors: Jiahong Ouyang, Li Chen, Gary Y. Li, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Sourabh Kulhare, Rachel Millin, Kenton W. Gregory, Cynthia R. Gregory, Meihua Zhu, David O. Kessler, Laurie Malia, Almaz Dessie, Joni Rabiner, Di Coneybeare, Bo Shopsin, Andrew Hersh, Cristian Madar, Jeffrey Shupp, Laura S. Johnson, Jacob Avila, Kristin Dwyer, Peter Weimersheimer, Balasundar Raju , et al. (2 additional authors not shown)

    Abstract: Frame-by-frame annotation of bounding boxes by clinical experts is often required to train fully supervised object detection models on medical video data. We propose a method for improving object detection in medical videos through weak supervision from video-level labels. More concretely, we aggregate individual detection predictions into video-level predictions and extend a teacher-student train… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: IPMI 2023

  34. arXiv:2307.02953  [pdf, other

    eess.IV cs.CV cs.LG

    SegNetr: Rethinking the local-global interactions and skip connections in U-shaped networks

    Authors: Junlong Cheng, Chengrui Gao, Fengjie Wang, Min Zhu

    Abstract: Recently, U-shaped networks have dominated the field of medical image segmentation due to their simple and easily tuned structure. However, existing U-shaped segmentation networks: 1) mostly focus on designing complex self-attention modules to compensate for the lack of long-term dependence based on convolution operation, which increases the overall number of parameters and computational complexit… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  35. arXiv:2302.14752  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Guided Crowd Evacuation: Two-Scale Modeling and Control

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: Emergency evacuation describes a complex situation involving time-critical decision-making by evacuees. Mobile robots are being actively explored as a potential solution to provide timely guidance. In this work, we study a robot-guided crowd evacuation problem where a small group of robots is used to guide a large human crowd to safe locations. The challenge lies in how to use micro-level human-ro… ▽ More

    Submitted 11 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

  36. arXiv:2302.04407  [pdf, other

    eess.SP

    Bayesian Non-parametric Hidden Markov Model for Agile Radar Pulse Sequences Streaming Analysis

    Authors: Jiadi Bao, Yunjie Li, Mengtao Zhu, Shafei Wang

    Abstract: Multi-function radars (MFRs) are sophisticated types of sensors with the capabilities of complex agile inter-pulse modulation implementation and dynamic work mode scheduling. The developments in MFRs pose great challenges to modern electronic reconnaissance systems or radar warning receivers for recognition and inference of MFR work modes. To address this issue, this paper proposes an online proce… ▽ More

    Submitted 22 August, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 15 pages, 10 figures, submitted to IEEE transactions on signal processing

  37. arXiv:2301.01448  [pdf, other

    eess.IV cs.CV

    A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imaging

    Authors: Zhilin Zheng, Xu Fang, Jiawen Yao, Mengmeng Zhu, Le Lu, Lingyun Huang, Jing Xiao, Yu Shi, Hong Lu, Jianping Lu, Ling Zhang, Chengwei Shao, Yun Bian

    Abstract: Lymph node (LN) metastasis status is one of the most critical prognostic and cancer staging factors for patients with resectable pancreatic ductal adenocarcinoma (PDAC), or in general, for any types of solid malignant tumors. Preoperative prediction of LN metastasis from non-invasive CT imaging is highly desired, as it might be straightforwardly used to guide the following neoadjuvant treatment de… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: 14 pages,5 figures

  38. arXiv:2301.01036  [pdf, other

    cs.CV eess.IV

    High-Quality Real-Time Rendering Using Subpixel Sampling Reconstruction

    Authors: Boyu Zhang, Hongliang Yuan, Mingyan Zhu, Ligang Liu, Jue Wang

    Abstract: Generating high-quality, realistic rendering images for real-time applications generally requires tracing a few samples-per-pixel (spp) and using deep learning-based approaches to denoise the resulting low-spp images. Existing denoising methods have yet to achieve real-time performance at high resolutions due to the physically-based sampling and network inference time costs. In this paper, we prop… ▽ More

    Submitted 25 June, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  39. Information Bottleneck-Inspired Type Based Multiple Access for Remote Estimation in IoT Systems

    Authors: Meiyi Zhu, Chunyan Feng, Caili Guo, Nan Jiang, Osvaldo Simeone

    Abstract: Type-based multiple access (TBMA) is a semantics-aware multiple access protocol for remote inference. In TBMA, codewords are reused across transmitting sensors, with each codeword being assigned to a different observation value. Existing TBMA protocols are based on fixed shared codebooks and on conventional maximum-likelihood or Bayesian decoders, which require knowledge of the distributions of ob… ▽ More

    Submitted 5 April, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, accepted by IEEE Signal Processing Letters (SPL)

  40. arXiv:2211.09332  [pdf

    eess.SY cs.RO

    iNavFIter-M: Matrix Formulation of Functional Iteration for Inertial Navigation Computation

    Authors: Hongyan Jiang, Maoran Zhu, Yanyan Fu, Yuanxin Wu

    Abstract: The acquisition of attitude, velocity, and position is an essential task in the field of inertial navigation, achieved by integrating the measurements from inertial sensors. Recently, the ultra-precision inertial navigation computation has been tackled by the functional iteration approach (iNavFIter) that drives the non-commutativity errors almost to the computer truncation error level. This paper… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 30 pages, 7 figures

  41. arXiv:2209.12586  [pdf, other

    eess.SY

    Learning Critical Scenarios in Feedback Control Systems for Automated Driving

    Authors: Mengjia Zhu, Alberto Bemporad, Maximilian Kneissl, Hasan Esen

    Abstract: Testing is essential for verifying and validating control designs, especially in safety-critical applications. In particular, the control system governing an automated driving vehicle must be proven reliable enough for its acceptance on the market. Recently, much research has focused on scenario-based methods. However, the number of possible driving scenarios to test is in principle infinite. In t… ▽ More

    Submitted 8 September, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  42. Enhanced Effective Aperture Distribution Function for Characterizing Large-Scale Antenna Arrays

    Authors: Xuesong Cai, Meifang Zhu, Aleksei Fedorov, Fredrik Tufvesson

    Abstract: Accurate characterization of large-scale antenna arrays is growing in importance and complexity for the fifth-generation (5G) and beyond systems, as they feature more antenna elements and require increased overall performance. The full 3D patterns of all antenna elements in the array need to be characterized because they are in general different due to construction inaccuracy, coupling, antenna ar… ▽ More

    Submitted 7 June, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 10 pages. To appear in IEEE Transactions on Antennas and Propagation

  43. arXiv:2209.09795  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Assisted Human Crowd Evacuation using Navigation Velocity Fields

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: This work studies a robot-assisted crowd evacuation problem where we control a small group of robots to guide a large human crowd to safe locations. The challenge lies in how to model human-robot interactions and design robot controls to indirectly control a human population that significantly outnumbers the robots. To address the challenge, we treat the crowd as a continuum and formulate the evac… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  44. arXiv:2207.07824  [pdf, other

    eess.SY

    Distributed Safe Learning and Planning for Multi-robot Systems

    Authors: Zhenyuan Yuan, Minghui Zhu

    Abstract: This paper considers the problem of online multi-robot motion planning with general nonlinear dynamics subject to unknown external disturbances. We propose dSLAP, a distributed safe learning and planning framework that allows the robots to safely navigate through the environments by coupling online learning and motion planning. Gaussian process regression is used to online learn the disturbances w… ▽ More

    Submitted 25 May, 2025; v1 submitted 15 July, 2022; originally announced July 2022.

  45. arXiv:2206.12281  [pdf

    eess.SP

    Real-time Dual-channel 2 * 2 MIMO Fiber-THz-Fiber Seamless Integration System at 385 GHz and 435 GHz

    Authors: Jiao Zhang, Min Zhu, Bingchang Hua, Mingzheng Lei, Yuancheng Cai, Liang Tian, Yucong Zou, Like Ma, Yongming Huang, Jianjun Yu, Xiaohu You

    Abstract: We demonstrate the first practical real-time dual-channel fiber-THz-fiber 2 * 2 MIMO seamless integration system with a record net data rate of 2 * 103.125 Gb/s at 385 GHz and 435 GHz over two spans of 20 km SSMF and 3 m wireless link.

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: This paper has been accepted by ECOC 2022

  46. arXiv:2205.13294  [pdf, other

    cs.CV eess.IV eess.SP

    Analytical Interpretation of Latent Codes in InfoGAN with SAR Images

    Authors: Zhenpeng Feng, Milos Dakovic, Hongbing Ji, Mingzhe Zhu, Ljubisa Stankovic

    Abstract: Generative Adversarial Networks (GANs) can synthesize abundant photo-realistic synthetic aperture radar (SAR) images. Some recent GANs (e.g., InfoGAN), are even able to edit specific properties of the synthesized images by introducing latent codes. It is crucial for SAR image synthesis since the targets in real SAR images are with different properties due to the imaging mechanism. Despite the succ… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 13 pages, 14 figures

  47. arXiv:2205.01805  [pdf, other

    cs.CV cs.LG eess.IV

    Splicing Detection and Localization In Satellite Imagery Using Conditional GANs

    Authors: Emily R. Bartusiak, Sri Kalyan Yarlagadda, David Güera, Paolo Bestagini, Stefano Tubaro, Fengqing M. Zhu, Edward J. Delp

    Abstract: The widespread availability of image editing tools and improvements in image processing techniques allow image manipulation to be very easy. Oftentimes, easy-to-use yet sophisticated image manipulation tools yields distortions/changes imperceptible to the human observer. Distribution of forged images can have drastic ramifications, especially when coupled with the speed and vastness of the Interne… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted to the 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

    Journal ref: IEEE Conference on Multimedia Information Processing and Retrieval, pp. 91-96, March 2019, San Jose, CA

  48. arXiv:2202.08935  [pdf, other

    cs.RO eess.SY

    A Formal Safety Characterization of Advanced Driver Assist Systems in the Car-Following Regime with Scenario-Sampling

    Authors: Bowen Weng, Minghao Zhu, Keith Redmill

    Abstract: The capability to follow a lead-vehicle and avoid rear-end collisions is one of the most important functionalities for human drivers and various Advanced Driver Assist Systems (ADAS). Existing safety performance justification of the car-following systems either relies on simple concrete scenarios with biased surrogate metrics or requires a significantly long driving distance for risk observation a… ▽ More

    Submitted 23 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  49. arXiv:2111.07450  [pdf, other

    eess.SP

    Beamspace Multidimensional ESPRIT Approaches for Simultaneous Localization and Communications

    Authors: Fan Jiang, Fuxi Wen, Yu Ge, Meifang Zhu, Henk Wymeersch, Fredrik Tufvesson

    Abstract: Modern wireless communication systems operating at high carrier frequencies are characterized by a high dimensionality of the underlying parameter space (including channel gains, angles, delays, and possibly Doppler shifts). Estimating these parameters is valuable for communication purposes, but also for localization and sensing, making channel estimation a critical component in any joint communic… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: 17 pages, 7 figures

  50. arXiv:2111.05315  [pdf

    q-bio.QM cs.CV eess.IV physics.bio-ph

    Stain-free Detection of Embryo Polarization using Deep Learning

    Authors: Cheng Shen, Adiyant Lamba, Meng Zhu, Ray Zhang, Changhuei Yang, Magdalena Zernicka Goetz

    Abstract: Polarization of the mammalian embryo at the right developmental time is critical for its development to term and would be valuable in assessing the potential of human embryos. However, tracking polarization requires invasive fluorescence staining, impermissible in the in vitro fertilization clinic. Here, we report the use of artificial intelligence to detect polarization from unstained time-lapse… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.