Skip to main content

Showing 1–50 of 55 results for author: Hu, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.12887  [pdf, ps, other

    eess.IV cs.CV

    RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions

    Authors: Junzhi Ning, Cheng Tang, Kaijin Zhou, Diping Song, Lihao Liu, Ming Hu, Wei Li, Yanzhou Su, Tianbing Li, Jiyao Liu, Yejin, Sheng Zhang, Yuanfeng Ji, Junjun He

    Abstract: The scarcity of high-quality, labelled retinal imaging data, which presents a significant challenge in the development of machine learning models for ophthalmology, hinders progress in the field. To synthesise Colour Fundus Photographs (CFPs), existing methods primarily relying on predefined disease labels face significant limitations. However, current methods remain limited, thus failing to gener… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2505.07449  [pdf, other

    eess.IV cs.CV

    Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model

    Authors: Wei Li, Ming Hu, Guoan Wang, Lihao Liu, Kaijin Zhou, Junzhi Ning, Xin Guo, Zongyuan Ge, Lixu Gu, Junjun He

    Abstract: In ophthalmic surgery, developing an AI system capable of interpreting surgical videos and predicting subsequent operations requires numerous ophthalmic surgical videos with high-quality annotations, which are difficult to collect due to privacy concerns and labor consumption. Text-guided video generation (T2V) emerges as a promising solution to overcome this issue by generating ophthalmic surgica… ▽ More

    Submitted 16 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: Early accepted in MICCAI25

  3. arXiv:2504.13131  [pdf, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

    Authors: Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong , et al. (88 additional authors not shown)

    Abstract: This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement. The challenge comprises two tracks: (i) Efficient Video Quality Assessment (KVQ), and (ii) Diffusion-based Image Super-Resolution (KwaiSR). Track 1 aims to advance the development of lightweight and efficient video quality assessment (VQA) models, with an emphasis on eliminating re… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of NTIRE 2025; Methods from 18 Teams; Accepted by CVPR Workshop; 21 pages

  4. arXiv:2503.14304  [pdf, other

    eess.IV cs.CV

    RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT

    Authors: Yuheng Li, Mingzhe Hu, Richard L. J. Qiu, Maria Thor, Andre Williams, Deborah Marshall, Xiaofeng Yang

    Abstract: Deep learning-based segmentation of genito-pelvic structures in MRI and CT is crucial for applications such as radiation therapy, surgical planning, and disease diagnosis. However, existing segmentation models often struggle with generalizability across imaging modalities, and anatomical variations. In this work, we propose RoMedFormer, a rotary-embedding transformer-based foundation model designe… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  5. arXiv:2503.13560  [pdf, other

    eess.IV cs.CV

    MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset

    Authors: Zhaodong Wu, Qiaochu Zhao, Ming Hu, Yulong Li, Haochen Xue, Kang Dang, Zhengyong Jiang, Angelos Stefanidis, Qiufeng Wang, Imran Razzak, Zongyuan Ge, Junjun He, Yu Qiao, Zhong Zheng, Feilong Tang, Jionglong Su

    Abstract: With the significantly increasing incidence and prevalence of abdominal diseases, there is a need to embrace greater use of new innovations and technology for the diagnosis and treatment of patients. Although deep-learning methods have notably been developed to assist radiologists in diagnosing abdominal diseases, existing models have the restricted ability to segment common lesions in the abdomen… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  6. arXiv:2412.09998  [pdf, other

    eess.IV cs.AI cs.CV

    Self-Consistent Nested Diffusion Bridge for Accelerated MRI Reconstruction

    Authors: Tao Song, Yicheng Wu, Minhao Hu, Xiangde Luo, Guoting Luo, Guotai Wang, Yi Guo, Feng Xu, Shaoting Zhang

    Abstract: Accelerated MRI reconstruction plays a vital role in reducing scan time while preserving image quality. While most existing methods rely on complex-valued image-space or k-space data, these formats are often inaccessible in clinical practice due to proprietary reconstruction pipelines, leaving only magnitude images stored in DICOM files. To address this gap, we focus on the underexplored task of m… ▽ More

    Submitted 27 April, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

  7. arXiv:2412.00715  [pdf, other

    eess.IV cs.CV

    A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation

    Authors: Xiaoxiang Han, Yiman Liu, Jiang Shang, Qingli Li, Jiangang Chen, Menghan Hu, Qi Zhang, Yuqi Zhang, Yan Wang

    Abstract: Segmenting internal structure from echocardiography is essential for the diagnosis and treatment of various heart diseases. Semi-supervised learning shows its ability in alleviating annotations scarcity. While existing semi-supervised methods have been successful in image segmentation across various medical imaging modalities, few have attempted to design methods specifically addressing the challe… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

    Comments: 6 pages, 4 figure, accepted by 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2024)

  8. arXiv:2411.14684  [pdf, other

    eess.IV cs.AI cs.CV

    Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis

    Authors: Tao Song, Yicheng Wu, Minhao Hu, Xiangde Luo, Linda Wei, Guotai Wang, Yi Guo, Feng Xu, Shaoting Zhang

    Abstract: Multimodal MR image synthesis aims to generate missing modality images by effectively fusing and mapping from a subset of available MRI modalities. Most existing methods adopt an image-to-image translation paradigm, treating multiple modalities as input channels. However, these approaches often yield sub-optimal results due to the inherent difficulty in achieving precise feature- or semantic-level… ▽ More

    Submitted 28 April, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

  9. arXiv:2411.09201  [pdf, other

    cs.AR eess.SP

    Noncontact Multi-Point Vital Sign Monitoring with mmWave MIMO Radar

    Authors: Wei Ren, Jiannong Cao, Huansheng Yi, Kaiyue Hou, Miaoyang Hu, Jianqi Wang, Fugui Qi

    Abstract: Multi-point vital sign monitoring is essential for providing detailed insights into physiological changes. Traditional single-sensor approaches are inadequate for capturing multi-point vibrations. Existing contact-based solutions, while addressing this need, can cause discomfort and skin allergies, whereas noncontact optical and acoustic methods are highly susceptible to light interference and env… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: 15 pages

    MSC Class: 94C30 ACM Class: C.3.4

  10. arXiv:2411.07069  [pdf, other

    eess.SY

    Two-Stage Stochastic Optimization for Low-Carbon Dispatch in a Combined Energy System

    Authors: Manling Hu, Manqi Xu, Dunnan Liu

    Abstract: While wind and solar power contribute to sustainability, their intermittent nature poses challenges when integrated into the grid. To mitigate these issues, renewable energy can be combined with coal fired power and hydropower sources to stabilize the energy system, with battery storage serving as a backup source to smooth the total output. This study develops a low carbon dispatch model for a com… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: 5 pages, 5 figures, accepted for publication in The 8th IEEE Conference on Energy Internet and Energy System Integration

  11. arXiv:2410.14882  [pdf

    cs.AR eess.SP

    Multi-diseases detection with memristive system on chip

    Authors: Zihan Wang, Daniel W. Yang, Zerui Liu, Evan Yan, Heming Sun, Ning Ge, Miao Hu, Wei Wu

    Abstract: This study presents the first implementation of multilayer neural networks on a memristor/CMOS integrated system on chip (SoC) to simultaneously detect multiple diseases. To overcome limitations in medical data, generative AI techniques are used to enhance the dataset, improving the classifier's robustness and diversity. The system achieves notable performance with low latency, high accuracy (91.8… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 14 pages, 5 figures

    ACM Class: C.1.3; I.2.0

  12. arXiv:2406.15160  [pdf, other

    eess.AS eess.SP

    Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios

    Authors: Ya Jiang, Qing Wang, Jun Du, Maocheng Hu, Pengfei Hu, Zeyan Liu, Shi Cheng, Zhaoxu Nian, Yuxuan Dong, Mingqi Cai, Xin Fang, Chin-Hui Lee

    Abstract: This study presents an audio-visual information fusion approach to sound event localization and detection (SELD) in low-resource scenarios. We aim at utilizing audio and video modality information through cross-modal learning and multi-modal fusion. First, we propose a cross-modal teacher-student learning (TSL) framework to transfer information from an audio-only teacher model, trained on a rich c… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: accepted by icme2024

  13. arXiv:2406.11519  [pdf, other

    cs.CV eess.IV

    HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model

    Authors: Di Wang, Meiqi Hu, Yao Jin, Yuchun Miao, Jiaqi Yang, Yichu Xu, Xiaolei Qin, Jiaqi Ma, Lingyu Sun, Chenxing Li, Chuan Fu, Hongruixuan Chen, Chengxi Han, Naoto Yokoya, Jing Zhang, Minqiang Xu, Lin Liu, Lefei Zhang, Chen Wu, Bo Du, Dacheng Tao, Liangpei Zhang

    Abstract: Accurate hyperspectral image (HSI) interpretation is critical for providing valuable insights into various earth observation-related applications such as urban planning, precision agriculture, and environmental monitoring. However, existing HSI processing methods are predominantly task-specific and scene-dependent, which severely limits their ability to transfer knowledge across tasks and scenes,… ▽ More

    Submitted 1 April, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE TPAMI. Project website: https://whu-sigma.github.io/HyperSIGMA

  14. arXiv:2405.11289  [pdf, other

    eess.IV cs.CV

    Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification

    Authors: Ming Hu, Siyuan Yan, Peng Xia, Feilong Tang, Wenxue Li, Peibo Duan, Lin Zhang, Zongyuan Ge

    Abstract: Deep learning-based diagnostic systems have demonstrated potential in skin disease diagnosis. However, their performance can easily degrade on test domains due to distribution shifts caused by input-level corruptions, such as imaging equipment variability, brightness changes, and image blur. This will reduce the reliability of model deployment in real-world scenarios. Most existing solutions focus… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  15. arXiv:2404.15946  [pdf

    cs.CV cs.AI eess.IV

    Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography

    Authors: Xuxin Chen, Yuheng Li, Mingzhe Hu, Ella Salari, Xiaoqian Chen, Richard L. J. Qiu, Bin Zheng, Xiaofeng Yang

    Abstract: Although fusion of information from multiple views of mammograms plays an important role to increase accuracy of breast cancer detection, developing multi-view mammograms-based computer-aided diagnosis (CAD) schemes still faces challenges and no such CAD schemes have been used in clinical practice. To overcome the challenges, we investigate a new approach based on Contrastive Language-Image Pre-tr… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  16. Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery

    Authors: Chengxi Han, Chen Wu, Haonan Guo, Meiqi Hu, Jiepan Li, Hongruixuan Chen

    Abstract: The rapid advancement of automated artificial intelligence algorithms and remote sensing instruments has benefited change detection (CD) tasks. However, there is still a lot of space to study for precise detection, especially the edge integrity and internal holes phenomenon of change features. In order to solve these problems, we design the Change Guiding Network (CGNet), to tackle the insufficien… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  17. arXiv:2404.01024  [pdf, other

    cs.CV eess.IV

    AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images

    Authors: Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

    Abstract: In recent years, the rapid advancement of Artificial Intelligence Generated Content (AIGC) has attracted widespread attention. Among the AIGC, AI generated omnidirectional images hold significant potential for Virtual Reality (VR) and Augmented Reality (AR) applications, hence omnidirectional AIGC techniques have also been widely studied. AI-generated omnidirectional images exhibit unique distorti… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  18. arXiv:2310.17661  [pdf, other

    eess.SP cs.NI

    An Overview on IEEE 802.11bf: WLAN Sensing

    Authors: Rui Du, Haocheng Hua, Hailiang Xie, Xianxin Song, Zhonghao Lyu, Mengshi Hu, Narengerile, Yan Xin, Stephen McCann, Michael Montemurro, Tony Xiao Han, Jie Xu

    Abstract: With recent advancements, the wireless local area network (WLAN) or wireless fidelity (Wi-Fi) technology has been successfully utilized to realize sensing functionalities such as detection, localization, and recognition. However, the WLANs standards are developed mainly for the purpose of communication, and thus may not be able to meet the stringent requirements for emerging sensing applications.… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 31 pages, 25 figures, this is a significant updated version of arXiv:2207.04859

  19. arXiv:2310.00912  [pdf

    cs.AR eess.SP

    A Resource-efficient FIR Filter Design Based on an RAG Improved Algorithm

    Authors: Mengwei Hu, Zhengxiong Li, Xianyang Jiang

    Abstract: In modern digital filter chip design, efficient resource utilization is a hot topic. Due to the linear phase characteristics of FIR filters, a pulsed fully parallel structure can be applied to address the problem. To further reduce hardware resource consumption, especially related to multiplication functions, an improved RAG algorithm has been proposed. Filters with different orders and for differ… ▽ More

    Submitted 23 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 4 pages, 3 figures, Conference paper for ICCS (International Conference on Circuits and Systems) 2023

  20. arXiv:2309.08906  [pdf, ps, other

    cs.ET eess.SP

    Scalable Multiuser Immersive Communications with Multi-numerology and Mini-slot

    Authors: Ming Hu, Jiazhi Peng, Lifeng Wang, Kai-Kit Wong

    Abstract: This paper studies multiuser immersive communications networks in which different user equipment may demand various extended reality (XR) services. In such heterogeneous networks, time-frequency resource allocation needs to be more adaptive since XR services are usually multi-modal and latency-sensitive. To this end, we develop a scalable time-frequency resource allocation method based on multi-nu… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  21. arXiv:2309.04730  [pdf, other

    eess.SP cs.DC eess.SY

    Integrated Robotics Networks with Co-optimization of Drone Placement and Air-Ground Communications

    Authors: Menghao Hu, Tong Zhang, Shuai Wang, Guoliang Li, Yingyang Chen, Qiang Li, Gaojie Chen

    Abstract: Terrestrial robots, i.e., unmanned ground vehicles (UGVs), and aerial robots, i.e., unmanned aerial vehicles (UAVs), operate in separate spaces. To exploit their complementary features (e.g., fields of views, communication links, computing capabilities), a promising paradigm termed integrated robotics network emerges, which provides communications for cooperative UAVs-UGVs applications. However, h… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by VTC2023-Fall, 5 pages, 4 figures

  22. arXiv:2306.06669  [pdf, other

    eess.IV cs.CV cs.LG

    TransMRSR: Transformer-based Self-Distilled Generative Prior for Brain MRI Super-Resolution

    Authors: Shan Huang, Xiaohong Liu, Tao Tan, Menghan Hu, Xiaoer Wei, Tingli Chen, Bin Sheng

    Abstract: Magnetic resonance images (MRI) acquired with low through-plane resolution compromise time and cost. The poor resolution in one orientation is insufficient to meet the requirement of high resolution for early diagnosis of brain disease and morphometric study. The common Single image super-resolution (SISR) solutions face two main challenges: (1) local detailed and global anatomical structural info… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: 2023 CGI

  23. arXiv:2305.12447  [pdf

    eess.IV cs.CV

    BreastSAM: A Study of Segment Anything Model for Breast Tumor Detection in Ultrasound Images

    Authors: Mingzhe Hu, Yuheng Li, Xiaofeng Yang

    Abstract: Breast cancer is one of the most common cancers among women worldwide, with early detection significantly increasing survival rates. Ultrasound imaging is a critical diagnostic tool that aids in early detection by providing real-time imaging of the breast tissue. We conducted a thorough investigation of the Segment Anything Model (SAM) for the task of interactive segmentation of breast tumors in u… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  24. arXiv:2305.00293  [pdf

    eess.IV cs.CV

    Polyp-SAM: Transfer SAM for Polyp Segmentation

    Authors: Yuheng Li, Mingzhe Hu, Xiaofeng Yang

    Abstract: Colon polyps are considered important precursors for colorectal cancer. Automatic segmentation of colon polyps can significantly reduce the misdiagnosis of colon cancer and improve physician annotation efficiency. While many methods have been proposed for polyp segmentation, training large-scale segmentation networks with limited colonoscopy data remains a challenge. Recently, the Segment Anything… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  25. Empirical Exploration of Zone-by-zone Energy Flexibility: a Non-intrusive Load Disaggregation Approach for Commercial Buildings

    Authors: Maomao Hu, Ram Rajagopal, Jacques A. de Chalendar

    Abstract: Building energy flexibility has been increasingly demonstrated as a cost-effective solution to respond to the needs of energy networks, including electric grids and district cooling and heating systems, improving the integration of intermittent renewable energy sources. Adjusting zonal temperature set-points is one of the most promising measures to unlock the energy flexibility potential of centra… ▽ More

    Submitted 14 July, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 33 pages, 18 figures

    Journal ref: Energy and Buildings. Volume 296. 2023. 113339

  26. arXiv:2304.08687  [pdf

    cs.CV eess.IV

    GlobalMind: Global Multi-head Interactive Self-attention Network for Hyperspectral Change Detection

    Authors: Meiqi Hu, Chen Wu, Liangpei Zhang

    Abstract: High spectral resolution imagery of the Earth's surface enables users to monitor changes over time in fine-grained scale, playing an increasingly important role in agriculture, defense, and emergency response. However, most current algorithms are still confined to describing local features and fail to incorporate a global perspective, which limits their ability to capture interactions between glob… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 14 page, 18 figures

  27. arXiv:2303.13753  [pdf

    cs.CV eess.IV

    EMS-Net: Efficient Multi-Temporal Self-Attention For Hyperspectral Change Detection

    Authors: Meiqi Hu, Chen Wu, Bo Du

    Abstract: Hyperspectral change detection plays an essential role of monitoring the dynamic urban development and detecting precise fine object evolution and alteration. In this paper, we have proposed an original Efficient Multi-temporal Self-attention Network (EMS-Net) for hyperspectral change detection. The designed EMS module cuts redundancy of those similar and containing-no-changes feature maps, comput… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 4 pages, 5 figures, submitted to IGARSS2023

  28. EDMAE: An Efficient Decoupled Masked Autoencoder for Standard View Identification in Pediatric Echocardiography

    Authors: Yiman Liu, Xiaoxiang Han, Tongtong Liang, Bin Dong, Jiajun Yuan, Menghan Hu, Qiaohong Liu, Jiangang Chen, Qingli Li, Yuqi Zhang

    Abstract: This paper introduces the Efficient Decoupled Masked Autoencoder (EDMAE), a novel self-supervised method for recognizing standard views in pediatric echocardiography. EDMAE introduces a new proxy task based on the encoder-decoder structure. The EDMAE encoder is composed of a teacher and a student encoder. The teacher encoder extracts the potential representation of the masked image blocks, while t… ▽ More

    Submitted 3 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 15 pages, 5 figures, 8 tables, Published in Biomedical Signal Processing and Control

    Journal ref: Biomedical Signal Processing and Control 86 (2023) 105280

  29. arXiv:2302.08549  [pdf, other

    eess.AS cs.SD

    Speaker Change Detection for Transformer Transducer ASR

    Authors: Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li

    Abstract: Speaker change detection (SCD) is an important feature that improves the readability of the recognized words from an automatic speech recognition (ASR) system by breaking the word sequence into paragraphs at speaker change points. Existing SCD solutions either require additional ensemble for the time based decisions and recognized word sequences, or implement a tight integration between ASR and SC… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 5 pages, 1 figure, accepted by ICASSP 2023

  30. Automated Movement Detection with Dirichlet Process Mixture Models and Electromyography

    Authors: Navin Cooray, Zhenglin Li, Jinzhuo Wang, Christine Lo, Mahnaz Arvaneh, Mkael Symmonds, Michele Hu, Maarten De Vos, Lyudmila S Mihaylova

    Abstract: Numerous sleep disorders are characterised by movement during sleep, these include rapid-eye movement sleep behaviour disorder (RBD) and periodic limb movement disorder. The process of diagnosing movement related sleep disorders requires laborious and time-consuming visual analysis of sleep recordings. This process involves sleep clinicians visually inspecting electromyogram (EMG) signals to ident… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Journal ref: 2022 25th International Conference on Information Fusion (FUSION), Linköping, Sweden, 2022, pp. 01-08

  31. arXiv:2211.11557  [pdf

    eess.IV cs.CV cs.LG

    Decomposing 3D Neuroimaging into 2+1D Processing for Schizophrenia Recognition

    Authors: Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan

    Abstract: Deep learning has been successfully applied to recognizing both natural images and medical images. However, there remains a gap in recognizing 3D neuroimaging data, especially for psychiatric diseases such as schizophrenia and depression that have no visible alteration in specific slices. In this study, we propose to process the 3D data by a 2+1D framework so that we can exploit the powerful deep… ▽ More

    Submitted 21 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

  32. arXiv:2211.11271  [pdf, other

    eess.SP

    Energy Efficiency Optimization of Intelligent Reflective Surface-assisted Terahertz-RSMA System

    Authors: Xiaoyu Chen, Feng Yan, Menghan Hu, Zihuai Lin

    Abstract: This paper examines the energy efficiency optimization problem of intelligent reflective surface (IRS)-assisted multi-user rate division multiple access (RSMA) downlink systems under terahertz propagation. The objective function for energy efficiency is optimized using the salp swarm algorithm (SSA) and compared with the successive convex approximation (SCA) technique. SCA technique requires multi… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  33. arXiv:2207.04859  [pdf, ps, other

    cs.NI eess.SP

    An Overview on IEEE 802.11bf: WLAN Sensing

    Authors: Rui Du, Hailiang Xie, Mengshi Hu, Narengerile, Yan Xin, Stephen McCann, Michael Montemurro, Tony Xiao Han, Jie Xu

    Abstract: With recent advancements, the wireless local area network (WLAN) or wireless fidelity (Wi-Fi) technology has been successfully utilized to realize sensing functionalities such as detection, localization, and recognition. However, the WLANs standards are developed mainly for the purpose of communication, and thus may not be able to meet the stringent sensing requirements in emerging applications. T… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  34. arXiv:2207.02250  [pdf, other

    cs.CV eess.IV

    Array Camera Image Fusion using Physics-Aware Transformers

    Authors: Qian Huang, Minghao Hu, David Jones Brady

    Abstract: We demonstrate a physics-aware transformer for feature-based data fusion from cameras with diverse resolution, color spaces, focal planes, focal lengths, and exposure. We also demonstrate a scalable solution for synthetic training data generation for the transformer using open-source computer graphics software. We demonstrate image synthesis on arrays with diverse spectral responses, instantaneous… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  35. arXiv:2203.02106  [pdf, other

    eess.IV cs.CV

    Scribble-Supervised Medical Image Segmentation via Dual-Branch Network and Dynamically Mixed Pseudo Labels Supervision

    Authors: Xiangde Luo, Minhao Hu, Wenjun Liao, Shuwei Zhai, Tao Song, Guotai Wang, Shaoting Zhang

    Abstract: Medical image segmentation plays an irreplaceable role in computer-assisted diagnosis, treatment planning, and following-up. Collecting and annotating a large-scale dataset is crucial to training a powerful segmentation model, but producing high-quality segmentation masks is an expensive and time-consuming procedure. Recently, weakly-supervised learning that uses sparse annotations (points, scribb… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 11 pages, 4 figures,code is available: https://github.com/HiLab-git/WSL4MIS.This is a comprehensive study about scribble-supervised medical image segmentation based on the ACDC dataset

  36. arXiv:2112.04894  [pdf, other

    eess.IV cs.CV

    Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer

    Authors: Xiangde Luo, Minhao Hu, Tao Song, Guotai Wang, Shaoting Zhang

    Abstract: Recently, deep learning with Convolutional Neural Networks (CNNs) and Transformers has shown encouraging results in fully supervised medical image segmentation. However, it is still challenging for them to achieve good performance with limited annotations for training. In this work, we present a very simple yet efficient framework for semi-supervised medical image segmentation by introducing the c… ▽ More

    Submitted 1 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: accepted to MIDL2022, code in SSL4MIS:https://github.com/HiLab-git/SSL4MIS

  37. arXiv:2112.04493  [pdf

    eess.IV cs.CV

    Binary Change Guided Hyperspectral Multiclass Change Detection

    Authors: Meiqi Hu, Chen Wu, Bo Du, Liangpei Zhang

    Abstract: Characterized by tremendous spectral information, hyperspectral image is able to detect subtle changes and discriminate various change classes for change detection. The recent research works dominated by hyperspectral binary change detection, however, cannot provide fine change classes information. And most methods incorporating spectral unmixing for hyperspectral multiclass change detection (HMCD… ▽ More

    Submitted 10 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 14 pages,17 figures

  38. arXiv:2111.03517  [pdf, other

    eess.IV physics.optics

    Snapshot Ptychography on Array cameras

    Authors: Chengyu Wang, Minghao Hu, Yuzuru Takashima, Timothy J. Schulz, David J. Brady

    Abstract: We use convolutional neural networks to recover images optically down-sampled by $6.7\times$ using coherent aperture synthesis over a 16 camera array. Where conventional ptychography relies on scanning and oversampling, here we apply decompressive neural estimation to recover full resolution image from a single snapshot, although as shown in simulation multiple snapshots can be used to improve SNR… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  39. arXiv:2108.09430  [pdf, ps, other

    cs.IT eess.SP

    An Attention-Aided Deep Learning Framework for Massive MIMO Channel Estimation

    Authors: Jiabao Gao, Mu Hu, Caijun Zhong, Geoffrey Ye Li, Zhaoyang Zhang

    Abstract: Channel estimation is one of the key issues in practical massive multiple-input multiple-output (MIMO) systems. Compared with conventional estimation algorithms, deep learning (DL) based ones have exhibited great potential in terms of performance and complexity. In this paper, an attention mechanism, exploiting the channel distribution characteristics, is proposed to improve the estimation accurac… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  40. Transportation Density Reduction Caused by City Lockdowns Across the World during the COVID-19 Epidemic: From the View of High-resolution Remote Sensing Imagery

    Authors: Chen Wu, Sihan Zhu, Jiaqi Yang, Meiqi Hu, Bo Du, Liangpei Zhang, Lefei Zhang, Chengxi Han, Meng Lan

    Abstract: As the COVID-19 epidemic began to worsen in the first months of 2020, stringent lockdown policies were implemented in numerous cities throughout the world to control human transmission and mitigate its spread. Although transportation density reduction inside the city was felt subjectively, there has thus far been no objective and quantitative study of its variation to reflect the intracity populat… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 14 pages, 7 figures, submitted to IEEE JSTARS

  41. arXiv:2010.14119  [pdf

    eess.IV cs.CV

    Hyperspectral Anomaly Change Detection Based on Auto-encoder

    Authors: Meiqi Hu, Chen Wu, Liangpei Zhang, Bo Du

    Abstract: With the hyperspectral imaging technology, hyperspectral data provides abundant spectral information and plays a more important role in geological survey, vegetation analysis and military reconnaissance. Different from normal change detection, hyperspectral anomaly change detection (HACD) helps to find those small but important anomaly changes between multi-temporal hyperspectral images (HSI). In… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: 11 pages,9 figures,3 tables

    MSC Class: 68U10 ACM Class: I.5.4

  42. arXiv:2010.11734  [pdf, other

    cs.CV cs.MM eess.SY

    Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

    Authors: Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

    Abstract: This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera. Existing non-contact breath assessments achieve satisfactory results under restricted conditions when human body stays relatively still. When someone moves forward, the breath signals detected by depth camera are hidden within signals of trunk displacement… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 5 pages, 3 figures

  43. arXiv:2010.10163  [pdf, other

    eess.IV cs.CV cs.LG

    Claw U-Net: A Unet-based Network with Deep Feature Concatenation for Scleral Blood Vessel Segmentation

    Authors: Chang Yao, Jingyu Tang, Menghan Hu, Yue Wu, Wenyi Guo, Qingli Li, Xiao-Ping Zhang

    Abstract: Sturge-Weber syndrome (SWS) is a vascular malformation disease, and it may cause blindness if the patient's condition is severe. Clinical results show that SWS can be divided into two types based on the characteristics of scleral blood vessels. Therefore, how to accurately segment scleral blood vessels has become a significant problem in computer-aided diagnosis. In this research, we propose to co… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 5 pages,4 figures

  44. arXiv:2009.06889  [pdf

    eess.SY math.OC

    Distributed Model Predicted Control of Multi-agent Systems with Applications to Multi-vehicle Cooperation

    Authors: Yougang Bian, Changkun Du, Manjiang Hu, Haikuo Liu

    Abstract: This paper proposes a distributed model predicted control (DMPC) approach for consensus control of multi-agent systems (MASs) with linear agent dynamics and bounded control input constraints. Within the proposed DMPC framework, each agent exchanges assumed state trajectories with neighbors and solves a local open-loop optimization problem to obtain the optimal control input. In the optimization pr… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  45. arXiv:2005.09808  [pdf, other

    astro-ph.IM eess.IV eess.SP

    Bernoulli generalized likelihood ratio test for signal detection from photon counting images

    Authors: Mengya Hu, He Sun, Anthony Harness, N. Jeremy Kasdin

    Abstract: Because exoplanets are extremely dim, an Electron Multiplying Charged Coupled Device (EMCCD) operating in photon counting (PC) mode is necessary to reduce the detector noise level and enable their detection. Typically, PC images are added together as a co-added image before processing. We present here a signal detection and estimation technique that works directly with individual PC images. The me… ▽ More

    Submitted 16 March, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: 37 pages, 13 figures, Accepted by JATIS on Feb 2021

  46. arXiv:2005.04132  [pdf, other

    eess.AS cs.SD

    Asteroid: the PyTorch-based audio source separation toolkit for researchers

    Authors: Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent

    Abstract: This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve reproducibility, Kaldi-style recipes on common audio source separation datasets are also provided. This paper describes the software architecture of Aste… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  47. arXiv:2004.06912  [pdf, other

    cs.CV eess.IV

    Combining Visible Light and Infrared Imaging for Efficient Detection of Respiratory Infections such as COVID-19 on Portable Device

    Authors: Zheng Jiang, Menghan Hu, Lei Fan, Yaling Pan, Wei Tang, Guangtao Zhai, Yong Lu

    Abstract: Coronavirus Disease 2019 (COVID-19) has become a serious global epidemic in the past few months and caused huge loss to human society worldwide. For such a large-scale epidemic, early detection and isolation of potential virus carriers is essential to curb the spread of the epidemic. Recent studies have shown that one important feature of COVID-19 is the abnormal respiratory status caused by viral… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  48. arXiv:2004.01479  [pdf, other

    cs.CY eess.SP

    Portable Health Screening Device of Respiratory Infections

    Authors: Zheng Jiang, Menghan Hu, Guangtao Zhai

    Abstract: The COVID-19 epidemic was listed as a public health emergency of international concern by the WHO on January 30, 2020. To curb the secondary spread of the epidemic, many public places were equipped with thermal imagers to check the body temperature. However, the COVID-19 pneumonia has concealed symptoms: the first symptom may not be fever, and can be shortness of breath. During epidemic prevention… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

  49. arXiv:2003.08818  [pdf

    cs.CV cs.LG eess.IV

    Brain MRI-based 3D Convolutional Neural Networks for Classification of Schizophrenia and Controls

    Authors: Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan

    Abstract: Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls. Given the subtle, mixed, and sparsely distributed brain atrophy patterns of schizophrenia, the capability of automatic feature learning makes CNN a powerful tool for classifying sc… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: 4 PAGES

  50. arXiv:2002.05534  [pdf, other

    cs.LG cs.CV eess.SP

    Abnormal respiratory patterns classifier may contribute to large-scale screening of people infected with COVID-19 in an accurate and unobtrusive manner

    Authors: Yunlu Wang, Menghan Hu, Qingli Li, Xiao-Ping Zhang, Guangtao Zhai, Nan Yao

    Abstract: Research significance: The extended version of this paper has been accepted by IEEE Internet of Things journal (DOI: 10.1109/JIOT.2020.2991456), please cite the journal version. During the epidemic prevention and control period, our study can be helpful in prognosis, diagnosis and screening for the patients infected with COVID-19 (the novel coronavirus) based on breathing characteristics. Accordin… ▽ More

    Submitted 20 December, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 6 page, 3 figure