Skip to main content

Showing 1–50 of 54 results for author: Zeng, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.05883  [pdf

    eess.IV cs.CV

    A novel framework for fully-automated co-registration of intravascular ultrasound and optical coherence tomography imaging data

    Authors: Xingwei He, Kit Mills Bransby, Ahmet Emir Ulutas, Thamil Kumaran, Nathan Angelo Lecaros Yap, Gonul Zeren, Hesong Zeng, Yaojun Zhang, Andreas Baumbach, James Moon, Anthony Mathur, Jouke Dijkstra, Qianni Zhang, Lorenz Raber, Christos V Bourantas

    Abstract: Aims: To develop a deep-learning (DL) framework that will allow fully automated longitudinal and circumferential co-registration of intravascular ultrasound (IVUS) and optical coherence tomography (OCT) images. Methods and results: Data from 230 patients (714 vessels) with acute coronary syndrome that underwent near-infrared spectroscopy (NIRS)-IVUS and OCT imaging in their non-culprit vessels wer… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: Preprint

  2. arXiv:2506.06758  [pdf, ps, other

    eess.SP

    A Novel Spreading-Factor-Index-Aided LoRa Scheme: Design and Performance Analysis

    Authors: Hao Zeng, Huan Ma, Yi Fang, Pingping Chen, Wenkun Wen, Tierui Min

    Abstract: LoRa is a widely recognized modulation technology in the field of low power wide area networks (LPWANs). However, the data rate of LoRa is too low to satisfy the requirements in the context of modern Internet of Things (IoT) applications. To address this issue, we propose a novel high-data-rate LoRa scheme based on the spreading factor index (SFI). In the proposed SFI-LoRa scheme, the starting fre… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  3. arXiv:2506.01394  [pdf, ps, other

    eess.IV cs.CV

    NTIRE 2025 the 2nd Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Zhengqiang Zhang, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we present a comprehensive overview of the NTIRE 2025 challenge on the 2nd Restore Any Image Model (RAIM) in the Wild. This challenge established a new benchmark for real-world image restoration, featuring diverse scenarios with and without reference ground truth. Participants were tasked with restoring real-captured images suffering from complex and unknown degradations, where both… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  4. arXiv:2505.01951  [pdf

    eess.IV

    UNet-3D with Adaptive TverskyCE Loss for Pancreas Medical Image Segmentation

    Authors: Xubei Zhang, Mikhail Y. Shalaginov, Tingying Helen Zeng

    Abstract: Pancreatic cancer, which has a low survival rate, is the most intractable one among all cancers. Most diagnoses of this cancer heavily depend on abdominal computed tomography (CT) scans. Therefore, pancreas segmentation is crucial but challenging. Because of the obscure position of the pancreas, surrounded by other large organs, and its small area, the pancreas has often been impeded and difficult… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: 6 pages and 3 figures

  5. arXiv:2504.13574  [pdf, other

    cs.LG cs.CV eess.IV

    MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework

    Authors: Zhenkai Qin, Feng Zhu, Huan Zeng, Xunyi Nong

    Abstract: The demand for lightweight models in image classification tasks under resource-constrained environments necessitates a balance between computational efficiency and robust feature representation. Traditional attention mechanisms, despite their strong feature modeling capability, often struggle with high computational complexity and structural rigidity, limiting their applicability in scenarios with… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  6. arXiv:2504.13131  [pdf, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

    Authors: Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong , et al. (88 additional authors not shown)

    Abstract: This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement. The challenge comprises two tracks: (i) Efficient Video Quality Assessment (KVQ), and (ii) Diffusion-based Image Super-Resolution (KwaiSR). Track 1 aims to advance the development of lightweight and efficient video quality assessment (VQA) models, with an emphasis on eliminating re… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of NTIRE 2025; Methods from 18 Teams; Accepted by CVPR Workshop; 21 pages

  7. arXiv:2503.02322  [pdf, other

    eess.IV cs.CV

    Generative Model-Assisted Demosaicing for Cross-multispectral Cameras

    Authors: Jiahui Luo, Kai Feng, Haijin Zeng, Yongyong Chen

    Abstract: As a crucial part of the spectral filter array (SFA)-based multispectral imaging process, spectral demosaicing has exploded with the proliferation of deep learning techniques. However, (1) bothering by the difficulty of capturing corresponding labels for real data or simulating the practical spectral imaging process, end-to-end networks trained in a supervised manner using simulated data often per… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  8. arXiv:2502.15174  [pdf, other

    eess.IV cs.CV

    FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression

    Authors: Shiqi Jiang, Hui Yuan, Shuai Li, Huanqiang Zeng, Sam Kwong

    Abstract: The learned image compression (LIC) methods have already surpassed traditional techniques in compressing natural scene (NS) images. However, directly applying these methods to screen content (SC) images, which possess distinct characteristics such as sharp edges, repetitive patterns, embedded text and graphics, yields suboptimal results. This paper addresses three key challenges in SC image compre… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  9. arXiv:2502.05330  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge

    Authors: Muhammad Imran, Jonathan R. Krebs, Vishal Balaji Sivaraman, Teng Zhang, Amarjeet Kumar, Walker R. Ueland, Michael J. Fassler, Jinlong Huang, Xiao Sun, Lisheng Wang, Pengcheng Shi, Maximilian Rokuss, Michael Baumgartner, Yannick Kirchhof, Klaus H. Maier-Hein, Fabian Isensee, Shuolin Liu, Bing Han, Bong Thanh Nguyen, Dong-jin Shin, Park Ji-Woo, Mathew Choi, Kwang-Hyun Uhm, Sung-Jea Ko, Chanwoong Lee , et al. (38 additional authors not shown)

    Abstract: Multi-class segmentation of the aorta in computed tomography angiography (CTA) scans is essential for diagnosing and planning complex endovascular treatments for patients with aortic dissections. However, existing methods reduce aortic segmentation to a binary problem, limiting their ability to measure diameters across different branches and zones. Furthermore, no open-source dataset is currently… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  10. arXiv:2411.12812  [pdf, other

    eess.SY

    DIETS: Diabetic Insulin Management System in Everyday Life

    Authors: Hanyu Zeng, Hui Ji, Pengfei Zhou

    Abstract: People with diabetes need insulin delivery to effectively manage their blood glucose levels, especially after meals, because their bodies either do not produce enough insulin or cannot fully utilize it. Accurate insulin delivery starts with estimating the nutrients in meals and is followed by developing a detailed, personalized insulin injection strategy. These tasks are particularly challenging i… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  11. arXiv:2410.14214  [pdf, other

    cs.CV eess.IV

    MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging

    Authors: Zhenghao Pan, Haijin Zeng, Jiezhang Cao, Yongyong Chen, Kai Zhang, Yong Xu

    Abstract: Color video snapshot compressive imaging (SCI) employs computational imaging techniques to capture multiple sequential video frames in a single Bayer-patterned measurement. With the increasing popularity of quad-Bayer pattern in mainstream smartphone cameras for capturing high-resolution videos, mobile photography has become more accessible to a wider audience. However, existing color video SCI re… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  12. arXiv:2410.12811  [pdf, other

    cs.CV cs.SD eess.AS

    Decoding Emotions: Unveiling Facial Expressions through Acoustic Sensing with Contrastive Attention

    Authors: Guangjing Wang, Juexing Wang, Ce Zhou, Weikang Ding, Huacheng Zeng, Tianxing Li, Qiben Yan

    Abstract: Expression recognition holds great promise for applications such as content recommendation and mental healthcare by accurately detecting users' emotional states. Traditional methods often rely on cameras or wearable sensors, which raise privacy concerns and add extra device burdens. In addition, existing acoustic-based methods struggle to maintain satisfactory performance when there is a distribut… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Comments: The extended version of the 2023 IEEE INFOCOM conference paper

  13. arXiv:2409.17500  [pdf, other

    cs.AI eess.SY math.OC

    GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent

    Authors: Hongtai Zeng, Chao Yang, Yanzhen Zhou, Cheng Yang, Qinglai Guo

    Abstract: Ensuring that the outputs of neural networks satisfy specific constraints is crucial for applying neural networks to real-life decision-making problems. In this paper, we consider making a batch of neural network outputs satisfy bounded and general linear constraints. We first reformulate the neural network output projection problem as an entropy-regularized linear programming problem. We show tha… ▽ More

    Submitted 11 November, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted by 2024 Advances in Neural Information Processing Systems. The reviews and comments can be found in https://openreview.net/forum?id=m1PVjNHvtP

  14. arXiv:2409.15109  [pdf, other

    cs.IT eess.SP

    End-User-Centric Collaborative MIMO: Performance Analysis and Proof of Concept

    Authors: Chao-Kai Wen, Yen-Cheng Chan, Tzu-Hao Huang, Hao-Jun Zeng, Fu-Kang Wang, Lung-Sheng Tsai, Pei-Kai Liao

    Abstract: The trend toward using increasingly large arrays of antenna elements continues. However, fitting more antennas into the limited space available on user equipment (UE) within the currently popular Frequency Range 1 spectrum presents a significant challenge. This limitation constrains the capacity-scaling gains for end users, even when networks support a higher number of antennas. To address this is… ▽ More

    Submitted 24 December, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 16 pages, 11 figures, this work has been submitted to IEEE for possible publication

  15. arXiv:2409.07417  [pdf, other

    eess.IV cs.CV

    Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging

    Authors: Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang

    Abstract: Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for capturing three-dimensional multispectral images (MSIs) through the complex inverse task of reconstructing these images from coded two-dimensional measurements. Current state-of-the-art methods, predominantly end-to-end, face limitations in reconstructing high-frequency details and often rely on constrained datasets like K… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  16. arXiv:2408.00629  [pdf, other

    cs.CV eess.IV

    Cross-Scan Mamba with Masked Training for Robust Spectral Imaging

    Authors: Wenzhe Tian, Haijin Zeng, Yin-Ping Zhao, Yongyong Chen, Zhen Wang, Xuelong Li

    Abstract: Snapshot Compressive Imaging (SCI) enables fast spectral imaging but requires effective decoding algorithms for hyperspectral image (HSI) reconstruction from compressed measurements. Current CNN-based methods are limited in modeling long-range dependencies, while Transformer-based models face high computational complexity. Although recent Mamba models outperform CNNs and Transformers in RGB tasks… ▽ More

    Submitted 6 December, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

    Comments: 11 pages,7 figures

  17. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for fundus images were pre-trained with limited disease categories and knowledge base. Here we introduce a knowledge-rich vision-language model (RetiZero) that leverages knowledge from more than 400 fundus diseases. For RetiZero's pretraining, we compiled 341,896 fundus images paired with texts, sourced from public datasets, ophthalmic literature, and online resources, e… ▽ More

    Submitted 10 April, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  18. arXiv:2405.16102  [pdf, other

    eess.IV cs.CV

    Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Rui Zheng, Huazhu Fu

    Abstract: Source-Free Unsupervised Domain Adaptation (SFUDA) has recently become a focus in the medical image domain adaptation, as it only utilizes the source model and does not require annotated target data. However, current SFUDA approaches cannot tackle the complex segmentation task across different MRI sequences, such as the vestibular schwannoma segmentation. To address this problem, we proposed Relia… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Early accepted by MICCAI 2024

  19. arXiv:2405.09923  [pdf, other

    cs.CV eess.IV

    NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild, including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation, where gener… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  20. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  21. arXiv:2404.16920  [pdf, other

    cs.NI cs.IT cs.LG eess.SP

    Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

    Authors: Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar

    Abstract: We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks. Our objective is to minimize the average delay in the system due to APs' limited service capacity and unreliable wireless channels between APs and u… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Wireless Communications

  22. arXiv:2402.11211  [pdf, other

    eess.IV cs.CV

    Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Yuchong Gao, Hongbo Chen, Haibin Zhang, Kang Zhou, Meng Wang, Rick Siow Mong Goh, Yong Liu, Chang Jiang, Rui Zheng, Huazhu Fu

    Abstract: Handheld ultrasound devices face usage limitations due to user inexperience and cannot benefit from supervised deep learning without extensive expert annotations. Moreover, the models trained on standard ultrasound device data are constrained by training data distribution and perform poorly when directly applied to handheld device data. In this study, we propose the Training-free Image Style Align… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  23. arXiv:2401.11620  [pdf, other

    eess.SY

    Real-Time Systems Optimization with Black-box Constraints and Hybrid Variables

    Authors: Sen Wang, Dong Li, Shao-Yu Huang, Xuanliang Deng, Ashrarul H. Sifat, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: When optimizing real-time systems, designers often face a challenging problem where the schedulability constraints are non-convex, non-continuous, or lack an analytical form to understand their properties. Although the optimization framework NORTH proposed in previous work is general (it works with arbitrary schedulability analysis) and scalable, it can only handle problems with continuous variabl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Workshop on OPtimization for Embedded and ReAl-time systems (OPERA 2023) co-located with the 44th IEEE Real-Time Systems Symposium (RTSS)

  24. arXiv:2401.03284  [pdf, other

    eess.SY

    Joint Optimization of Continuous Variables and Priority Assignments for Real-Time Systems with Black-box Schedulability Constraints

    Authors: Sen Wang, Dong Li, Shao-Yu Huang, Xuanliang Deng, Ashrarul H. Sifat, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: In real-time systems optimization, designers often face a challenging problem posed by the non-convex and non-continuous schedulability conditions, which may even lack an analytical form to understand their properties. To tackle this challenging problem, we treat the schedulability analysis as a black box that only returns true/false results. We propose a general and scalable framework to optimize… ▽ More

    Submitted 18 March, 2025; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Extension of a conference paper

  25. arXiv:2310.19699  [pdf, other

    eess.SY cs.OS cs.SC

    Optimizing Logical Execution Time Model for Both Determinism and Low Latency

    Authors: Sen Wang, Dong Li, Ashrarul H. Sifat, Shao-Yu Huang, Xuanliang Deng, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: The Logical Execution Time (LET) programming model has recently received considerable attention, particularly because of its timing and dataflow determinism. In LET, task computation appears always to take the same amount of time (called the task's LET interval), and the task reads (resp. writes) at the beginning (resp. end) of the interval. Compared to other communication mechanisms, such as impl… ▽ More

    Submitted 7 March, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: accepted in RTAS'24

  26. arXiv:2307.01990  [pdf

    eess.IV cs.CV

    Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks

    Authors: Kai Feng, Yongqiang Zhao, Seong G. Kong, Haijin Zeng

    Abstract: This paper presents a deep learning-based spectral demosaicing technique trained in an unsupervised manner. Many existing deep learning-based techniques relying on supervised learning with synthetic images, often underperform on real-world images especially when the number of spectral bands increases. According to the characteristics of the spectral mosaic image, this paper proposes a mosaic loss… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  27. arXiv:2305.04047  [pdf, other

    eess.IV cs.CV

    Degradation-Noise-Aware Deep Unfolding Transformer for Hyperspectral Image Denoising

    Authors: Haijin Zeng, Jiezhang Cao, Kai Feng, Shaoguang Huang, Hongyan Zhang, Hiep Luong, Wilfried Philips

    Abstract: Hyperspectral imaging (HI) has emerged as a powerful tool in diverse fields such as medical diagnosis, industrial inspection, and agriculture, owing to its ability to detect subtle differences in physical properties through high spectral resolution. However, hyperspectral images (HSIs) are often quite noisy because of narrow band spectral filtering. To reduce the noise in HSI data cubes, both mode… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  28. arXiv:2303.13571  [pdf, other

    cs.CV eess.IV

    Inheriting Bayer's Legacy-Joint Remosaicing and Denoising for Quad Bayer Image Sensor

    Authors: Haijin Zeng, Kai Feng, Jiezhang Cao, Shaoguang Huang, Yongqiang Zhao, Hiep Luong, Jan Aelterman, Wilfried Philips

    Abstract: Pixel binning based Quad sensors have emerged as a promising solution to overcome the hardware limitations of compact cameras in low-light imaging. However, binning results in lower spatial resolution and non-Bayer CFA artifacts. To address these challenges, we propose a dual-head joint remosaicing and denoising network (DJRD), which enables the conversion of noisy Quad Bayer and standard noise-fr… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  29. arXiv:2303.13404  [pdf, other

    eess.IV cs.CV

    MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing

    Authors: Haijin Zeng, Kai Feng, Shaoguang Huang, Jiezhang Cao, Yongyong Chen, Hongyan Zhang, Hiep Luong, Wilfried Philips

    Abstract: Hyperspectral imaging systems that use multispectral filter arrays (MSFA) capture only one spectral component in each pixel. Hyperspectral demosaicing is used to recover the non-measured components. While deep learning methods have shown promise in this area, they still suffer from several challenges, including limited modeling of non-local dependencies, lack of consideration of the periodic MSFA… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  30. arXiv:2302.03839  [pdf, other

    eess.IV cs.CV cs.LG

    Futuristic Variations and Analysis in Fundus Images Corresponding to Biological Traits

    Authors: Muhammad Hassan, Hao Zhang, Ahmed Fateh Ameen, Home Wu Zeng, Shuye Ma, Wen Liang, Dingqi Shang, Jiaming Ding, Ziheng Zhan, Tsz Kwan Lam, Ming Xu, Qiming Huang, Dongmei Wu, Can Yang Zhang, Zhou You, Awiwu Ain, Pei Wu Qin

    Abstract: Fundus image captures rear of an eye, and which has been studied for the diseases identification, classification, segmentation, generation, and biological traits association using handcrafted, conventional, and deep learning methods. In biological traits estimation, most of the studies have been carried out for the age prediction and gender classification with convincing results. However, the curr… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures, 3 tables

  31. arXiv:2301.06132  [pdf, other

    cs.CV eess.IV

    Deep Diversity-Enhanced Feature Representation of Hyperspectral Images

    Authors: Jinhui Hou, Zhiyu Zhu, Junhui Hou, Hui Liu, Huanqiang Zeng, Deyu Meng

    Abstract: In this paper, we study the problem of efficiently and effectively embedding the high-dimensional spatio-spectral information of hyperspectral (HS) images, guided by feature diversity. Specifically, based on the theoretical formulation that feature diversity is correlated with the rank of the unfolded kernel matrix, we rectify 3D convolution by modifying its topology to enhance the rank upper-boun… ▽ More

    Submitted 9 May, 2024; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 17 pages, 12 figures. Accepted in TPAMI 2024. arXiv admin note: substantial text overlap with arXiv:2207.04266

  32. arXiv:2301.01420  [pdf

    cs.MM eess.IV

    Improved CNN Prediction Based Reversible Data Hiding

    Authors: Yingqiang Qiu, Wanli Peng, Xiaodan Lin, Huanqiang Zeng, Zhenxing Qian

    Abstract: This letter proposes an improved CNN predictor (ICNNP) for reversible data hiding (RDH) in images, which consists of a feature extraction module, a pixel prediction module, and a complexity prediction module. Due to predicting the complexity of each pixel with the ICNNP during the embedding process, the proposed method can achieve superior performance than the CNN predictor-based method. Specifica… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  33. arXiv:2212.14747  [pdf, other

    eess.IV cs.CV

    VertMatch: A Semi-supervised Framework for Vertebral Structure Detection in 3D Ultrasound Volume

    Authors: Hongye Zeng, kang Zhou, Songhan Ge, Yuchong Gao, Jianhao Zhao, Shenghua Gao, Rui Zheng

    Abstract: Three-dimensional (3D) ultrasound imaging technique has been applied for scoliosis assessment, but current assessment method only uses coronal projection image and cannot illustrate the 3D deformity and vertebra rotation. The vertebra detection is essential to reveal 3D spine information, but the detection task is challenging due to complex data and limited annotations. We propose VertMatch, a two… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: 15 pages, 8 figures

  34. Deep Posterior Distribution-based Embedding for Hyperspectral Image Super-resolution

    Authors: Jinhui Hou, Zhiyu Zhu, Junhui Hou, Huanqiang Zeng, Jinjian Wu, Jiantao Zhou

    Abstract: In this paper, we investigate the problem of hyperspectral (HS) image spatial super-resolution via deep learning. Particularly, we focus on how to embed the high-dimensional spatial-spectral information of HS images efficiently and effectively. Specifically, in contrast to existing methods adopting empirically-designed network modules, we formulate HS embedding as an approximation of the posterior… ▽ More

    Submitted 23 August, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE Transactions on Image Processing

  35. arXiv:2204.12879  [pdf, other

    cs.CV eess.IV

    Low-rank Meets Sparseness: An Integrated Spatial-Spectral Total Variation Approach to Hyperspectral Denoising

    Authors: Haijin Zeng, Shaoguang Huang, Yongyong Chen, Hiep Luong, Wilfried Philips

    Abstract: Spatial-Spectral Total Variation (SSTV) can quantify local smoothness of image structures, so it is widely used in hyperspectral image (HSI) processing tasks. Essentially, SSTV assumes a sparse structure of gradient maps calculated along the spatial and spectral directions. In fact, these gradient tensors are not only sparse, but also (approximately) low-rank under FFT, which we have verified by n… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  36. arXiv:2204.07228  [pdf

    cs.CL cs.SD eess.AS

    Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech

    Authors: Cong Zhang, Huinan Zeng, Huang Liu, Jiewen Zheng

    Abstract: This study investigates whether the phonological features derived from the Featurally Underspecified Lexicon model can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. This mapping was tested for whether it could lead to the successful generation of nati… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: submitted to Interspeech 2022. arXiv admin note: substantial text overlap with arXiv:2110.03609

  37. arXiv:2203.16537  [pdf, other

    cs.LG cs.AI eess.SP

    Efficient Localness Transformer for Smart Sensor-Based Energy Disaggregation

    Authors: Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

    Abstract: Modern smart sensor-based energy management systems leverage non-intrusive load monitoring (NILM) to predict and optimize appliance load distribution in real-time. NILM, or energy disaggregation, refers to the decomposition of electricity usage conditioned on the aggregated power signals (i.e., smart sensor on the main channel). Based on real-time appliance power prediction using sensory technolog… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to DCOSS 2022

  38. arXiv:2203.14216  [pdf, other

    cs.CV eess.IV

    Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution

    Authors: Jie Liang, Hui Zeng, Lei Zhang

    Abstract: Efficient and effective real-world image super-resolution (Real-ISR) is a challenging task due to the unknown complex degradation of real-world images and the limited computation resources in practical applications. Recent research on Real-ISR has achieved significant progress by modeling the image degradation space; however, these methods largely rely on heavy backbone networks and they are infle… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  39. arXiv:2203.09195  [pdf, other

    eess.IV cs.CV

    Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution

    Authors: Jie Liang, Hui Zeng, Lei Zhang

    Abstract: Single image super-resolution (SISR) with generative adversarial networks (GAN) has recently attracted increasing attention due to its potentials to generate rich details. However, the training of GAN is unstable, and it often introduces many perceptually unpleasant artifacts along with the generated details. In this paper, we demonstrate that it is possible to train a GAN-based SISR model which c… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: To appear at CVPR 2022

  40. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  41. arXiv:2112.02858  [pdf

    eess.IV cs.CV cs.MM

    A comparison study of CNN denoisers on PRNU extraction

    Authors: Hui Zeng, Morteza Darvish Morshedi Hosseini, Kang Deng, Anjie Peng, Miroslav Goljan

    Abstract: Performance of the sensor-based camera identification (SCI) method heavily relies on the denoising filter in estimating Photo-Response Non-Uniformity (PRNU). Given various attempts on enhancing the quality of the extracted PRNU, it still suffers from unsatisfactory performance in low-resolution images and high computational demand. Leveraging the similarity of PRNU estimation and image denoising,… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 12 pages, 6 figures, 4 tables

  42. arXiv:2111.14474  [pdf, other

    eess.IV cs.CV

    Learning-Based Video Coding with Joint Deep Compression and Enhancement

    Authors: Tiesong Zhao, Weize Feng, Hongji Zeng, Yuzhen Niu, Jiaying Liu

    Abstract: The end-to-end learning-based video compression has attracted substantial attentions by paving another way to compress video signals as stacked visual features. This paper proposes an efficient end-to-end deep video codec with jointly optimized compression and enhancement modules (JCEVC). First, we propose a dual-path generative adversarial network (DPEG) to reconstruct video details after compres… ▽ More

    Submitted 30 April, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 10 pages, 9 figures

  43. arXiv:2111.08233  [pdf, ps, other

    eess.SP

    Toward UL-DL Rate Balancing: Joint Resource Allocation and Hybrid-Mode Multiple Access for UAV-BS Assisted Communication Systems

    Authors: Haiyong Zeng, Xu Zhu, Yufei Jiang, Zhongxiang Wei, Sumei Sun

    Abstract: In this paper, we investigate unmanned aerial vehicle (UAV) assisted communication systems that require quasi-balanced data rates in uplink (UL) and downlink (DL), as well as users' heterogeneous traffic. To the best of our knowledge, this is the first work to explicitly investigate joint UL-DL optimization for UAV assisted systems under heterogeneous requirements. A hybrid-mode multiple access (H… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 32 pages, 9 figures

  44. arXiv:2110.03609  [pdf

    cs.CL cs.LG cs.SD eess.AS

    Applying Phonological Features in Multilingual Text-To-Speech

    Authors: Cong Zhang, Huinan Zeng, Huang Liu, Jiewen Zheng

    Abstract: This study investigates whether phonological features can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. We tested whether this mapping could lead to the successful generation of native, non-native, and code-switched speech in the two languages. We ran… ▽ More

    Submitted 10 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: demo webpage: https://congzhang365.github.io/feature_tts/

  45. arXiv:2105.07825  [pdf, other

    eess.IV cs.CV cs.LG

    Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Andrew Lek, Mustafa Ayazoglu, Jie Liu, Zongcai Du, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan, Zexin Zhang, Yixin Chen, Yunbo Peng, Yue Lin, Xindong Zhang, Hui Zeng, Kun Zeng, Peirong Li, Zhihuang Liu, Shiqi Xue, Shengpeng Wang

    Abstract: Image super-resolution is one of the most popular computer vision problems with many important applications to mobile devices. While many solutions have been proposed for this task, they are usually not optimized even for common smartphone AI hardware, not to mention more constrained smart TV platforms that are often supporting INT8 inference only. To address this problem, we introduce the first M… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/

  46. arXiv:2105.03847  [pdf

    eess.IV cs.CV

    Automatic segmentation of vertebral features on ultrasound spine images using Stacked Hourglass Network

    Authors: Hong-Ye Zeng, Song-Han Ge, Yu-Chong Gao, De-Sen Zhou, Kang Zhou, Xu-Ming He, Edmond Lou, Rui Zheng

    Abstract: Objective: The spinous process angle (SPA) is one of the essential parameters to denote three-dimensional (3-D) deformity of spine. We propose an automatic segmentation method based on Stacked Hourglass Network (SHN) to detect the spinous processes (SP) on ultrasound (US) spine images and to measure the SPAs of clinical scoliotic subjects. Methods: The network was trained to detect vertebral SP an… ▽ More

    Submitted 23 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: 9 pages,5 figures

  47. arXiv:2104.14655  [pdf

    eess.IV cs.CV

    Lung Cancer Diagnosis Using Deep Attention Based on Multiple Instance Learning and Radiomics

    Authors: Junhua Chen, Haiyan Zeng, Chong Zhang, Zhenwei Shi, Andre Dekker, Leonard Wee, Inigo Bermejo

    Abstract: Early diagnosis of lung cancer is a key intervention for the treatment of lung cancer computer aided diagnosis (CAD) can play a crucial role. However, most published CAD methods treat lung cancer diagnosis as a lung nodule classification problem, which does not reflect clinical practice, where clinicians diagnose a patient based on a set of images of nodules, instead of one specific nodule. Beside… ▽ More

    Submitted 12 February, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

  48. Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time

    Authors: Hui Zeng, Jianrui Cai, Lida Li, Zisheng Cao, Lei Zhang

    Abstract: Recent years have witnessed the increasing popularity of learning based methods to enhance the color and tone of photos. However, many existing photo enhancement methods either deliver unsatisfactory results or consume too much computational and memory resources, hindering their application to high-resolution images (usually with more than 12 megapixels) in practice. In this paper, we learn image-… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: High quality adaptive photo enhancement in real-time (<2ms for 4K resolution images)! Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

  49. Hyperspectral Image Super-resolution via Deep Progressive Zero-centric Residual Learning

    Authors: Zhiyu Zhu, Junhui Hou, Jie Chen, Huanqiang Zeng, Jiantao Zhou

    Abstract: This paper explores the problem of hyperspectral image (HSI) super-resolution that merges a low resolution HSI (LR-HSI) and a high resolution multispectral image (HR-MSI). The cross-modality distribution of the spatial and spectral information makes the problem challenging. Inspired by the classic wavelet decomposition-based image fusion, we propose a novel \textit{lightweight} deep neural network… ▽ More

    Submitted 5 December, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  50. Hyperspectral Image Denoising via Global Spatial-Spectral Total Variation Regularized Nonconvex Local Low-Rank Tensor Approximation

    Authors: Haijin Zeng, Xiaozhen Xie, Jifeng Ning

    Abstract: Hyperspectral image (HSI) denoising aims to restore clean HSI from the noise-contaminated one. Noise contamination can often be caused during data acquisition and conversion. In this paper, we propose a novel spatial-spectral total variation (SSTV) regularized nonconvex local low-rank (LR) tensor approximation method to remove mixed noise in HSIs. From one aspect, the clean HSI data have its under… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    MSC Class: 94A12

    Journal ref: Signal Processing Volume 178, January 2021, 107805