Skip to main content

Showing 1–31 of 31 results for author: Qi, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.16177  [pdf, ps, other

    eess.IV cs.CV

    Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression

    Authors: Linfeng Qi, Zhaoyang Jia, Jiahao Li, Bin Li, Houqiang Li, Yan Lu

    Abstract: Most existing approaches for image and video compression perform transform coding in the pixel space to reduce redundancy. However, due to the misalignment between the pixel-space distortion and human perception, such schemes often face the difficulties in achieving both high-realism and high-fidelity at ultra-low bitrate. To solve this problem, we propose \textbf{G}enerative \textbf{L}atent \text… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  3. arXiv:2504.01577  [pdf, other

    eess.IV cs.CV

    Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology

    Authors: Lirui Qi, Hongliang He, Tong Wang, Siwei Feng, Guohong Fu

    Abstract: Nuclear instance segmentation plays a vital role in disease diagnosis within digital pathology. However, limited labeled data in pathological images restricts the overall performance of nuclear instance segmentation. To tackle this challenge, we propose a novel data augmentation framework Instance Migration Diffusion Model (IM-Diffusion), IM-Diffusion designed to generate more varied pathological… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  4. arXiv:2503.06945  [pdf, other

    eess.IV cs.CV

    Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data Classification

    Authors: Junyan Lin, Feng Gap, Lin Qi, Junyu Dong, Qian Du, Xinbo Gao

    Abstract: Hyperspectral image (HSI) and LiDAR data joint classification is a challenging task. Existing multi-source remote sensing data classification methods often rely on human-designed frameworks for feature extraction, which heavily depend on expert knowledge. To address these limitations, we propose a novel Dynamic Cross-Modal Feature Interaction Network (DCMNet), the first framework leveraging a dyna… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Accepted by IEEE TGRS 2025

  5. arXiv:2502.20762  [pdf, other

    eess.IV cs.CV

    Towards Practical Real-Time Neural Video Compression

    Authors: Zhaoyang Jia, Bin Li, Jiahao Li, Wenxuan Xie, Linfeng Qi, Houqiang Li, Yan Lu

    Abstract: We introduce a practical real-time neural video codec (NVC) designed to deliver high compression ratio, low latency and broad versatility. In practice, the coding speed of NVCs depends on 1) computational costs, and 2) non-computational operational costs, such as memory I/O and the number of function calls. While most efficient NVCs prioritize reducing computational cost, we identify operational c… ▽ More

    Submitted 18 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

    Comments: CVPR 2025. Visit the project page at https://dcvccodec.github.io and access the code at https://github.com/microsoft/DCVC

  6. arXiv:2412.08671  [pdf, other

    cs.CV cs.LG eess.IV

    A Deep Semantic Segmentation Network with Semantic and Contextual Refinements

    Authors: Zhiyan Wang, Deyin Liu, Lin Yuanbo Wu, Song Wang, Xin Guo, Lin Qi

    Abstract: Semantic segmentation is a fundamental task in multimedia processing, which can be used for analyzing, understanding, editing contents of images and videos, among others. To accelerate the analysis of multimedia data, existing segmentation researches tend to extract semantic information by progressively reducing the spatial resolutions of feature maps. However, this approach introduces a misalignm… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accept by tmm

  7. arXiv:2412.08670  [pdf, other

    cs.CV cs.LG eess.IV

    A feature refinement module for light-weight semantic segmentation network

    Authors: Zhiyan Wang, Xin Guo, Song Wang, Peixiao Zheng, Lin Qi

    Abstract: Low computational complexity and high segmentation accuracy are both essential to the real-world semantic segmentation tasks. However, to speed up the model inference, most existing approaches tend to design light-weight networks with a very limited number of parameters, leading to a considerable degradation in accuracy due to the decrease of the representation ability of the networks. To solve th… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accept by icip 2023

  8. arXiv:2408.13975  [pdf

    physics.med-ph eess.IV

    Cross-sectional imaging of speed-of-sound distribution using photoacoustic reversal beacons

    Authors: Yang Wang, Danni Wang, Liting Zhong, Yi Zhou, Qing Wang, Wufan Chen, Li Qi

    Abstract: Photoacoustic tomography (PAT) enables non-invasive cross-sectional imaging of biological tissues, but it fails to map the spatial variation of speed-of-sound (SOS) within tissues. While SOS is intimately linked to density and elastic modulus of tissues, the imaging of SOS distri-bution serves as a complementary imaging modality to PAT. Moreover, an accurate SOS map can be leveraged to correct for… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  9. arXiv:2408.12760  [pdf, other

    eess.IV cs.CV

    Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification

    Authors: Han Luo, Feng Gao, Junyu Dong, Lin Qi

    Abstract: Hyperspectral image (HSI) and synthetic aperture radar (SAR) data joint classification is a crucial and yet challenging task in the field of remote sensing image interpretation. However, feature modeling in existing methods is deficient to exploit the abundant global, spectral, and local features simultaneously, leading to sub-optimal classification performance. To solve the problem, we propose a… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE GRSL

  10. arXiv:2408.09241  [pdf, other

    cs.CV eess.IV

    Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration

    Authors: Xin Lin, Yuyan Zhou, Jingtong Yue, Chao Ren, Kelvin C. K. Chan, Lu Qi, Ming-Hsuan Yang

    Abstract: Unsupervised restoration approaches based on generative adversarial networks (GANs) offer a promising solution without requiring paired datasets. Yet, these GAN-based approaches struggle to surpass the performance of conventional unsupervised GAN-based frameworks without significantly modifying model structures or increasing the computational complexity. To address these issues, we propose a self-… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: This paper is an extended and revised version of our previous work "Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches"(https://openaccess.thecvf.com/content/ICCV2023/papers/Lin_Unsupervised_Image_Denoising_in_Real-World_Scenarios_via_Self-Collaboration_Parallel_Generative_ICCV_2023_paper.pdf)

  11. arXiv:2407.19573  [pdf, other

    eess.SY

    Passivity based Stability Assessment for Four types of Droops for DC Microgrids

    Authors: Muhammad Anees, Lisa Qi, Mario Schweizer, Srdjan Lukic

    Abstract: DC microgrids are getting more and more applications due to simple converters, only voltage control and higher efficiencies compared to conventional AC grids. Droop control is a well know decentralized control strategy for power sharing among converter interfaced sources and loads in a DC microgrid. This work compares the stability assessment and control of four types of droops for boost converter… ▽ More

    Submitted 17 September, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  12. arXiv:2407.19570  [pdf, other

    eess.SY

    A Baseline Approach for Modeling and Characterization of Commercial Off-The-Shelf (COTS) Droop Controlled Converter

    Authors: Muhammad Anees, Lisa Qi, Mehnaz Khan, Srdjan Lukic

    Abstract: Due to advancements in power electronics, new converter topologies are introduced day by day. It's hard to get an equivalent model from any manufacturer of any Commercial Off-The-Shelf (COTS) power electronics converters because of intellectual property (IP) and safety concerns. Most COTS products don't reveal the exact topology of the converter as well as the control architecture and correspondin… ▽ More

    Submitted 17 September, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  13. arXiv:2407.03992  [pdf, other

    eess.IV

    Medical Image Fusion for High-Level Analysis: A Mutual Enhancement Framework for Unaligned PAT and MRI

    Authors: Yutian Zhong, Jinchuan He, Zhichao Liang, Shuangyang Zhang, Qianjin Feng, Lijun Lu, Li Qi

    Abstract: Photoacoustic tomography (PAT) offers optical contrast, whereas magnetic resonance imaging (MRI) excels in imaging soft tissue and organ anatomy. The fusion of PAT with MRI holds promising application prospects due to their complementary advantages. Existing image fusion have made considerable progress in pre-registered images, yet spatial deformations are difficult to avoid in medical imaging sce… ▽ More

    Submitted 19 March, 2025; v1 submitted 4 July, 2024; originally announced July 2024.

  14. arXiv:2406.01644  [pdf, other

    eess.IV

    Dual-Stream Attention Network for Hyperspectral Image Unmixing

    Authors: Yufang Wang, Wenmin Wu, Lin Qi, Feng Gao

    Abstract: Hyperspectral image (HSI) contains abundant spatial and spectral information, making it highly valuable for unmixing. In this paper, we propose a Dual-Stream Attention Network (DSANet) for HSI unmixing. The endmembers and abundance of a pixel in HSI have high correlations with its adjacent pixels. Therefore, we adopt a "many to one" strategy to estimate the abundance of the central pixel. In addit… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE IGARSS 2024

  15. arXiv:2406.01245  [pdf, other

    eess.IV

    Sparse Focus Network for Multi-Source Remote Sensing Data Classification

    Authors: Xuepeng Jin, Junyan Lin, Feng Gao, Lin Qi, Yang Zhou

    Abstract: Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors. Existing multi-source data classification methods are susceptible to irrelevant information interference during multi-source feature extraction and fusion. To solve this issue, we propose a sparse focus network for multi-source data classification. Sparse attention is… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE IGARSS 2024

  16. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  17. arXiv:2404.06695  [pdf, other

    eess.IV physics.med-ph

    Spiral Scanning and Self-Supervised Image Reconstruction Enable Ultra-Sparse Sampling Multispectral Photoacoustic Tomography

    Authors: Yutian Zhong, Xiaoming Zhang, Zongxin Mo, Shuangyang Zhang, Wufan Chen, Li Qi

    Abstract: Multispectral photoacoustic tomography (PAT) is an imaging modality that utilizes the photoacoustic effect to achieve non-invasive and high-contrast imaging of internal tissues. However, the hardware cost and computational demand of a multispectral PAT system consisting of up to thousands of detectors are huge. To address this challenge, we propose an ultra-sparse spiral sampling strategy for mult… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  18. arXiv:2312.01727  [pdf

    eess.IV physics.bio-ph

    Deep learning acceleration of iterative model-based light fluence correction for photoacoustic tomography

    Authors: Zhaoyong Liang, Shuangyang Zhang, Zhichao Liang, Zhongxin Mo, Xiaoming Zhang, Yutian Zhong, Wufan Chen, Li Qi

    Abstract: Photoacoustic tomography (PAT) is a promising imaging technique that can visualize the distribution of chromophores within biological tissue. However, the accuracy of PAT imaging is compromised by light fluence (LF), which hinders the quantification of light absorbers. Currently, model-based iterative methods are used for LF correction, but they require significant computational resources due to r… ▽ More

    Submitted 7 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

  19. arXiv:2305.03997  [pdf, other

    eess.IV cs.CV

    Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark

    Authors: Xin Lin, Jingtong Yue, Sixian Ding, Chao Ren, Lu Qi, Ming-Hsuan Yang

    Abstract: Rain in the dark poses a significant challenge to deploying real-world applications such as autonomous driving, surveillance systems, and night photography. Existing low-light enhancement or deraining methods struggle to brighten low-light conditions and remove rain simultaneously. Additionally, cascade approaches like ``deraining followed by low-light enhancement'' or the reverse often result in… ▽ More

    Submitted 17 June, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

  20. SAWU-Net: Spatial Attention Weighted Unmixing Network for Hyperspectral Images

    Authors: Lin Qi, Xuewen Qin, Feng Gao, Junyu Dong, Xinbo Gao

    Abstract: Hyperspectral unmixing is a critical yet challenging task in hyperspectral image interpretation. Recently, great efforts have been made to solve the hyperspectral unmixing task via deep autoencoders. However, existing networks mainly focus on extracting spectral features from mixed pixels, and the employment of spatial feature prior knowledge is still insufficient. To this end, we put forward a sp… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: IEEE GRSL 2023

  21. arXiv:2209.05658  [pdf

    eess.SY

    EV Charging Station Wholesale Market Participation: A Strategic Bidding and Pricing Approach

    Authors: Mohammad Mousavi, Li "Lisa" Qi, Alexander Brissette, Meng Wu

    Abstract: This paper presents a framework for simultaneous bidding and pricing strategy for wholesale market participation of electric vehicle (EV) charging stations aggregator. The proposed framework incorporates the EV charging stations' technical constraints as well as EV owners' preferences. A bi-level optimization is adopted to model the problem. In the upper level, the total profit of the EV charging… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  22. arXiv:2205.03380  [pdf, ps, other

    eess.IV cs.CV math.OC

    Multi-mode Tensor Train Factorization with Spatial-spectral Regularization for Remote Sensing Images Recovery

    Authors: Gaohang Yu, Shaochun Wan, Liqun Qi, Yanwei Xu

    Abstract: Tensor train (TT) factorization and corresponding TT rank, which can well express the low-rankness and mode correlations of higher-order tensors, have attracted much attention in recent years. However, TT factorization based methods are generally not sufficient to characterize low-rankness along each mode of third-order tensor. Inspired by this, we generalize the tensor train factorization to the… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 21 pages

  23. SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

    Authors: Lin Qi, Feng Gao, Junyu Dong, Xinbo Gao, Qian Du

    Abstract: Linear spectral unmixing is an essential technique in hyperspectral image processing and interpretation. In recent years, deep learning-based approaches have shown great promise in hyperspectral unmixing, in particular, unsupervised unmixing methods based on autoencoder networks are a recent trend. The autoencoder model, which automatically learns low-dimensional representations (abundances) and r… ▽ More

    Submitted 8 August, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

    Comments: IEEE TGRS 2022

  24. arXiv:2107.11517  [pdf, other

    eess.IV cs.CV cs.LG

    Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

    Authors: Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

    Abstract: Accurate image segmentation plays a crucial role in medical image analysis, yet it faces great challenges of various shapes, diverse sizes, and blurry boundaries. To address these difficulties, square kernel-based encoder-decoder architecture has been proposed and widely used, but its performance remains still unsatisfactory. To further cope with these challenges, we present a novel double-branch… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: 13 pages, 12 figures

    MSC Class: 68T07 ACM Class: I.4.6

  25. arXiv:2103.15295  [pdf, other

    eess.IV cs.CV

    Best-Buddy GANs for Highly Detailed Image Super-Resolution

    Authors: Wenbo Li, Kun Zhou, Lu Qi, Liying Lu, Nianjuan Jiang, Jiangbo Lu, Jiaya Jia

    Abstract: We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input. Recently, generative adversarial networks (GANs) become popular to hallucinate details. Most methods along this line rely on a predefined single-LR-single-HR mapping, which is not flexible enough for the SISR task. Also, GAN-generated fake details may… ▽ More

    Submitted 27 December, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

  26. The Property of Frequency Shift in 2D-FRFT Domain with Application to Image Encryption

    Authors: Lei Gao, Lin Qi, Ling Guan

    Abstract: The Fractional Fourier Transform (FRFT) has been playing a unique and increasingly important role in signal and image processing. In this letter, we investigate the property of frequency shift in two-dimensional FRFT (2D-FRFT) domain. It is shown that the magnitude of image reconstruction from phase information is frequency shift-invariant in 2D-FRFT domain, enhancing the robustness of image encry… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: IEEE Signal Processing Letters, 2021

  27. arXiv:2102.03837  [pdf, ps, other

    eess.IV cs.CV

    A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

    Authors: Zekun Li, Wei Zhao, Feng Shi, Lei Qi, Xingzhi Xie, Ying Wei, Zhongxiang Ding, Yang Gao, Shangjie Wu, Jun Liu, Yinghuan Shi, Dinggang Shen

    Abstract: How to fast and accurately assess the severity level of COVID-19 is an essential problem, when millions of people are suffering from the pandemic around the world. Currently, the chest CT is regarded as a popular and informative imaging tool for COVID-19 diagnosis. However, we observe that there are two issues -- weak annotation and insufficient data that may obstruct automatic COVID-19 severity a… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: To appear in Medical Image Analysis

  28. arXiv:2101.06853  [pdf, other

    eess.IV cs.CV

    Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation

    Authors: Xiaoting Han, Lei Qi, Qian Yu, Ziqi Zhou, Yefeng Zheng, Yinghuan Shi, Yang Gao

    Abstract: Unsupervised domain adaptation (UDA) methods have shown their promising performance in the cross-modality medical image segmentation tasks. These typical methods usually utilize a translation network to transform images from the source domain to target domain or train the pixel-level classifier merely using translated source images and original target images. However, when there exists a large dom… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  29. arXiv:1907.03246  [pdf

    eess.IV cs.CV cs.MM

    An Experimental-based Review of Image Enhancement and Image Restoration Methods for Underwater Imaging

    Authors: Yan Wang, Wei Song, Giancarlo Fortino, Lizhe Qi, Wenqiang Zhang, Antonio Liotta

    Abstract: Underwater images play a key role in ocean exploration, but often suffer from severe quality degradation due to light absorption and scattering in water medium. Although major breakthroughs have been made recently in the general area of image enhancement and restoration, the applicability of new methods for improving the quality of underwater images has not specifically been captured. In this pape… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: 19

  30. arXiv:1906.01704  [pdf, other

    q-bio.NC eess.SP

    A Novel Bi-hemispheric Discrepancy Model for EEG Emotion Recognition

    Authors: Yang Li, Wenming Zheng, Lei Wang, Yuan Zong, Lei Qi, Zhen Cui, Tong Zhang, Tengfei Song

    Abstract: The neuroscience study has revealed the discrepancy of emotion expression between left and right hemispheres of human brain. Inspired by this study, in this paper, we propose a novel bi-hemispheric discrepancy model (BiHDM) to learn the asymmetric differences between two hemispheres for electroencephalograph (EEG) emotion recognition. Concretely, we first employ four directed recurrent neural netw… ▽ More

    Submitted 10 May, 2019; originally announced June 2019.

  31. arXiv:1503.03383  [pdf, ps, other

    math.OC eess.SY

    An Explicit SOS Decomposition of A Fourth Order Four Dimensional Hankel Tensor with A Symmetric Generating Vector

    Authors: Yannan Chen, Liqun Qi, Qun Wang

    Abstract: In this note, we construct explicit SOS decomposition of A Fourth Order Four Dimensional Hankel Tensor with A Symmetric Generating Vector, at the critical value. This is a supplementary note to Paper [3].

    Submitted 8 March, 2015; originally announced March 2015.