Skip to main content

Showing 1–39 of 39 results for author: Kwong, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.11823  [pdf, ps, other

    eess.IV cs.CV

    Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution

    Authors: Zhangkai Ni, Yang Zhang, Wenhan Yang, Hanli Wang, Shiqi Wang, Sam Kwong

    Abstract: Major efforts in data-driven image super-resolution (SR) primarily focus on expanding the receptive field of the model to better capture contextual information. However, these methods are typically implemented by stacking deeper networks or leveraging transformer-based attention mechanisms, which consequently increases model complexity. In contrast, model-driven methods based on the unfolding para… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: Accepted to IEEE Transactions on Image Processing

  2. arXiv:2503.00047  [pdf, other

    eess.IV cs.CV eess.SP

    PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement based on Optimal Transport

    Authors: Tian Guo, Hui Yuan, Qi Liu, Honglei Su, Raouf Hamzaoui, Sam Kwong

    Abstract: Point cloud compression significantly reduces data volume but sacrifices reconstruction quality, highlighting the need for advanced quality enhancement techniques. Most existing approaches focus primarily on point-to-point fidelity, often neglecting the importance of perceptual quality as interpreted by the human visual system. To address this issue, we propose a generative adversarial network for… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

  3. arXiv:2502.15174  [pdf, other

    eess.IV cs.CV

    FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression

    Authors: Shiqi Jiang, Hui Yuan, Shuai Li, Huanqiang Zeng, Sam Kwong

    Abstract: The learned image compression (LIC) methods have already surpassed traditional techniques in compressing natural scene (NS) images. However, directly applying these methods to screen content (SC) images, which possess distinct characteristics such as sharp edges, repetitive patterns, embedded text and graphics, yields suboptimal results. This paper addresses three key challenges in SC image compre… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  4. arXiv:2501.01481  [pdf, other

    eess.IV cs.CV

    Unleashing Correlation and Continuity for Hyperspectral Reconstruction from RGB Images

    Authors: Fuxiang Feng, Runmin Cong, Shoushui Wei, Yipeng Zhang, Jun Li, Sam Kwong, Wei Zhang

    Abstract: Reconstructing Hyperspectral Images (HSI) from RGB images can yield high spatial resolution HSI at a lower cost, demonstrating significant application potential. This paper reveals that local correlation and global continuity of the spectral characteristics are crucial for HSI reconstruction tasks. Therefore, we fully explore these inter-spectral relationships and propose a Correlation and Continu… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

  5. arXiv:2412.15847  [pdf, other

    eess.IV cs.CV

    Image Quality Assessment: Enhancing Perceptual Exploration and Interpretation with Collaborative Feature Refinement and Hausdorff distance

    Authors: Xuekai Wei, Junyu Zhang, Qinlin Hu, Mingliang Zhou\\Yong Feng, Weizhi Xian, Huayan Pu, Sam Kwong

    Abstract: Current full-reference image quality assessment (FR-IQA) methods often fuse features from reference and distorted images, overlooking that color and luminance distortions occur mainly at low frequencies, whereas edge and texture distortions occur at high frequencies. This work introduces a pioneering training-free FR-IQA method that accurately predicts image quality in alignment with the human vis… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  6. arXiv:2411.09308  [pdf, other

    eess.IV cs.CV

    DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines

    Authors: Junqi Liu, Yun Zhang, Xiaoqi Wang, Xu Long, Sam Kwong

    Abstract: Just Recognizable Difference (JRD) represents the minimum visual difference that is detectable by machine vision, which can be exploited to promote machine vision oriented visual signal processing. In this paper, we propose a Deep Transformer based JRD (DT-JRD) prediction model for Video Coding for Machines (VCM), where the accurately predicted JRD can be used reduce the coding bit rate while main… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: Submitted to IEEE Transactions on Multimedia

  7. arXiv:2409.11711  [pdf, other

    eess.IV cs.CV

    LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution

    Authors: Shiyu Feng, Yun Zhang, Linwei Zhu, Sam Kwong

    Abstract: Light-Field (LF) image is emerging 4D data of light rays that is capable of realistically presenting spatial and angular information of 3D scene. However, the large data volume of LF images becomes the most challenging issue in real-time processing, transmission, and storage. In this paper, we propose an end-to-end deep LF Image Compression method Using Disentangled Representation and Asymmetrical… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  8. arXiv:2409.10293  [pdf, other

    eess.IV cs.CV

    SPAC: Sampling-based Progressive Attribute Compression for Dense Point Clouds

    Authors: Xiaolong Mao, Hui Yuan, Tian Guo, Shiqi Jiang, Raouf Hamzaoui, Sam Kwong

    Abstract: We propose an end-to-end attribute compression method for dense point clouds. The proposed method combines a frequency sampling module, an adaptive scale feature extraction module with geometry assistance, and a global hyperprior entropy model. The frequency sampling module uses a Hamming window and the Fast Fourier Transform to extract high-frequency components of the point cloud. The difference… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 136pages, 13 figures

  9. arXiv:2409.04123  [pdf, other

    eess.IV

    Feature Compression for Cloud-Edge Multimodal 3D Object Detection

    Authors: Chongzhen Tian, Zhengxin Li, Hui Yuan, Raouf Hamzaoui, Liquan Shen, Sam Kwong

    Abstract: Machine vision systems, which can efficiently manage extensive visual perception tasks, are becoming increasingly popular in industrial production and daily life. Due to the challenge of simultaneously obtaining accurate depth and texture information with a single sensor, multimodal data captured by cameras and LiDAR is commonly used to enhance performance. Additionally, cloud-edge cooperation has… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  10. arXiv:2308.11627  [pdf, other

    eess.SP cs.AI cs.CV eess.IV eess.SY

    Non-Intrusive Electric Load Monitoring Approach Based on Current Feature Visualization for Smart Energy Management

    Authors: Yiwen Xu, Dengfeng Liu, Liangtao Huang, Zhiquan Lin, Tiesong Zhao, Sam Kwong

    Abstract: The state-of-the-art smart city has been calling for an economic but efficient energy management over large-scale network, especially for the electric power system. It is a critical issue to monitor, analyze and control electric loads of all users in system. In this paper, we employ the popular computer vision techniques of AI to design a non-invasive load monitoring method for smart electric ener… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  11. arXiv:2306.12298  [pdf, other

    cs.CV cs.LG eess.IV

    StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

    Authors: Fengchuang Xing, Yuan-Gen Wang, Weixuan Tang, Guopu Zhu, Sam Kwong

    Abstract: Self-attention based Transformer has achieved great success in many computer vision tasks. However, its application to video quality assessment (VQA) has not been satisfactory so far. Evaluating the quality of in-the-wild videos is challenging due to the unknown of pristine reference and shooting distortion. This paper presents a co-trained Space-Time Attention network for the VQA problem, termed… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  12. arXiv:2306.08918  [pdf, other

    eess.IV cs.CV

    PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators

    Authors: Runmin Cong, Wenyu Yang, Wei Zhang, Chongyi Li, Chun-Le Guo, Qingming Huang, Sam Kwong

    Abstract: Due to the light absorption and scattering induced by the water medium, underwater images usually suffer from some degradation problems, such as low contrast, color distortion, and blurring details, which aggravate the difficulty of downstream underwater understanding tasks. Therefore, how to obtain clear and visually pleasant images has become a common concern of people, and the task of underwate… ▽ More

    Submitted 7 December, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 8 pages, 4 figures, Accepted by IEEE Transactions on Image Processing 2023

  13. Geometric Prior Based Deep Human Point Cloud Geometry Compression

    Authors: Xinju Wu, Pingping Zhang, Meng Wang, Peilin Chen, Shiqi Wang, Sam Kwong

    Abstract: The emergence of digital avatars has raised an exponential increase in the demand for human point clouds with realistic and intricate details. The compression of such data becomes challenging with overwhelming data amounts comprising millions of points. Herein, we leverage the human geometric prior in geometry redundancy removal of point clouds, greatly promoting the compression performance. More… ▽ More

    Submitted 25 March, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted by TCSVT 2024

  14. arXiv:2210.04158  [pdf, other

    eess.IV cs.CV

    HVS Revisited: A Comprehensive Video Quality Assessment Framework

    Authors: Ao-Xiang Zhang, Yuan-Gen Wang, Weixuan Tang, Leida Li, Sam Kwong

    Abstract: Video quality is a primary concern for video service providers. In recent years, the techniques of video quality assessment (VQA) based on deep convolutional neural networks (CNNs) have been developed rapidly. Although existing works attempt to introduce the knowledge of the human visual system (HVS) into VQA, there still exhibit limitations that prevent the full exploitation of HVS, including an… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 13 pages, 5 figures, Journal paper

  15. arXiv:2209.02934  [pdf, other

    eess.IV cs.CV

    Boundary Guided Semantic Learning for Real-time COVID-19 Lung Infection Segmentation System

    Authors: Runmin Cong, Yumo Zhang, Ning Yang, Haisheng Li, Xueqi Zhang, Ruochen Li, Zewen Chen, Yao Zhao, Sam Kwong

    Abstract: The coronavirus disease 2019 (COVID-19) continues to have a negative impact on healthcare systems around the world, though the vaccines have been developed and national vaccination coverage rate is steadily increasing. At the current stage, automatically segmenting the lung infection area from CT images is essential for the diagnosis and treatment of COVID-19. Thanks to the development of deep lea… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Transactions on Consumer Electronics 2022

  16. arXiv:2209.02285  [pdf, other

    cs.CV eess.IV

    High Dynamic Range Image Quality Assessment Based on Frequency Disparity

    Authors: Yue Liu, Zhangkai Ni, Shiqi Wang, Hanli Wang, Sam Kwong

    Abstract: In this paper, a novel and effective image quality assessment (IQA) algorithm based on frequency disparity for high dynamic range (HDR) images is proposed, termed as local-global frequency feature-based model (LGFM). Motivated by the assumption that the human visual system is highly adapted for extracting structural information and partial frequencies when perceiving the visual scene, the Gabor an… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  17. DeepWSD: Projecting Degradations in Perceptual Space to Wasserstein Distance in Deep Feature Space

    Authors: Xingran Liao, Baoliang Chen, Hanwei Zhu, Shiqi Wang, Mingliang Zhou, Sam Kwong

    Abstract: Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image quality in a deterministic way by explicitly comparing the features, gauging how severely distorted an image is by how far the corresponding feature lies from the space of the reference images. Herein, we look at this problem from a different viewpoint and propose to model the quality degradation in perceptua… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: ACM Multimedia 2022 accepted thesis

  18. arXiv:2207.08114  [pdf, other

    eess.IV cs.CV

    BCS-Net: Boundary, Context and Semantic for Automatic COVID-19 Lung Infection Segmentation from CT Images

    Authors: Runmin Cong, Haowei Yang, Qiuping Jiang, Wei Gao, Haisheng Li, Cong Wang, Yao Zhao, Sam Kwong

    Abstract: The spread of COVID-19 has brought a huge disaster to the world, and the automatic segmentation of infection regions can help doctors to make diagnosis quickly and reduce workload. However, there are several challenges for the accurate and complete segmentation, such as the scattered infection area distribution, complex background noises, and blurred segmentation boundaries. To this end, in this p… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE Transactions on Instrumentation and Measurement 2022, Code: https://github.com/rmcong/BCS-Net-TIM22

  19. arXiv:2207.00965  [pdf, other

    cs.CV eess.IV

    Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement

    Authors: Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Lin Ma, Sam Kwong

    Abstract: Getting rid of the fundamental limitations in fitting to the paired training data, recent unsupervised low-light enhancement methods excel in adjusting illumination and contrast of images. However, for unsupervised low light enhancement, the remaining noise suppression issue due to the lacking of supervision of detailed signal largely impedes the wide deployment of these methods in real-world appl… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: 9 pages, 7 figures, accepted to ACM MM 2022

  20. arXiv:2205.03587  [pdf, other

    eess.IV cs.CV

    Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation

    Authors: Tiesong Zhao, Yuhang Huang, Weize Feng, Yiwen Xu, Sam Kwong

    Abstract: The ever-growing multimedia traffic has underscored the importance of effective multimedia codecs. Among them, the up-to-date lossy video coding standard, Versatile Video Coding (VVC), has been attracting attentions of video coding community. However, the gain of VVC is achieved at the cost of significant encoding complexity, which brings the need to realize fast encoder with comparable Rate Disto… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

    Comments: 10 pages, 10 figures

  21. arXiv:2204.04059  [pdf, other

    eess.IV cs.CV cs.MM

    Deep Learning-Based Intra Mode Derivation for Versatile Video Coding

    Authors: Linwei Zhu, Yun Zhang, Na Li, Gangyi Jiang, Sam Kwong

    Abstract: In intra coding, Rate Distortion Optimization (RDO) is performed to achieve the optimal intra mode from a pre-defined candidate list. The optimal intra mode is also required to be encoded and transmitted to the decoder side besides the residual signal, where lots of coding bits are consumed. To further improve the performance of intra coding in Versatile Video Coding (VVC), an intelligent intra mo… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 19 pages, 7 figures, submitted to ACM TOMM

  22. arXiv:2202.09802  [pdf, other

    cs.CV eess.IV

    Distortion-Aware Loop Filtering of Intra 360^o Video Coding with Equirectangular Projection

    Authors: Pingping Zhang, Xu Wang, Linwei Zhu, Yun Zhang, Shiqi Wang, Sam Kwong

    Abstract: In this paper, we propose a distortion-aware loop filtering model to improve the performance of intra coding for 360$^o$ videos projected via equirectangular projection (ERP) format. To enable the awareness of distortion, our proposed module analyzes content characteristics based on a coding unit (CU) partition mask and processes them through partial convolution to activate the specified area. The… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  23. arXiv:2201.11975  [pdf, other

    cs.CV eess.IV

    Generalized Visual Quality Assessment of GAN-Generated Face Images

    Authors: Yu Tian, Zhangkai Ni, Baoliang Chen, Shiqi Wang, Hanli Wang, Sam Kwong

    Abstract: Recent years have witnessed the dramatically increased interest in face generation with generative adversarial networks (GANs). A number of successful GAN algorithms have been developed to produce vivid face images towards different application scenarios. However, little work has been dedicated to automatic quality assessment of such GAN-generated face images (GFIs), even less have been devoted to… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures, journal paper

  24. arXiv:2112.15299  [pdf, other

    eess.IV cs.CV

    CSformer: Bridging Convolution and Transformer for Compressive Sensing

    Authors: Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong

    Abstract: Convolution neural networks (CNNs) have succeeded in compressive image sensing. However, due to the inductive bias of locality and weight sharing, the convolution operations demonstrate the intrinsic limitations in modeling the long-range dependency. Transformer, designed initially as a sequence-to-sequence model, excels at capturing global contexts due to the self-attention-based architectures ev… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

  25. arXiv:2112.12284  [pdf, other

    cs.MM eess.IV

    A Survey on Perceptually Optimized Video Coding

    Authors: Yun Zhang, Linwei Zhu, Gangyi Jiang, Sam Kwong, C. -C. Jay Kuo

    Abstract: To provide users with more realistic visual experiences, videos are developing in the trends of Ultra High Definition (UHD), High Frame Rate (HFR), High Dynamic Range (HDR), Wide Color Gammut (WCG) and high clarity. However, the data amount of videos increases exponentially, which requires high efficiency video compression for storage and network transmission. Perceptually optimized video coding a… ▽ More

    Submitted 15 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: 36 pages, 12 figures, 6 tables, accepted by ACM Computing Surveys

  26. arXiv:2112.00485  [pdf, other

    cs.CV eess.IV

    Learning Transformer Features for Image Quality Assessment

    Authors: Chao Zeng, Sam Kwong

    Abstract: Objective image quality evaluation is a challenging task, which aims to measure the quality of a given image automatically. According to the availability of the reference images, there are Full-Reference and No-Reference IQA tasks, respectively. Most deep learning approaches use regression from deep features extracted by Convolutional Neural Networks. For the FR task, another option is conducting… ▽ More

    Submitted 23 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  27. arXiv:2012.15052  [pdf, other

    eess.IV cs.CV

    Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network

    Authors: Zhangkai Ni, Wenhan Yang, Shiqi Wang, Lin Ma, Sam Kwong

    Abstract: In this work, we aim to learn an unpaired image enhancement model, which can enrich low-quality images with the characteristics of high-quality images provided by users. We propose a quality attention generative adversarial network (QAGAN) trained on unpaired data based on the bidirectional Generative Adversarial Network (GAN) embedded with a quality attention module (QAM). The key novelty of the… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  28. arXiv:2009.12537  [pdf, other

    eess.IV cs.CV

    Deep Selective Combinatorial Embedding and Consistency Regularization for Light Field Super-resolution

    Authors: Jing Jin, Junhui Hou, Zhiyu Zhu, Jie Chen, Sam Kwong

    Abstract: Light field (LF) images acquired by hand-held devices usually suffer from low spatial resolution as the limited detector resolution has to be shared with the angular dimension. LF spatial super-resolution (SR) thus becomes an indispensable part of the LF camera processing pipeline. The high-dimensionality characteristic and complex geometrical structure of LF images make the problem more challengi… ▽ More

    Submitted 6 October, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: 14 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2004.02215

  29. arXiv:2008.05642  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Towards Modality Transferable Visual Information Representation with Optimal Model Compression

    Authors: Rongqun Lin, Linwei Zhu, Shiqi Wang, Sam Kwong

    Abstract: Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the redundancies within visual signals, much less work has been dedicated to the transformation of the visual signals to another well-established modality for better represen… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted in ACM Multimedia 2020

  30. Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display

    Authors: Xinju Wu, Yun Zhang, Chunling Fan, Junhui Hou, Sam Kwong

    Abstract: In this paper, we focus on subjective and objective Point Cloud Quality Assessment (PCQA) in an immersive environment and study the effect of geometry and texture attributes in compression distortion. Using a Head-Mounted Display (HMD) with six degrees of freedom, we establish a subjective PCQA database, named SIAT Point Cloud Quality Database (SIAT-PCQD). Our database consists of 340 distorted po… ▽ More

    Submitted 3 August, 2021; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: This work has been submitted to the IEEE for possible publication

  31. arXiv:2004.02215  [pdf, other

    cs.CV eess.IV

    Light Field Spatial Super-resolution via Deep Combinatorial Geometry Embedding and Structural Consistency Regularization

    Authors: Jing Jin, Junhui Hou, Jie Chen, Sam Kwong

    Abstract: Light field (LF) images acquired by hand-held devices usually suffer from low spatial resolution as the limited sampling resources have to be shared with the angular dimension. LF spatial super-resolution (SR) thus becomes an indispensable part of the LF camera processing pipeline. The high-dimensionality characteristic and complex geometrical structure of LF images make the problem more challengi… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: This paper was accepted by CVPR 2020

  32. arXiv:1912.11954  [pdf, other

    cs.CV cs.MM eess.SY

    Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video Streaming over HTTP

    Authors: Hui Yuan, Huayong Fu, Ju Liu, Junhui Hou, Sam Kwong

    Abstract: Dynamic Adaptive Streaming over HTTP (DASH) has demonstrated to be an emerging and promising multimedia streaming technique, owing to its capability of dealing with the variability of networks. Rate adaptation mechanism, a challenging and open issue, plays an important role in DASH based systems since it affects Quality of Experience (QoE) of users, network utilization, etc. In this paper, based o… ▽ More

    Submitted 26 December, 2019; originally announced December 2019.

    Comments: This paper has been published on IEEE Transactions on Mobile Computing. H. Yuan, H. Fu, J. Liu, J. Hou, and S. Kwong, "Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video Streaming over HTTP," IEEE Transactions on Mobile Computing, vol.17, no.10, pp. 2334-2348, Oct. 2018

    Journal ref: IEEE Transactions on Mobile Computing, vol.17, no.10, pp. 2334-2348, Oct. 2018

  33. arXiv:1912.11822  [pdf, other

    cs.CV cs.MM eess.IV eess.SY

    An Ensemble Rate Adaptation Framework for Dynamic Adaptive Streaming Over HTTP

    Authors: Hui Yuan, Xiaoqian Hu, Junhui Hou, Xuekai Wei, Sam Kwong

    Abstract: Rate adaptation is one of the most important issues in dynamic adaptive streaming over HTTP (DASH). Due to the frequent fluctuations of the network bandwidth and complex variations of video content, it is difficult to deal with the varying network conditions and video content perfectly by using a single rate adaptation method. In this paper, we propose an ensemble rate adaptation framework for DAS… ▽ More

    Submitted 26 December, 2019; originally announced December 2019.

    Comments: This article has been accepted by IEEE Transactions on Broadcasting

  34. arXiv:1912.10653  [pdf, ps, other

    eess.IV

    Video Compression Coding via Colorization: A Generative Adversarial Network (GAN)-Based Approach

    Authors: Zhaoqing Pan, Feng Yuan, Jianjun Lei, Sam Kwong

    Abstract: Under the limited storage, computing and network bandwidth resources, the video compression coding technology plays an important role for visual communication. To efficiently compress raw video data, a colorization-based video compression coding method is proposed in this paper. In the proposed encoder, only the video luminance components are encoded and transmitted. To restore the video chrominan… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: This work has been submitted to IEEE TCSVT

  35. arXiv:1912.09675  [pdf, other

    cs.CV cs.MM eess.IV

    Spatial and Temporal Consistency-Aware Dynamic Adaptive Streaming for 360-Degree Videos

    Authors: Hui Yuan, Shiyun Zhao, Junhui Hou, Xuekai Wei, Sam Kwong

    Abstract: The 360-degree video allows users to enjoy the whole scene by interactively switching viewports. However, the huge data volume of the 360-degree video limits its remote applications via network. To provide high quality of experience (QoE) for remote web users, this paper presents a tile-based adaptive streaming method for 360-degree videos. First, we propose a simple yet effective rate adaptation… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 16 pages, This paper has been accepted by the IEEE Journal of Selected Topics in Signal Processing

  36. arXiv:1911.10566  [pdf, other

    eess.IV cs.CV

    Controllable List-wise Ranking for Universal No-reference Image Quality Assessment

    Authors: Fu-Zhao Ou, Yuan-Gen Wang, Jin Li, Guopu Zhu, Sam Kwong

    Abstract: No-reference image quality assessment (NR-IQA) has received increasing attention in the IQA community since reference image is not always available. Real-world images generally suffer from various types of distortion. Unfortunately, existing NR-IQA methods do not work with all types of distortion. It is a challenging task to develop universal NR-IQA that has the ability of evaluating all types of… ▽ More

    Submitted 5 January, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

  37. arXiv:1909.01341  [pdf, other

    eess.IV cs.CV

    Deep Coarse-to-fine Dense Light Field Reconstruction with Flexible Sampling and Geometry-aware Fusion

    Authors: Jing Jin, Junhui Hou, Jie Chen, Huanqiang Zeng, Sam Kwong, Jingyi Yu

    Abstract: A densely-sampled light field (LF) is highly desirable in various applications, such as 3-D reconstruction, post-capture refocusing and virtual reality. However, it is costly to acquire such data. Although many computational methods have been proposed to reconstruct a densely-sampled LF from a sparsely-sampled one, they still suffer from either low reconstruction quality, low computational efficie… ▽ More

    Submitted 26 September, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: 17 pages, 11 figures, 10 tables

  38. arXiv:1907.09640  [pdf, other

    cs.CV eess.IV

    Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

    Authors: Jing Jin, Junhui Hou, Jie Chen, Sam Kwong, Jingyi Yu

    Abstract: This paper explores the problem of reconstructing high-resolution light field (LF) images from hybrid lenses, including a high-resolution camera surrounded by multiple low-resolution cameras. To tackle this challenge, we propose a novel end-to-end learning-based approach, which can comprehensively utilize the specific characteristics of the input from two complementary and parallel perspectives. S… ▽ More

    Submitted 31 July, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: This paper was accepted by ACM MM 2020

  39. arXiv:1905.02001  [pdf, other

    eess.IV cs.MM

    Compressed Image Quality Assessment Based on Saak Features

    Authors: Xinfeng Zhang, Sam Kwong, C. -C. Jay Kuo

    Abstract: Compressed image quality assessment plays an important role in image services, especially in image compression applications, which can be utilized as a guidance to optimize image processing algorithms. In this paper, we propose an objective image quality assessment algorithm to measure the quality of compressed images. The proposed method utilizes a data-driven transform, Saak (Subspace approximat… ▽ More

    Submitted 16 May, 2019; v1 submitted 6 May, 2019; originally announced May 2019.