Skip to main content

Showing 1–50 of 68 results for author: Bovik, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.22790  [pdf, ps, other

    eess.IV cs.CV cs.MM

    ICME 2025 Generalizable HDR and SDR Video Quality Measurement Grand Challenge

    Authors: Yixu Chen, Bowen Chen, Hai Wei, Alan C. Bovik, Baojun Li, Wei Sun, Linhan Cao, Kang Fu, Dandan Zhu, Jun Jia, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Dounia Hammou, Fei Yin, Rafal Mantiuk, Amritha Premkumar, Prajit T Rajendran, Vignesh V Menon

    Abstract: This paper reports IEEE International Conference on Multimedia \& Expo (ICME) 2025 Grand Challenge on Generalizable HDR and SDR Video Quality Measurement. With the rapid development of video technology, especially High Dynamic Range (HDR) and Standard Dynamic Range (SDR) contents, the need for robust and generalizable Video Quality Assessment (VQA) methods has become increasingly demanded. Existin… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: ICME 2025 Grand Challenges

  2. arXiv:2412.04508  [pdf, other

    eess.IV cs.CV

    Video Quality Assessment: A Comprehensive Survey

    Authors: Qi Zheng, Yibo Fan, Leilei Huang, Tianyu Zhu, Jiaming Liu, Zhijian Hao, Shuo Xing, Chia-Ju Chen, Xiongkuo Min, Alan C. Bovik, Zhengzhong Tu

    Abstract: Video quality assessment (VQA) is an important processing task, aiming at predicting the quality of videos in a manner highly consistent with human judgments of perceived quality. Traditional VQA models based on natural image and/or video statistics, which are inspired both by models of projected images of the real world and by dual models of the human visual system, deliver only limited predictio… ▽ More

    Submitted 11 December, 2024; v1 submitted 4 December, 2024; originally announced December 2024.

  3. arXiv:2410.08534  [pdf, other

    cs.CV eess.IV

    Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

    Authors: Abhijay Ghildyal, Yuanhan Chen, Saman Zadtootaghaj, Nabajeet Barman, Alan C. Bovik

    Abstract: The advent of AI has influenced many aspects of human life, from self-driving cars and intelligent chatbots to text-based image and video generation models capable of creating realistic images and videos based on user prompts (text-to-image, image-to-image, and image-to-video). AI-based methods for image and video super resolution, video frame interpolation, denoising, and compression have already… ▽ More

    Submitted 19 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

  4. Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

    Authors: Yu-Chih Chen, Avinab Saha, Alexandre Chapiro, Christian Häne, Jean-Charles Bazin, Bo Qiu, Stefano Zanetti, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We also study the ability of video quality models to predict human judgments. As streaming human avatar videos in VR or AR become increasingly common, the need for… ▽ More

    Submitted 2 October, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: Accepted to IEEE Transactions on Image Processing, 2024

  5. arXiv:2408.01932  [pdf, other

    eess.IV

    Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity

    Authors: Krishna Srikar Durbha, Alan C. Bovik

    Abstract: Adaptive video streaming allows for the construction of bitrate ladders that deliver perceptually optimized visual quality to viewers under bandwidth constraints. Two common approaches to adaptation are per-title encoding and per-shot encoding. The former involves encoding each program, movie, or other content in a manner that is perceptually- and bandwidth-optimized for that content but is otherw… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: Under Review

  6. arXiv:2404.13484  [pdf, other

    eess.IV cs.CV

    Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: The deep learning revolution has strongly impacted low-level image processing tasks such as style/domain transfer, enhancement/restoration, and visual quality assessments. Despite often being treated separately, the aforementioned tasks share a common theme of understanding, editing, or enhancing the appearance of input images without modifying the underlying content. We leverage this observation… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  7. arXiv:2404.13452  [pdf, other

    eess.IV cs.CV

    Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos. Although HDR video capture has seen increasing popularity because of recent flagship mobile phones such as Apple iPhones, Google Pixels, and Samsung Galaxy phones, a broad swath of consumers still utilize… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  8. arXiv:2403.15061  [pdf, other

    eess.IV cs.CV

    Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos

    Authors: Abhinau K. Venkataramanan, Alan C. Bovik

    Abstract: High Dynamic Range (HDR) videos are able to represent wider ranges of contrasts and colors than Standard Dynamic Range (SDR) videos, giving more vivid experiences. Due to this, HDR videos are expected to grow into the dominant video modality of the future. However, HDR videos are incompatible with existing SDR displays, which form the majority of affordable consumer displays on the market. Because… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  9. arXiv:2401.02794  [pdf, other

    eess.IV cs.CV

    Subjective and Objective Analysis of Indian Social Media Video Quality

    Authors: Sandeep Mishra, Mukul Jha, Alan C. Bovik

    Abstract: We conducted a large-scale subjective study of the perceptual quality of User-Generated Mobile Video Content on a set of mobile-originated videos obtained from the Indian social media platform ShareChat. The content viewed by volunteer human subjects under controlled laboratory conditions has the benefit of culturally diversifying the existing corpus of User-Generated Content (UGC) video quality d… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Submitted to the IEEE Transactions on Image Processing

  10. arXiv:2312.08524  [pdf, other

    eess.IV cs.CV

    A FUNQUE Approach to the Quality Assessment of Compressed HDR Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: Recent years have seen steady growth in the popularity and availability of High Dynamic Range (HDR) content, particularly videos, streamed over the internet. As a result, assessing the subjective quality of HDR videos, which are generally subjected to compression, is of increasing importance. In particular, we target the task of full-reference quality assessment of compressed HDR videos. The state… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  11. arXiv:2312.07780  [pdf, other

    eess.IV

    Bitrate Ladder Construction using Visual Information Fidelity

    Authors: Krishna Srikar Durbha, Hassene Tmar, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: Recently proposed perceptually optimized per-title video encoding methods provide better BD-rate savings than fixed bitrate-ladder approaches that have been employed in the past. However, a disadvantage of per-title encoding is that it requires significant time and energy to compute bitrate ladders. Over the past few years, a variety of methods have been proposed to construct optimal bitrate ladde… ▽ More

    Submitted 28 February, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: PCS 2024 Camera Ready Submission

  12. arXiv:2311.16372  [pdf, other

    eess.IV

    Joint Deep Image Restoration and Unsupervised Quality Assessment

    Authors: Hakan Emre Gedik, Abhinau K. Venkataramanan, Alan C. Bovik

    Abstract: Deep learning techniques have revolutionized the fields of image restoration and image quality assessment in recent years. While image restoration methods typically utilize synthetically distorted training data for training, deep quality assessment models often require expensive labeled subjective data. However, recent studies have shown that activations of deep neural networks trained for visual… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 4 Pages, 2 figures, 3 tables

  13. arXiv:2311.15437  [pdf, ps, other

    eess.IV cs.CV math.ST

    Quality Modeling Under A Relaxed Natural Scene Statistics Model

    Authors: Abhinau K. Venkataramanan, Alan C. Bovik

    Abstract: Information-theoretic image quality assessment (IQA) models such as Visual Information Fidelity (VIF) and Spatio-temporal Reduced Reference Entropic Differences (ST-RRED) have enjoyed great success by seamlessly integrating natural scene statistics (NSS) with information theory. The Gaussian Scale Mixture (GSM) model that governs the wavelet subband coefficients of natural images forms the foundat… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  14. arXiv:2311.11059  [pdf, other

    cs.CV cs.MM eess.IV

    HIDRO-VQA: High Dynamic Range Oracle for Video Quality Assessment

    Authors: Shreshth Saini, Avinab Saha, Alan C. Bovik

    Abstract: We introduce HIDRO-VQA, a no-reference (NR) video quality assessment model designed to provide precise quality evaluations of High Dynamic Range (HDR) videos. HDR videos exhibit a broader spectrum of luminance, detail, and color than Standard Dynamic Range (SDR) videos. As HDR content becomes increasingly popular, there is a growing demand for video quality assessment (VQA) algorithms that effecti… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: WACV 2024 Workshop Paper. Shreshth Saini, Avinab Saha contributed equally to this work

  15. Helping Visually Impaired People Take Better Quality Pictures

    Authors: Maniratnam Mandal, Deepti Ghadiyaram, Danna Gurari, Alan C. Bovik

    Abstract: Perception-based image analysis technologies can be used to help visually impaired people take better quality pictures by providing automated guidance, thereby empowering them to interact more confidently on social media. The photographs taken by visually impaired users often suffer from one or both of two kinds of quality issues: technical quality (distortions), and semantic quality, such as fram… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  16. arXiv:2305.02422  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content

    Authors: Yu-Chih Chen, Avinab Saha, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: The mobile cloud gaming industry has been rapidly growing over the last decade. When streaming gaming videos are transmitted to customers' client devices from cloud servers, algorithms that can monitor distorted video quality without having any reference video available are desirable tools. However, creating No-Reference Video Quality Assessment (NR VQA) models that can accurately predict the qual… ▽ More

    Submitted 29 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE SPL 2023. The implementation of GAMIVAL has been made available online: https://github.com/lskdream/GAMIVAL

    MSC Class: 68U10

    Journal ref: IEEE Signal Processing Letters, vol. 30, pp. 324-328, 2023

  17. arXiv:2304.13162  [pdf, other

    eess.IV cs.CV cs.MM

    HDR or SDR? A Subjective and Objective Study of Scaled and Compressed Videos

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yixu Chen, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We conducted a large-scale study of human perceptual quality judgments of High Dynamic Range (HDR) and Standard Dynamic Range (SDR) videos subjected to scaling and compression levels and viewed on three different display devices. HDR videos are able to present wider color gamuts, better contrasts, and brighter whites and darker blacks than SDR videos. While conventional expectations are that HDR q… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  18. arXiv:2304.13156  [pdf, other

    eess.IV cs.CV

    HDR-ChipQA: No-Reference Quality Assessment on High Dynamic Range Videos

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We present a no-reference video quality model and algorithm that delivers standout performance for High Dynamic Range (HDR) videos, which we call HDR-ChipQA. HDR videos represent wider ranges of luminances, details, and colors than Standard Dynamic Range (SDR) videos. The growing adoption of HDR in massively scaled video networks has driven the need for video quality assessment (VQA) algorithms th… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  19. Making Video Quality Assessment Models Robust to Bit Depth

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We introduce a novel feature set, which we call HDRMAX features, that when included into Video Quality Assessment (VQA) algorithms designed for Standard Dynamic Range (SDR) videos, sensitizes them to distortions of High Dynamic Range (HDR) videos that are inadequately accounted for by these algorithms. While these features are not specific to HDR, and also augment the equality prediction performan… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Published in IEEE Signal Processing Letters 2023

  20. arXiv:2304.03412  [pdf, other

    eess.IV

    One Transform To Compute Them All: Efficient Fusion-Based Full-Reference Video Quality Assessment

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: The Visual Multimethod Assessment Fusion (VMAF) algorithm has recently emerged as a state-of-the-art approach to video quality prediction, that now pervades the streaming and social media industry. However, since VMAF requires the evaluation of a heterogeneous set of quality models, it is computationally expensive. Given other advances in hardware-accelerated encoding, quality assessment is emergi… ▽ More

    Submitted 18 November, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Version 2

  21. arXiv:2209.10005  [pdf, other

    eess.IV cs.CV

    Subjective Assessment of High Dynamic Range Videos Under Different Ambient Conditions

    Authors: Zaixi Shang, Joshua P. Ebenezer, Alan C. Bovik, Yongjun Wu, Hai Wei, Sriram Sethuraman

    Abstract: High Dynamic Range (HDR) videos can represent a much greater range of brightness and color than Standard Dynamic Range (SDR) videos and are rapidly becoming an industry standard. HDR videos have more challenging capture, transmission, and display requirements than legacy SDR videos. With their greater bit depth, advanced electro-optical transfer functions, and wider color gamuts, comes the need fo… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  22. arXiv:2207.09956  [pdf, other

    cs.CV eess.IV

    Telepresence Video Quality Assessment

    Authors: Zhenqiang Ying, Deepti Ghadiyaram, Alan Bovik

    Abstract: Video conferencing, which includes both video and audio content, has contributed to dramatic increases in Internet traffic, as the COVID-19 pandemic forced millions of people to work and learn from home. Global Internet traffic of video conferencing has dramatically increased Because of this, efficient and accurate video quality tools are needed to monitor and perceptually optimize telepresence tr… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  23. arXiv:2206.14713  [pdf, other

    eess.IV cs.CV cs.MM

    CONVIQT: Contrastive Video Quality Estimator

    Authors: Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms. Here we consider the problem of learning perceptually relevant video quality representations in a self-supervised manner. Distortion type identification and degradation level determination is employed as an auxiliary task to train a deep learning model containing a deep Convolutional N… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  24. arXiv:2206.04877  [pdf, other

    eess.IV cs.CV cs.LG

    Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning

    Authors: Somdyuti Paul, Andrey Norkin, Alan C. Bovik

    Abstract: Adaptive video streaming relies on the construction of efficient bitrate ladders to deliver the best possible visual quality to viewers under bandwidth constraints. The traditional method of content dependent bitrate ladder selection requires a video shot to be pre-encoded with multiple encoding parameters to find the optimal operating points given by the convex hull of the resulting rate-quality… ▽ More

    Submitted 31 August, 2024; v1 submitted 10 June, 2022; originally announced June 2022.

  25. arXiv:2205.10501  [pdf, ps, other

    eess.IV cs.CV cs.MM

    Making Video Quality Assessment Models Sensitive to Frame Rate Distortions

    Authors: Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: We consider the problem of capturing distortions arising from changes in frame rate as part of Video Quality Assessment (VQA). Variable frame rate (VFR) videos have become much more common, and streamed videos commonly range from 30 frames per second (fps) up to 120 fps. VFR-VQA offers unique challenges in terms of distortion types as well as in making non-uniform comparisons of reference and dist… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Journal ref: IEEE Signal Processing Letters. 29 (2022) 897-901

  26. arXiv:2204.12022  [pdf, other

    eess.IV cs.CV

    Estimating the Resize Parameter in End-to-end Learned Image Compression

    Authors: Li-Heng Chen, Christos G. Bampis, Zhi Li, Lukáš Krasula, Alan C. Bovik

    Abstract: We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models. Our approach is simple: compose a pair of differentiable downsampling/upsampling layers that sandwich a neural compression model. To determine resize factors for different inputs, we utilize another neural network jointly trained with the compression model,… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  27. arXiv:2204.00128  [pdf, other

    eess.IV cs.CV

    Perceptual Quality Assessment of UGC Gaming Videos

    Authors: Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased. However, relatively little research has been done on the automatic quality prediction of gaming videos, especially on those that fall in the category of "User-Generated-Content" (UGC). Since current leading general-purpose Video… ▽ More

    Submitted 13 April, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

  28. arXiv:2203.16490  [pdf, other

    eess.IV cs.CV

    Foveation-based Deep Video Compression without Motion Search

    Authors: Meixu Chen, Richard Webb, Alan C. Bovik

    Abstract: The requirements of much larger file sizes, different storage formats, and immersive viewing conditions of VR pose significant challenges to the goals of acquiring, transmitting, compressing, and displaying high-quality VR content. At the same time, the great potential of deep learning to advance progress on the video compression problem has driven a significant research effort. Because of the hig… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  29. arXiv:2203.12824  [pdf, other

    cs.CV eess.IV

    Subjective and Objective Analysis of Streamed Gaming Videos

    Authors: Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: The rising popularity of online User-Generated-Content (UGC) in the form of streamed and shared videos, has hastened the development of perceptual Video Quality Assessment (VQA) models, which can be used to help optimize their delivery. Gaming videos, which are a relatively new type of UGC videos, are created when skilled gamers post videos of their gameplay. These kinds of screenshots of UGC game… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  30. arXiv:2202.11241  [pdf, other

    cs.CV eess.IV

    FUNQUE: Fusion of Unified Quality Evaluators

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Alan C. Bovik

    Abstract: Fusion-based quality assessment has emerged as a powerful method for developing high-performance quality models from quality models that individually achieve lower performances. A prominent example of such an algorithm is VMAF, which has been widely adopted as an industry standard for video quality prediction along with SSIM. In addition to advancing the state-of-the-art, it is imperative to allev… ▽ More

    Submitted 6 July, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at ICIP 2022

  31. arXiv:2201.02973  [pdf, other

    eess.IV cs.CV

    MAXIM: Multi-Axis MLP for Image Processing

    Authors: Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

    Abstract: Recent progress on Transformers and multi-layer perceptron (MLP) models provide new network architectural designs for computer vision tasks. Although these models proved to be effective in many vision tasks such as image recognition, there remain challenges in adapting them for low-level vision. The inflexibility to support high-resolution images and limitations of local attention are perhaps the… ▽ More

    Submitted 1 April, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

    Comments: CVPR 2022 Oral; Code: \url{https://github.com/google-research/maxim}

  32. arXiv:2201.01492  [pdf, other

    eess.IV cs.CV

    FAVER: Blind Quality Prediction of Variable Frame Rate Videos

    Authors: Qi Zheng, Zhengzhong Tu, Pavan C. Madhusudana, Xiaoyang Zeng, Alan C. Bovik, Yibo Fan

    Abstract: Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales. Recent advances in mobile devices and cloud computing techniques have made it possible to capture, process, and share high resolution, high frame rate (HFR) videos across the Internet nearly instantaneously. Being able to monitor and control the quality of these streamed… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures

  33. arXiv:2110.13266  [pdf, other

    cs.CV cs.MM eess.IV

    Image Quality Assessment using Contrastive Learning

    Authors: Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: We consider the problem of obtaining image quality representations in a self-supervised manner. We use prediction of distortion type and degree as an auxiliary task to learn features from an unlabeled image dataset containing a mixture of synthetic and realistic distortions. We then train a deep Convolutional Neural Network (CNN) using a contrastive pairwise objective to solve the auxiliary proble… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Journal ref: IEEE Transactions on Image Processing. 31 (2022) 4149 - 4161

  34. arXiv:2110.01805  [pdf, other

    eess.IV cs.CV

    Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression

    Authors: Somdyuti Paul, Andrey Norkin, Alan C. Bovik

    Abstract: Block based motion estimation is integral to inter prediction processes performed in hybrid video codecs. Prevalent block matching based methods that are used to compute block motion vectors (MVs) rely on computationally intensive search procedures. They also suffer from the aperture problem, which can worsen as the block size is reduced. Moreover, the block matching criteria used in typical codec… ▽ More

    Submitted 3 December, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    ACM Class: I.4

  35. High Frame Rate Video Quality Assessment using VMAF and Entropic Differences

    Authors: Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: The popularity of streaming videos with live, high-action content has led to an increased interest in High Frame Rate (HFR) videos. In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor. The current VQA models such as VMAF have superior correlation with perceptual judgments when vid… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Journal ref: 2021 Picture Coding Symposium (PCS)

  36. ChipQA: No-Reference Video Quality Prediction via Space-Time Chips

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We propose a new model for no-reference video quality assessment (VQA). Our approach uses a new idea of highly-localized space-time (ST) slices called Space-Time Chips (ST Chips). ST Chips are localized cuts of video data along directions that \textit{implicitly} capture motion. We use perceptually-motivated bandpass and normalization models to first process the video data, and then select oriente… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE Transactions on Image Processing in Sep 2021

  37. FOVQA: Blind Foveated Video Quality Assessment

    Authors: Yize Jin, Anjul Patney, Richard Webb, Alan Bovik

    Abstract: Previous blind or No Reference (NR) video quality assessment (VQA) models largely rely on features drawn from natural scene statistics (NSS), but under the assumption that the image statistics are stationary in the spatial domain. Several of these models are quite successful on standard pictures. However, in Virtual Reality (VR) applications, foveated video compression is regaining attention, and… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  38. arXiv:2106.08431  [pdf, ps, other

    eess.IV

    Assessment of Subjective and Objective Quality of Live Streaming Sports Videos

    Authors: Zaixi Shang, Joshua P. Ebenezer, Alan C. Bovik, Yongjun Wu, Hai Wei, Sriram Sethuraman

    Abstract: Video live streaming is gaining prevalence among video streaming services, especially for the delivery of popular sporting events. Many objective Video Quality Assessment (VQA) models have been developed to predict the perceptual quality of videos. Appropriate databases that exemplify the distortions encountered in live streaming videos are important to designing and learning objective VQA models.… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  39. arXiv:2106.06817  [pdf, other

    eess.IV cs.CV

    Evaluating Foveated Video Quality Using Entropic Differencing

    Authors: Yize Jin, Anjul Patney, Alan Bovik

    Abstract: Virtual Reality is regaining attention due to recent advancements in hardware technology. Immersive images / videos are becoming widely adopted to carry omnidirectional visual information. However, due to the requirements for higher spatial and temporal resolution of real video data, immersive videos require significantly larger bandwidth consumption. To reduce stresses on bandwidth, foveated vide… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

  40. arXiv:2105.09999  [pdf, other

    eess.IV cs.MM

    Convolutional Block Design for Learned Fractional Downsampling

    Authors: Li-Heng Chen, Christos G. Bampis, Zhi Li, Chao Chen, Alan C. Bovik

    Abstract: The layers of convolutional neural networks (CNNs) can be used to alter the resolution of their inputs, but the scaling factors are limited to integer values. However, in many image and video processing applications, the ability to resize by a fractional factor would be advantageous. One example is conversion between resolutions standardized for video compression, such as from 1080p to 720p. To so… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 4 pages conference paper

  41. arXiv:2103.16771  [pdf

    eess.IV

    Space-Time Video Regularity and Visual Fidelity: Compression, Resolution and Frame Rate Adaptation

    Authors: Dae Yeol Lee, Hyunsuk Ko, Jongho Kim, Alan C. Bovik

    Abstract: In order to be able to deliver today's voluminous amount of video contents through limited bandwidth channels in a perceptually optimal way, it is important to consider perceptual trade-offs of compression and space-time downsampling protocols. In this direction, we have studied and developed new models of natural video statistics (NVS), which are useful because high-quality videos contain statist… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  42. arXiv:2102.00155  [pdf, other

    cs.CV cs.MM eess.IV

    Regression or Classification? New Methods to Evaluate No-Reference Picture and Video Quality Models

    Authors: Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

    Abstract: Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus. However, recent efforts have shown that accurate quality score regression on real-world user-generated content (UGC) is a very challenging task. To make the problem more tractable, we propose two new methods - binary, and ordinal classifi… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: ICASSP2021

  43. A Subjective and Objective Study of Space-Time Subsampled Video Quality

    Authors: Dae Yeol Lee, Somdyuti Paul, Christos G. Bampis, Hyunsuk Ko, Jongho Kim, Se Yoon Jeong, Blake Homan, Alan C. Bovik

    Abstract: Video dimensions are continuously increasing to provide more realistic and immersive experiences to global streaming and social media viewers. However, increments in video parameters such as spatial resolution and frame rate are inevitably associated with larger data volumes. Transmitting increasingly voluminous videos through limited bandwidth networks in a perceptually optimal way is a current c… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

  44. On the Space-Time Statistics of Motion Pictures

    Authors: Dae Yeol Lee, Hyunsuk Ko, Jongho Kim, Alan C. Bovik

    Abstract: It is well-known that natural images possess statistical regularities that can be captured by bandpass decomposition and divisive normalization processes that approximate early neural processing in the human visual system. We expand on these studies and present new findings on the properties of space-time natural statistics that are inherent in motion pictures. Our model relies on the concept of t… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  45. arXiv:2101.10955  [pdf, other

    cs.CV cs.MM eess.IV

    RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content

    Authors: Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

    Abstract: Blind or no-reference video quality assessment of user-generated content (UGC) has become a trending, challenging, heretofore unsolved problem. Accurate and efficient video quality predictors suitable for this content are thus in great demand to achieve more intelligent analysis and processing of UGC videos. Previous studies have shown that natural scene statistics and deep learning features are b… ▽ More

    Submitted 14 November, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: IEEE Open Journal of Signal Processing 2021

  46. arXiv:2101.06354  [pdf, other

    eess.IV cs.CV cs.MM

    A Hitchhiker's Guide to Structural Similarity

    Authors: Abhinau K. Venkataramanan, Chengyang Wu, Alan C. Bovik, Ioannis Katsavounidis, Zafar Shahid

    Abstract: The Structural Similarity (SSIM) Index is a very widely used image/video quality model that continues to play an important role in the perceptual evaluation of compression algorithms, encoding recipes and numerous other image/video processing algorithms. Several public implementations of the SSIM and Multiscale-SSIM (MS-SSIM) algorithms have been developed, which differ in efficiency and performan… ▽ More

    Submitted 30 January, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: Submitted final version to IEEE Access on January 30, 2021

  47. arXiv:2010.13715  [pdf, other

    cs.MM cs.CV eess.IV

    ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction

    Authors: Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos. More generally, we study how perceptual quality is affected by frame rate, and how frame rate and compression combine to affect perceived quality. We devise an objective VQA model called Space-Time GeneRalized Entropic Difference (GREED… ▽ More

    Submitted 26 September, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Journal ref: IEEE Transactions on Image Processing. 30 (2021) 7446 - 7457

  48. Learning to Compress Videos without Computing Motion

    Authors: Meixu Chen, Todd Goodall, Anjul Patney, Alan C. Bovik

    Abstract: With the development of higher resolution contents and displays, its significant volume poses significant challenges to the goals of acquiring, transmitting, compressing, and displaying high-quality video content. In this paper, we propose a new deep learning video compression architecture that does not require motion estimation, which is the most expensive element of modern hybrid video compressi… ▽ More

    Submitted 26 March, 2022; v1 submitted 29 September, 2020; originally announced September 2020.

  49. Perceptual Video Quality Prediction Emphasizing Chroma Distortions

    Authors: Li-Heng Chen, Christos G. Bampis, Zhi Li, Joel Sole, Alan C. Bovik

    Abstract: Measuring the quality of digital videos viewed by human observers has become a common practice in numerous multimedia applications, such as adaptive video streaming, quality monitoring, and other digital TV applications. Here we explore a significant, yet relatively unexplored problem: measuring perceptual quality on videos arising from both luma and chroma distortions from compression. Toward inv… ▽ More

    Submitted 24 September, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: 14 pages

  50. Adaptive Debanding Filter

    Authors: Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik

    Abstract: Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions. These false contours can be very noticeable even on high-quality videos, especially when displayed on high-definition screens. Yet, relatively little attention has been applied to this problem. Here we consider banding artifact re… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 4 pages, 7 figures, 1 table. Accepted to IEEE Signal Processing Letters