Skip to main content

Showing 1–18 of 18 results for author: Barman, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15952  [pdf, ps, other

    cs.CV cs.AI

    VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game Quality Assurance

    Authors: Mohammad Reza Taesiri, Abhijay Ghildyal, Saman Zadtootaghaj, Nabajeet Barman, Cor-Paul Bezemer

    Abstract: With video games now generating the highest revenues in the entertainment industry, optimizing game development workflows has become essential for the sector's sustained growth. Recent advancements in Vision-Language Models (VLMs) offer considerable potential to automate and enhance various aspects of game development, particularly Quality Assurance (QA), which remains one of the industry's most l… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Project website with code and data: https://asgaardlab.github.io/videogameqa-bench/

  2. arXiv:2503.04685  [pdf, ps, other

    cs.CL

    DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module

    Authors: Krish Sharma, Niyar R Barman, Akshay Chaturvedi, Nicholas Asher

    Abstract: We look at reasoning on GSM8k, a dataset of short texts presenting primary school, math problems. We find, with Mirzadeh et al. (2024), that current LLM progress on the data set may not be explained by better reasoning but by exposure to a broader pretraining data distribution. We then introduce a novel information source for helping models with less data or inferior training reason better: discou… ▽ More

    Submitted 7 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  3. arXiv:2410.08534  [pdf, other

    cs.CV eess.IV

    Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

    Authors: Abhijay Ghildyal, Yuanhan Chen, Saman Zadtootaghaj, Nabajeet Barman, Alan C. Bovik

    Abstract: The advent of AI has influenced many aspects of human life, from self-driving cars and intelligent chatbots to text-based image and video generation models capable of creating realistic images and videos based on user prompts (text-to-image, image-to-image, and image-to-video). AI-based methods for image and video super resolution, video frame interpolation, denoising, and compression have already… ▽ More

    Submitted 19 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

  4. arXiv:2409.16271  [pdf, other

    cs.CV

    AIM 2024 Challenge on UHD Blind Photo Quality Assessment

    Authors: Vlad Hosu, Marcos V. Conde, Lorenzo Agnolucci, Nabajeet Barman, Saman Zadtootaghaj, Radu Timofte

    Abstract: We introduce the AIM 2024 UHD-IQA Challenge, a competition to advance the No-Reference Image Quality Assessment (NR-IQA) task for modern, high-resolution photos. The challenge is based on the recently released UHD-IQA Benchmark Database, which comprises 6,073 UHD-1 (4K) images annotated with perceptual quality ratings from expert raters. Unlike previous NR-IQA datasets, UHD-IQA focuses on highly a… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: ECCV 2024 - Advances in Image Manipulation (AIM). arXiv admin note: text overlap with arXiv:2401.10511 by other authors

  5. arXiv:2409.07650  [pdf, other

    cs.CV

    Foundation Models Boost Low-Level Perceptual Similarity Metrics

    Authors: Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj

    Abstract: For full-reference image quality assessment (FR-IQA) using deep-learning approaches, the perceptual similarity score between a distorted image and a reference image is typically computed as a distance measure between features extracted from a pretrained CNN or more recently, a Transformer network. Often, these intermediate features require further fine-tuning or processing with additional neural n… ▽ More

    Submitted 12 January, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: ICASSP 2025, Code: https://github.com/abhijay9/ZS-IQA

  6. arXiv:2408.17057  [pdf, other

    cs.CV cs.MM eess.IV

    LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model

    Authors: Nasim Jamshidi Avanaki, Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj

    Abstract: Recent advancements in the field of No-Reference Image Quality Assessment (NR-IQA) using deep learning techniques demonstrate high performance across multiple open-source datasets. However, such models are typically very large and complex making them not so suitable for real-world deployment, especially on resource- and battery-constrained mobile devices. To address this limitation, we propose a c… ▽ More

    Submitted 6 September, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

  7. arXiv:2408.16879  [pdf, other

    cs.CV cs.MM

    MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning

    Authors: Nasim Jamshidi Avanaki, Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj

    Abstract: No-Reference Image Quality Assessment (NR-IQA) remains a challenging task due to the diversity of distortions and the lack of large annotated datasets. Many studies have attempted to tackle these challenges by developing more accurate NR-IQA models, often employing complex and computationally expensive networks, or by bridging the domain gap between various distortions to enhance performance on te… ▽ More

    Submitted 6 September, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  8. arXiv:2408.10446  [pdf, other

    cs.CV cs.AI

    The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks

    Authors: Niyar R Barman, Krish Sharma, Ashhar Aziz, Shashwat Bajpai, Shwetangshu Biswas, Vasu Sharma, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: The rapid advancement of text-to-image generation systems, exemplified by models like Stable Diffusion, Midjourney, Imagen, and DALL-E, has heightened concerns about their potential misuse. In response, companies like Meta and Google have intensified their efforts to implement watermarking techniques on AI-generated images to curb the circulation of potentially misleading visuals. However, in this… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 23 pages and 10 figures

  9. arXiv:2404.16205  [pdf, other

    cs.CV cs.MM

    AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

    Authors: Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai , et al. (11 additional authors not shown)

    Abstract: This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed met… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Workshop -- AI for Streaming (AIS) Video Quality Assessment Challenge

  10. arXiv:2401.04039  [pdf, other

    cs.MM cs.IT eess.IV

    Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations

    Authors: Nabajeet Barman, Maria G. Martini, Yuriy Reznik

    Abstract: The Bjøntegaard Delta (BD) method proposed in 2001 has become a popular tool for comparing video codec compression efficiency. It was initially proposed to compute bitrate and quality differences between two Rate-Distortion curves using PSNR as a distortion metric. Over the years, many works have calculated and reported BD results using other objective quality metrics such as SSIM, VMAF and, in so… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  11. arXiv:2310.05030  [pdf, other

    cs.CL cs.AI

    Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

    Authors: Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: With the rise of prolific ChatGPT, the risk and consequences of AI-generated text has increased alarmingly. To address the inevitable question of ownership attribution for AI-generated artifacts, the US Copyright Office released a statement stating that 'If a work's traditional elements of authorship were produced by a machine, the work lacks human authorship and the Office will not register it'.… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main

  12. arXiv:2305.03138  [pdf, other

    cs.MM

    A Subjective Dataset for Multi-Screen Video Streaming Applications

    Authors: Nabajeet Barman, Yuriy Reznik, Maria G. Martini

    Abstract: In modern-era video streaming systems, videos are streamed and displayed on a wide range of devices. Such devices vary from large-screen UHD and HDTVs to medium-screen Desktop PCs and Laptops to smaller-screen devices such as mobile phones and tablets. It is well known that a video is perceived differently when displayed on different devices. The viewing experience for a particular video on smalle… ▽ More

    Submitted 22 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  13. arXiv:2305.02142  [pdf, other

    cs.MM

    Datasheet for Subjective and Objective Quality Assessment Datasets

    Authors: Nabajeet Barman, Yuriy Reznik, Maria Martini

    Abstract: Over the years, many subjective and objective quality assessment datasets have been created and made available to the research community. However, there is no standard process for documenting the various aspects of the dataset, such as details about the source sequences, number of test subjects, test methodology, encoding settings, etc. Such information is often of great importance to the users of… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  14. arXiv:2204.05580  [pdf, other

    cs.MM

    Codec Compression Efficiency Evaluation of MPEG-5 part 2 (LCEVC) using Objective and Subjective Quality Assessment

    Authors: Nabajeet Barman, Steven Schmidt, Saman Zadtootaghaj, Maria G Martini

    Abstract: With the increasing advancements in video compression efficiency achieved by newer codecs such as HEVC, AV1, and VVC, and intelligent encoding strategies, as well as improved bandwidth availability,there has been a proliferation and acceptance of newer services such as Netflix, Twitch, etc. However, such higher compression efficiencies are achieved at the cost of higher complexity and encoding del… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  15. arXiv:2103.02189  [pdf, other

    cs.MM

    User Generated HDR Gaming Video Streaming: Dataset, Codec Comparison and Challenges

    Authors: Nabajeet Barman, Maria G Martini

    Abstract: Gaming video streaming services have grown tremendously in the past few years, with higher resolutions, higher frame rates and HDR gaming videos getting increasingly adopted among the gaming community. Since gaming content as such is different from non-gaming content, it is imperative to evaluate the performance of the existing encoders to help understand the bandwidth requirements of such service… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 14 pages, 8 figures, submitted to IEEE journal

  16. arXiv:1912.12467  [pdf

    cs.NI cs.MM eess.SP

    QoE Management of Multimedia Streaming Services in Future Networks: A Tutorial and Survey

    Authors: Alcardo Alex Barakabitze, Nabajeet Barman, Arslan Ahmad, Saman Zadtootaghaj, Lingfen Sun, Maria G. Martini, Luigi Atzori

    Abstract: We provide in this paper a tutorial and a comprehensive survey of QoE management solutions in current and future networks. We start with a high level description of QoE management for multimedia services, which integrates QoE modelling, monitoring, and optimization. This followed by a discussion of HTTP Adaptive Streaming (HAS) solutions as the dominant technique for streaming videos over the best… ▽ More

    Submitted 28 December, 2019; originally announced December 2019.

    Comments: 42 pages, 21 figures, 10 tables

  17. arXiv:1209.2903  [pdf

    cs.CV

    A Novel Approach of Harris Corner Detection of Noisy Images using Adaptive Wavelet Thresholding Technique

    Authors: Nilanjan Dey, Pradipti Nandi, Nilanjana Barman

    Abstract: In this paper we propose a method of corner detection for obtaining features which is required to track and recognize objects within a noisy image. Corner detection of noisy images is a challenging task in image processing. Natural images often get corrupted by noise during acquisition and transmission. Though Corner detection of these noisy images does not provide desired results, hence de-noisin… ▽ More

    Submitted 13 September, 2012; originally announced September 2012.

    Comments: 5 pages, 10 figures. arXiv admin note: substantial text overlap with arXiv:1209.1558

    Journal ref: International Journal of Computer Science & Technology(IJCST) Vol. 2, ISSUE 4, OCT. - DEC. 2011

  18. arXiv:1209.1558  [pdf

    cs.CV

    A Comparative Study between Moravec and Harris Corner Detection of Noisy Images Using Adaptive Wavelet Thresholding Technique

    Authors: Nilanjan Dey, Pradipti Nandi, Nilanjana Barman, Debolina Das, Subhabrata Chakraborty

    Abstract: In this paper a comparative study between Moravec and Harris Corner Detection has been done for obtaining features required to track and recognize objects within a noisy image. Corner detection of noisy images is a challenging task in image processing. Natural images often get corrupted by noise during acquisition and transmission. As Corner detection of these noisy images does not provide desired… ▽ More

    Submitted 7 September, 2012; originally announced September 2012.

    Comments: 8 pages, 13 figures

    Journal ref: International Journal of Engineering Research and Applications (IJERA) Vol. 2, Issue 1, Jan-Feb 2012, pp.599-606