UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

Abdullah, Hasnat Md; Liu, Tian; Wei, Kangda; Kong, Shu; Huang, Ruihong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.01180 (cs)

[Submitted on 2 Oct 2024]

Title:UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

Authors:Hasnat Md Abdullah, Tian Liu, Kangda Wei, Shu Kong, Ruihong Huang

View PDF HTML (experimental)

Abstract:Localizing unusual activities, such as human errors or surveillance incidents, in videos holds practical significance. However, current video understanding models struggle with localizing these unusual events likely because of their insufficient representation in models' pretraining datasets. To explore foundation models' capability in localizing unusual activity, we introduce UAL-Bench, a comprehensive benchmark for unusual activity localization, featuring three video datasets: UAG-OOPS, UAG-SSBD, UAG-FunQA, and an instruction-tune dataset: OOPS-UAG-Instruct, to improve model capabilities. UAL-Bench evaluates three approaches: Video-Language Models (Vid-LLMs), instruction-tuned Vid-LLMs, and a novel integration of Vision-Language Models and Large Language Models (VLM-LLM). Our results show the VLM-LLM approach excels in localizing short-span unusual events and predicting their onset (start time) more accurately than Vid-LLMs. We also propose a new metric, R@1, TD <= p, to address limitations in existing evaluation methods. Our findings highlight the challenges posed by long-duration videos, particularly in autism diagnosis scenarios, and the need for further advancements in localization techniques. Our work not only provides a benchmark for unusual activity localization but also outlines the key challenges for existing foundation models, suggesting future research directions on this important task.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2410.01180 [cs.CV]
	(or arXiv:2410.01180v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.01180
Journal reference:	wacv(2025) 5801-5811

Submission history

From: Hasnat Md Abdullah [view email]
[v1] Wed, 2 Oct 2024 02:33:09 UTC (536 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators