Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection

Flaborea, Alessandro; Collorone, Luca; D'Amely, Guido; D'Arrigo, Stefano; Prenkaj, Bardh; Galasso, Fabio

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.07205 (cs)

[Submitted on 14 Jul 2023 (v1), last revised 28 Aug 2023 (this version, v3)]

Title:Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection

Authors:Alessandro Flaborea, Luca Collorone, Guido D'Amely, Stefano D'Arrigo, Bardh Prenkaj, Fabio Galasso

View PDF

Abstract:Anomalies are rare and anomaly detection is often therefore framed as One-Class Classification (OCC), i.e. trained solely on normalcy. Leading OCC techniques constrain the latent representations of normal motions to limited volumes and detect as abnormal anything outside, which accounts satisfactorily for the openset'ness of anomalies. But normalcy shares the same openset'ness property since humans can perform the same action in several ways, which the leading techniques neglect. We propose a novel generative model for video anomaly detection (VAD), which assumes that both normality and abnormality are multimodal. We consider skeletal representations and leverage state-of-the-art diffusion probabilistic models to generate multimodal future human poses. We contribute a novel conditioning on the past motion of people and exploit the improved mode coverage capabilities of diffusion processes to generate different-but-plausible future motions. Upon the statistical aggregation of future modes, an anomaly is detected when the generated set of motions is not pertinent to the actual future. We validate our model on 4 established benchmarks: UBnormal, HR-UBnormal, HR-STC, and HR-Avenue, with extensive experiments surpassing state-of-the-art results.

Comments:	Accepted at ICCV2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.07205 [cs.CV]
	(or arXiv:2307.07205v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.07205

Submission history

From: Alessandro Flaborea [view email]
[v1] Fri, 14 Jul 2023 07:42:45 UTC (7,104 KB)
[v2] Sat, 19 Aug 2023 16:22:39 UTC (11,354 KB)
[v3] Mon, 28 Aug 2023 10:41:07 UTC (11,354 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators