Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition

Gunasekara, Shanaka Ramesh; Li, Wanqing; Ogunbona, Philip; Yang, Jack

doi:10.1109/TBIOM.2025.3566212

Computer Science > Computer Vision and Pattern Recognition

arXiv:2505.23012 (cs)

[Submitted on 29 May 2025]

Title:Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition

Authors:Shanaka Ramesh Gunasekara, Wanqing Li, Philip Ogunbona, Jack Yang

View PDF HTML (experimental)

Abstract:Traditional approaches in unsupervised or self supervised learning for skeleton-based action classification have concentrated predominantly on the dynamic aspects of skeletal sequences. Yet, the intricate interaction between the moving and static elements of the skeleton presents a rarely tapped discriminative potential for action classification. This paper introduces a novel measurement, referred to as spatial-temporal joint density (STJD), to quantify such interaction. Tracking the evolution of this density throughout an action can effectively identify a subset of discriminative moving and/or static joints termed "prime joints" to steer self-supervised learning. A new contrastive learning strategy named STJD-CL is proposed to align the representation of a skeleton sequence with that of its prime joints while simultaneously contrasting the representations of prime and nonprime joints. In addition, a method called STJD-MP is developed by integrating it with a reconstruction-based framework for more effective learning. Experimental evaluations on the NTU RGB+D 60, NTU RGB+D 120, and PKUMMD datasets in various downstream tasks demonstrate that the proposed STJD-CL and STJD-MP improved performance, particularly by 3.5 and 3.6 percentage points over the state-of-the-art contrastive methods on the NTU RGB+D 120 dataset using X-sub and X-set evaluations, respectively.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2505.23012 [cs.CV]
	(or arXiv:2505.23012v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2505.23012
Journal reference:	IEEE Transactions on Biometrics, Behavior, and Identity Science (2025)
Related DOI:	https://doi.org/10.1109/TBIOM.2025.3566212

Submission history

From: Shanaka Ramesh [view email]
[v1] Thu, 29 May 2025 02:40:47 UTC (4,608 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators