TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions

Lal, Rohit; Garg, Yash; Dutta, Arindam; Ta, Calvin-Khang; Raychaudhuri, Dripta S.; Asif, M. Salman; Roy-Chowdhury, Amit K.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.16221v1 (cs)

[Submitted on 24 Dec 2023 (this version), latest version 4 Dec 2024 (v4)]

Title:TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions

Authors:Rohit Lal, Yash Garg, Arindam Dutta, Calvin-Khang Ta, Dripta S. Raychaudhuri, M. Salman Asif, Amit K. Roy-Chowdhury

View PDF HTML (experimental)

Abstract:Existing 3D human pose estimation methods perform remarkably well in both monocular and multi-view settings. However, their efficacy diminishes significantly in the presence of heavy occlusions, which limits their practical utility. For video sequences, temporal continuity can help infer accurate poses, especially in heavily occluded frames. In this paper, we aim to leverage this potential of temporal continuity through human motion priors, coupled with large-scale pre-training on 3D poses and self-supervised learning, to enhance 3D pose estimation in a given video sequence. This leads to a temporally continuous 3D pose estimate on unlabelled in-the-wild videos, which may contain occlusions, while exclusively relying on pre-trained 3D pose models. We propose an unsupervised method named TEMP3D that aligns a motion prior model on a given in-the-wild video using existing SOTA single image-based 3D pose estimation methods to give temporally continuous output under occlusions. To evaluate our method, we test it on the Occluded Human3.6M dataset, our custom-built dataset which contains significantly large (up to 100%) human body occlusions incorporated into the Human3.6M dataset. We achieve SOTA results on Occluded Human3.6M and the OcMotion dataset while maintaining competitive performance on non-occluded data. URL: this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.16221 [cs.CV]
	(or arXiv:2312.16221v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.16221

Submission history

From: Rohit Lal [view email]
[v1] Sun, 24 Dec 2023 11:05:10 UTC (12,476 KB)
[v2] Thu, 14 Mar 2024 03:36:00 UTC (19,721 KB)
[v3] Tue, 3 Dec 2024 18:39:15 UTC (15,052 KB)
[v4] Wed, 4 Dec 2024 10:25:18 UTC (15,052 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators