LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering

Yang, Kaibing; Gu, Renshu; Wang, Maoyu; Toyoura, Masahiro; Xu, Gang

doi:10.1109/TIP.2022.3149229

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.00351 (cs)

[Submitted on 1 Aug 2021 (v1), last revised 29 Jan 2022 (this version, v5)]

Title:LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering

Authors:Kaibing Yang, Renshu Gu, Maoyu Wang, Masahiro Toyoura, Gang Xu

View PDF

Abstract:A key challenge in the task of human pose and shape estimation is occlusion, including self-occlusions, object-human occlusions, and inter-person occlusions. The lack of diverse and accurate pose and shape training data becomes a major bottleneck, especially for scenes with occlusions in the wild. In this paper, we focus on the estimation of human pose and shape in the case of inter-person occlusions, while also handling object-human occlusions and self-occlusion. We propose a novel framework that synthesizes occlusion-aware silhouette and 2D keypoints data and directly regress to the SMPL pose and shape parameters. A neural 3D mesh renderer is exploited to enable silhouette supervision on the fly, which contributes to great improvements in shape estimation. In addition, keypoints-and-silhouette-driven training data in panoramic viewpoints are synthesized to compensate for the lack of viewpoint diversity in any existing dataset. Experimental results show that we are among the state-of-the-art on the 3DPW and 3DPW-Crowd datasets in terms of pose estimation accuracy. The proposed method evidently outperforms Mesh Transformer, 3DCrowdNet and ROMP in terms of shape estimation. Top performance is also achieved on SSP-3D in terms of shape prediction accuracy. Demo and code will be available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.00351 [cs.CV]
	(or arXiv:2108.00351v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.00351
Related DOI:	https://doi.org/10.1109/TIP.2022.3149229

Submission history

From: Kaibing Yang [view email]
[v1] Sun, 1 Aug 2021 02:09:16 UTC (27,072 KB)
[v2] Sun, 12 Dec 2021 13:18:59 UTC (42,331 KB)
[v3] Wed, 15 Dec 2021 09:29:57 UTC (42,331 KB)
[v4] Sun, 2 Jan 2022 05:18:50 UTC (42,331 KB)
[v5] Sat, 29 Jan 2022 12:36:13 UTC (41,245 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators