LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

Rogez, Gregory; Weinzaepfel, Philippe; Schmid, Cordelia

doi:10.1109/TPAMI.2019.2892985

Computer Science > Computer Vision and Pattern Recognition

arXiv:1803.00455 (cs)

[Submitted on 1 Mar 2018 (v1), last revised 13 Jan 2019 (this version, v3)]

Title:LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

Authors:Gregory Rogez, Philippe Weinzaepfel, Cordelia Schmid

View PDF

Abstract:We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D poses of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our Localization-Classification-Regression architecture, named LCR-Net, contains 3 main components: 1) the pose proposal generator that suggests candidate poses at different locations in the image; 2) a classifier that scores the different pose proposals; and 3) a regressor that refines pose proposals both in 2D and 3D. All three stages share the convolutional feature layers and are trained jointly. The final pose estimation is obtained by integrating over neighboring pose hypotheses, which is shown to improve over a standard non maximum suppression algorithm. Our method recovers full-body 2D and 3D poses, hallucinating plausible body parts when the persons are partially occluded or truncated by the image boundary. Our approach significantly outperforms the state of the art in 3D pose estimation on Human3.6M, a controlled environment. Moreover, it shows promising results on real images for both single and multi-person subsets of the MPII 2D pose benchmark and demonstrates satisfying 3D pose results even for multi-person images.

Comments:	journal version of the CVPR 2017 paper, accepted to appear in IEEE Trans. PAMI
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1803.00455 [cs.CV]
	(or arXiv:1803.00455v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1803.00455
Related DOI:	https://doi.org/10.1109/TPAMI.2019.2892985

Submission history

From: Grégory Rogez [view email]
[v1] Thu, 1 Mar 2018 15:39:38 UTC (7,632 KB)
[v2] Sat, 13 Oct 2018 17:31:32 UTC (7,093 KB)
[v3] Sun, 13 Jan 2019 19:33:24 UTC (7,537 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators