ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

Wan, Jun; Lin, Chi; Wen, Longyin; Li, Yunan; Miao, Qiguang; Escalera, Sergio; Anbarjafari, Gholamreza; Guyon, Isabelle; Guo, Guodong; Li, Stan Z.

doi:10.1109/TCYB.2020.3012092

Computer Science > Computer Vision and Pattern Recognition

arXiv:1907.12193 (cs)

[Submitted on 29 Jul 2019]

Title:ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

Authors:Jun Wan, Chi Lin, Longyin Wen, Yunan Li, Qiguang Miao, Sergio Escalera, Gholamreza Anbarjafari, Isabelle Guyon, Guodong Guo, Stan Z. Li

View PDF

Abstract:The ChaLearn large-scale gesture recognition challenge has been run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than $200$ teams round the world. This challenge has two tracks, focusing on isolated and continuous gesture recognition, respectively. This paper describes the creation of both benchmark datasets and analyzes the advances in large-scale gesture recognition based on these two datasets. We discuss the challenges of collecting large-scale ground-truth annotations of gesture recognition, and provide a detailed analysis of the current state-of-the-art methods for large-scale isolated and continuous gesture recognition based on RGB-D video sequences. In addition to recognition rate and mean jaccard index (MJI) as evaluation metrics used in our previous challenges, we also introduce the corrected segmentation rate (CSR) metric to evaluate the performance of temporal segmentation for continuous gesture recognition. Furthermore, we propose a bidirectional long short-term memory (Bi-LSTM) baseline method, determining the video division points based on the skeleton points extracted by convolutional pose machine (CPM). Experiments demonstrate that the proposed Bi-LSTM outperforms the state-of-the-art methods with an absolute improvement of $8.1\%$ (from $0.8917$ to $0.9639$) of CSR.

Comments:	14 pages, 8 figures, 6 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1907.12193 [cs.CV]
	(or arXiv:1907.12193v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1907.12193
Journal reference:	IEEE Transactions on Cybernetics 2020
Related DOI:	https://doi.org/10.1109/TCYB.2020.3012092

Submission history

From: Jun Wan [view email]
[v1] Mon, 29 Jul 2019 03:09:40 UTC (7,224 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators