Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks

Roy, Prasun; Bhattacharya, Saumik; Roy, Partha Pratim; Pal, Umapada

Computer Science > Computer Vision and Pattern Recognition

arXiv:2010.12669 (cs)

[Submitted on 23 Oct 2020 (v1), last revised 18 Feb 2025 (this version, v4)]

Title:Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks

Authors:Prasun Roy, Saumik Bhattacharya, Partha Pratim Roy, Umapada Pal

View PDF HTML (experimental)

Abstract:Sign language is a gesture-based symbolic communication medium among speech and hearing impaired people. It also serves as a communication bridge between non-impaired and impaired populations. Unfortunately, in most situations, a non-impaired person is not well conversant in such symbolic languages restricting the natural information flow between these two categories. Therefore, an automated translation mechanism that seamlessly translates sign language into natural language can be highly advantageous. In this paper, we attempt to perform recognition of 30 basic Indian sign gestures. Gestures are represented as temporal sequences of 3D maps (RGB + depth), each consisting of 3D coordinates of 20 body joints captured by the Kinect sensor. A recurrent neural network (RNN) is employed as the classifier. To improve the classifier's performance, we use geometric transformation for the alignment correction of depth frames. In our experiments, the model achieves 84.81% accuracy.

Comments:	10 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2010.12669 [cs.CV]
	(or arXiv:2010.12669v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2010.12669

Submission history

From: Prasun Roy [view email]
[v1] Fri, 23 Oct 2020 21:07:40 UTC (361 KB)
[v2] Tue, 14 Mar 2023 15:20:15 UTC (324 KB)
[v3] Sun, 16 Feb 2025 06:24:56 UTC (324 KB)
[v4] Tue, 18 Feb 2025 16:00:45 UTC (324 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators