A Classification Model Utilizing Facial Landmark Tracking to Determine Sentence Types for American Sign Language Recognition

Nguyen, Janice; Wang, Y. Curtis

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2211.12723 (eess)

[Submitted on 23 Nov 2022 (v1), last revised 11 May 2023 (this version, v3)]

Title:A Classification Model Utilizing Facial Landmark Tracking to Determine Sentence Types for American Sign Language Recognition

Authors:Janice Nguyen, Y. Curtis Wang

View PDF

Abstract:The deaf and hard of hearing community relies on American Sign Language (ASL) as their primary mode of communication, but communication with others who do not know ASL can be difficult, especially during emergencies where no interpreter is available. As an effort to alleviate this problem, research in computer vision based real time ASL interpreting models is ongoing. However, most of these models are hand shape (gesture) based and lack the integration of facial cues, which are crucial in ASL to convey tone and distinguish sentence types. Thus, the integration of facial cues in computer vision based ASL interpreting models has the potential to improve performance and reliability. In this paper, we introduce a simple, computationally efficient facial expression based classification model that can be used to improve ASL interpreting models. This model utilizes the relative angles of facial landmarks with principal component analysis and a Random Forest Classification tree model to classify frames taken from videos of ASL users signing a complete sentence. The model classifies the frames as statements or assertions. The model was able to achieve an accuracy of 86.5%.

Subjects:	Image and Video Processing (eess.IV)
Cite as:	arXiv:2211.12723 [eess.IV]
	(or arXiv:2211.12723v3 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2211.12723

Submission history

From: Janice Nguyen [view email]
[v1] Wed, 23 Nov 2022 06:09:03 UTC (6,610 KB)
[v2] Tue, 21 Feb 2023 04:15:31 UTC (6,610 KB)
[v3] Thu, 11 May 2023 17:13:10 UTC (1,715 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:A Classification Model Utilizing Facial Landmark Tracking to Determine Sentence Types for American Sign Language Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:A Classification Model Utilizing Facial Landmark Tracking to Determine Sentence Types for American Sign Language Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators