Skip to main content

Showing 1–1 of 1 results for author: Kashiwagi, S

Searching in archive eess. Search in all archives.
.
  1. Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

    Authors: Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima

    Abstract: This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR). The difference in lip movements between the two poses a challenge for existing VSR models, which exhibit degraded accuracy when applied to silent speech. To solve this issue and tackle the scarcity of training data for silent speech, we propose to… ▽ More

    Submitted 16 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by INTERSPEECH 2023