Skip to main content

Showing 1–1 of 1 results for author: Dinh, Q M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.00059  [pdf, other

    cs.CL cs.SD eess.AS

    BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition

    Authors: Paige Tuttösí, Mantaj Dhillon, Luna Sang, Shane Eastwood, Poorvi Bhatia, Quang Minh Dinh, Avni Kapoor, Yewon Jin, Angelica Lim

    Abstract: Some speech recognition tasks, such as automatic speech recognition (ASR), are approaching or have reached human performance in many reported metrics. Yet, they continue to struggle in complex, real-world, situations, such as with distanced speech. Previous challenges have released datasets to address the issue of distanced ASR, however, the focus remains primarily on distance, specifically relyin… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: Accepted to Computer Speech and Language, Special issue: Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition (September 2025)