Skip to main content

Showing 1–1 of 1 results for author: Leal, S E

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.15350  [pdf, other

    eess.AS cs.CL

    A Large Dataset of Spontaneous Speech with the Accent Spoken in São Paulo for Automatic Speech Recognition Evaluation

    Authors: Rodrigo Lima, Sidney Evaldo Leal, Arnaldo Candido Junior, Sandra Maria Aluísio

    Abstract: We present a freely available spontaneous speech corpus for the Brazilian Portuguese language and report preliminary automatic speech recognition (ASR) results, using both the Wav2Vec2-XLSR-53 and Distil-Whisper models fine-tuned and trained on our corpus. The NURC-SP Audio Corpus comprises 401 different speakers (204 females, 197 males) with a total of 239.30 hours of transcribed audio recordings… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.