Skip to main content

Showing 1–6 of 6 results for author: de Wet, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2501.06478  [pdf, other

    eess.AS cs.CL cs.SD

    Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives

    Authors: Christiaan Jacobs, Annelien Smith, Daleen Klop, Ondřej Klejch, Febe de Wet, Herman Kamper

    Abstract: We develop automatic speech recognition (ASR) systems for stories told by Afrikaans and isiXhosa preschool children. Oral narratives provide a way to assess children's language development before they learn to read. We consider a range of prior child-speech ASR strategies to determine which is best suited to this unique setting. Using Whisper and only 5 minutes of transcribed in-domain child speec… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: Accepted to ICASSP 2025

  2. arXiv:2011.03118  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

    Authors: Trideba Padhi, Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

    Abstract: In this work, we explore the benefits of using multilingual bottleneck features (mBNF) in acoustic modelling for the automatic speech recognition of code-switched (CS) speech in African languages. The unavailability of annotated corpora in the languages of interest has always been a primary challenge when developing speech recognition systems for this severely under-resourced type of speech. Hence… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: In Proceedings of The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities

    Journal ref: http://festvox.org/cedar/WSTCSMC2020.pdf

  3. arXiv:2004.06480  [pdf, other

    eess.AS cs.LG cs.SD

    Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech

    Authors: N. Wilkinson, A. Biswas, E. Yılmaz, F. de Wet, E. van der Westhuizen, T. R. Niesler

    Abstract: This paper considers the impact of automatic segmentation on the fully-automatic, semi-supervised training of automatic speech recognition (ASR) systems for five-lingual code-switched (CS) speech. Four automatic segmentation techniques were evaluated in terms of the recognition performance of an ASR system trained on the resulting segments in a semi-supervised manner. The system's output was compa… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: SLTU 2020. arXiv admin note: text overlap with arXiv:2003.03135

  4. arXiv:2004.04054  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Semi-supervised acoustic and language model training for English-isiZulu code-switched speech recognition

    Authors: A. Biswas, F. de Wet, E. van der Westhuizen, T. R. Niesler

    Abstract: We present an analysis of semi-supervised acoustic and language model training for English-isiZulu code-switched ASR using soap opera speech. Approximately 11 hours of untranscribed multilingual speech was transcribed automatically using four bilingual code-switching transcription systems operating in English-isiZulu, English-isiXhosa, English-Setswana and English-Sesotho. These transcriptions wer… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 4th Code-Switch workshop, France

  5. arXiv:2003.03135  [pdf, other

    eess.AS cs.LG cs.SD

    Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages

    Authors: Astik Biswas, Emre Yılmaz, Febe de Wet, Ewald van der Westhuizen, Thomas Niesler

    Abstract: This paper reports on the semi-supervised development of acoustic and language models for under-resourced, code-switched speech in five South African languages. Two approaches are considered. The first constructs four separate bilingual automatic speech recognisers (ASRs) corresponding to four different language pairs between which speakers switch frequently. The second uses a single, unified, fiv… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Conference

    ACM Class: F.2.2; I.2.7

  6. arXiv:1906.08647  [pdf, other

    cs.CL cs.SD eess.AS

    Semi-supervised acoustic model training for five-lingual code-switched ASR

    Authors: Astik Biswas, Emre Yılmaz, Febe de Wet, Ewald van der Westhuizen, Thomas Niesler

    Abstract: This paper presents recent progress in the acoustic modelling of under-resourced code-switched (CS) speech in multiple South African languages. We consider two approaches. The first constructs separate bilingual acoustic models corresponding to language pairs (English-isiZulu, English-isiXhosa, English-Setswana and English-Sesotho). The second constructs a single unified five-lingual acoustic mode… ▽ More

    Submitted 15 October, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at Interspeech 2019