Skip to main content

Showing 1–1 of 1 results for author: Ejigu, Y A

.
  1. arXiv:2503.18485  [pdf, other

    cs.CL cs.LG

    Whispering in Amharic: Fine-tuning Whisper for Low-resource Language

    Authors: Dawit Ketema Gete, Bedru Yimam Ahmed, Tadesse Destaw Belay, Yohannes Ayana Ejigu, Sukairaj Hafiz Imam, Alemu Belay Tessema, Mohammed Oumer Adem, Tadesse Amare Belay, Robert Geislinger, Umma Aliyu Musa, Martin Semmann, Shamsuddeen Hassan Muhammad, Henning Schreiber, Seid Muhie Yimam

    Abstract: This work explores fine-tuning OpenAI's Whisper automatic speech recognition (ASR) model for Amharic, a low-resource language, to improve transcription accuracy. While the foundational Whisper model struggles with Amharic due to limited representation in its training data, we fine-tune it using datasets like Mozilla Common Voice, FLEURS, and the BDU-speech dataset. The best-performing model, Whisp… ▽ More

    Submitted 28 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.