-
arXiv:2505.02518 [pdf, ps, other]
Bemba Speech Translation: Exploring a Low-Resource African Language
Abstract: This paper describes our system submission to the International Conference on Spoken Language Translation (IWSLT 2025), low-resource languages track, namely for Bemba-to-English speech translation. We built cascaded speech translation systems based on Whisper and NLLB-200, and employed data augmentation techniques, such as back-translation. We investigate the effect of using synthetic data and dis… ▽ More
Submitted 2 June, 2025; v1 submitted 5 May, 2025; originally announced May 2025.
Comments: IWSLT 2025
Journal ref: Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
-
arXiv:2502.12050 [pdf, ps, other]
SpeechT: Findings of the First Mentorship in Speech Translation
Abstract: This work presents the details and findings of the first mentorship in speech translation (SpeechT), which took place in December 2024 and January 2025. To fulfil the mentorship requirements, the participants engaged in key activities, including data preparation, modelling, and advanced research. The participants explored data augmentation techniques and compared end-to-end and cascaded speech tra… ▽ More
Submitted 2 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.
Comments: MT Summit 2025