Skip to main content

Showing 1–1 of 1 results for author: Sekoyan, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.13404  [pdf, other

    cs.CL eess.AS

    Granary: Speech Recognition and Translation Dataset in 25 European Languages

    Authors: Nithin Rao Koluguri, Monica Sekoyan, George Zelenfroynd, Sasha Meister, Shuoyang Ding, Sofia Kostandian, He Huang, Nikolay Karpov, Jagadeesh Balam, Vitaly Lavrukhin, Yifan Peng, Sara Papi, Marco Gaido, Alessio Brutti, Boris Ginsburg

    Abstract: Multi-task and multilingual approaches benefit large models, yet speech processing for low-resource languages remains underexplored due to data scarcity. To address this, we present Granary, a large-scale collection of speech datasets for recognition and translation across 25 European languages. This is the first open-source effort at this scale for both transcription and translation. We enhance d… ▽ More

    Submitted 21 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted at Interspeech 2025 v2: Added links