Skip to main content

Showing 1–2 of 2 results for author: Fazel-Zarandi, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.13516  [pdf, other

    cs.CL cs.SD eess.AS

    Scaling Speech Technology to 1,000+ Languages

    Authors: Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli

    Abstract: Expanding the language coverage of speech technology has the potential to improve access to information for many more people. However, current speech technology is restricted to about one hundred languages which is a small fraction of the over 7,000 languages spoken around the world. The Massively Multilingual Speech (MMS) project increases the number of supported languages by 10-40x, depending on… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  2. arXiv:2303.11131  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Cocktail HuBERT: Generalized Self-Supervised Pre-training for Mixture and Single-Source Speech

    Authors: Maryam Fazel-Zarandi, Wei-Ning Hsu

    Abstract: Self-supervised learning leverages unlabeled data effectively, improving label efficiency and generalization to domains without labeled data. While recent work has studied generalization to more acoustic/linguistic domains, languages, and modalities, these investigations are limited to single-source speech with one primary speaker in the recording. This paper presents Cocktail HuBERT, a self-super… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023