Skip to main content

Showing 1–3 of 3 results for author: Bondaruk, Ł

.
  1. arXiv:2502.07562  [pdf, other

    cs.SD cs.AI eess.AS

    LoRP-TTS: Low-Rank Personalized Text-To-Speech

    Authors: Łukasz Bondaruk, Jakub Kubiak

    Abstract: Speech synthesis models convert written text into natural-sounding audio. While earlier models were limited to a single speaker, recent advancements have led to the development of zero-shot systems that generate realistic speech from a wide range of speakers using their voices as additional prompts. However, they still struggle with imitating non-studio-quality samples that differ significantly fr… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  2. arXiv:2410.22903  [pdf, other

    eess.AS cs.SD

    Augmenting Polish Automatic Speech Recognition System With Synthetic Data

    Authors: Łukasz Bondaruk, Jakub Kubiak, Mateusz Czyżnikiewicz

    Abstract: This paper presents a system developed for submission to Poleval 2024, Task 3: Polish Automatic Speech Recognition Challenge. We describe Voicebox-based speech synthesis pipeline and utilize it to augment Conformer and Whisper speech recognition models with synthetic data. We show that addition of synthetic speech to training improves achieved results significantly. We also present final results a… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  3. Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech

    Authors: Mateusz Czyżnikiewicz, Łukasz Bondaruk, Jakub Kubiak, Adam Wiącek, Łukasz Degórski, Marek Kubis, Paweł Skórzewski

    Abstract: In this paper we study the impact of augmenting spoken language corpora with domain-specific synthetic samples for the purpose of training a speech recognition system. Using both a conventional neural TTS system and a zero-shot one with voice cloning ability we generate speech corpora that vary in the number of voices. We compare speech recognition models trained with addition of different amounts… ▽ More

    Submitted 29 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to FedCSIS 2024