Skip to main content

Showing 1–1 of 1 results for author: Maximenko, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.01192  [pdf, ps, other

    eess.AS cs.SD

    GigaAM: Efficient Self-Supervised Learner for Speech Recognition

    Authors: Aleksandr Kutsakov, Alexandr Maximenko, Georgii Gospodinov, Pavel Bogomolov, Fyodor Minkin

    Abstract: Self-Supervised Learning (SSL) has demonstrated strong performance in speech processing, particularly in automatic speech recognition. In this paper, we explore an SSL pretraining framework that leverages masked language modeling with targets derived from a speech recognition model. We also present chunkwise attention with dynamic chunk size sampling during pretraining to enable both full-context… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Accepted to Interspeech 2025