Skip to main content

Showing 1–3 of 3 results for author: Nasretdinov, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.21198  [pdf, ps, other

    cs.SD eess.AS

    Universal Speech Enhancement with Regression and Generative Mamba

    Authors: Rong Chao, Rauf Nasretdinov, Yu-Chiang Frank Wang, Ante Jukić, Szu-Wei Fu, Yu Tsao

    Abstract: The Interspeech 2025 URGENT Challenge aimed to advance universal, robust, and generalizable speech enhancement by unifying speech enhancement tasks across a wide variety of conditions, including seven different distortion types and five languages. We present Universal Speech Enhancement Mamba (USEMamba), a state-space speech enhancement model designed to handle long-range sequence modeling, time-f… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted to Interspeech 2025

  2. Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement

    Authors: Rauf Nasretdinov, Roman Korostik, Ante Jukić

    Abstract: In this work, we investigate application of generative speech enhancement to improve the robustness of ASR models in noisy and reverberant conditions. We employ a recently-proposed speech enhancement model based on Schrödinger bridge, which has been shown to perform well compared to diffusion-based approaches. We analyze the impact of model scaling and different sampling methods on the ASR perform… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 5 pages. Published in ICASSP 2025

    Journal ref: ICASSP 2025: IEEE International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, April 2025. ICASSP 2025: IEEE International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, April 2025

  3. arXiv:2208.07657  [pdf, other

    eess.AS cs.LG cs.SD

    Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition

    Authors: Andrei Andrusenko, Rauf Nasretdinov, Aleksei Romanenko

    Abstract: Optimization of modern ASR architectures is among the highest priority tasks since it saves many computational resources for model training and inference. The work proposes a new Uconv-Conformer architecture based on the standard Conformer model. It consistently reduces the input sequence length by 16 times, which results in speeding up the work of the intermediate layers. To solve the convergence… ▽ More

    Submitted 11 March, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 5 pages, 1 figure, accepted by ICASSP 2023