Skip to main content

Showing 1–2 of 2 results for author: Irie, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2004.00960  [pdf, other

    eess.AS cs.SD

    The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugment

    Authors: Wei Zhou, Wilfried Michel, Kazuki Irie, Markus Kitza, Ralf Schlüter, Hermann Ney

    Abstract: We present a complete training pipeline to build a state-of-the-art hybrid HMM-based ASR system on the 2nd release of the TED-LIUM corpus. Data augmentation using SpecAugment is successfully applied to improve performance on top of our best SAT model using i-vectors. By investigating the effect of different maskings, we achieve improvements from SpecAugment on hybrid HMM models without increasing… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: accepted at ICASSP 2020

  2. RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation

    Authors: Christoph Lüscher, Eugen Beck, Kazuki Irie, Markus Kitza, Wilfried Michel, Albert Zeyer, Ralf Schlüter, Hermann Ney

    Abstract: We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descriptions of the system development, including model design, pretraining schemes, training schedules, and optimization approaches are provided for both system architectures. Both hybrid DN… ▽ More

    Submitted 25 July, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: Proceedings of INTERSPEECH 2019