Skip to main content

Showing 1–5 of 5 results for author: Kostek, B

.
  1. Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need

    Authors: Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Bozena Kostek

    Abstract: The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: Published in Speech Communication Journal

  2. arXiv:2106.03494  [pdf, other

    eess.AS cs.LG

    Weakly-supervised word-level pronunciation error detection in non-native English speech

    Authors: Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Shira Calamaro, Bozena Kostek

    Abstract: We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to Interspeech 2021

  3. arXiv:2101.06396  [pdf, other

    eess.AS cs.LG cs.SD

    Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

    Authors: Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek

    Abstract: A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do… ▽ More

    Submitted 8 February, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: Accepted to ICASSP 2021

  4. arXiv:2012.14788  [pdf, other

    eess.AS cs.SD

    Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention

    Authors: Daniel Korzekwa, Roberto Barra-Chicote, Szymon Zaporowski, Grzegorz Beringer, Jaime Lorenzo-Trueba, Alicja Serafinowicz, Jasha Droppo, Thomas Drugman, Bozena Kostek

    Abstract: This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learni… ▽ More

    Submitted 7 June, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: Accepted to Interspeech 2021

  5. arXiv:1907.04743  [pdf, other

    eess.AS cs.CL cs.SD

    Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

    Authors: Daniel Korzekwa, Roberto Barra-Chicote, Bozena Kostek, Thomas Drugman, Mateusz Lajszczak

    Abstract: This paper proposed a novel approach for the detection and reconstruction of dysarthric speech. The encoder-decoder model factorizes speech into a low-dimensional latent space and encoding of the input text. We showed that the latent space conveys interpretable characteristics of dysarthria, such as intelligibility and fluency of speech. MUSHRA perceptual test demonstrated that the adaptation of t… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: 5 pages, 5 figures, Accepted for Interspeech 2019