Skip to main content

Showing 1–7 of 7 results for author: Glembek, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.11718  [pdf, ps, other

    eess.AS cs.SD

    Analysis of the BUT Diarization System for VoxConverse Challenge

    Authors: Federico Landini, Ondřej Glembek, Pavel Matějka, Johan Rohdin, Lukáš Burget, Mireia Diez, Anna Silnova

    Abstract: This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the VoxConverse dataset. The system consists of signal pre-processing, voice activity detection, speaker embedding extraction, an initial agglomerative hierarchical clustering followed by diarization using a Bayesian hidden Markov model, a reclust… ▽ More

    Submitted 9 February, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted to ICASSP 2021

  2. arXiv:2007.01359  [pdf, ps, other

    cs.CL

    A Bayesian Multilingual Document Model for Zero-shot Topic Identification and Discovery

    Authors: Santosh Kesiraju, Sangeet Sagar, Ondřej Glembek, Lukáš Burget, Ján Černocký, Suryakanth V Gangashetty

    Abstract: In this paper, we present a Bayesian multilingual document model for learning language-independent document embeddings. The model is an extension of BaySMM [Kesiraju et al 2020] to the multilingual scenario. It learns to represent the document embeddings in the form of Gaussian distributions, thereby encoding the uncertainty in its covariance. We propagate the learned uncertainties through linear… ▽ More

    Submitted 23 March, 2024; v1 submitted 2 July, 2020; originally announced July 2020.

  3. arXiv:1907.06112  [pdf, ps, other

    eess.AS cs.CL cs.SD

    BUT VOiCES 2019 System Description

    Authors: Hossein Zeinali, Pavel Matějka, Ladislav Mošner, Oldřich Plchot, Anna Silnova, Ondřej Novotný, Ján Profant, Ondřej Glembek, Lukáš Burget

    Abstract: This is a description of our effort in VOiCES 2019 Speaker Recognition challenge. All systems in the fixed condition are based on the x-vector paradigm with different features and DNN topologies. The single best system reaches 1.2% EER and a fusion of 3 systems yields 1.0% EER, which is 15% relative improvement. The open condition allowed us to use external data which we did for the PLDA adaptatio… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

  4. arXiv:1904.04235  [pdf, other

    eess.AS cs.SD

    Factorization of Discriminatively Trained i-vector Extractor for Speaker Recognition

    Authors: Ondrej Novotny, Oldrich Plchot, Ondrej Glembek, Lukas Burget

    Abstract: In this work, we continue in our research on i-vector extractor for speaker verification (SV) and we optimize its architecture for fast and effective discriminative training. We were motivated by computational and memory requirements caused by the large number of parameters of the original generative i-vector model. Our aim is to preserve the power of the original generative model, and at the same… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Submitted to Interspeech 2019, Graz, Austria. arXiv admin note: substantial text overlap with arXiv:1810.13183

  5. arXiv:1811.07629  [pdf, other

    eess.AS cs.SD

    Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition

    Authors: Ondrej Novotny, Oldrich Plchot, Ondrej Glembek, Jan "Honza" Cernocky, Lukas Burget

    Abstract: In this work, we present an analysis of a DNN-based autoencoder for speech enhancement, dereverberation and denoising. The target application is a robust speaker verification (SV) system. We start our approach by carefully designing a data augmentation process to cover wide range of acoustic conditions and obtain rich training data for various components of our SV system. We augment several well-k… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: 16 pages, 7 figures, Submission to Computer Speech and Language, special issue on Speaker and language characterization and recognition

  6. arXiv:1811.02938  [pdf, other

    eess.AS cs.SD

    On the use of DNN Autoencoder for Robust Speaker Recognition

    Authors: Ondrej Novotny, Oldrich Plchot, Pavel Matejka, Ondrej Glembek

    Abstract: In this paper, we present an analysis of a DNN-based autoencoder for speech enhancement, dereverberation and denoising. The target application is a robust speaker recognition system. We started with augmenting the Fisher database with artificially noised and reverberated data and we trained the autoencoder to map noisy and reverberated speech to its clean version. We use the autoencoder as a prepr… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 5 pages, 1 figure

  7. arXiv:1810.13183  [pdf, other

    eess.AS cs.SD

    Discriminatively Re-trained i-vector Extractor for Speaker Recognition

    Authors: Ondrej Novotny, Oldrich Plchot, Ondrej Glembek, Lukas Burget, Pavel Matejka

    Abstract: In this work we revisit discriminative training of the i-vector extractor component in the standard speaker verification (SV) system. The motivation of our research lies in the robustness and stability of this large generative model, which we want to preserve, and focus its power towards any intended SV task. We show that after generative initialization of the i-vector extractor, we can further re… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

    Comments: 5 pages, 1 figure, submitted to ICASSP 2019