Showing 1–1 of 1 results for author: Chowdhury, F A R R

Search v0.5.6 released 2020-02-24

arXiv:1710.10470 [pdf, other]

eess.AS cs.LG cs.SD stat.ML

Attention-Based Models for Text-Dependent Speaker Verification

Authors: F A Rezaur Rahman Chowdhury, Quan Wang, Ignacio Lopez Moreno, Li Wan

Abstract: Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire length of an input sequence. In this paper, we analyze the usage of attention mechanisms to the problem of sequence summarization in our end-to-end text-dependen… ▽ More Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire length of an input sequence. In this paper, we analyze the usage of attention mechanisms to the problem of sequence summarization in our end-to-end text-dependent speaker recognition system. We explore different topologies and their variants of the attention layer, and compare different pooling methods on the attention weights. Ultimately, we show that attention-based models can improves the Equal Error Rate (EER) of our speaker verification system by relatively 14% compared to our non-attention LSTM baseline model. △ Less

Submitted 31 January, 2018; v1 submitted 28 October, 2017; originally announced October 2017.

Comments: Submitted to ICASSP 2018

Search v0.5.6 released 2020-02-24