Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Cui, Xiaodong; Zhang, Wei; Finkler, Ulrich; Saon, George; Picheny, Michael; Kung, David

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2002.10502 (cs)

[Submitted on 24 Feb 2020]

Title:Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Authors:Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

View PDF

Abstract:The past decade has witnessed great progress in Automatic Speech Recognition (ASR) due to advances in deep learning. The improvements in performance can be attributed to both improved models and large-scale training data. Key to training such models is the employment of efficient distributed learning techniques. In this article, we provide an overview of distributed training techniques for deep neural network acoustic models for ASR. Starting with the fundamentals of data parallel stochastic gradient descent (SGD) and ASR acoustic modeling, we will investigate various distributed training strategies and their realizations in high performance computing (HPC) environments with an emphasis on striking the balance between communication and computation. Experiments are carried out on a popular public benchmark to study the convergence, speedup and recognition performance of the investigated strategies.

Comments:	Accepted to IEEE Signal Processing Magazine
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2002.10502 [cs.DC]
	(or arXiv:2002.10502v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2002.10502

Submission history

From: Xiaodong Cui [view email]
[v1] Mon, 24 Feb 2020 19:31:50 UTC (453 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators