Accelerator-Aware Training for Transducer-Based Speech Recognition

Shakiah, Suhaila M.; Swaminathan, Rupak Vignesh; Nguyen, Hieu Duy; Chinta, Raviteja; Afzal, Tariq; Susanj, Nathan; Mouchtaris, Athanasios; Strimel, Grant P.; Rastrow, Ariya

doi:10.1109/SLT54892.2023.10022592

Computer Science > Machine Learning

arXiv:2305.07778 (cs)

[Submitted on 12 May 2023]

Title:Accelerator-Aware Training for Transducer-Based Speech Recognition

Authors:Suhaila M. Shakiah, Rupak Vignesh Swaminathan, Hieu Duy Nguyen, Raviteja Chinta, Tariq Afzal, Nathan Susanj, Athanasios Mouchtaris, Grant P. Strimel, Ariya Rastrow

View PDF

Abstract:Machine learning model weights and activations are represented in full-precision during training. This leads to performance degradation in runtime when deployed on neural network accelerator (NNA) chips, which leverage highly parallelized fixed-point arithmetic to improve runtime memory and latency. In this work, we replicate the NNA operators during the training phase, accounting for the degradation due to low-precision inference on the NNA in back-propagation. Our proposed method efficiently emulates NNA operations, thus foregoing the need to transfer quantization error-prone data to the Central Processing Unit (CPU), ultimately reducing the user perceived latency (UPL). We apply our approach to Recurrent Neural Network-Transducer (RNN-T), an attractive architecture for on-device streaming speech recognition tasks. We train and evaluate models on 270K hours of English data and show a 5-7% improvement in engine latency while saving up to 10% relative degradation in WER.

Comments:	Accepted to SLT 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2305.07778 [cs.LG]
	(or arXiv:2305.07778v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.07778
Journal reference:	IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar, 2023, pp. 100-107
Related DOI:	https://doi.org/10.1109/SLT54892.2023.10022592

Submission history

From: Suhaila Shakiah [view email]
[v1] Fri, 12 May 2023 21:49:51 UTC (564 KB)

Computer Science > Machine Learning

Title:Accelerator-Aware Training for Transducer-Based Speech Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Accelerator-Aware Training for Transducer-Based Speech Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators