Phoneme Level Language Models for Sequence Based Low Resource ASR

Dalmia, Siddharth; Li, Xinjian; Black, Alan W; Metze, Florian

Computer Science > Computation and Language

arXiv:1902.07613 (cs)

[Submitted on 20 Feb 2019]

Title:Phoneme Level Language Models for Sequence Based Low Resource ASR

Authors:Siddharth Dalmia, Xinjian Li, Alan W Black, Florian Metze

View PDF

Abstract:Building multilingual and crosslingual models help bring different languages together in a language universal space. It allows models to share parameters and transfer knowledge across languages, enabling faster and better adaptation to a new language. These approaches are particularly useful for low resource languages. In this paper, we propose a phoneme-level language model that can be used multilingually and for crosslingual adaptation to a target language. We show that our model performs almost as well as the monolingual models by using six times fewer parameters, and is capable of better adaptation to languages not seen during training in a low resource scenario. We show that these phoneme-level language models can be used to decode sequence based Connectionist Temporal Classification (CTC) acoustic model outputs to obtain comparable word error rates with Weighted Finite State Transducer (WFST) based decoding in Babel languages. We also show that these phoneme-level language models outperform WFST decoding in various low-resource conditions like adapting to a new language and domain mismatch between training and testing data.

Comments:	To appear in ICASSP 2019
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1902.07613 [cs.CL]
	(or arXiv:1902.07613v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1902.07613

Submission history

From: Siddharth Dalmia [view email]
[v1] Wed, 20 Feb 2019 16:00:12 UTC (175 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Siddharth Dalmia
Xinjian Li
Alan W. Black
Florian Metze

export BibTeX citation

Computer Science > Computation and Language

Title:Phoneme Level Language Models for Sequence Based Low Resource ASR

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Phoneme Level Language Models for Sequence Based Low Resource ASR

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators