Efficient MDI Adaptation for n-gram Language Models

Huang, Ruizhe; Li, Ke; Arora, Ashish; Povey, Dan; Khudanpur, Sanjeev

Computer Science > Computation and Language

arXiv:2008.02385 (cs)

[Submitted on 5 Aug 2020]

Title:Efficient MDI Adaptation for n-gram Language Models

Authors:Ruizhe Huang, Ke Li, Ashish Arora, Dan Povey, Sanjeev Khudanpur

View PDF

Abstract:This paper presents an efficient algorithm for n-gram language model adaptation under the minimum discrimination information (MDI) principle, where an out-of-domain language model is adapted to satisfy the constraints of marginal probabilities of the in-domain data. The challenge for MDI language model adaptation is its computational complexity. By taking advantage of the backoff structure of n-gram model and the idea of hierarchical training method, originally proposed for maximum entropy (ME) language models, we show that MDI adaptation can be computed in linear-time complexity to the inputs in each iteration. The complexity remains the same as ME models, although MDI is more general than ME. This makes MDI adaptation practical for large corpus and vocabulary. Experimental results confirm the scalability of our algorithm on very large datasets, while MDI adaptation gets slightly worse perplexity but better word error rate results compared to simple linear interpolation.

Comments:	To appear in INTERSPEECH 2020. Appendix A of this full version will be filled soon
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2008.02385 [cs.CL]
	(or arXiv:2008.02385v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2008.02385

Submission history

From: Ruizhe Huang [view email]
[v1] Wed, 5 Aug 2020 22:21:03 UTC (61 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ke Li
Ashish Arora
Daniel Povey
Sanjeev Khudanpur

export BibTeX citation

Computer Science > Computation and Language

Title:Efficient MDI Adaptation for n-gram Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficient MDI Adaptation for n-gram Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators