Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation

Doulaty, Mortaza; Saz, Oscar; Ng, Raymond W. M.; Hain, Thomas

doi:10.1109/ASRU.2015.7404785

Computer Science > Computation and Language

arXiv:1511.05076 (cs)

[Submitted on 16 Nov 2015]

Title:Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation

Authors:Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain

View PDF

Abstract:This paper presents a new method for the discovery of latent domains in diverse speech data, for the use of adaptation of Deep Neural Networks (DNNs) for Automatic Speech Recognition. Our work focuses on transcription of multi-genre broadcast media, which is often only categorised broadly in terms of high level genres such as sports, news, documentary, etc. However, in terms of acoustic modelling these categories are coarse. Instead, it is expected that a mixture of latent domains can better represent the complex and diverse behaviours within a TV show, and therefore lead to better and more robust performance. We propose a new method, whereby these latent domains are discovered with Latent Dirichlet Allocation, in an unsupervised manner. These are used to adapt DNNs using the Unique Binary Code (UBIC) representation for the LDA domains. Experiments conducted on a set of BBC TV broadcasts, with more than 2,000 shows for training and 47 shows for testing, show that the use of LDA-UBIC DNNs reduces the error up to 13% relative compared to the baseline hybrid DNN models.

Comments:	IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), 13-17 Dec 2015, Scottsdale, Arizona, USA
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1511.05076 [cs.CL]
	(or arXiv:1511.05076v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1511.05076
Related DOI:	https://doi.org/10.1109/ASRU.2015.7404785

Submission history

From: Mortaza Doulaty [view email]
[v1] Mon, 16 Nov 2015 18:25:33 UTC (454 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2015-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mortaza Doulaty
Oscar Saz
Raymond W. M. Ng
Thomas Hain

export BibTeX citation

Computer Science > Computation and Language

Title:Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators