Landmark-based consonant voicing detection on multilingual corpora

Kong, Xiang; Yang, Xuesong; Hasegawa-Johnson, Mark; Choi, Jeung-Yoon; Shattuck-Hufnagel, Stefanie

doi:10.1121/1.4987203

Computer Science > Computation and Language

arXiv:1611.03533 (cs)

[Submitted on 10 Nov 2016]

Title:Landmark-based consonant voicing detection on multilingual corpora

Authors:Xiang Kong, Xuesong Yang, Mark Hasegawa-Johnson, Jeung-Yoon Choi, Stefanie Shattuck-Hufnagel

View PDF

Abstract:This paper tests the hypothesis that distinctive feature classifiers anchored at phonetic landmarks can be transferred cross-lingually without loss of accuracy. Three consonant voicing classifiers were developed: (1) manually selected acoustic features anchored at a phonetic landmark, (2) MFCCs (either averaged across the segment or anchored at the landmark), and(3) acoustic features computed using a convolutional neural network (CNN). All detectors are trained on English data (TIMIT),and tested on English, Turkish, and Spanish (performance measured using F1 and accuracy). Experiments demonstrate that manual features outperform all MFCC classifiers, while CNNfeatures outperform both. MFCC-based classifiers suffer an F1reduction of 16% absolute when generalized from English to other languages. Manual features suffer only a 5% F1 reduction,and CNN features actually perform better in Turkish and Span-ish than in the training language, demonstrating that features capable of representing long-term spectral dynamics (CNN and landmark-based features) are able to generalize cross-lingually with little or no loss of accuracy

Comments:	ready to submit to JASA-EL
Subjects:	Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:1611.03533 [cs.CL]
	(or arXiv:1611.03533v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1611.03533
Related DOI:	https://doi.org/10.1121/1.4987203

Submission history

From: Xiang Kong [view email]
[v1] Thu, 10 Nov 2016 22:11:16 UTC (92 KB)

Computer Science > Computation and Language

Title:Landmark-based consonant voicing detection on multilingual corpora

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Landmark-based consonant voicing detection on multilingual corpora

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators