ASR in German: A Detailed Error Analysis

Wirth, Johannes; Peinl, Rene

Computer Science > Computation and Language

arXiv:2204.05617 (cs)

[Submitted on 12 Apr 2022]

Title:ASR in German: A Detailed Error Analysis

Authors:Johannes Wirth, Rene Peinl

View PDF

Abstract:The amount of freely available systems for automatic speech recognition (ASR) based on neural networks is growing steadily, with equally increasingly reliable predictions. However, the evaluation of trained models is typically exclusively based on statistical metrics such as WER or CER, which do not provide any insight into the nature or impact of the errors produced when predicting transcripts from speech input. This work presents a selection of ASR model architectures that are pretrained on the German language and evaluates them on a benchmark of diverse test datasets. It identifies cross-architectural prediction errors, classifies those into categories and traces the sources of errors per category back into training data as well as other sources. Finally, it discusses solutions in order to create qualitatively better training datasets and more robust ASR systems.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
ACM classes:	C.4; I.2.7
Cite as:	arXiv:2204.05617 [cs.CL]
	(or arXiv:2204.05617v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.05617

Submission history

From: René Peinl [view email]
[v1] Tue, 12 Apr 2022 08:25:01 UTC (396 KB)

Computer Science > Computation and Language

Title:ASR in German: A Detailed Error Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ASR in German: A Detailed Error Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators