Advocating Character Error Rate for Multilingual ASR Evaluation

K, Thennal D; James, Jesin; Gopinath, Deepa P; K, Muhammed Ashraf

Computer Science > Computation and Language

arXiv:2410.07400 (cs)

[Submitted on 9 Oct 2024 (v1), last revised 18 Oct 2024 (this version, v2)]

Title:Advocating Character Error Rate for Multilingual ASR Evaluation

Authors:Thennal D K, Jesin James, Deepa P Gopinath, Muhammed Ashraf K

View PDF

Abstract:Automatic speech recognition (ASR) systems have traditionally been evaluated using English datasets, with the word error rate (WER) serving as the predominant metric. WER's simplicity and ease of interpretation have contributed to its widespread adoption, particularly for English. However, as ASR systems expand to multilingual contexts, WER fails in various ways, particularly with morphologically complex languages or those without clear word boundaries. Our work documents the limitations of WER as an evaluation metric and advocates for the character error rate (CER) as the primary metric in multilingual ASR evaluation. We show that CER avoids many of the challenges WER faces and exhibits greater consistency across writing systems. We support our proposition by conducting human evaluations of ASR transcriptions in three languages: Malayalam, English, and Arabic, which exhibit distinct morphological characteristics. We show that CER correlates more closely with human judgments than WER, even for English. To facilitate further research, we release our human evaluation dataset for future benchmarking of ASR metrics. Our findings suggest that CER should be prioritized, or at least supplemented, in multilingual ASR evaluations to account for the varying linguistic characteristics of different languages.

Comments:	4 pages
Subjects:	Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2410.07400 [cs.CL]
	(or arXiv:2410.07400v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.07400

Submission history

From: Thennal D K [view email]
[v1] Wed, 9 Oct 2024 19:57:07 UTC (297 KB)
[v2] Fri, 18 Oct 2024 15:54:56 UTC (293 KB)

Computer Science > Computation and Language

Title:Advocating Character Error Rate for Multilingual ASR Evaluation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Advocating Character Error Rate for Multilingual ASR Evaluation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators