ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Yuen, Daniel Hao Xian; Pang, Andrew Yong Chen; Yang, Zhou; Chong, Chun Yong; Lim, Mei Kuan; Lo, David

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2302.05582 (eess)

[Submitted on 11 Feb 2023]

Title:ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Authors:Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong Chong, Mei Kuan Lim, David Lo

View PDF

Abstract:Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems and (2) a phonetic analysis module to identify on which phonemes the ASR system tend to produce errors. ASDF generates more high-quality test cases by applying various text transformation methods (e.g., change tense) to the texts in failed test cases. By doing so, ASDF can utilize a small text corpus to generate a large number of audio test cases, something which CrossASR++ is not capable of. In addition, ASDF implements more metrics to evaluate the performance of ASR systems from multiple perspectives. ASDF performs phonetic analysis on the identified failed test cases to identify the phonemes that ASR systems tend to transcribe incorrectly, providing useful information for developers to improve ASR systems. The demonstration video of our tool is made online at this https URL. The implementation is available at this https URL.

Comments:	Accpeted by ICST 2023 Tool Demo Track
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD); Software Engineering (cs.SE)
Cite as:	arXiv:2302.05582 [eess.AS]
	(or arXiv:2302.05582v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2302.05582

Submission history

From: Zhou Yang [view email]
[v1] Sat, 11 Feb 2023 02:53:12 UTC (1,066 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators