Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Karakasidis, Georgios; Grósz, Tamás; Kurimo, Mikko

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2208.05782 (eess)

[Submitted on 10 Aug 2022]

Title:Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Authors:Georgios Karakasidis, Tamás Grósz, Mikko Kurimo

View PDF

Abstract:It is common knowledge that the quantity and quality of the training data play a significant role in the creation of a good machine learning model. In this paper, we take it one step further and demonstrate that the way the training examples are arranged is also of crucial importance. Curriculum Learning is built on the observation that organized and structured assimilation of knowledge has the ability to enable faster training and better comprehension. When humans learn to speak, they first try to utter basic phones and then gradually move towards more complex structures such as words and sentences. This methodology is known as Curriculum Learning, and we employ it in the context of Automatic Speech Recognition. We hypothesize that end-to-end models can achieve better performance when provided with an organized training set consisting of examples that exhibit an increasing level of difficulty (i.e. a curriculum). To impose structure on the training set and to define the notion of an easy example, we explored multiple scoring functions that either use feedback from an external neural network or incorporate feedback from the model itself. Empirical results show that with different curriculums we can balance the training times and the network's performance.

Comments:	5 pages, 2 figures, in Proceedings Interspeech 2022
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
ACM classes:	I.2.7; I.2.0
Cite as:	arXiv:2208.05782 [eess.AS]
	(or arXiv:2208.05782v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2208.05782

Submission history

From: Georgios Karakasidis [view email]
[v1] Wed, 10 Aug 2022 06:56:58 UTC (143 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators