Quality Assurance of Generative Dialog Models in an Evolving Conversational Agent Used for Swedish Language Practice

Borg, Markus; Bengtsson, Johan; Österling, Harald; Hagelborn, Alexander; Gagner, Isabella; Tomaszewski, Piotr

Computer Science > Software Engineering

arXiv:2203.15414 (cs)

[Submitted on 29 Mar 2022]

Title:Quality Assurance of Generative Dialog Models in an Evolving Conversational Agent Used for Swedish Language Practice

Authors:Markus Borg, Johan Bengtsson, Harald Österling, Alexander Hagelborn, Isabella Gagner, Piotr Tomaszewski

View PDF

Abstract:Due to the migration megatrend, efficient and effective second-language acquisition is vital. One proposed solution involves AI-enabled conversational agents for person-centered interactive language practice. We present results from ongoing action research targeting quality assurance of proprietary generative dialog models trained for virtual job interviews. The action team elicited a set of 38 requirements for which we designed corresponding automated test cases for 15 of particular interest to the evolving solution. Our results show that six of the test case designs can detect meaningful differences between candidate models. While quality assurance of natural language processing applications is complex, we provide initial steps toward an automated framework for machine learning model selection in the context of an evolving conversational agent. Future work will focus on model selection in an MLOps setting.

Comments:	Accepted for publication in the Proc. of the 1st International Conference on AI Engineering, 2022
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2203.15414 [cs.SE]
	(or arXiv:2203.15414v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2203.15414

Submission history

From: Markus Borg [view email]
[v1] Tue, 29 Mar 2022 10:25:13 UTC (638 KB)

Computer Science > Software Engineering

Title:Quality Assurance of Generative Dialog Models in an Evolving Conversational Agent Used for Swedish Language Practice

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Quality Assurance of Generative Dialog Models in an Evolving Conversational Agent Used for Swedish Language Practice

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators