The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

Kucherenko, Taras; Nagy, Rajmund; Yoon, Youngwoo; Woo, Jieyeon; Nikolov, Teodor; Tsakov, Mihail; Henter, Gustav Eje

Computer Science > Human-Computer Interaction

arXiv:2308.12646 (cs)

[Submitted on 24 Aug 2023]

Title:The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

Authors:Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

View PDF

Abstract:This paper reports on the GENEA Challenge 2023, in which participating teams built speech-driven gesture-generation systems using the same speech and motion dataset, followed by a joint evaluation. This year's challenge provided data on both sides of a dyadic interaction, allowing teams to generate full-body motion for an agent given its speech (text and audio) and the speech and motion of the interlocutor. We evaluated 12 submissions and 2 baselines together with held-out motion-capture data in several large-scale user studies. The studies focused on three aspects: 1) the human-likeness of the motion, 2) the appropriateness of the motion for the agent's own speech whilst controlling for the human-likeness of the motion, and 3) the appropriateness of the motion for the behaviour of the interlocutor in the interaction, using a setup that controls for both the human-likeness of the motion and the agent's own speech. We found a large span in human-likeness between challenge submissions, with a few systems rated close to human mocap. Appropriateness seems far from being solved, with most submissions performing in a narrow range slightly above chance, far behind natural motion. The effect of the interlocutor is even more subtle, with submitted systems at best performing barely above chance. Interestingly, a dyadic system being highly appropriate for agent speech does not necessarily imply high appropriateness for the interlocutor. Additional material is available via the project website at this https URL .

Comments:	The first three authors made equal contributions. Accepted for publication at the ACM International Conference on Multimodal Interaction (ICMI)
Subjects:	Human-Computer Interaction (cs.HC); Graphics (cs.GR); Machine Learning (cs.LG)
ACM classes:	I.3; I.2
Cite as:	arXiv:2308.12646 [cs.HC]
	(or arXiv:2308.12646v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2308.12646

Submission history

From: Taras Kucherenko [view email]
[v1] Thu, 24 Aug 2023 08:42:06 UTC (5,622 KB)

Computer Science > Human-Computer Interaction

Title:The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators