Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera

Tanioka, Hiroki; Ueta, Tetsushi; Sano, Masahiko

Computer Science > Human-Computer Interaction

arXiv:2408.07982 (cs)

[Submitted on 15 Aug 2024 (v1), last revised 18 Feb 2025 (this version, v2)]

Title:Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera

Authors:Hiroki Tanioka, Tetsushi Ueta, Masahiko Sano

View PDF HTML (experimental)

Abstract:The performance of ChatGPT© and other LLMs has improved tremendously, and in online environments, they are increasingly likely to be used in a wide variety of situations, such as ChatBot on web pages, call center operations using voice interaction, and dialogue functions using agents. In the offline environment, multimodal dialogue functions are also being realized, such as guidance by Artificial Intelligence agents (AI agents) using tablet terminals and dialogue systems in the form of LLMs mounted on robots. In this multimodal dialogue, mutual emotion recognition between the AI and the user will become important. So far, there have been methods for expressing emotions on the part of the AI agent or for recognizing them using textual or voice information of the user's utterances, but methods for AI agents to recognize emotions from the user's facial expressions have not been studied. In this study, we examined whether or not LLM-based AI agents can interact with users according to their emotional states by capturing the user in dialogue with a camera, recognizing emotions from facial expressions, and adding such emotion information to prompts. The results confirmed that AI agents can have conversations according to the emotional state for emotional states with relatively high scores, such as Happy and Angry.

Comments:	4 pages, 5 figures, 1 table, The 1st InterAI: Interactive AI for Human-Centered Robotics workshop in conjunction with IEEE Ro-MAN 2024, Pasadona, LA, USA, Aug. 2024
Subjects:	Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Robotics (cs.RO)
MSC classes:	68T40
ACM classes:	I.2.10; I.2.7
Cite as:	arXiv:2408.07982 [cs.HC]
	(or arXiv:2408.07982v2 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2408.07982
Journal reference:	The 1st InterAI Workshop: Interactive AI for Human-centered Robotics, 2024

Submission history

From: Hiroki Tanioka Dr [view email]
[v1] Thu, 15 Aug 2024 07:03:00 UTC (5,723 KB)
[v2] Tue, 18 Feb 2025 12:48:27 UTC (5,724 KB)

Computer Science > Human-Computer Interaction

Title:Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators