Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation

Shou, Yuntao; Meng, Tao; Ai, Wei; Li, Keqin

Computer Science > Computation and Language

arXiv:2412.02935 (cs)

[Submitted on 4 Dec 2024 (v1), last revised 27 Jan 2025 (this version, v2)]

Title:Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation

Authors:Yuntao Shou, Tao Meng, Wei Ai, Keqin Li

View PDF HTML (experimental)

Abstract:Multimodal emotion recognition in conversation (MERC) refers to identifying and classifying human emotional states by combining data from multiple different modalities (e.g., audio, images, text, video, etc.). Most existing multimodal emotion recognition methods use GCN to improve performance, but existing GCN methods are prone to overfitting and cannot capture the temporal dependency of the speaker's emotions. To address the above problems, we propose a Dynamic Graph Neural Ordinary Differential Equation Network (DGODE) for MERC, which combines the dynamic changes of emotions to capture the temporal dependency of speakers' emotions, and effectively alleviates the overfitting problem of GCNs. Technically, the key idea of DGODE is to utilize an adaptive mixhop mechanism to improve the generalization ability of GCNs and use the graph ODE evolution network to characterize the continuous dynamics of node representations over time and capture temporal dependencies. Extensive experiments on two publicly available multimodal emotion recognition datasets demonstrate that the proposed DGODE model has superior performance compared to various baselines. Furthermore, the proposed DGODE can also alleviate the over-smoothing problem, thereby enabling the construction of a deep GCN network.

Comments:	13 pages, 6 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.02935 [cs.CL]
	(or arXiv:2412.02935v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.02935

Submission history

From: Yuntao Shou [view email]
[v1] Wed, 4 Dec 2024 01:07:59 UTC (2,718 KB)
[v2] Mon, 27 Jan 2025 02:01:59 UTC (2,718 KB)

Computer Science > Computation and Language

Title:Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators