On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Lakomkin, Egor; Zamani, Mohammad Ali; Weber, Cornelius; Magg, Sven; Wermter, Stefan

Computer Science > Robotics

arXiv:1804.02173 (cs)

[Submitted on 6 Apr 2018]

Title:On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Authors:Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter

View PDF

Abstract:Speech emotion recognition (SER) is an important aspect of effective human-robot collaboration and received a lot of attention from the research community. For example, many neural network-based architectures were proposed recently and pushed the performance to a new level. However, the applicability of such neural SER models trained only on in-domain data to noisy conditions is currently under-researched. In this work, we evaluate the robustness of state-of-the-art neural acoustic emotion recognition models in human-robot interaction scenarios. We hypothesize that a robot's ego noise, room conditions, and various acoustic events that can occur in a home environment can significantly affect the performance of a model. We conduct several experiments on the iCub robot platform and propose several novel ways to reduce the gap between the model's performance during training and testing in real-world conditions. Furthermore, we observe large improvements in the model performance on the robot and demonstrate the necessity of introducing several data augmentation techniques like overlaying background noise and loudness variations to improve the robustness of the neural approaches.

Comments:	Submitted to IROS'18, Madrid, Spain
Subjects:	Robotics (cs.RO); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1804.02173 [cs.RO]
	(or arXiv:1804.02173v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1804.02173

Submission history

From: Egor Lakomkin [view email]
[v1] Fri, 6 Apr 2018 09:03:29 UTC (5,016 KB)

Computer Science > Robotics

Title:On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators