EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models

Kim, Soowon; Jo, Ha-Na; Ko, Eunyeong

Computer Science > Sound

arXiv:2411.09302 (cs)

[Submitted on 14 Nov 2024]

Title:EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models

Authors:Soowon Kim, Ha-Na Jo, Eunyeong Ko

View PDF HTML (experimental)

Abstract:In this study, we propose an ensemble learning framework for electroencephalogram-based overt speech classification, leveraging denoising diffusion probabilistic models with varying convolutional kernel sizes. The ensemble comprises three models with kernel sizes of 51, 101, and 201, effectively capturing multi-scale temporal features inherent in signals. This approach improves the robustness and accuracy of speech decoding by accommodating the rich temporal complexity of neural signals. The ensemble models work in conjunction with conditional autoencoders that refine the reconstructed signals and maximize the useful information for downstream classification tasks. The results indicate that the proposed ensemble-based approach significantly outperforms individual models and existing state-of-the-art techniques. These findings demonstrate the potential of ensemble methods in advancing brain signal decoding, offering new possibilities for non-verbal communication applications, particularly in brain-computer interface systems aimed at aiding individuals with speech impairments.

Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
Cite as:	arXiv:2411.09302 [cs.SD]
	(or arXiv:2411.09302v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2411.09302

Submission history

From: Soowon Kim [view email]
[v1] Thu, 14 Nov 2024 09:23:58 UTC (224 KB)

Computer Science > Sound

Title:EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators