The 2025 PNPL Competition: Speech Detection and Phoneme Classification in the LibriBrain Dataset
Authors:
Gilad Landau,
Miran Özdogan,
Gereon Elvers,
Francesco Mantegna,
Pratik Somaiya,
Dulhan Jayalath,
Luisa Kurth,
Teyun Kwon,
Brendan Shillingford,
Greg Farquhar,
Minqi Jiang,
Karim Jerbi,
Hamza Abdelhedi,
Yorguin Mantilla Ramos,
Caglar Gulcehre,
Mark Woolrich,
Natalie Voets,
Oiwi Parker Jones
Abstract:
The advance of speech decoding from non-invasive brain data holds the potential for profound societal impact. Among its most promising applications is the restoration of communication to paralysed individuals affected by speech deficits such as dysarthria, without the need for high-risk surgical interventions. The ultimate aim of the 2025 PNPL competition is to produce the conditions for an "Image…
▽ More
The advance of speech decoding from non-invasive brain data holds the potential for profound societal impact. Among its most promising applications is the restoration of communication to paralysed individuals affected by speech deficits such as dysarthria, without the need for high-risk surgical interventions. The ultimate aim of the 2025 PNPL competition is to produce the conditions for an "ImageNet moment" or breakthrough in non-invasive neural decoding, by harnessing the collective power of the machine learning community.
To facilitate this vision we present the largest within-subject MEG dataset recorded to date (LibriBrain) together with a user-friendly Python library (pnpl) for easy data access and integration with deep learning frameworks. For the competition we define two foundational tasks (i.e. Speech Detection and Phoneme Classification from brain data), complete with standardised data splits and evaluation metrics, illustrative benchmark models, online tutorial code, a community discussion board, and public leaderboard for submissions. To promote accessibility and participation the competition features a Standard track that emphasises algorithmic innovation, as well as an Extended track that is expected to reward larger-scale computing, accelerating progress toward a non-invasive brain-computer interface for speech.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
Neuro-GPT: Towards A Foundation Model for EEG
Authors:
Wenhui Cui,
Woojae Jeong,
Philipp Thölke,
Takfarinas Medani,
Karim Jerbi,
Anand A. Joshi,
Richard M. Leahy
Abstract:
To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked…
▽ More
To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked EEG segments. We then fine-tune the model on a Motor Imagery Classification task to validate its performance in a low-data regime (9 subjects). Our experiments demonstrate that applying a foundation model can significantly improve classification performance compared to a model trained from scratch, which provides evidence for the generalizability of the foundation model and its ability to address challenges of data scarcity and heterogeneity in EEG. The code is publicly available at github.com/wenhui0206/NeuroGPT.
△ Less
Submitted 2 March, 2024; v1 submitted 7 November, 2023;
originally announced November 2023.