ECGBERT: Understanding Hidden Language of ECGs with Self-Supervised Representation Learning
Authors:
Seokmin Choi,
Sajad Mousavi,
Phillip Si,
Haben G. Yhdego,
Fatemeh Khadem,
Fatemeh Afghah
Abstract:
In the medical field, current ECG signal analysis approaches rely on supervised deep neural networks trained for specific tasks that require substantial amounts of labeled data. However, our paper introduces ECGBERT, a self-supervised representation learning approach that unlocks the underlying language of ECGs. By unsupervised pre-training of the model, we mitigate challenges posed by the lack of…
▽ More
In the medical field, current ECG signal analysis approaches rely on supervised deep neural networks trained for specific tasks that require substantial amounts of labeled data. However, our paper introduces ECGBERT, a self-supervised representation learning approach that unlocks the underlying language of ECGs. By unsupervised pre-training of the model, we mitigate challenges posed by the lack of well-labeled and curated medical data. ECGBERT, inspired by advances in the area of natural language processing and large language models, can be fine-tuned with minimal additional layers for various ECG-based problems. Through four tasks, including Atrial Fibrillation arrhythmia detection, heartbeat classification, sleep apnea detection, and user authentication, we demonstrate ECGBERT's potential to achieve state-of-the-art results on a wide variety of tasks.
△ Less
Submitted 10 June, 2023;
originally announced June 2023.
ECG Language Processing (ELP): a New Technique to Analyze ECG Signals
Authors:
Sajad Mousavi,
Fatemeh Afghah,
Fatemeh Khadem,
U. Rajendra Acharya
Abstract:
A language is constructed of a finite/infinite set of sentences composing of words. Similar to natural languages, Electrocardiogram (ECG) signal, the most common noninvasive tool to study the functionality of the heart and diagnose several abnormal arrhythmias, is made up of sequences of three or four distinct waves including the P-wave, QRS complex, T-wave and U-wave. An ECG signal may contain se…
▽ More
A language is constructed of a finite/infinite set of sentences composing of words. Similar to natural languages, Electrocardiogram (ECG) signal, the most common noninvasive tool to study the functionality of the heart and diagnose several abnormal arrhythmias, is made up of sequences of three or four distinct waves including the P-wave, QRS complex, T-wave and U-wave. An ECG signal may contain several different varieties of each wave (e.g., the QRS complex can have various appearances). For this reason, the ECG signal is a sequence of heartbeats similar to sentences in natural languages) and each heartbeat is composed of a set of waves (similar to words in a sentence) of different morphologies. Analogous to natural language processing (NLP) which is used to help computers understand and interpret the human's natural language, it is possible to develop methods inspired by NLP to aid computers to gain a deeper understanding of Electrocardiogram signals. In this work, our goal is to propose a novel ECG analysis technique, \textit{ECG language processing (ELP)}, focusing on empowering computers to understand ECG signals in a way physicians do. We evaluated the proposed method on two tasks including the classification of heartbeats and the detection of atrial fibrillation in the ECG signals. Experimental results on three databases (i.e., PhysionNet's MIT-BIH, MIT-BIH AFIB and PhysioNet Challenge 2017 AFIB Dataset databases) reveal that the proposed method is a general idea that can be applied to a variety of biomedical applications and is able to achieve remarkable performance.
△ Less
Submitted 12 June, 2020;
originally announced June 2020.