MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

Hayat, Nasir; Geras, Krzysztof J.; Shamout, Farah E.

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2207.07027 (eess)

[Submitted on 14 Jul 2022 (v1), last revised 2 Mar 2023 (this version, v2)]

Title:MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

Authors:Nasir Hayat, Krzysztof J. Geras, Farah E. Shamout

View PDF

Abstract:Multi-modal fusion approaches aim to integrate information from different data sources. Unlike natural datasets, such as in audio-visual applications, where samples consist of "paired" modalities, data in healthcare is often collected asynchronously. Hence, requiring the presence of all modalities for a given sample is not realistic for clinical tasks and significantly limits the size of the dataset during training. In this paper, we propose MedFuse, a conceptually simple yet promising LSTM-based fusion module that can accommodate uni-modal as well as multi-modal input. We evaluate the fusion method and introduce new benchmark results for in-hospital mortality prediction and phenotype classification, using clinical time-series data in the MIMIC-IV dataset and corresponding chest X-ray images in MIMIC-CXR. Compared to more complex multi-modal fusion strategies, MedFuse provides a performance improvement by a large margin on the fully paired test set. It also remains robust across the partially paired test set containing samples with missing chest X-ray images. We release our code for reproducibility and to enable the evaluation of competing models in the future.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2207.07027 [eess.IV]
	(or arXiv:2207.07027v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2207.07027

Submission history

From: Farah Shamout [view email]
[v1] Thu, 14 Jul 2022 15:59:03 UTC (2,879 KB)
[v2] Thu, 2 Mar 2023 14:49:06 UTC (3,576 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators