DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era

Restrepo, David; Wu, Chenwei; Vásquez-Venegas, Constanza; Nakayama, Luis Filipe; Celi, Leo Anthony; López, Diego M

Computer Science > Artificial Intelligence

arXiv:2404.12278 (cs)

[Submitted on 18 Apr 2024 (v1), last revised 2 Jun 2024 (this version, v2)]

Title:DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era

Authors:David Restrepo, Chenwei Wu, Constanza Vásquez-Venegas, Luis Filipe Nakayama, Leo Anthony Celi, Diego M López

View PDF HTML (experimental)

Abstract:In the big data era, integrating diverse data modalities poses significant challenges, particularly in complex fields like healthcare. This paper introduces a new process model for multimodal Data Fusion for Data Mining, integrating embeddings and the Cross-Industry Standard Process for Data Mining with the existing Data Fusion Information Group model. Our model aims to decrease computational costs, complexity, and bias while improving efficiency and reliability. We also propose "disentangled dense fusion", a novel embedding fusion method designed to optimize mutual information and facilitate dense inter-modality feature interaction, thereby minimizing redundant information.
We demonstrate the model's efficacy through three use cases: predicting diabetic retinopathy using retinal images and patient metadata, domestic violence prediction employing satellite imagery, internet, and census data, and identifying clinical and demographic features from radiography images and clinical notes. The model achieved a Macro F1 score of 0.92 in diabetic retinopathy prediction, an R-squared of 0.854 and sMAPE of 24.868 in domestic violence prediction, and a macro AUC of 0.92 and 0.99 for disease prediction and sex classification, respectively, in radiological analysis.
These results underscore the Data Fusion for Data Mining model's potential to significantly impact multimodal data processing, promoting its adoption in diverse, resource-constrained settings.

Comments:	6 figures, 5 tables
Subjects:	Artificial Intelligence (cs.AI)
MSC classes:	68T30
ACM classes:	I.2.0; I.3.6
Cite as:	arXiv:2404.12278 [cs.AI]
	(or arXiv:2404.12278v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2404.12278

Submission history

From: David Restrepo [view email]
[v1] Thu, 18 Apr 2024 15:52:42 UTC (1,392 KB)
[v2] Sun, 2 Jun 2024 16:51:46 UTC (1,393 KB)

Computer Science > Artificial Intelligence

Title:DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators