A Markov Random Field Multi-Modal Variational AutoEncoder

Oubari, Fouad; Baha, Mohamed El; Meunier, Raphael; Décatoire, Rodrigue; Mougeot, Mathilde

Computer Science > Machine Learning

arXiv:2408.09576 (cs)

[Submitted on 18 Aug 2024 (v1), last revised 7 Feb 2025 (this version, v2)]

Title:A Markov Random Field Multi-Modal Variational AutoEncoder

Authors:Fouad Oubari, Mohamed El Baha, Raphael Meunier, Rodrigue Décatoire, Mathilde Mougeot

View PDF HTML (experimental)

Abstract:Recent advancements in multimodal Variational AutoEncoders (VAEs) have highlighted their potential for modeling complex data from multiple modalities. However, many existing approaches use relatively straightforward aggregating schemes that may not fully capture the complex dynamics present between different modalities. This work introduces a novel multimodal VAE that incorporates a Markov Random Field (MRF) into both the prior and posterior distributions. This integration aims to capture complex intermodal interactions more effectively. Unlike previous models, our approach is specifically designed to model and leverage the intricacies of these relationships, enabling a more faithful representation of multimodal data. Our experiments demonstrate that our model performs competitively on the standard PolyMNIST dataset and shows superior performance in managing complex intermodal dependencies in a specially designed synthetic dataset, intended to test intricate relationships.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2408.09576 [cs.LG]
	(or arXiv:2408.09576v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.09576

Submission history

From: Fouad Oubari [view email]
[v1] Sun, 18 Aug 2024 19:27:30 UTC (1,444 KB)
[v2] Fri, 7 Feb 2025 08:19:34 UTC (1,444 KB)

Computer Science > Machine Learning

Title:A Markov Random Field Multi-Modal Variational AutoEncoder

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Markov Random Field Multi-Modal Variational AutoEncoder

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators