Skip to main content

Showing 1–1 of 1 results for author: Nguyen, D P T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.06380  [pdf, other

    cs.CV cs.CL

    TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems

    Authors: Khang H. N. Vo, Duc P. T. Nguyen, Thong Nguyen, Tho T. Quan

    Abstract: This paper focuses on multimodal alignment within the realm of Artificial Intelligence, particularly in text and image modalities. The semantic gap between the textual and visual modality poses a discrepancy problem towards the effectiveness of multi-modalities fusion. Therefore, we introduce Text-Image Joint Embedding Predictive Architecture (TI-JEPA), an innovative pre-training strategy that lev… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.