Skip to main content

Showing 1–1 of 1 results for author: Gavit, D

.
  1. arXiv:2505.20937  [pdf, ps, other

    cs.CL

    On VLMs for Diverse Tasks in Multimodal Meme Classification

    Authors: Deepesh Gavit, Debajyoti Mazumder, Samiran Das, Jasabanta Patro

    Abstract: In this paper, we present a comprehensive and systematic analysis of vision-language models (VLMs) for disparate meme classification tasks. We introduced a novel approach that generates a VLM-based understanding of meme images and fine-tunes the LLMs on textual understanding of the embedded meme text for improving the performance. Our contributions are threefold: (1) Benchmarking VLMs with diverse… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 16 pages