Skip to main content

Showing 1–1 of 1 results for author: Sailaja, K P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06809  [pdf, other

    cs.CV

    DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks

    Authors: Amin Karimi Monsefi, Kishore Prakash Sailaja, Ali Alilooee, Ser-Nam Lim, Rajiv Ramnath

    Abstract: In this paper, we introduce DetailCLIP: A Detail-Oriented CLIP to address the limitations of contrastive learning-based vision-language models, particularly CLIP, in handling detail-oriented and fine-grained tasks like segmentation. While CLIP and its variants excel in the global alignment of image and text representations, they often struggle to capture the fine-grained details necessary for prec… ▽ More

    Submitted 31 March, 2025; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: Accepted in SSI-FM Workshop of ICLR 2025