Skip to main content

Showing 1–3 of 3 results for author: Kuchibhotla, H C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.01064  [pdf, ps, other

    cs.CV cs.LG

    Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs

    Authors: Hari Chandana Kuchibhotla, Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Fine-grained Visual Recognition (FGVR) involves distinguishing between visually similar categories, which is inherently challenging due to subtle inter-class differences and the need for large, expert-annotated datasets. In domains like medical imaging, such curated datasets are unavailable due to issues like privacy concerns and high annotation costs. In such scenarios lacking labeled data, an FG… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: preprint; earlier version accepted at NeurIPS 2024 Workshop on Adaptive Foundation Models

  2. arXiv:2405.07921  [pdf, other

    cs.CV

    Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?

    Authors: Hari Chandana Kuchibhotla, Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Going beyond mere fine-tuning of vision-language models (VLMs), learnable prompt tuning has emerged as a promising, resource-efficient alternative. Despite their potential, effectively learning prompts faces the following challenges: (i) training in a low-shot scenario results in overfitting, limiting adaptability, and yielding weaker performance on newer classes or datasets; (ii) prompt-tuning's… ▽ More

    Submitted 20 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2203.16517  [pdf, other

    cs.CV

    Unseen Classes at a Later Time? No Problem

    Authors: Hari Chandana Kuchibhotla, Sumitra S Malagi, Shivam Chandhok, Vineeth N Balasubramanian

    Abstract: Recent progress towards learning from limited supervision has encouraged efforts towards designing models that can recognize novel classes at test time (generalized zero-shot learning or GZSL). GZSL approaches assume knowledge of all classes, with or without labeled data, beforehand. However, practical scenarios demand models that are adaptable and can handle dynamic addition of new seen and unsee… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available @ (https://github.com/sumitramalagi/Unseen-classes-at-a-later-time)