Skip to main content

Showing 1–3 of 3 results for author: Gandhamal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03709  [pdf, other

    cs.CV

    AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives

    Authors: Aniruddh Sikdar, Aditya Gandhamal, Suresh Sundaram

    Abstract: Open-vocabulary semantic segmentation (OVSS) involves assigning labels to each pixel in an image based on textual descriptions, leveraging world models like CLIP. However, they encounter significant challenges in cross-domain generalization, hindering their practical efficacy in real-world applications. Embodied AI systems are transforming autonomous navigation for ground vehicles and drones by en… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted at Workshop on Foundation Models Meet Embodied Agents at CVPR 2025 (Non-archival Track)

  2. arXiv:2506.03706  [pdf, other

    cs.CV

    OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation

    Authors: Aditya Gandhamal, Aniruddh Sikdar, Suresh Sundaram

    Abstract: Open-vocabulary semantic segmentation (OVSS) entails assigning semantic labels to each pixel in an image using textual descriptions, typically leveraging world models such as CLIP. To enhance out-of-domain generalization, we propose Cost Aggregation with Optimal Transport (OV-COAST) for open-vocabulary semantic segmentation. To align visual-language features within the framework of optimal transpo… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted at CVPR 2025 Workshop on Transformers for Vision (Non-archival track)

  3. arXiv:2410.20953  [pdf, other

    cs.CV

    IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks

    Authors: Manjunath D, Prajwal Gurunath, Sumanth Udupa, Aditya Gandhamal, Shrikar Madhu, Aniruddh Sikdar, Suresh Sundaram

    Abstract: Deep neural networks (DNNs) have shown exceptional performance when trained on well-illuminated images captured by Electro-Optical (EO) cameras, which provide rich texture details. However, in critical applications like aerial perception, it is essential for DNNs to maintain consistent reliability across all conditions, including low-light scenarios where EO cameras often struggle to capture suffi… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures