Skip to main content

Showing 1–6 of 6 results for author: Gangisetty, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22111  [pdf, ps, other

    cs.CV cs.HC

    Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD

    Authors: Ruthvik Bokkasam, Shankar Gangisetty, A. H. Abdul Hafez, C. V. Jawahar

    Abstract: With the rapid advancements in autonomous driving, accurately predicting pedestrian behavior has become essential for ensuring safety in complex and unpredictable traffic conditions. The growing interest in this challenge highlights the need for comprehensive datasets that capture unstructured environments, enabling the development of more robust prediction models to enhance pedestrian safety and… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  2. arXiv:2503.08437  [pdf, other

    cs.CV cs.AI cs.HC cs.RO

    ICPR 2024 Competition on Rider Intention Prediction

    Authors: Shankar Gangisetty, Abdul Wasi, Shyam Nandan Rai, C. V. Jawahar, Sajay Raj, Manish Prajapati, Ayesha Choudhary, Aaryadev Chandra, Dev Chandan, Shireen Chand, Suvaditya Mukherjee

    Abstract: The recent surge in the vehicle market has led to an alarming increase in road accidents. This underscores the critical importance of enhancing road safety measures, particularly for vulnerable road users like motorcyclists. Hence, we introduce the rider intention prediction (RIP) competition that aims to address challenges in rider safety by proactively predicting maneuvers before they occur, the… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  3. arXiv:2308.00295  [pdf, other

    cs.CV cs.AI cs.CL

    Making the V in Text-VQA Matter

    Authors: Shamanthak Hegde, Soumya Jahagirdar, Shankar Gangisetty

    Abstract: Text-based VQA aims at answering questions by reading the text present in the images. It requires a large amount of scene-text relationship understanding compared to the VQA task. Recent studies have shown that the question-answer pairs in the dataset are more focused on the text present in the image but less importance is given to visual features and some questions do not require understanding th… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted for the CVPR 2023 Workshop on Open-Domain Reasoning Under Multi-Modal Settings

  4. arXiv:2306.06622  [pdf, other

    cs.CV

    Weakly Supervised Visual Question Answer Generation

    Authors: Charani Alampalle, Shamanthak Hegde, Soumya Jahagirdar, Shankar Gangisetty

    Abstract: Growing interest in conversational agents promote twoway human-computer communications involving asking and answering visual questions have become an active area of research in AI. Thus, generation of visual questionanswer pair(s) becomes an important and challenging task. To address this issue, we propose a weakly-supervised visual question answer generation method that generates a relevant quest… ▽ More

    Submitted 11 September, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Pages: 5588-5596, 2023

  5. arXiv:2211.12950  [pdf, other

    cs.CV

    Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

    Authors: Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra

    Abstract: We present a novel problem of text-based visual question generation or TextVQG in short. Given the recent growing interest of the document image analysis community in combining text understanding with conversational artificial intelligence, e.g., text-based visual question answering, TextVQG becomes an important task. TextVQG aims to generate a natural language question for a given input image and… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  6. PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud Segmentation

    Authors: Sindhu Hegde, Shankar Gangisetty

    Abstract: Point clouds, being the simple and compact representation of surface geometry of 3D objects, have gained increasing popularity with the evolution of deep learning networks for classification and segmentation tasks. Unlike human, teaching the machine to analyze the segments of an object is a challenging task and quite essential in various machine vision applications. In this paper, we address the p… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 Figures, 6 Tables, Accepted in Computers & Graphics Journal 2021