Skip to main content

Showing 1–12 of 12 results for author: Goswami, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20425  [pdf, other

    cs.RO

    OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation

    Authors: Raktim Gautam Goswami, Prashanth Krishnamurthy, Yann LeCun, Farshad Khorrami

    Abstract: Visual imitation learning enables robotic agents to acquire skills by observing expert demonstration videos. In the one-shot setting, the agent generates a policy after observing a single expert demonstration without additional fine-tuning. Existing approaches typically train and evaluate on the same set of tasks, varying only object configurations, and struggle to generalize to unseen tasks with… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2503.14255  [pdf, other

    cs.RO

    A Chain-Driven, Sandwich-Legged Quadruped Robot: Design and Experimental Analysis

    Authors: Aman Singh, Bhavya Giri Goswami, Ketan Nehete, Shishir N. Y. Kolathaya

    Abstract: This paper introduces a chain-driven, sandwich-legged, mid-size quadruped robot designed as an accessible research platform. The design prioritizes enhanced locomotion capabilities, improved reliability and safety of the actuation system, and simplified, cost-effective manufacturing processes. Locomotion performance is optimized through a sandwiched leg design and a dual-motor configuration, reduc… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 6 pages, 9 figures

  3. arXiv:2411.17662  [pdf, other

    cs.RO cs.CV

    RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training

    Authors: Raktim Gautam Goswami, Prashanth Krishnamurthy, Yann LeCun, Farshad Khorrami

    Abstract: Vision-based pose estimation of articulated robots with unknown joint angles has applications in collaborative robotics and human-robot interaction tasks. Current frameworks use neural network encoders to extract image features and downstream layers to predict joint angles and robot pose. While images of robots inherently contain rich information about the robot's physical structures, existing met… ▽ More

    Submitted 2 May, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

  4. arXiv:2410.06239  [pdf, other

    cs.RO

    OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and Open-Vocabulary Semantic Scene Graphs

    Authors: Venkata Naren Devarakonda, Raktim Gautam Goswami, Ali Umut Kaypak, Naman Patel, Rooholla Khorrambakht, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Enabling robots to autonomously navigate unknown, complex, dynamic environments and perform diverse tasks remains a fundamental challenge in developing robust autonomous physical agents. These agents must effectively perceive their surroundings while leveraging world knowledge for decision-making. Although recent approaches utilize vision-language and large language models for scene understanding… ▽ More

    Submitted 22 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  5. arXiv:2410.00702  [pdf, other

    cs.CV

    FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training

    Authors: Raktim Gautam Goswami, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Map-free LiDAR localization systems accurately localize within known environments by predicting sensor position and orientation directly from raw point clouds, eliminating the need for large maps and descriptors. However, their long training times hinder rapid adaptation to new environments. To address this, we propose FlashMix, which uses a frozen, scene-agnostic backbone to extract local point d… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  6. arXiv:2407.08260  [pdf, other

    cs.CV cs.RO

    SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition

    Authors: Raktim Gautam Goswami, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Large-scale LiDAR mappings and localization leverage place recognition techniques to mitigate odometry drifts, ensuring accurate mapping. These techniques utilize scene representations from LiDAR point clouds to identify previously visited sites within a database. Local descriptors, assigned to each point within a point cloud, are aggregated to form a scene representation for the point cloud. Thes… ▽ More

    Submitted 30 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  7. arXiv:2403.07043  [pdf, other

    cs.RO

    A Collision Cone Approach for Control Barrier Functions

    Authors: Manan Tayal, Bhavya Giri Goswami, Karthik Rajgopal, Rajpal Singh, Tejas Rao, Jishnu Keshavan, Pushpak Jagtap, Shishir Kolathaya

    Abstract: This work presents a unified approach for collision avoidance using Collision-Cone Control Barrier Functions (CBFs) in both ground (UGV) and aerial (UAV) unmanned vehicles. We propose a novel CBF formulation inspired by collision cones, to ensure safety by constraining the relative velocity between the vehicle and the obstacle to always point away from each other. The efficacy of this approach is… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 13 pages, 16 pages. arXiv admin note: substantial text overlap with arXiv:2209.11524, arXiv:2303.15871, arXiv:2310.10839

  8. arXiv:2310.10839  [pdf, other

    cs.RO eess.SY math.OC

    Collision Cone Control Barrier Functions: Experimental Validation on UGVs for Kinematic Obstacle Avoidance

    Authors: Bhavya Giri Goswami, Manan Tayal, Karthik Rajgopal, Pushpak Jagtap, Shishir Kolathaya

    Abstract: Autonomy advances have enabled robots in diverse environments and close human interaction, necessitating controllers with formal safety guarantees. This paper introduces an experimental platform designed for the validation and demonstration of a novel class of Control Barrier Functions (CBFs) tailored for Unmanned Ground Vehicles (UGVs) to proactively prevent collisions with kinematic obstacles by… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 8 pages, 11 figures, Submitted at American Control Conference (ACC), 2024. arXiv admin note: substantial text overlap with arXiv:2209.11524

  9. arXiv:2209.11524  [pdf, other

    cs.RO math.OC

    Control Barrier Functions in UGVs for Kinematic Obstacle Avoidance: A Collision Cone Approach

    Authors: Phani Thontepu, Bhavya Giri Goswami, Manan Tayal, Neelaksh Singh, Shyamsundar P I, Shyam Sundar M G, Suresh Sundaram, Vaibhav Katewa, Shishir Kolathaya

    Abstract: In this paper, we propose a new class of Control Barrier Functions (CBFs) for Unmanned Ground Vehicles (UGVs) that help avoid collisions with kinematic (non-zero velocity) obstacles. While the current forms of CBFs have been successful in guaranteeing safety/collision avoidance with static obstacles, extensions for the dynamic case have seen limited success. Moreover, with the UGV models like the… ▽ More

    Submitted 16 October, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 6 pages, 4 figures, For supplement video follow https://youtu.be/Dme7Wm9y6es. *The first and second authors have contributed equally

    ACM Class: I.2.9; G.1.6; J.2

  10. arXiv:2112.13511  [pdf

    cs.RO

    Design, Manufacturing, and Controls of a Prismatic Quadruped Robot: PRISMA

    Authors: Team Robocon, IIT Roorkee, :, Bhavya Giri Goswami, Aman Verma, Gautam Jha, Vandan Gajjar, Vedant Neekhra, Utkarsh Deepak, Aayush Singh Chauhan

    Abstract: Most of the quadrupeds developed are highly actuated, and their control is hence quite cumbersome. They need advanced electronics equipment to solve convoluted inverse kinematic equations continuously. In addition, they demand special and costly sensors to autonomously navigate through the environment as traditional distance sensors usually fail because of the continuous perturbation due to the mo… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 14 pages, 16 figures, 4 tables

  11. arXiv:2010.14122  [pdf, other

    eess.AS cs.AI eess.SP

    Phase Aware Speech Enhancement using Realisation of Complex-valued LSTM

    Authors: Raktim Gautam Goswami, Sivaganesh Andhavarapu, K Sri Rama Murty

    Abstract: Most of the deep learning based speech enhancement (SE) methods rely on estimating the magnitude spectrum of the clean speech signal from the observed noisy speech signal, either by magnitude spectral masking or regression. These methods reuse the noisy phase while synthesizing the time-domain waveform from the estimated magnitude spectrum. However, there have been recent works highlighting the im… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  12. arXiv:1803.00401  [pdf, other

    cs.CV

    Unravelling Robustness of Deep Learning based Face Recognition Against Adversarial Attacks

    Authors: Gaurav Goswami, Nalini Ratha, Akshay Agarwal, Richa Singh, Mayank Vatsa

    Abstract: Deep neural network (DNN) architecture based models have high expressive power and learning capacity. However, they are essentially a black box method since it is not easy to mathematically formulate the functions that are learned within its many layers of representation. Realizing this, many researchers have started to design methods to exploit the drawbacks of deep learning based algorithms ques… ▽ More

    Submitted 22 February, 2018; originally announced March 2018.

    Comments: Accepted in AAAI 2018 (8 pages, 5 figures, 5 tables)