Skip to main content

Showing 1–50 of 120 results for author: Krishna, K M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.21145  [pdf, ps, other

    cs.RO

    DAGDiff: Guiding Dual-Arm Grasp Diffusion to Stable and Collision-Free Grasps

    Authors: Md Faizal Karim, Vignesh Vembar, Keshab Patra, Gaurav Singh, K Madhava Krishna

    Abstract: Reliable dual-arm grasping is essential for manipulating large and complex objects but remains a challenging problem due to stability, collision, and generalization requirements. Prior methods typically decompose the task into two independent grasp proposals, relying on region priors or heuristics that limit generalization and provide no principled guarantee of stability. We propose DAGDiff, an en… ▽ More

    Submitted 29 September, 2025; v1 submitted 25 September, 2025; originally announced September 2025.

  2. arXiv:2509.00001  [pdf, ps, other

    math.OC cs.IT math.FA math.PR

    Continuous Donoho-Elad Spark Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Donoho and Elad \textit{[Proc. Natl. Acad. Sci. USA, 2003]} introduced the important notion of the spark of a frame, using which they derived a fundamental uncertainty principle. Based on spark, they also provided a necessary and sufficient condition for the uniqueness of sparse solutions to the NP-hard $\ell_0$-minimization problem. In this nano note, we show that the notion of spark can be exten… ▽ More

    Submitted 1 August, 2025; originally announced September 2025.

    Comments: 4 Pages, 0 Figures

    MSC Class: 94A12; 42C15; 94A08; 28A05

  3. arXiv:2508.07387  [pdf, ps, other

    cs.RO

    MonoMPC: Monocular Vision Based Navigation with Learned Collision Model and Risk-Aware Model Predictive Control

    Authors: Basant Sharma, Prajyot Jadhav, Pranjal Paul, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigating unknown environments with a single RGB camera is challenging, as the lack of depth information prevents reliable collision-checking. While some methods use estimated depth to build collision maps, we found that depth estimates from vision foundation models are too noisy for zero-shot navigation in cluttered environments. We propose an alternative approach: instead of using noisy estim… ▽ More

    Submitted 10 August, 2025; originally announced August 2025.

  4. arXiv:2507.18763  [pdf, ps, other

    cs.CV cs.RO

    Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving

    Authors: Keshav Gupta, Tejas S. Stanley, Pranjal Paul, Arun K. Singh, K. Madhava Krishna

    Abstract: Drivable Free-space prediction is a fundamental and crucial problem in autonomous driving. Recent works have addressed the problem by representing the entire non-obstacle road regions as the free-space. In contrast our aim is to estimate the driving corridors that are a navigable subset of the entire road region. Unfortunately, existing corridor estimation methods directly assume a BEV-centric rep… ▽ More

    Submitted 24 July, 2025; originally announced July 2025.

    Comments: 8 pages, 7 figures, IROS 2025

  5. arXiv:2506.18913  [pdf, ps, other

    math.FA cs.IT math-ph math.NT math.OC

    p-adic Ghobber-Jaming Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\{τ_j\}_{j=1}^n$ and $\{ω_k\}_{k=1}^n$ be two orthonormal bases for a finite dimensional p-adic Hilbert space $\mathcal{X}$. Let $M,N\subseteq \{1, \dots, n\}$ be such that \begin{align*} \displaystyle \max_{j \in M, k \in N}|\langle τ_j, ω_k \rangle|<1, \end{align*} where $o(M)$ is the cardinality of $M$. Then for all $x \in \mathcal{X}$, we show that \begin{align} (1) \quad \quad \quad \qua… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 11 Pages, 0 Figures

    MSC Class: 12J25; 46S10; 47S10; 11D88

  6. arXiv:2503.23465  [pdf, ps, other

    cs.RO

    SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation

    Authors: Pranjal Paul, Vineeth Bhat, Tejas Salian, Mohammad Omama, Krishna Murthy Jatavallabhula, Naveen Arulselvan, K. Madhava Krishna

    Abstract: Global localization is a critical problem in autonomous navigation, enabling precise positioning without reliance on GPS. Modern global localization techniques often depend on dense LiDAR maps, which, while precise, require extensive storage and computational resources. Recent approaches have explored alternative methods, such as sparse maps and learned features, but they suffer from poor robustne… ▽ More

    Submitted 28 July, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

  7. arXiv:2503.08358  [pdf, ps, other

    cs.RO

    DG16M: A Large-Scale Dataset for Dual-Arm Grasping with Force-Optimized Grasps

    Authors: Md Faizal Karim, Mohammed Saad Hashmi, Shreya Bollimuntha, Mahesh Reddy Tapeti, Gaurav Singh, Nagamanikandan Govindan, K Madhava Krishna

    Abstract: Dual-arm robotic grasping is crucial for handling large objects that require stable and coordinated manipulation. While single-arm grasping has been extensively studied, datasets tailored for dual-arm settings remain scarce. We introduce a large-scale dataset of 16 million dual-arm grasps, evaluated under improved force-closure constraints. Additionally, we develop a benchmark dataset containing 3… ▽ More

    Submitted 27 July, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  8. arXiv:2501.19042  [pdf, other

    cs.RO cs.AI

    Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors

    Authors: Simon Idoko, B. Bhanu Teja, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Coordination behavior in robot swarms is inherently multi-modal in nature. That is, there are numerous ways in which a swarm of robots can avoid inter-agent collisions and reach their respective goals. However, the problem of generating diverse and feasible swarm behaviors in a scalable manner remains largely unaddressed. In this paper, we fill this gap by combining generative models with a safety… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: Submitted to RAL

  9. arXiv:2412.10396  [pdf, ps, other

    math.FA cs.IT math-ph

    3-Heisenberg-Robertson-Schrodinger Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathcal{X}$ be a 3-product space. Let $A: \mathcal{D}(A)\subseteq \mathcal{X}\to \mathcal{X}$, $B: \mathcal{D}(B)\subseteq \mathcal{X}\to \mathcal{X}$ and $C: \mathcal{D}(C)\subseteq \mathcal{X}\to \mathcal{X}$ be possibly unbounded 3-self-adjoint operators. Then for all \begin{align*} x \in \mathcal{D}(ABC)\cap\mathcal{D}(ACB) \cap \mathcal{D}(BAC)\cap\mathcal{D}(BCA) \cap \mathcal{D}(CAB… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 46C50; 46B99

  10. arXiv:2411.10886  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation

    Authors: Ansh Shah, K Madhava Krishna

    Abstract: Recovering metric depth from a single image remains a fundamental challenge in computer vision, requiring both scene understanding and accurate scaling. While deep learning has advanced monocular depth estimation, current models often struggle with unfamiliar scenes and layouts, particularly in zero-shot scenarios and when predicting scale-ergodic metric depth. We present MetricGold, a novel appro… ▽ More

    Submitted 5 December, 2024; v1 submitted 16 November, 2024; originally announced November 2024.

  11. arXiv:2411.10171  [pdf, other

    cs.RO cs.AI

    Imagine-2-Drive: Leveraging High-Fidelity World Models via Multi-Modal Diffusion Policies

    Authors: Anant Garg, K Madhava Krishna

    Abstract: World Model-based Reinforcement Learning (WMRL) enables sample efficient policy learning by reducing the need for online interactions which can potentially be costly and unsafe, especially for autonomous driving. However, existing world models often suffer from low prediction fidelity and compounding one-step errors, leading to policy degradation over long horizons. Additionally, traditional RL po… ▽ More

    Submitted 9 March, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: Submitted to IROS 2025

  12. arXiv:2411.00790  [pdf, ps, other

    math.FA cs.IT

    Product Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Motivated from Deutsch entropic uncertainty principle and several product uncertainty principles, we derive an uncertainty principle for the product of entropies using functions.

    Submitted 17 October, 2024; originally announced November 2024.

    Comments: 5 Pages, 0 Figures

    MSC Class: 42C15

  13. arXiv:2410.19712  [pdf, other

    cs.RO

    DA-VIL: Adaptive Dual-Arm Manipulation with Reinforcement Learning and Variable Impedance Control

    Authors: Md Faizal Karim, Shreya Bollimuntha, Mohammed Saad Hashmi, Autrio Das, Gaurav Singh, Srinath Sridhar, Arun Kumar Singh, Nagamanikandan Govindan, K Madhava Krishna

    Abstract: Dual-arm manipulation is an area of growing interest in the robotics community. Enabling robots to perform tasks that require the coordinated use of two arms, is essential for complex manipulation tasks such as handling large objects, assembling components, and performing human-like interactions. However, achieving effective dual-arm manipulation is challenging due to the need for precise coordina… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  14. arXiv:2410.12432  [pdf, other

    cs.RO

    Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks

    Authors: Pranjali Pathre, Gunjan Gupta, M. Nomaan Qureshi, Mandyam Brunda, Samarth Brahmbhatt, K. Madhava Krishna

    Abstract: Visual servoing, the method of controlling robot motion through feedback from visual sensors, has seen significant advancements with the integration of optical flow-based methods. However, its application remains limited by inherent challenges, such as the necessity for a target image at test time, the requirement of substantial overlap between initial and target images, and the reliance on feedba… ▽ More

    Submitted 7 December, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Published at 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  15. arXiv:2409.16011  [pdf, other

    cs.RO math.OC

    CrowdSurfer: Sampling Optimization Augmented with Vector-Quantized Variational AutoEncoder for Dense Crowd Navigation

    Authors: Naman Kumar, Antareep Singha, Laksh Nanwani, Dhruv Potdar, Tarun R, Fatemeh Rastgar, Simon Idoko, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Navigation amongst densely packed crowds remains a challenge for mobile robots. The complexity increases further if the environment layout changes, making the prior computed global plan infeasible. In this paper, we show that it is possible to dramatically enhance crowd navigation by just improving the local planner. Our approach combines generative modelling with inference time optimization to ge… ▽ More

    Submitted 7 March, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted at IEEE ICRA 2025

  16. arXiv:2409.12002  [pdf, other

    cs.RO cs.CV

    Towards Global Localization using Multi-Modal Object-Instance Re-Identification

    Authors: Aneesh Chavan, Vaibhav Agrawal, Vineeth Bhat, Sarthak Chittawar, Siddharth Srivastava, Chetan Arora, K Madhava Krishna

    Abstract: Re-identification (ReID) is a critical challenge in computer vision, predominantly studied in the context of pedestrians and vehicles. However, robust object-instance ReID, which has significant implications for tasks such as autonomous exploration, long-term perception, and scene understanding, remains underexplored. In this work, we address this gap by proposing a novel dual-path object-instance… ▽ More

    Submitted 1 May, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 8 pages, 5 figures, 3 tables. Accepted at Advances in Robotics, AIR 2025 (Oral)

    MSC Class: 68T40 ACM Class: I.2.9; I.2.10

  17. arXiv:2409.09060  [pdf, ps, other

    math.FA cs.IT math.OA math.OC math.ST

    Noncommutative Donoho-Elad-Gribonval-Nielsen-Fuchs Sparsity Theorem

    Authors: K. Mahesh Krishna

    Abstract: Breakthrough Sparsity Theorem, derived independently by Donoho and Elad \textit{[Proc. Natl. Acad. Sci. USA, 2003]}, Gribonval and Nielsen \textit{[IEEE Trans. Inform. Theory, 2003]} and Fuchs \textit{[IEEE Trans. Inform. Theory, 2004]} says that unique sparse solution to NP-Hard $\ell_0$-minimization problem can be obtained using unique solution of P-Type $\ell_1$-minimization problem. In this pa… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46L08

    Journal ref: Mathematical Inequalities and Applications, Volume 28, Number 3 (2025), 531-539

  18. arXiv:2407.14513  [pdf, ps, other

    math.OA cs.IT math.FA

    Modular Deutsch Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Khosravi, Drnovšek and Moslehian [\textit{Filomat, 2012}] derived Buzano inequality for Hilbert C*-modules. Using this inequality we derive Deutsch entropic uncertainty principle for Hilbert C*-modules over commutative unital C*-algebras.

    Submitted 8 August, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 46L08; 42C15; 46L05

  19. arXiv:2405.08003  [pdf, ps, other

    math.FA cs.IT math.OA math.QA

    Continuous Krishna-Parthasarathy Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: In 2002, Krishna and Parthasarathy [\textit{Sankhyā Ser. A}] derived discrete quantum version of Maassen-Uffink [\textit{Phys. Rev. Lett., 1988}] entropic uncertainty principle. In this paper, using the notion of continuous operator-valued frames, we derive an entropic uncertainty principle for arbitrary family of operators indexed by measure spaces having finite measure. We give an application to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 0 Figures

    MSC Class: 81P15; 94A17; 42C15

    Journal ref: Special issue of Infinite Dimensional Analysis, Quantum Probability and Related Topics in honour of Prof. K. R. Parthasarathy, 18 March 2024

  20. Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM

    Authors: Laksh Nanwani, Kumaraditya Gupta, Aditya Mathur, Swayam Agrawal, A. H. Abdul Hafez, K. Madhava Krishna

    Abstract: Humans excel at forming mental maps of their surroundings, equipping them to understand object relationships and navigate based on language queries. Our previous work SI Maps [1] showed that having instance-level information and the semantic understanding of an environment helps significantly improve performance for language-guided tasks. We extend this instance-level approach to 3D while increasi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Journal ref: Advanced Robotics - Taylor and Francis - 2024

  21. arXiv:2404.04643  [pdf, other

    cs.RO cs.CV

    Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation

    Authors: Gaurav Singh, Sanket Kalwar, Md Faizal Karim, Bipasha Sen, Nagamanikandan Govindan, Srinath Sridhar, K Madhava Krishna

    Abstract: Efficiently generating grasp poses tailored to specific regions of an object is vital for various robotic manipulation tasks, especially in a dual-arm setup. This scenario presents a significant challenge due to the complex geometries involved, requiring a deep understanding of the local geometry to generate grasps efficiently on the specified constrained regions. Existing methods only explore set… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Project Page: https://constrained-grasp-diffusion.github.io/

  22. arXiv:2404.03307  [pdf, other

    cs.RO eess.SY

    Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

    Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More

    Submitted 22 November, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  23. arXiv:2404.00910  [pdf, ps, other

    math.FA cs.IT math-ph

    Unexpected Uncertainty Principle for Disc Banach Spaces

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_n\}_{n=1}^\infty, \{τ_n\}_{n=1}^\infty)$ and $(\{g_n\}_{n=1}^\infty, \{ω_n\}_{n=1}^\infty)$ be unbounded continuous p-Schauder frames ($0<p<1$) for a disc Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad \|θ_f x\|_0\|θ_g x\|_0 \geq \frac{1}{\left(\displaystyle\sup_{n… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 6 Pages, 0 Figures

    MSC Class: 42C15

  24. arXiv:2403.20116  [pdf, other

    cs.RO

    LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving

    Authors: Pranjal Paul, Anant Garg, Tushar Choudhary, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Existing Vision-Language models (VLMs) estimate either long-term trajectory waypoints or a set of control actions as a reactive solution for closed-loop planning based on their rich scene comprehension. However, these estimations are coarse and are subjective to their "world understanding" which may generate sub-optimal decisions due to perception errors. In this paper, we introduce LeGo-Drive, wh… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  25. arXiv:2403.17946  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Heisenberg-Robertson-Schrodinger Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We derive an uncertainty principle for Lipschitz maps acting on subsets of Banach spaces. We show that this nonlinear uncertainty principle reduces to the Heisenberg-Robertson-Schrodinger uncertainty principle for linear operators acting on Hilbert spaces.

    Submitted 8 August, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 26A16; 46B99

  26. arXiv:2402.08591  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Maccone-Pati Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We show that one of the two important uncertainty principles derived by Maccone and Pati \textit{[Phys. Rev. Lett., 2014]} can be derived for arbitrary maps defined on subsets of $\mathcal{L}^p$ spaces for $1< p<\infty$. Our main tool is the Clarkson inequalities. We also derive a nonlinear uncertainty principle for weak parallelogram spaces and Type-p Banach spaces.

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 0 figures

    MSC Class: 46B20; 46E30

  27. arXiv:2402.04255  [pdf, ps, other

    math.FA cs.IT

    Functional Kuppinger-Durisi-Bölcskei Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathcal{X}$ be a Banach space. Let $\{τ_j\}_{j=1}^n, \{ω_k\}_{k=1}^m\subseteq \mathcal{X}$ and $\{f_j\}_{j=1}^n$, $\{g_k\}_{k=1}^m\subseteq \mathcal{X}^*$ satisfy $ |f_j(τ_j)|\geq 1$ for all $ 1\leq j \leq n$, $|g_k(ω_k)|\geq 1 $ for all $1\leq k \leq m$. If $x \in \mathcal{X}\setminus \{0\}$ is such that $x=θ_τθ_f x=θ_ωθ_g x$, then we show that \begin{align}\label{FKDB} (1) \quad\quad\quad\… ▽ More

    Submitted 1 January, 2024; originally announced February 2024.

    Comments: 9 Pages, 0 Figures

    MSC Class: 46A45; 46B45; 42C15

  28. arXiv:2401.17399  [pdf, other

    cs.RO

    ATPPNet: Attention based Temporal Point cloud Prediction Network

    Authors: Kaustab Pal, Aditya Sharma, Avinash Sharma, K. Madhava Krishna

    Abstract: Point cloud prediction is an important yet challenging task in the field of autonomous driving. The goal is to predict future point cloud sequences that maintain object structures while accurately representing their temporal motion. These predicted point clouds help in other subsequent tasks like object trajectory estimation for collision avoidance or estimating locations with the least odometry d… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted for presentation at the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  29. arXiv:2312.00366  [pdf, ps, other

    math.FA cs.IT math-ph

    Unbounded Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principles

    Authors: K. Mahesh Krishna

    Abstract: Let $(Ω, μ)$, $(Δ, ν)$ be measure spaces and $p=1$ or $p=\infty$. Let $(\{f_α\}_{α\in Ω}, \{τ_α\}_{α\in Ω})$ and $(\{g_β\}_{β\in Δ}, \{ω_β\}_{β\in Δ})$ be unbounded continuous p-Schauder frames for a Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad μ(\operatorname{supp}(θ_f… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 Figures, 0 Figures

    MSC Class: 42C15

  30. arXiv:2311.14635  [pdf

    cs.CV cs.RO

    Automated Detection and Counting of Windows using UAV Imagery based Remote Sensing

    Authors: Dhruv Patel, Shivani Chepuri, Sarvesh Thakur, K. Harikumar, Ravi Kiran S., K. Madhava Krishna

    Abstract: Despite the technological advancements in the construction and surveying sector, the inspection of salient features like windows in an under-construction or existing building is predominantly a manual process. Moreover, the number of windows present in a building is directly related to the magnitude of deformation it suffers under earthquakes. In this research, a method to accurately detect and co… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  31. NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving

    Authors: Kaustab Pal, Aditya Sharma, Mohd Omama, Parth N. Shah, K. Madhava Krishna

    Abstract: In this paper we show an effective means of integrating data driven frameworks to sampling based optimal control to vastly reduce the compute time for easy adoption and adaptation to real time applications such as on-road autonomous driving in the presence of dynamic actors. Presented with training examples, a spatio-temporal CNN learns to predict the optimal mean control over a finite horizon tha… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Published in 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  32. arXiv:2310.08270  [pdf, other

    cs.RO

    Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction

    Authors: Basant Sharma, Aditya Sharma, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Safe autonomous driving critically depends on how well the ego-vehicle can predict the trajectories of neighboring vehicles. To this end, several trajectory prediction algorithms have been presented in the existing literature. Many of these approaches output a multi-modal distribution of obstacle trajectories instead of a single deterministic prediction to account for the underlying uncertainty. H… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  33. arXiv:2310.04181  [pdf, other

    cs.CV cs.RO

    DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

    Authors: Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  34. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/

  35. arXiv:2307.01215  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark Approximate Support Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. If $ x \in \mathcal{X}\setminus\{0\}$ is such that $θ_fx$ is $\varepsilon$-supported on $M\subseteq \{1,\dots, n\}$ w.r.t. p-norm and $θ_gx$ is $δ$-supported on $N\subseteq \{1,\dots, n\}$ w.r.t. p-norm, then we show that \begin{align}\la… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

  36. arXiv:2306.06093  [pdf, other

    cs.CV

    HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork

    Authors: Bipasha Sen, Gaurav Singh, Aditya Agarwal, Rohith Agaram, K Madhava Krishna, Srinath Sridhar

    Abstract: Neural Radiance Fields (NeRF) have become an increasingly popular representation to capture high-quality appearance and shape of scenes and objects. However, learning generalizable NeRF priors over categories of scenes or objects has been challenging due to the high dimensionality of network weight space. To address the limitations of existing work on generalization, multi-view consistency and to… ▽ More

    Submitted 23 December, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Project Page: https://hyp-nerf.github.io

  37. arXiv:2306.04939  [pdf, other

    cs.RO

    UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images

    Authors: Vikrant Dewangan, Basant Sharma, Tushar Choudhary, Sarthak Sharma, Aakash Aanegola, Arun K. Singh, K. Madhava Krishna

    Abstract: Autonomous driving requires accurate reasoning of the location of objects from raw sensor data. Recent end-to-end learning methods go from raw sensor data to a trajectory output via Bird's Eye View(BEV) segmentation as an interpretable intermediate representation. Motion planning over cost maps generated via Birds Eye View (BEV) segmentation has emerged as a prominent approach in autonomous drivin… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to CASE 2023. Project video available at https://vikr-182.github.io/UAP-BEV

  38. Functional Ghobber-Jaming Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. Let $M,N\subseteq \{1, \dots, n\}$ be such that \begin{align*} o(M)^\frac{1}{q}o(N)^\frac{1}{p}< \frac{1}{\displaystyle \max_{1\leq j,k\leq n}|g_k(τ_j) |}, \end{align*} where $q$ is the conjugate index of $p$. Then for all… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

    Journal ref: Math Notes, 116, 1064-1071 (2024)

  39. Instance-Level Semantic Maps for Vision Language Navigation

    Authors: Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Monis, Aditya Mathur, Krishna Murthy, Abdul Hafez, Vineet Gandhi, K. Madhava Krishna

    Abstract: Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic instructions. A natural goal in Vision Language Navigation (VLN) research is to impart autonomous agents with similar capabilities. Recent works take a step towards this… ▽ More

    Submitted 1 July, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Journal ref: IEEE RO-MAN 2023

  40. arXiv:2304.03324  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^m, \{ω_k\}_{k=1}^m)$ be p-Schauder frames for a finite dimensional Banach space $\mathcal{X}$. Then for every $x \in \mathcal{X}\setminus\{0\}$, we show that \begin{align} (1) \quad \|θ_f x\|_0^\frac{1}{p}\|θ_g x\|_0^\frac{1}{q} \geq \frac{1}{\displaystyle\max_{1\leq j\leq n, 1\leq k\leq m}|f_j(ω_k)|}\quad \text{and} \quad \|θ_g x\|_0^\f… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 5 Pages, 0 Figures

    MSC Class: 42C15

  41. arXiv:2304.01074  [pdf, other

    cs.RO

    FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

    Authors: Sudarshan S Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna

    Abstract: We focus on the problem of LiDAR point cloud based loop detection (or Finding) and closure (LDC) in a multi-agent setting. State-of-the-art (SOTA) techniques directly generate learned embeddings of a given point cloud, require large data transfers, and are not robust to wide variations in 6 Degrees-of-Freedom (DOF) viewpoint. Moreover, absence of strong priors in an unstructured point cloud leads… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  42. arXiv:2211.16882  [pdf, other

    cs.CV cs.RO

    MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

    Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

    Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

  43. arXiv:2210.07062  [pdf, ps, other

    cs.IT math.FA math.NT

    Non-Archimedean Welch Bounds and Non-Archimedean Zauner Conjecture

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathbb{K}$ be a non-Archimedean (complete) valued field satisfying \begin{align*} \left|\sum_{j=1}^{n}λ_j^2\right|=\max_{1\leq j \leq n}|λ_j|^2, \quad \forall λ_j \in \mathbb{K}, 1\leq j \leq n, \forall n \in \mathbb{N}. \end{align*} For $d\in \mathbb{N}$, let $\mathbb{K}^d$ be the standard $d$-dimensional non-Archimedean Hilbert space. Let $m \in \mathbb{N}$ and… ▽ More

    Submitted 28 August, 2022; originally announced October 2022.

    Comments: 9 Pages, 0 Figures

    MSC Class: 12J25; 46S10; 47S10

  44. arXiv:2209.14922  [pdf, other

    cs.CV cs.RO

    GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions

    Authors: Sanket Kalwar, Dhruv Patel, Aakash Aanegola, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Submitted to ICRA2023. More information at https://gatedip.github.io

  45. arXiv:2209.13418  [pdf, other

    cs.CV cs.RO

    UAV-based Visual Remote Sensing for Automated Building Inspection

    Authors: Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna

    Abstract: Unmanned Aerial Vehicle (UAV) based remote sensing system incorporated with computer vision has demonstrated potential for assisting building construction and in disaster management like damage assessment during earthquakes. The vulnerability of a building to earthquake can be assessed through inspection that takes into account the expected damage progression of the associated component and the co… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Paper accepted at CVCIE Workshop at ECCV, 2022 and the project page is https://uvrsabi.github.io/

  46. arXiv:2209.11972  [pdf, other

    cs.CV

    Ground then Navigate: Language-guided Navigation in Dynamic Scenes

    Authors: Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi

    Abstract: We investigate the Vision-and-Language Navigation (VLN) problem in the context of autonomous driving in outdoor settings. We solve the problem by explicitly grounding the navigable regions corresponding to the textual command. At each timestamp, the model predicts a segmentation mask corresponding to the intermediate or the final navigable region. Our work contrasts with existing efforts in VLN, w… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  47. arXiv:2209.04805  [pdf, other

    cs.RO

    Real-Time Heuristic Framework for Safe Landing of UAVs in Dynamic Scenarios

    Authors: Jaskirat Singh, Neel Adwani, Harikumar Kandath, K. Madhava Krishna

    Abstract: The world we live in is full of technology and with each passing day the advancement and usage of UAVs increases efficiently. As a result of the many application scenarios, there are some missions where the UAVs are vulnerable to external disruptions, such as a ground station's loss of connectivity, security missions, safety concerns, and delivery-related missions. Therefore, depending on the scen… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures, 36 references

  48. arXiv:2208.03038  [pdf, other

    cs.RO math.OC

    Leveraging Distributional Bias for Reactive Collision Avoidance under Uncertainty: A Kernel Embedding Approach

    Authors: Anish Gupta, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristic… ▽ More

    Submitted 22 September, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

  49. arXiv:2207.03557  [pdf, other

    cs.RO

    Flow Synthesis Based Visual Servoing Frameworks for Monocular Obstacle Avoidance Amidst High-Rises

    Authors: Harshit K. Sankhla, M. Nomaan Qureshi, Shankara Narayanan V., Vedansh Mittal, Gunjan Gupta, Harit Pandya, K. Madhava Krishna

    Abstract: We propose a novel flow synthesis based visual servoing framework enabling long-range obstacle avoidance for Micro Air Vehicles (MAV) flying amongst tall skyscrapers. Recent deep learning based frameworks use optical flow to do high-precision visual servoing. In this paper, we explore the question: can we design a surrogate flow for these high-precision visual-servoing methods, which leads to obst… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE International Conference on Automation Science and Engineering (CASE), 2022

  50. arXiv:2205.04090  [pdf, other

    cs.RO

    Approaches and Challenges in Robotic Perception for Table-top Rearrangement and Planning

    Authors: Aditya Agarwal, Bipasha Sen, Shankara Narayanan V, Vishal Reddy Mandadi, Brojeshwar Bhowmick, K Madhava Krishna

    Abstract: Table-top Rearrangement and Planning is a challenging problem that relies heavily on an excellent perception stack. The perception stack involves observing and registering the 3D scene on the table, detecting what objects are on the table, and how to manipulate them. Consequently, it greatly influences the system's task-planning and motion-planning stacks that follow. We present a comprehensive ov… ▽ More

    Submitted 3 June, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 5 pages including references, 3 figures