Skip to main content

Showing 1–27 of 27 results for author: Bhatt, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.19379  [pdf, ps, other

    cs.LG cs.AI cs.RO stat.ML

    Learning from Observation: A Survey of Recent Advances

    Authors: Returaj Burnwal, Hriday Mehta, Nirav Pravinbhai Bhatt, Balaraman Ravindran

    Abstract: Imitation Learning (IL) algorithms offer an efficient way to train an agent by mimicking an expert's behavior without requiring a reward function. IL algorithms often necessitate access to state and action information from expert demonstrations. Although expert actions can provide detailed guidance, requiring such action information may prove impractical for real-world applications where expert ac… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

  2. arXiv:2509.18592  [pdf, ps, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    VLN-Zero: Rapid Exploration and Cache-Enabled Neurosymbolic Vision-Language Planning for Zero-Shot Transfer in Robot Navigation

    Authors: Neel P. Bhatt, Yunhao Yang, Rohan Siva, Pranay Samineni, Daniel Milan, Zhangyang Wang, Ufuk Topcu

    Abstract: Rapid adaptation in unseen environments is essential for scalable real-world autonomy, yet existing approaches rely on exhaustive exploration or rigid navigation policies that fail to generalize. We present VLN-Zero, a two-phase vision-language navigation framework that leverages vision-language models to efficiently construct symbolic scene graphs and enable zero-shot neurosymbolic navigation. In… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Codebase, datasets, and videos for VLN-Zero are available at: https://vln-zero.github.io/

  3. arXiv:2509.09619  [pdf, ps, other

    cs.LG

    Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction

    Authors: Roshan Balaji, Joe Bobby, Nirav Pravinbhai Bhatt

    Abstract: Molecular property prediction using deep learning (DL) models has accelerated drug and materials discovery, but the resulting DL models often lack interpretability, hindering their adoption by chemists. This work proposes developing molecule representations using the concept of Functional Groups (FG) in chemistry. We introduce the Functional Group Representation (FGR) framework, a novel approach t… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

  4. arXiv:2508.13408  [pdf, ps, other

    cs.LG

    NovoMolGen: Rethinking Molecular Language Model Pretraining

    Authors: Kamran Chitsaz, Roshan Balaji, Quentin Fournier, Nirav Pravinbhai Bhatt, Sarath Chandar

    Abstract: Designing de-novo molecules with desired property profiles requires efficient exploration of the vast chemical space ranging from $10^{23}$ to $10^{60}$ possible synthesizable candidates. While various deep generative models have been developed to design small molecules using diverse input representations, Molecular Large Language Models (Mol-LLMs) based on string representations have emerged as a… ▽ More

    Submitted 22 August, 2025; v1 submitted 18 August, 2025; originally announced August 2025.

  5. arXiv:2507.11352  [pdf, ps, other

    cs.AI cs.FL

    Foundation Models for Logistics: Toward Certifiable, Conversational Planning Interfaces

    Authors: Yunhao Yang, Neel P. Bhatt, Christian Ellis, Alvaro Velasquez, Zhangyang Wang, Ufuk Topcu

    Abstract: Logistics operators, from battlefield coordinators rerouting airlifts ahead of a storm to warehouse managers juggling late trucks, often face life-critical decisions that demand both domain expertise and rapid and continuous replanning. While popular methods like integer programming yield logistics plans that satisfy user-defined logical constraints, they are slow and assume an idealized mathemati… ▽ More

    Submitted 15 July, 2025; originally announced July 2025.

  6. arXiv:2505.05519  [pdf, other

    cs.CV

    Real-Time Privacy Preservation for Robot Visual Perception

    Authors: Minkyu Choi, Yunhao Yang, Neel P. Bhatt, Kushagra Gupta, Sahil Shah, Aditya Rai, David Fridovich-Keil, Ufuk Topcu, Sandeep P. Chinchali

    Abstract: Many robots (e.g., iRobot's Roomba) operate based on visual observations from live video streams, and such observations may inadvertently include privacy-sensitive objects, such as personal identifiers. Existing approaches for preserving privacy rely on deep learning models, differential privacy, or cryptography. They lack guarantees for the complete concealment of all sensitive objects. Guarantee… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  7. arXiv:2503.14281  [pdf, other

    cs.CR cs.LG cs.SE

    XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants

    Authors: Adam Štorek, Mukur Gupta, Noopur Bhatt, Aditya Gupta, Janie Kim, Prashast Srivastava, Suman Jana

    Abstract: AI coding assistants are widely used for tasks like code generation. These tools now require large and complex contexts, automatically sourced from various origins$\unicode{x2014}$across files, projects, and contributors$\unicode{x2014}$forming part of the prompt fed to underlying LLMs. This automatic context-gathering introduces new vulnerabilities, allowing attackers to subtly poison input to co… ▽ More

    Submitted 20 May, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  8. arXiv:2502.05150  [pdf, other

    cs.CL

    CodeSCM: Causal Analysis for Multi-Modal Code Generation

    Authors: Mukur Gupta, Noopur Bhatt, Suman Jana

    Abstract: In this paper, we propose CodeSCM, a Structural Causal Model (SCM) for analyzing multi-modal code generation using large language models (LLMs). By applying interventions to CodeSCM, we measure the causal effects of different prompt modalities, such as natural language, code, and input-output examples, on the model. CodeSCM introduces latent mediator variables to separate the code and natural lang… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted to NAACL 2025

  9. arXiv:2411.01639  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework

    Authors: Neel P. Bhatt, Yunhao Yang, Rohan Siva, Daniel Milan, Ufuk Topcu, Zhangyang Wang

    Abstract: Multimodal foundation models offer a promising framework for robotic perception and planning by processing sensory inputs to generate actionable plans. However, addressing uncertainty in both perception (sensory interpretation) and decision-making (plan generation) remains a critical challenge for ensuring task reliability. We present a comprehensive framework to disentangle, quantify, and mitigat… ▽ More

    Submitted 16 April, 2025; v1 submitted 3 November, 2024; originally announced November 2024.

    Comments: Fine-tuned models, code, and datasets are available at https://uncertainty-in-planning.github.io/

  10. arXiv:2409.19924  [pdf, other

    cs.AI cs.LG cs.RO

    On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

    Authors: Kevin Wang, Junbo Li, Neel P. Bhatt, Yihan Xi, Qiang Liu, Ufuk Topcu, Zhangyang Wang

    Abstract: Recent advancements in Large Language Models (LLMs) have showcased their ability to perform complex reasoning tasks, but their effectiveness in planning remains underexplored. In this study, we evaluate the planning capabilities of OpenAI's o1 models across a variety of benchmark tasks, focusing on three key aspects: feasibility, optimality, and generalizability. Through empirical evaluations on c… ▽ More

    Submitted 13 October, 2024; v1 submitted 29 September, 2024; originally announced September 2024.

    Comments: Code available at https://github.com/VITA-Group/o1-planning

  11. arXiv:2409.00639  [pdf, other

    cs.CV cs.AI

    Artificial Intelligence in Gastrointestinal Bleeding Analysis for Video Capsule Endoscopy: Insights, Innovations, and Prospects (2008-2023)

    Authors: Tanisha Singh, Shreshtha Jha, Nidhi Bhatt, Palak Handa, Nidhi Goel, Sreedevi Indu

    Abstract: The escalating global mortality and morbidity rates associated with gastrointestinal (GI) bleeding, compounded by the complexities and limitations of traditional endoscopic methods, underscore the urgent need for a critical review of current methodologies used for addressing this condition. With an estimated 300,000 annual deaths worldwide, the demand for innovative diagnostic and therapeutic stra… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  12. arXiv:2404.00923  [pdf, other

    cs.CV cs.AI cs.RO

    MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements

    Authors: Lisong C. Sun, Neel P. Bhatt, Jonathan C. Liu, Zhiwen Fan, Zhangyang Wang, Todd E. Humphreys, Ufuk Topcu

    Abstract: Simultaneous localization and mapping is essential for position tracking and scene understanding. 3D Gaussian-based map representations enable photorealistic reconstruction and real-time rendering of scenes using multiple posed cameras. We show for the first time that using 3D Gaussians for map representation with unposed camera images and inertial measurements can enable accurate SLAM. Our method… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Project Webpage: https://vita-group.github.io/MM3DGS-SLAM

  13. arXiv:2403.16993  [pdf, other

    cs.CV

    Comp4D: LLM-Guided Compositional 4D Scene Generation

    Authors: Dejia Xu, Hanwen Liang, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Plataniotis, Zhangyang Wang

    Abstract: Recent advancements in diffusion models for 2D and 3D content creation have sparked a surge of interest in generating 4D content. However, the scarcity of 3D scene datasets constrains current methodologies to primarily object-centric generation. To overcome this limitation, we present Comp4D, a novel framework for Compositional 4D Generation. Unlike conventional methods that generate a singular 4D… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://vita-group.github.io/Comp4D/

  14. arXiv:2312.00938  [pdf, other

    cs.RO cs.AI cs.CV

    WATonoBus: Field-Tested All-Weather Autonomous Shuttle Technology

    Authors: Neel P. Bhatt, Ruihe Zhang, Minghao Ning, Ahmad Reza Alghooneh, Joseph Sun, Pouya Panahandeh, Ehsan Mohammadbagher, Ted Ecclestone, Ben MacCallum, Ehsan Hashemi, Amir Khajepour

    Abstract: All-weather autonomous vehicle operation poses significant challenges, encompassing modules from perception and decision-making to path planning and control. The complexity arises from the need to address adverse weather conditions such as rain, snow, and fog across the autonomy stack. Conventional model-based single-module approaches often lack holistic integration with upstream or downstream tas… ▽ More

    Submitted 14 August, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 8 pages, 10 figures. This work has been submitted to the ITSC for possible publication

  15. arXiv:2310.18239  [pdf, other

    cs.AI cs.CL cs.FL cs.RO

    Fine-Tuning Language Models Using Formal Methods Feedback

    Authors: Yunhao Yang, Neel P. Bhatt, Tyler Ingebrand, William Ward, Steven Carr, Zhangyang Wang, Ufuk Topcu

    Abstract: Although pre-trained language models encode generic knowledge beneficial for planning and control, they may fail to generate appropriate control policies for domain-specific tasks. Existing fine-tuning methods use human feedback to address this limitation, however, sourcing human feedback is labor intensive and costly. We present a fully automated approach to fine-tune pre-trained language models… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  16. arXiv:2305.19111  [pdf, other

    cs.RO cs.AI cs.LG

    GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts

    Authors: Returaj Burnwal, Anirban Santara, Nirav P. Bhatt, Balaraman Ravindran, Gaurav Aggarwal

    Abstract: Model predictive control (MPC) is a popular approach for trajectory optimization in practical robotics applications. MPC policies can optimize trajectory parameters under kinodynamic and safety constraints and provide guarantees on safety, optimality, generalizability, interpretability, and explainability. However, some behaviors are complex and it is difficult to hand-craft an MPC objective funct… ▽ More

    Submitted 7 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Recipient of the best paper award at RBCDSAI-DAI 2023, IIT Madras (https://rbcdsai.iitm.ac.in/DAI-2023/)

  17. arXiv:2212.03420  [pdf, other

    cs.RO eess.SY

    What Happens When Pneu-Net Soft Robotic Actuators Get Fatigued?

    Authors: Jacqueline Libby, Aniket A. Somwanshi, Federico Stancati, Gayatri Tyagi, Aadit Patel, Naigam Bhatt, JohnRoss Rizzo, S. Farokh Atashzar

    Abstract: Soft actuators have attracted a great deal of interest in the context of rehabilitative and assistive robots for increasing safety and lowering costs as compared to rigid-body robotic systems. During actuation, soft actuators experience high levels of deformation, which can lead to microscale fractures in their elastomeric structure, which fatigues the system over time and eventually leads to macr… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  18. arXiv:2205.08109  [pdf

    cs.LG cs.AI eess.SP

    Forecasting Solar Power Generation on the basis of Predictive and Corrective Maintenance Activities

    Authors: Soham Vyas, Yuvraj Goyal, Neel Bhatt, Sanskar Bhuwania, Hardik Patel, Shakti Mishra, Brijesh Tripathi

    Abstract: Solar energy forecasting has seen tremendous growth in the last decade using historical time series collected from a weather station, such as weather variables wind speed and direction, solar radiance, and temperature. It helps in the overall management of solar power plants. However, the solar power plant regularly requires preventive and corrective maintenance activities that further impact ener… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  19. arXiv:2108.02233  [pdf, other

    eess.IV cs.CV cs.LG

    Unsupervised Detection of Lung Nodules in Chest Radiography Using Generative Adversarial Networks

    Authors: Nitish Bhatt, David Ramon Prados, Nedim Hodzic, Christos Karanassios, H. R. Tizhoosh

    Abstract: Lung nodules are commonly missed in chest radiographs. We propose and evaluate P-AnoGAN, an unsupervised anomaly detection approach for lung nodules in radiographs. P-AnoGAN modifies the fast anomaly detection generative adversarial network (f-AnoGAN) by utilizing a progressive GAN and a convolutional encoder-decoder-encoder pipeline. Model training uses only unlabelled healthy lung patches extrac… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted in EMBC 2021: 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society

  20. arXiv:2106.06801   

    cs.CV

    Contrastive Semi-Supervised Learning for 2D Medical Image Segmentation

    Authors: Prashant Pandey, Ajey Pai, Nisarg Bhatt, Prasenjit Das, Govind Makharia, Prathosh AP, Mausam

    Abstract: Contrastive Learning (CL) is a recent representation learning approach, which encourages inter-class separability and intra-class compactness in learned image representations. Since medical images often contain multiple semantic classes in an image, using CL to learn representations of local features (as opposed to global) is important. In this work, we present a novel semi-supervised 2D medical s… ▽ More

    Submitted 6 August, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: The paper is withdrawn due to a bug in experimental protocol that renders its experimental results and observations invalid. All expts were conducted by the student authors. The roles of senior authors (Prasenjit Das, Govind Makharia, Prathosh, and Mausam) were in defining the problem statement, discussions of potential solutions and framing of the paper and not in performing experiments

  21. arXiv:2101.06901  [pdf, other

    cs.RO eess.SY

    Soft Constrained Autonomous Vehicle Navigation using Gaussian Processes and Instance Segmentation

    Authors: Bruno H. Groenner Barbosa, Neel P. Bhatt, Amir Khajepour, Ehsan Hashemi

    Abstract: This paper presents a generic feature-based navigation framework for autonomous vehicles using a soft constrained Particle Filter. Selected map features, such as road and landmark locations, and vehicle states are used for designing soft constraints. After obtaining features of mapped landmarks in instance-based segmented images acquired from a monocular camera, vehicle-to-landmark distances are p… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  22. Automated Repair of Resource Leaks in Android Applications

    Authors: Bhargav Nagaraja Bhatt, Carlo A. Furia

    Abstract: Resource leaks -- a program does not release resources it previously acquired -- are a common kind of bug in Android applications. Even with the help of existing techniques to automatically detect leaks, writing a leak-free program remains tricky. One of the reasons is Android's event-driven programming model, which complicates the understanding of an application's overall control flow. In this… ▽ More

    Submitted 28 June, 2022; v1 submitted 6 March, 2020; originally announced March 2020.

  23. arXiv:1905.08716  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Conserved Networks from Flows

    Authors: Satya Jayadev P., Shankar Narasimhan, Nirav Bhatt

    Abstract: A challenging problem in complex networks is the network reconstruction problem from data. This work deals with a class of networks denoted as conserved networks, in which a flow associated with every edge and the flows are conserved at all non-source and non-sink nodes. We propose a novel polynomial time algorithm to reconstruct conserved networks from flow data by exploiting graph theoretic prop… ▽ More

    Submitted 12 April, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

  24. arXiv:1609.02678  [pdf, ps, other

    eess.SY cs.LG

    Identifying Topology of Power Distribution Networks Based on Smart Meter Data

    Authors: Jayadev P Satya, Nirav Bhatt, Ramkrishna Pasumarthy, Aravind Rajeswaran

    Abstract: In a power distribution network, the network topology information is essential for an efficient operation of the network. This information of network connectivity is not accurately available, at the low voltage level, due to uninformed changes that happen from time to time. In this paper, we propose a novel data--driven approach to identify the underlying network topology including the load phase… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: Submitted to IEEE transaction on smart grid

  25. arXiv:1511.06063  [pdf, ps, other

    cs.LG stat.AP stat.ML

    A Novel Approach for Phase Identification in Smart Grids Using Graph Theory and Principal Component Analysis

    Authors: P Satya Jayadev, Aravind Rajeswaran, Nirav P Bhatt, Ramkrishna Pasumarthy

    Abstract: Consumers with low demand, like households, are generally supplied single-phase power by connecting their service mains to one of the phases of a distribution transformer. The distribution companies face the problem of keeping a record of consumer connectivity to a phase due to uninformed changes that happen. The exact phase connectivity information is important for the efficient operation and con… ▽ More

    Submitted 7 June, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Accepted for the presentation at ACC 16

  26. Deconstructing Principal Component Analysis Using a Data Reconciliation Perspective

    Authors: Shankar Narasimhan, Nirav Bhatt

    Abstract: Data reconciliation (DR) and Principal Component Analysis (PCA) are two popular data analysis techniques in process industries. Data reconciliation is used to obtain accurate and consistent estimates of variables and parameters from erroneous measurements. PCA is primarily used as a method for reducing the dimensionality of high dimensional data and as a preprocessing technique for denoising measu… ▽ More

    Submitted 2 May, 2015; originally announced May 2015.

    ACM Class: I.2

    Journal ref: Computers and Chemical Engineering 77 (2015) 74-84

  27. Monotonous (Semi-)Nonnegative Matrix Factorization

    Authors: Nirav Bhatt, Arun Ayyar

    Abstract: Nonnegative matrix factorization (NMF) factorizes a non-negative matrix into product of two non-negative matrices, namely a signal matrix and a mixing matrix. NMF suffers from the scale and ordering ambiguities. Often, the source signals can be monotonous in nature. For example, in source separation problem, the source signals can be monotonously increasing or decreasing while the mixing matrix ca… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

    ACM Class: I.2