Skip to main content

Showing 1–4 of 4 results for author: Vattikonda, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04103  [pdf, ps, other

    cs.AI cs.LG stat.ML

    How to Train Your LLM Web Agent: A Statistical Diagnosis

    Authors: Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Megh Thakkar, Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Piché, Alexandre Lacoste, Massimo Caccia

    Abstract: LLM-based web agents have recently made significant progress, but much of it has occurred in closed-source systems, widening the gap with open-source alternatives. Progress has been held back by two key challenges: first, a narrow focus on single-step tasks that overlooks the complexity of multi-step web interactions; and second, the high compute costs required to post-train LLM-based web agents.… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

  2. arXiv:2504.03089  [pdf, other

    cs.CV

    SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections

    Authors: Prashant Kumar, Dheeraj Vattikonda, Kshitij Madhav Bhat, Kunal Dargan, Prem Kalra

    Abstract: The widespread adoption of learning-based methods for the LiDAR makes autonomous vehicles vulnerable to adversarial attacks through adversarial \textit{point injections (PiJ)}. It poses serious security challenges for navigation and map generation. Despite its critical nature, no major work exists that studies learning-based attacks on LiDAR-based SLAM. Our work proposes SLACK, an end-to-end deep… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  3. arXiv:2407.03471  [pdf, other

    cs.CV

    Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

    Authors: Benno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Christopher Pal, Siva Reddy

    Abstract: An image editing model should be able to perform diverse edits, ranging from object replacement, changing attributes or style, to performing actions or movement, which require many forms of reasoning. Current general instruction-guided editing models have significant shortcomings with action and reasoning-centric edits. Object, attribute or stylistic changes can be learned from visually static dat… ▽ More

    Submitted 17 October, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: NeurIPS 2024 (Dataset & Benchmarks)

  4. arXiv:2309.09206  [pdf, other

    cs.RO cs.CV cs.LG

    Differentiable SLAM Helps Deep Learning-based LiDAR Perception Tasks

    Authors: Prashant Kumar, Dheeraj Vattikonda, Vedang Bhupesh Shenvi Nadkarni, Erqun Dong, Sabyasachi Sahoo

    Abstract: We investigate a new paradigm that uses differentiable SLAM architectures in a self-supervised manner to train end-to-end deep learning models in various LiDAR based applications. To the best of our knowledge there does not exist any work that leverages SLAM as a training signal for deep learning based models. We explore new ways to improve the efficiency, robustness, and adaptability of LiDAR sys… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 15 pages,6 Tables, 3 figures. Accepted at BMVC 2023