Skip to main content

Showing 1–50 of 164 results for author: Deshpande, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03536  [pdf, other

    cs.DB

    Beyond Relations: A Case for Elevating to the Entity-Relationship Abstraction

    Authors: Amol Deshpande

    Abstract: Spurred by a number of recent trends, we make the case that the relational database systems should urgently move beyond supporting the basic object-relational model and instead embrace a more abstract data model, specifically, the entity-relationship model. We argue that the current RDBMSs don't inherently support sufficient "logical" data independence, and that is relegating the database systems… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 14th Annual Conference on Innovative Data Systems Research (CIDR'25), January 19-22, 2025, Amsterdam, The Netherlands

  2. arXiv:2504.08706  [pdf, other

    cs.RO

    BiFlex: A Passive Bimodal Stiffness Flexible Wrist for Manipulation in Unstructured Environments

    Authors: Gu-Cheol Jeong, Stefano Dalla Gasperina, Ashish D. Deshpande, Lillian Chin, Roberto Martín-Martín

    Abstract: Robotic manipulation in unstructured, humancentric environments poses a dual challenge: achieving the precision need for delicate free-space operation while ensuring safety during unexpected contact events. Traditional wrists struggle to balance these demands, often relying on complex control schemes or complicated mechanical designs to mitigate potential damage from force overload. In response, w… ▽ More

    Submitted 14 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: 8 pages, 10 figures

  3. arXiv:2503.19115  [pdf, other

    q-bio.MN cs.NE

    Implementation of Support Vector Machines using Reaction Networks

    Authors: Amey Choudhary, Jiaxin Jin, Abhishek Deshpande

    Abstract: Can machine learning algorithms be implemented using chemical reaction networks? We demonstrate that this is possible in the case of support vector machines (SVMs). SVMs are powerful tools for data classification, leveraging VC theory to handle high-dimensional data and small datasets effectively. In this work, we propose a reaction network scheme for implementing SVMs, utilizing the steady-state… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 26 pages, 4 figures

  4. Enforcing Cybersecurity Constraints for LLM-driven Robot Agents for Online Transactions

    Authors: Shraddha Pradipbhai Shah, Aditya Vilas Deshpande

    Abstract: The integration of Large Language Models (LLMs) into autonomous robotic agents for conducting online transactions poses significant cybersecurity challenges. This study aims to enforce robust cybersecurity constraints to mitigate the risks associated with data breaches, transaction fraud, and system manipulation. The background focuses on the rise of LLM-driven robotic systems in e-commerce, finan… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  5. arXiv:2503.15228  [pdf, other

    cs.NI eess.SP

    Sensing-Based Beamformed Resource Allocation in Standalone Millimeter-Wave Vehicular Networks

    Authors: Alessandro Traspadini, Anay Ajit Deshpande, Marco Giordani, Chinmay Mahabal, Takayuki Shimizu, Michele Zorzi

    Abstract: In 3GPP New Radio (NR) Vehicle-to-Everything (V2X), the new standard for next-generation vehicular networks, vehicles can autonomously select sidelink resources for data transmission, which permits network operations without cellular coverage. However, standalone resource allocation is uncoordinated, and is complicated by the high mobility of the nodes that may introduce unforeseen channel collisi… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 7 pages, 8 figures, 3 tables. Accepted for publication in the 2025 IEEE International Conference on Communications (ICC). \c{opyright} 2025 IEEE. A. Traspadini, A. A. Deshpande, M. Giordani, C. Mahabal, T. Shimizu, and M. Zorzi, "Sensing-Based Beamformed Resource Allocation in Standalone Millimeter-Wave Vehicular Networks," in Proc. IEEE International Conference on Communications (ICC), 2025

  6. arXiv:2503.02956  [pdf, other

    cs.DB

    TreeCat: Standalone Catalog Engine for Large Data Systems

    Authors: Keonwoo Oh, Pooja Nilangekar, Amol Deshpande

    Abstract: With ever increasing volume and heterogeneity of data, advent of new specialized compute engines, and demand for complex use cases, large scale data systems require a performant catalog system that can satisfy diverse needs. We argue that existing solutions, including recent lakehouse storage formats, have fundamental limitations and that there is a strong motivation for a specialized database eng… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: submitted to PVLDB Volume 18 (for VLDB 2025)

  7. arXiv:2502.02711  [pdf, other

    cs.CE cs.PL

    Tensor Network Structure Search Using Program Synthesis

    Authors: Zheng Guo, Aditya Deshpande, Brian Kiedrowski, Xinyu Wang, Alex Gorodetsky

    Abstract: Tensor networks provide a powerful framework for compressing multi-dimensional data. The optimal tensor network structure for a given data tensor depends on both the inherent data properties and the specific optimality criteria, making tensor network structure search a crucial research problem. Existing solutions typically involve sampling and validating numerous candidate structures; this is comp… ▽ More

    Submitted 22 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  8. arXiv:2412.16323  [pdf, other

    cs.DB

    Optimizing Queries with Many-to-Many Joins

    Authors: Hasara Kalumin, Amol Deshpande

    Abstract: As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional enterprise settings that have been the focus of much of the query optimization work to date, such queries are seen frequently in other contexts such as graph workloads.… ▽ More

    Submitted 24 April, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

  9. arXiv:2412.03084  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Hybrid deep learning-based strategy for the hepatocellular carcinoma cancer grade classification of H&E stained liver histopathology images

    Authors: Ajinkya Deshpande, Deep Gupta, Ankit Bhurane, Nisha Meshram, Sneha Singh, Petia Radeva

    Abstract: Hepatocellular carcinoma (HCC) is a common type of liver cancer whose early-stage diagnosis is a common challenge, mainly due to the manual assessment of hematoxylin and eosin-stained whole slide images, which is a time-consuming process and may lead to variability in decision-making. For accurate detection of HCC, we propose a hybrid deep learning-based architecture that uses transfer learning to… ▽ More

    Submitted 28 February, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: 14 figure, 9 tables

  10. arXiv:2412.00451  [pdf, other

    cs.CV

    A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge

    Authors: Atharva Deshpande, Kaushik Gopalan, Jeet Shah, Hrishikesh Simu

    Abstract: This study explores the application of deep learning for rainfall prediction, leveraging the Spinning Enhanced Visible and Infrared Imager (SEVIRI) High rate information transmission (HRIT) data as input and the Operational Program on the Exchange of weather RAdar information (OPERA) ground-radar reflectivity data as ground truth. We use the mean of 4 InfraRed frequency channels as the input. The… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

  11. arXiv:2410.02172  [pdf, other

    cs.LG cs.AI stat.ML

    Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation

    Authors: Shreyas Chaudhari, Ameet Deshpande, Bruno Castro da Silva, Philip S. Thomas

    Abstract: Evaluating policies using off-policy data is crucial for applying reinforcement learning to real-world problems such as healthcare and autonomous driving. Previous methods for off-policy evaluation (OPE) generally suffer from high variance or irreducible bias, leading to unacceptably high prediction errors. In this work, we introduce STAR, a framework for OPE that encompasses a broad range of esti… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted at the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

  12. arXiv:2407.18416  [pdf, other

    cs.CL cs.AI cs.LG

    PersonaGym: Evaluating Persona Agents and LLMs

    Authors: Vinay Samuel, Henry Peng Zou, Yue Zhou, Shreyas Chaudhari, Ashwin Kalyan, Tanmay Rajpurohit, Ameet Deshpande, Karthik Narasimhan, Vishvak Murahari

    Abstract: Persona agents, which are LLM agents that act according to an assigned persona, have demonstrated impressive contextual response capabilities across various applications. These persona agents offer significant enhancements across diverse sectors, such as education, healthcare, and entertainment, where model developers can align agent responses to different user requirements thereby broadening the… ▽ More

    Submitted 18 December, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 21 pages, 5 figures

  13. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  14. arXiv:2406.03142  [pdf, ps, other

    cs.LG

    On the Power of Randomization in Fair Classification and Representation

    Authors: Sushant Agarwal, Amit Deshpande

    Abstract: Fair classification and fair representation learning are two important problems in supervised and unsupervised fair machine learning, respectively. Fair classification asks for a classifier that maximizes accuracy on a given data distribution subject to fairness constraints. Fair representation maps a given data distribution over the original feature space to a distribution over a new representati… ▽ More

    Submitted 7 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Appeared in ACM FAccT 2022

  15. arXiv:2405.19307  [pdf, other

    cs.RO

    Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

    Authors: Abhay Deshpande, Liyiming Ke, Quinn Pfeifer, Abhishek Gupta, Siddhartha S. Srinivasa

    Abstract: We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by l… ▽ More

    Submitted 21 October, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Presented at IROS 2024

  16. arXiv:2405.04325  [pdf, other

    cs.CL

    Deception in Reinforced Autonomous Agents

    Authors: Atharvan Dogra, Krishna Pillutla, Ameet Deshpande, Ananya B Sai, John Nay, Tanmay Rajpurohit, Ashwin Kalyan, Balaraman Ravindran

    Abstract: We explore the ability of large language model (LLM)-based agents to engage in subtle deception such as strategically phrasing and intentionally manipulating information to misguide and deceive other agents. This harmful behavior can be hard to detect, unlike blatant lying or unintentional hallucination. We build an adversarial testbed mimicking a legislative environment where two LLMs play opposi… ▽ More

    Submitted 4 October, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  17. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: Ajinkya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  18. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  19. arXiv:2403.05749  [pdf, other

    eess.SY cs.DM

    Characterizing Flow Complexity in Transportation Networks using Graph Homology

    Authors: Shashank A Deshpande, Hamsa Balakrishnan

    Abstract: Series-parallel network topologies generally exhibit simplified dynamical behavior and avoid high combinatorial complexity. A comprehensive analysis of how flow complexity emerges with a graph's deviation from series-parallel topology is therefore of fundamental interest. We introduce the notion of a robust $k$-path on a directed acycylic graph, with increasing values of the length $k$ reflecting… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures, letter

  20. arXiv:2402.11741  [pdf, other

    cs.DS cs.CC cs.DB cs.DC

    To Store or Not to Store: a graph theoretical approach for Dataset Versioning

    Authors: Anxin Guo, Jingwei Li, Pattara Sukprasert, Samir Khuller, Amol Deshpande, Koyel Mukherjee

    Abstract: In this work, we study the cost efficient data versioning problem, where the goal is to optimize the storage and reconstruction (retrieval) costs of data versions, given a graph of datasets as nodes and edges capturing edit/delta information. One central variant we study is MinSum Retrieval (MSR) where the goal is to minimize the total retrieval costs, while keeping the storage costs bounded. This… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by IPDPS 2024

  21. arXiv:2402.06733  [pdf, other

    cs.CL cs.AI cs.LG

    NICE: To Optimize In-Context Examples or Not?

    Authors: Pragya Srivastava, Satvik Golechha, Amit Deshpande, Amit Sharma

    Abstract: Recent work shows that in-context learning and optimization of in-context examples (ICE) can significantly improve the accuracy of large language models (LLMs) on a wide range of tasks, leading to an apparent consensus that ICE optimization is crucial for better performance. However, most of these studies assume a fixed or no instruction provided in the prompt. We challenge this consensus by inves… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted as a full paper (9 pages) at ACL 2024 (Main)

    Journal ref: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024 (Volume 1: Long Papers)

  22. arXiv:2312.16735  [pdf, other

    cs.DB cs.DC

    Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

    Authors: Gang Liao, Amol Deshpande, Daniel J. Abadi

    Abstract: Existing serverless data analytics systems rely on external storage services like S3 for data shuffling and communication between cloud functions. While this approach provides the elasticity benefits of serverless computing, it incurs additional latency and cost overheads. We present Flock, a novel cloud-native streaming query engine that leverages the on-demand scalability of FaaS platforms for r… ▽ More

    Submitted 21 April, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  23. arXiv:2312.10534  [pdf, other

    cs.LG cs.CR cs.CV

    Rethinking Robustness of Model Attributions

    Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian

    Abstract: For machine learning models to be reliable and trustworthy, their decisions must be interpretable. As these models find increasing use in safety-critical applications, it is important that not just the model predictions but also their explanations (as feature attributions) be robust to small human-imperceptible input perturbations. Recent works have shown that many attribution methods are fragile… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted AAAI 2024

  24. arXiv:2312.10396  [pdf, ps, other

    cs.LG cs.AI

    How Far Can Fairness Constraints Help Recover From Biased Data?

    Authors: Mohit Sharma, Amit Deshpande

    Abstract: A general belief in fair classification is that fairness constraints incur a trade-off with accuracy, which biased data may worsen. Contrary to this belief, Blum & Stangl (2019) show that fair classification with equal opportunity constraints even on extremely biased data can recover optimally accurate and fair classifiers on the original data distribution. Their result is interesting because it d… ▽ More

    Submitted 1 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted for publication at ICML 2024

  25. arXiv:2312.05323  [pdf, other

    cs.RO

    BaRiFlex: A Robotic Gripper with Versatility and Collision Robustness for Robot Learning

    Authors: Gu-Cheol Jeong, Arpit Bahety, Gabriel Pedraza, Ashish D. Deshpande, Roberto Martín-Martín

    Abstract: We present a new approach to robot hand design specifically suited for successfully implementing robot learning methods to accomplish tasks in daily human environments. We introduce BaRiFlex, an innovative gripper design that alleviates the issues caused by unexpected contact and collisions during robot learning, offering robustness, grasping versatility, task versatility, and simplicity to the le… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 8 pages, 6 figures, project website: https://robin-lab.cs.utexas.edu/bariflex/

  26. arXiv:2312.04294  [pdf, ps, other

    cs.NI

    Energy-Efficient Internet of Things Monitoring with Content-Based Wake-Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled. However, polling-based WUR may still lead to wasted energy if values sensed by the polled sensors provide no new information to the… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  27. arXiv:2312.00348  [pdf, other

    cs.CV

    Student Activity Recognition in Classroom Environments using Transfer Learning

    Authors: Anagha Deshpande, Vedant Deshpande

    Abstract: The recent advances in artificial intelligence and deep learning facilitate automation in various applications including home automation, smart surveillance systems, and healthcare among others. Human Activity Recognition is one of its emerging applications, which can be implemented in a classroom environment to enhance safety, efficiency, and overall educational quality. This paper proposes a sys… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 6 pages, 12 figures, accepted at the IEEE International Conference on Computational Intelligence, Networks and Security (ICCINS) 2023

  28. arXiv:2311.09735  [pdf, other

    cs.LG cs.IR

    GEO: Generative Engine Optimization

    Authors: Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande

    Abstract: The advent of large language models (LLMs) has ushered in a new paradigm of search engines that use generative models to gather and summarize information to answer user queries. This emerging technology, which we formalize under the unified framework of generative engines (GEs), can generate accurate and personalized responses, rapidly replacing traditional search engines like Google and Bing. Gen… ▽ More

    Submitted 28 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to KDD 2024

  29. arXiv:2311.04892  [pdf, other

    cs.CL

    Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

    Authors: Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot

    Abstract: Recent works have showcased the ability of LLMs to embody diverse personas in their responses, exemplified by prompts like 'You are Yoda. Explain the Theory of Relativity.' While this ability allows personalization of LLMs and enables human behavior simulation, its effect on LLMs' capabilities remains unclear. To fill this gap, we present the first extensive study of the unintended side-effects of… ▽ More

    Submitted 27 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Project page: https://allenai.github.io/persona-bias. Paper to appear at ICLR 2024. Added results for other LLMs in v2 (similar findings)

  30. arXiv:2311.02807  [pdf, other

    cs.LG cs.AI cs.CL

    QualEval: Qualitative Evaluation for Model Improvement

    Authors: Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan

    Abstract: Quantitative evaluation metrics have traditionally been pivotal in gauging the advancements of artificial intelligence systems, including large language models (LLMs). However, these metrics have inherent limitations. Given the intricate nature of real-world tasks, a single scalar to quantify and compare is insufficient to capture the fine-grained nuances of model behavior. Metrics serve only as a… ▽ More

    Submitted 5 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  31. arXiv:2310.12972  [pdf, other

    cs.RO

    CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning

    Authors: Liyiming Ke, Yunchu Zhang, Abhay Deshpande, Siddhartha Srinivasa, Abhishek Gupta

    Abstract: We present a new technique to enhance the robustness of imitation learning methods by generating corrective data to account for compounding errors and disturbances. While existing methods rely on interactive expert labeling, additional offline datasets, or domain-specific invariances, our approach requires minimal additional assumptions beyond access to expert data. The key insight is to leverage… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  32. arXiv:2310.10294  [pdf, other

    cs.CL cs.AI

    Key-phrase boosted unsupervised summary generation for FinTech organization

    Authors: Aadit Deshpande, Shreya Goyal, Prateek Nagwanshi, Avinash Tripathy

    Abstract: With the recent advances in social media, the use of NLP techniques in social media data analysis has become an emerging research direction. Business organizations can particularly benefit from such an analysis of social media discourse, providing an external perspective on consumer behavior. Some of the NLP applications such as intent detection, sentiment classification, text summarization can he… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures

  33. arXiv:2310.01892  [pdf, ps, other

    cs.LG cs.AI

    FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations

    Authors: Chanakya Ekbote, Ajinkya Pankaj Deshpande, Arun Iyer, Ramakrishna Bairi, Sundararajan Sellamanickam

    Abstract: Unsupervised node representations learnt using contrastive learning-based methods have shown good performance on downstream tasks. However, these methods rely on augmentations that mimic low-pass filters, limiting their performance on tasks requiring different eigen-spectrum parts. This paper presents a simple filter-based augmentation method to capture different parts of the eigen-spectrum. We sh… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  34. arXiv:2309.03750  [pdf, other

    cs.CV

    PBP: Path-based Trajectory Prediction for Autonomous Driving

    Authors: Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui

    Abstract: Trajectory prediction plays a crucial role in the autonomous driving stack by enabling autonomous vehicles to anticipate the motion of surrounding agents. Goal-based prediction models have gained traction in recent years for addressing the multimodal nature of future trajectories. Goal-based prediction models simplify multimodal prediction by first predicting 2D goal locations of agents and then p… ▽ More

    Submitted 2 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally

  35. arXiv:2309.02710  [pdf, ps, other

    cs.LG cs.CG cs.DS

    Improved Outlier Robust Seeding for k-means

    Authors: Amit Deshpande, Rameshwar Pratap

    Abstract: The $k$-means is a popular clustering objective, although it is inherently non-robust and sensitive to outliers. Its popular seeding or initialization called $k$-means++ uses $D^{2}$ sampling and comes with a provable $O(\log k)$ approximation guarantee \cite{AV2007}. However, in the presence of adversarial noise or outliers, $D^{2}$ sampling is more likely to pick centers from distant outliers in… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  36. arXiv:2309.00133  [pdf, other

    cs.CV

    Distraction-free Embeddings for Robust VQA

    Authors: Atharvan Dogra, Deeksha Varshney, Ashwin Kalyan, Ameet Deshpande, Neeraj Kumar

    Abstract: The generation of effective latent representations and their subsequent refinement to incorporate precise information is an essential prerequisite for Vision-Language Understanding (VLU) tasks such as Video Question Answering (VQA). However, most existing methods for VLU focus on sparsely sampling or fine-graining the input information (e.g., sampling a sparse set of frames or text tokens), or add… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  37. arXiv:2308.13242  [pdf, other

    cs.LG cs.CY cs.IR

    Optimizing Group-Fair Plackett-Luce Ranking Models for Relevance and Ex-Post Fairness

    Authors: Sruthi Gorantla, Eshaan Bhansali, Amit Deshpande, Anand Louis

    Abstract: In learning-to-rank (LTR), optimizing only the relevance (or the expected ranking utility) can cause representational harm to certain categories of items. Moreover, if there is implicit bias in the relevance scores, LTR models may fail to optimize for true relevance. Previous works have proposed efficient algorithms to train stochastic ranking models that achieve fairness of exposure to the groups… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 20 pages

  38. Low-Latency Massive Access with Multicast Wake Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled, saving energy. However, polling-based Time Division Multiple Access (TDMA) may significantly increase data transmission delay if pac… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 2023 21st Mediterranean Communication and Computer Networking Conference (MedComNet)

  39. arXiv:2307.08593  [pdf, other

    physics.acc-ph cs.LG hep-ex nucl-ex nucl-th

    Artificial Intelligence for the Electron Ion Collider (AI4EIC)

    Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

  40. arXiv:2307.00259  [pdf, other

    cs.CL cs.AI

    InstructEval: Systematic Evaluation of Instruction Selection Methods

    Authors: Anirudh Ajith, Chris Pan, Mengzhou Xia, Ameet Deshpande, Karthik Narasimhan

    Abstract: In-context learning (ICL) performs tasks by prompting a large language model (LLM) using an instruction and a small set of annotated examples called demonstrations. Recent work has shown that precise details of the inputs used in the ICL prompt significantly impact performance, which has incentivized instruction selection algorithms. The effect of instruction-choice however is severely underexplor… ▽ More

    Submitted 16 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: 8 content pages + 3 pages of supplementary material, 3 figures, 10 tables

  41. arXiv:2306.11964  [pdf, other

    cs.CY cs.DS cs.IR cs.LG stat.ML

    Sampling Individually-Fair Rankings that are Always Group Fair

    Authors: Sruthi Gorantla, Anay Mehrotra, Amit Deshpande, Anand Louis

    Abstract: Rankings on online platforms help their end-users find the relevant information -- people, news, media, and products -- quickly. Fair ranking tasks, which ask to rank a set of items to maximize utility subject to satisfying group-fairness constraints, have gained significant interest in the Algorithmic Fairness, Information Retrieval, and Machine Learning literature. Recent works, however, identif… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Full version of a paper accepted for presentation in ACM AIES 2023

  42. arXiv:2306.11072  [pdf, other

    cs.LG

    Causal Effect Regularization: Automated Detection and Removal of Spurious Attributes

    Authors: Abhinav Kumar, Amit Deshpande, Amit Sharma

    Abstract: In many classification datasets, the task labels are spuriously correlated with some input attributes. Classifiers trained on such datasets often rely on these attributes for prediction, especially when the spurious correlation is high, and thus fail to generalize whenever there is a shift in the attributes' correlation at deployment. If we assume that the spurious attributes are known a priori, s… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  43. arXiv:2305.15093  [pdf, other

    cs.CL cs.AI cs.LG

    C-STS: Conditional Semantic Textual Similarity

    Authors: Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan

    Abstract: Semantic textual similarity (STS), a cornerstone task in NLP, measures the degree of similarity between a pair of sentences, and has broad application in fields such as information retrieval and natural language understanding. However, sentence similarity can be inherently ambiguous, depending on the specific aspect of interest. We resolve this ambiguity by proposing a novel task called Conditiona… ▽ More

    Submitted 6 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023

  44. arXiv:2305.14784  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    Anthropomorphization of AI: Opportunities and Risks

    Authors: Ameet Deshpande, Tanmay Rajpurohit, Karthik Narasimhan, Ashwin Kalyan

    Abstract: Anthropomorphization is the tendency to attribute human-like traits to non-human entities. It is prevalent in many social contexts -- children anthropomorphize toys, adults do so with brands, and it is a literary device. It is also a versatile tool in science, with behavioral psychology and evolutionary biology meticulously documenting its consequences. With widespread adoption of AI systems, and… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  45. arXiv:2304.05335  [pdf, other

    cs.CL cs.AI cs.LG

    Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

    Authors: Ameet Deshpande, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan

    Abstract: Large language models (LLMs) have shown incredible capabilities and transcended the natural language processing (NLP) community, with adoption throughout many services like healthcare, therapy, education, and customer service. Since users include people with critical information needs like students or patients engaging with chatbots, the safety of these systems is of prime importance. Therefore, a… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  46. arXiv:2303.05508  [pdf, other

    cs.RO

    Cherry-Picking with Reinforcement Learning : Robust Dynamic Grasping in Unstable Conditions

    Authors: Yunchu Zhang, Liyiming Ke, Abhay Deshpande, Abhishek Gupta, Siddhartha Srinivasa

    Abstract: Grasping small objects surrounded by unstable or non-rigid material plays a crucial role in applications such as surgery, harvesting, construction, disaster recovery, and assisted feeding. This task is especially difficult when fine manipulation is required in the presence of sensor noise and perception errors; errors inevitably trigger dynamic motion, which is challenging to model precisely. Circ… ▽ More

    Submitted 28 June, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  47. arXiv:2302.13191  [pdf

    cs.RO cs.AI cs.LG cs.NE eess.SY

    DeepCPG Policies for Robot Locomotion

    Authors: Aditya M. Deshpande, Eric Hurd, Ali A. Minai, Manish Kumar

    Abstract: Central Pattern Generators (CPGs) form the neural basis of the observed rhythmic behaviors for locomotion in legged animals. The CPG dynamics organized into networks allow the emergence of complex locomotor behaviors. In this work, we take this inspiration for developing walking behaviors in multi-legged robots. We present novel DeepCPG policies that embed CPGs as a layer in a larger neural networ… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Preprint of paper accepted for publication in IEEE Transaction On Cognitive and Developmental Systems

  48. arXiv:2302.12441  [pdf, other

    cs.LG cs.CL

    MUX-PLMs: Data Multiplexing for High-throughput Language Models

    Authors: Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan

    Abstract: The widespread adoption of large language models such as ChatGPT and Bard has led to unprecedented demand for these technologies. The burgeoning cost of inference for ever-increasing model sizes coupled with hardware shortages has limited affordable access and poses a pressing need for efficiency approaches geared towards high throughput and performance. Multi-input multi-output (MIMO) algorithms… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  49. arXiv:2302.05906  [pdf, other

    cs.LG cs.AI

    On Comparing Fair Classifiers under Data Bias

    Authors: Mohit Sharma, Amit Deshpande, Rajiv Ratn Shah

    Abstract: In this paper, we consider a theoretical model for injecting data bias, namely, under-representation and label bias (Blum & Stangl, 2019). We empirically study the effect of varying data biases on the accuracy and fairness of fair classifiers. Through extensive experiments on both synthetic and real-world datasets (e.g., Adult, German Credit, Bank Marketing, COMPAS), we empirically audit pre-, in-… ▽ More

    Submitted 10 December, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted as a Spotlight Presentation at Algorithmic Fairness through the Lens of Time, Neurips 2023 Workshop

  50. arXiv:2301.11309  [pdf, other

    cs.CL

    SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

    Authors: Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan

    Abstract: Extreme classification (XC) involves predicting over large numbers of classes (thousands to millions), with real-world applications like news article classification and e-commerce product tagging. The zero-shot version of this task requires generalization to novel classes without additional supervision. In this paper, we develop SemSup-XC, a model that achieves state-of-the-art zero-shot and few-s… ▽ More

    Submitted 22 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Published at ICML 2023. V2: camera ready version at ICML 2023