Skip to main content

Showing 1–50 of 158 results for author: Dheeraj

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07614  [pdf, ps, other

    math.PR cs.LG math.ST

    Poisson Midpoint Method for Log Concave Sampling: Beyond the Strong Error Lower Bounds

    Authors: Rishikesh Srinivasan, Dheeraj Nagaraj

    Abstract: We study the problem of sampling from strongly log-concave distributions over $\mathbb{R}^d$ using the Poisson midpoint discretization (a variant of the randomized midpoint method) for overdamped/underdamped Langevin dynamics. We prove its convergence in the 2-Wasserstein distance ($W_2$), achieving a cubic speedup in dependence on the target accuracy ($ε$) over the Euler-Maruyama discretization,… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.03335  [pdf, ps, other

    cs.CV

    SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports

    Authors: Dheeraj Khanna, Jerrin Bright, Yuhao Chen, John S. Zelek

    Abstract: Multi-object tracking (MOT) in team sports is particularly challenging due to the fast-paced motion and frequent occlusions resulting in motion blur and identity switches, respectively. Predicting player positions in such scenarios is particularly difficult due to the observed highly non-linear motion patterns. Current methods are heavily reliant on object detection and appearance-based tracking,… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Paper accepted at CVSports IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW'25). The paper has 8 pages, including 6 Figures and 5 Tables

  3. arXiv:2506.02931  [pdf, ps, other

    cs.MA cs.AI cs.LG

    ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms

    Authors: Praneet Sai Madhu Surabhi, Dheeraj Reddy Mudireddy, Jian Tao

    Abstract: This paper presents ThinkTank, a comprehensive and scalable framework designed to transform specialized AI agent systems into versatile collaborative intelligence platforms capable of supporting complex problem-solving across diverse domains. ThinkTank systematically generalizes agent roles, meeting structures, and knowledge integration mechanisms by adapting proven scientific collaboration method… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  4. arXiv:2505.21596  [pdf

    q-bio.QM cs.AI cs.LG

    Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning

    Authors: Esra Adiyeke, Tianqi Liu, Venkata Sai Dheeraj Naganaboina, Han Li, Tyler J. Loftus, Yuanfang Ren, Benjamin Shickel, Matthew M. Ruppert, Karandeep Singh, Ruogu Fang, Parisa Rashidi, Azra Bihorac, Tezcan Ozrazgat-Baslanti

    Abstract: Traditional methods of surgical decision making heavily rely on human experience and prompt actions, which are variable. A data-driven system generating treatment recommendations based on patient states can be a substantial asset in perioperative decision-making, as in cases of intraoperative hypotension, for which suboptimal management is associated with acute kidney injury (AKI), a common and mo… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 41 pages, 1 table, 5 figures, 5 supplemental tables, 6 supplemental figures

  5. arXiv:2505.15803  [pdf, other

    cs.LG

    Adaptive Estimation and Learning under Temporal Distribution Shift

    Authors: Dheeraj Baby, Yifei Tang, Hieu Duy Nguyen, Yu-Xiang Wang, Rohit Pyati

    Abstract: In this paper, we study the problem of estimation and learning under temporal distribution shift. Consider an observation sequence of length $n$, which is a noisy realization of a time-varying groundtruth sequence. Our focus is to develop methods to estimate the groundtruth at the final time-step while providing sharp point-wise estimation error rates. We show that, without prior knowledge on the… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted at ICML 2025

  6. arXiv:2504.12276  [pdf, other

    cs.CV

    The Tenth NTIRE 2025 Image Denoising Challenge Report

    Authors: Lei Sun, Hang Guo, Bin Ren, Luc Van Gool, Radu Timofte, Yawei Li, Xiangyu Kong, Hyunhee Park, Xiaoxuan Yu, Suejin Han, Hakjae Jeon, Jia Li, Hyung-Ju Chun, Donghun Ryou, Inju Ha, Bohyung Han, Jingyu Ma, Zhijuan Huang, Huiyuan Fu, Hongyuan Yu, Boqi Zhang, Jiawei Shi, Heng Zhang, Huadong Ma, Deepak Kumar Tyagi , et al. (69 additional authors not shown)

    Abstract: This paper presents an overview of the NTIRE 2025 Image Denoising Challenge (σ = 50), highlighting the proposed methodologies and corresponding results. The primary objective is to develop a network architecture capable of achieving high-quality denoising performance, quantitatively evaluated using PSNR, without constraints on computational complexity or model size. The task assumes independent ad… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  7. arXiv:2504.07261  [pdf, other

    cs.LG

    Adapting to Online Distribution Shifts in Deep Learning: A Black-Box Approach

    Authors: Dheeraj Baby, Boran Han, Shuai Zhang, Cuixiong Hu, Yuyang Wang, Yu-Xiang Wang

    Abstract: We study the well-motivated problem of online distribution shift in which the data arrive in batches and the distribution of each batch can change arbitrarily over time. Since the shifts can be large or small, abrupt or gradual, the length of the relevant historical data to learn from may vary over time, which poses a major challenge in designing algorithms that can automatically adapt to the best… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: To appear at AISTATS 2025

  8. arXiv:2504.04635  [pdf, other

    cs.CL

    Steering off Course: Reliability Challenges in Steering Language Models

    Authors: Patrick Queiroz Da Silva, Hari Sethuraman, Dheeraj Rajagopal, Hannaneh Hajishirzi, Sachin Kumar

    Abstract: Steering methods for language models (LMs) have gained traction as lightweight alternatives to fine-tuning, enabling targeted modifications to model activations. However, prior studies primarily report results on a few models, leaving critical gaps in understanding the robustness of these methods. In this work, we systematically examine three prominent steering methods -- DoLa, function vectors, a… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  9. arXiv:2504.03089  [pdf, other

    cs.CV

    SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections

    Authors: Prashant Kumar, Dheeraj Vattikonda, Kshitij Madhav Bhat, Kunal Dargan, Prem Kalra

    Abstract: The widespread adoption of learning-based methods for the LiDAR makes autonomous vehicles vulnerable to adversarial attacks through adversarial \textit{point injections (PiJ)}. It poses serious security challenges for navigation and map generation. Despite its critical nature, no major work exists that studies learning-based attacks on LiDAR-based SLAM. Our work proposes SLACK, an end-to-end deep… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  10. arXiv:2503.13115  [pdf, ps, other

    cs.LG cs.AI math.PR stat.ML

    Beyond Propagation of Chaos: A Stochastic Algorithm for Mean Field Optimization

    Authors: Chandan Tankala, Dheeraj M. Nagaraj, Anant Raj

    Abstract: Gradient flow in the 2-Wasserstein space is widely used to optimize functionals over probability distributions and is typically implemented using an interacting particle system with $n$ particles. Analyzing these algorithms requires showing (a) that the finite-particle system converges and/or (b) that the resultant empirical distribution of the particles closely approximates the optimal distributi… ▽ More

    Submitted 17 June, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

  11. arXiv:2503.04184  [pdf

    cs.NI cs.AI cs.CL

    Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

    Authors: Adnan Shahid, Adrian Kliks, Ahmed Al-Tahmeesschi, Ahmed Elbakary, Alexandros Nikou, Ali Maatouk, Ali Mokh, Amirreza Kazemi, Antonio De Domenico, Athanasios Karapantelakis, Bo Cheng, Bo Yang, Bohao Wang, Carlo Fischione, Chao Zhang, Chaouki Ben Issaid, Chau Yuen, Chenghui Peng, Chongwen Huang, Christina Chaccour, Christo Kurisummoottil Thomas, Dheeraj Sharma, Dimitris Kalogiros, Dusit Niyato, Eli De Poorter , et al. (110 additional authors not shown)

    Abstract: This white paper discusses the role of large-scale AI in the telecommunications industry, with a specific focus on the potential of generative AI to revolutionize network functions and user experiences, especially in the context of 6G systems. It highlights the development and deployment of Large Telecom Models (LTMs), which are tailored AI models designed to address the complex challenges faced b… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  12. arXiv:2502.13450  [pdf, other

    cs.LG cs.AI

    Interleaved Gibbs Diffusion for Constrained Generation

    Authors: Gautham Govind Anil, Sachin Yadav, Dheeraj Nagaraj, Karthikeyan Shanmugam, Prateek Jain

    Abstract: We introduce Interleaved Gibbs Diffusion (IGD), a novel generative modeling framework for mixed continuous-discrete data, focusing on constrained generation problems. Prior works on discrete and continuous-discrete diffusion models assume factorized denoising distribution for fast generation, which can hinder the modeling of strong dependencies between random variables encountered in constrained g… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  13. arXiv:2502.10354  [pdf, other

    cs.LG math.ST stat.ML

    Dimension-free Score Matching and Time Bootstrapping for Diffusion Models

    Authors: Syamantak Kumar, Dheeraj Nagaraj, Purnamrita Sarkar

    Abstract: Diffusion models generate samples by estimating the score function of the target distribution at various noise levels. The model is trained using samples drawn from the target distribution, progressively adding noise. In this work, we establish the first (nearly) dimension-free sample complexity bounds for learning these score functions, achieving a double exponential improvement in dimension over… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  14. arXiv:2411.17580  [pdf, other

    cs.CV

    Revisiting Point Cloud Completion: Are We Ready For The Real-World?

    Authors: Stuti Pathak, Prashant Kumar, Dheeraj Baiju, Nicholus Mboga, Gunther Steenackers, Rudi Penne

    Abstract: Point clouds acquired in constrained, challenging, uncontrolled, and multi-sensor real-world settings are noisy, incomplete, and non-uniformly sparse. This presents acute challenges for the vital task of point cloud completion. Using tools from Algebraic Topology and Persistent Homology (PH), we demonstrate that current benchmark object point clouds lack rich topological features that are integral… ▽ More

    Submitted 11 March, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

  15. arXiv:2410.20135  [pdf, ps, other

    stat.ML cs.LG

    Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD

    Authors: Aniket Das, Dheeraj Nagaraj, Soumyabrata Pal, Arun Suggala, Prateek Varshney

    Abstract: We consider the problem of high-dimensional heavy-tailed statistical estimation in the streaming setting, which is much harder than the traditional batch setting due to memory constraints. We cast this problem as stochastic convex optimization with heavy tailed stochastic gradients, and prove that the widely used Clipped-SGD algorithm attains near-optimal sub-Gaussian statistical rates whenever th… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS 2024

  16. arXiv:2410.17413  [pdf, other

    cs.CL

    Scalable Influence and Fact Tracing for Large Language Model Pretraining

    Authors: Tyler A. Chang, Dheeraj Rajagopal, Tolga Bolukbasi, Lucas Dixon, Ian Tenney

    Abstract: Training data attribution (TDA) methods aim to attribute model outputs back to specific training examples, and the application of these methods to large language model (LLM) outputs could significantly advance model transparency and data curation. However, it has been challenging to date to apply these methods to the full scale of LLM pretraining. In this paper, we refine existing gradient-based m… ▽ More

    Submitted 20 December, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  17. arXiv:2410.06307  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    Model Predictive Control is Almost Optimal for Restless Bandit

    Authors: Nicolas Gast, Dheeraj Narasimha

    Abstract: We consider the discrete time infinite horizon average reward restless markovian bandit (RMAB) problem. We propose a \emph{model predictive control} based non-stationary policy with a rolling computational horizon $τ$. At each time-slot, this policy solves a $τ$ horizon linear program whose first control value is kept as a control for the RMAB. Our solution requires minimal assumptions and quantif… ▽ More

    Submitted 5 June, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: Reviewed and accepted to COLT 2025

  18. arXiv:2408.05843  [pdf, other

    cs.LG cs.IR stat.ML

    Online Matrix Completion: A Collaborative Approach with Hott Items

    Authors: Dheeraj Baby, Soumyabrata Pal

    Abstract: We investigate the low rank matrix completion problem in an online setting with ${M}$ users, ${N}$ items, ${T}$ rounds, and an unknown rank-$r$ reward matrix ${R}\in \mathbb{R}^{{M}\times {N}}$. This problem has been well-studied in the literature and has several applications in practice. In each round, we recommend ${S}$ carefully chosen distinct items to every user and observe noisy rewards. In… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Appeared at the Forty-first International Conference on Machine Learning, 2024

  19. arXiv:2408.05686  [pdf, other

    cs.LG cs.MA

    The Bandit Whisperer: Communication Learning for Restless Bandits

    Authors: Yunfan Zhao, Tonghan Wang, Dheeraj Nagaraj, Aparna Taneja, Milind Tambe

    Abstract: Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a promising avenue for addressing allocation problems with resource constraints and temporal dynamics. However, classic RMAB models largely overlook the challenges of (systematic) data errors - a common occurrence in real-world scenarios due to factors like varying data collection protocols and intentional noise for… ▽ More

    Submitted 19 March, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

  20. arXiv:2407.06325  [pdf, other

    cs.LG cs.DC math.OC

    CONGO: Compressive Online Gradient Optimization

    Authors: Jeremy Carleton, Prathik Vijaykumar, Divyanshu Saxena, Dheeraj Narasimha, Srinivas Shakkottai, Aditya Akella

    Abstract: We address the challenge of zeroth-order online convex optimization where the objective function's gradient exhibits sparsity, indicating that only a small number of dimensions possess non-zero gradients. Our aim is to leverage this sparsity to obtain useful estimates of the objective function's gradient even when the only information available is a limited number of function samples. Our motivati… ▽ More

    Submitted 16 May, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at ICLR 2025; 34 pages, 12 figures

  21. arXiv:2407.05778  [pdf, other

    cs.CL cs.AI

    When is the consistent prediction likely to be a correct prediction?

    Authors: Alex Nguyen, Dheeraj Mekala, Chengyu Dong, Jingbo Shang

    Abstract: Self-consistency (Wang et al., 2023) suggests that the most consistent answer obtained through large language models (LLMs) is more likely to be correct. In this paper, we challenge this argument and propose a nuanced correction. Our observations indicate that consistent answers derived through more computation i.e. longer reasoning texts, rather than simply the most consistent answer across all o… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  22. arXiv:2407.03471  [pdf, other

    cs.CV

    Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

    Authors: Benno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Christopher Pal, Siva Reddy

    Abstract: An image editing model should be able to perform diverse edits, ranging from object replacement, changing attributes or style, to performing actions or movement, which require many forms of reasoning. Current general instruction-guided editing models have significant shortcomings with action and reasoning-centric edits. Object, attribute or stylistic changes can be learned from visually static dat… ▽ More

    Submitted 17 October, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: NeurIPS 2024 (Dataset & Benchmarks)

  23. arXiv:2407.00121  [pdf, other

    cs.LG cs.AI cs.CL

    Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

    Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

    Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  24. arXiv:2406.17591  [pdf, other

    cs.CV

    DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation

    Authors: Ahmad Mohammadshirazi, Ali Nosrati Firoozsalari, Mengxi Zhou, Dheeraj Kulshrestha, Rajiv Ramnath

    Abstract: Automating the annotation of scanned documents is challenging, requiring a balance between computational efficiency and accuracy. DocParseNet addresses this by combining deep learning and multi-modal learning to process both text and visual data. This model goes beyond traditional OCR and semantic segmentation, capturing the interplay between text and images to preserve contextual nuances in compl… ▽ More

    Submitted 21 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  25. POPCat: Propagation of particles for complex annotation tasks

    Authors: Adam Srebrnjak Yang, Dheeraj Khanna, John S. Zelek

    Abstract: Novel dataset creation for all multi-object tracking, crowd-counting, and industrial-based videos is arduous and time-consuming when faced with a unique class that densely populates a video sequence. We propose a time efficient method called POPCat that exploits the multi-target and temporal features of video data to produce a semi-supervised pipeline for segmentation or box-based video annotation… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures, Accepted in "Conference on Robots and Vision 2024"

  26. arXiv:2406.08848  [pdf, other

    cs.CL cs.AI

    An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade Conversational Assistants

    Authors: G P Shrivatsa Bhargav, Sumit Neelam, Udit Sharma, Shajith Ikbal, Dheeraj Sreedhar, Hima Karanam, Sachindra Joshi, Pankaj Dhoolia, Dinesh Garg, Kyle Croutwater, Haode Qi, Eric Wayne, J William Murdock

    Abstract: We present an approach to build Large Language Model (LLM) based slot-filling system to perform Dialogue State Tracking in conversational assistants serving across a wide variety of industry-grade applications. Key requirements of this system include: 1) usage of smaller-sized models to meet low latency requirements and to enable convenient and cost-effective cloud and customer premise deployments… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  27. arXiv:2405.17068  [pdf, other

    cs.LG math.NA stat.ML

    The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models

    Authors: Saravanan Kandasamy, Dheeraj Nagaraj

    Abstract: Langevin Dynamics is a Stochastic Differential Equation (SDE) central to sampling and generative modeling and is implemented via time discretization. Langevin Monte Carlo (LMC), based on the Euler-Maruyama discretization, is the simplest and most studied algorithm. LMC can suffer from slow convergence - requiring a large number of steps of small step-size to obtain good quality samples. This becom… ▽ More

    Submitted 29 October, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: "One often meets his destiny on the road he takes to avoid it" - Master Oogway. My destiny seems to be to write triangle inequalities for the rest of my life

  28. arXiv:2405.17035  [pdf, other

    cs.LG

    Glauber Generative Model: Discrete Diffusion Models via Binary Classification

    Authors: Harshit Varma, Dheeraj Nagaraj, Karthikeyan Shanmugam

    Abstract: We introduce the Glauber Generative Model (GGM), a new class of discrete diffusion models, to obtain new samples from a distribution given samples from a discrete space. GGM deploys a discrete Markov chain called the heat bath dynamics (or the Glauber dynamics) to denoise a sequence of noisy tokens to a sample from a joint distribution of discrete tokens. Our novel conceptual framework provides an… ▽ More

    Submitted 16 March, 2025; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: ICLR 2025

  29. arXiv:2405.07698  [pdf, other

    cs.CV

    oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving

    Authors: Abdul Hannan Khan, Syed Tahseen Raza Rizvi, Dheeraj Varma Chittari Macharavtu, Andreas Dengel

    Abstract: Autonomous driving systems require a quick and robust perception of the nearby environment to carry out their routines effectively. With the aim to avoid collisions and drive safely, autonomous driving systems rely heavily on object detection. However, 2D object detections alone are insufficient; more information, such as relative velocity and distance, is required for safer planning. Monocular 3D… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  30. arXiv:2404.09127  [pdf, other

    cs.CL

    Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

    Authors: Ruixin Yang, Dheeraj Rajagopal, Shirley Anugrah Hayati, Bin Hu, Dongyeop Kang

    Abstract: Uncertainty estimation is a significant issue for current large language models (LLMs) that are generally poorly calibrated and over-confident, especially with reinforcement learning from human feedback (RLHF). Unlike humans, whose decisions and confidences not only stem from intrinsic beliefs but can also be adjusted through daily observations, existing calibration methods for LLMs focus on estim… ▽ More

    Submitted 10 May, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  31. arXiv:2404.00439  [pdf, other

    cs.CL

    DOCMASTER: A Unified Platform for Annotation, Training, & Inference in Document Question-Answering

    Authors: Alex Nguyen, Zilong Wang, Jingbo Shang, Dheeraj Mekala

    Abstract: The application of natural language processing models to PDF documents is pivotal for various business applications yet the challenge of training models for this purpose persists in businesses due to specific hurdles. These include the complexity of working with PDF formats that necessitate parsing text and layout information for curating training data and the lack of privacy-preserving annotation… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  32. arXiv:2402.14807  [pdf, other

    cs.MA cs.AI cs.LG

    A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health

    Authors: Nikhil Behari, Edwin Zhang, Yunfan Zhao, Aparna Taneja, Dheeraj Nagaraj, Milind Tambe

    Abstract: Restless multi-armed bandits (RMAB) have demonstrated success in optimizing resource allocation for large beneficiary populations in public health settings. Unfortunately, RMAB models lack flexibility to adapt to evolving public health policy priorities. Concurrently, Large Language Models (LLMs) have emerged as adept automated planners across domains of robotic control and navigation. In this pap… ▽ More

    Submitted 25 October, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Journal ref: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

  33. arXiv:2402.14158  [pdf, other

    cs.CL

    TOOLVERIFIER: Generalization to New Tools via Self-Verification

    Authors: Dheeraj Mekala, Jason Weston, Jack Lanchantin, Roberta Raileanu, Maria Lomeli, Jingbo Shang, Jane Dwivedi-Yu

    Abstract: Teaching language models to use tools is an important milestone towards building general assistants, but remains an open problem. While there has been significant progress on learning to use specific tools via fine-tuning, language models still struggle with learning how to robustly use new tools from only a few demonstrations. In this work we introduce a self-verification method which distinguish… ▽ More

    Submitted 13 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  34. arXiv:2402.11728  [pdf, other

    cs.CL cs.LG q-fin.CP

    Numerical Claim Detection in Finance: A New Financial Dataset, Weak-Supervision Model, and Market Analysis

    Authors: Agam Shah, Arnav Hiray, Pratvi Shah, Arkaprabha Banerjee, Anushka Singh, Dheeraj Eidnani, Sahasra Chava, Bhaskar Chaudhury, Sudheer Chava

    Abstract: In this paper, we investigate the influence of claims in analyst reports and earnings calls on financial market returns, considering them as significant quarterly events for publicly traded companies. To facilitate a comprehensive analysis, we construct a new financial dataset for the claim detection task in the financial domain. We benchmark various language models on this dataset and propose a n… ▽ More

    Submitted 4 October, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted at The Seventh FEVER Workshop EMNLP 2024

  35. MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization

    Authors: Yasaman Jafari, Dheeraj Mekala, Rose Yu, Taylor Berg-Kirkpatrick

    Abstract: RL-based techniques can be employed to search for prompts that, when fed into a target language model, maximize a set of user-specified reward functions. However, in many target applications, the natural reward functions are in tension with one another -- for example, content preservation vs. style matching in style transfer tasks. Current techniques focus on maximizing the average of reward funct… ▽ More

    Submitted 16 October, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  36. arXiv:2402.10430  [pdf, other

    cs.CL

    Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models

    Authors: Dheeraj Mekala, Alex Nguyen, Jingbo Shang

    Abstract: Instruction-tuning language models has become a crucial step in aligning them for general use. Typically, this process involves extensive training on large datasets, incurring high training costs. In this paper, we introduce a novel training data selection based on the learning percentage of the samples. We assert that current language models possess the capability to autonomously select high-qual… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  37. arXiv:2402.04400  [pdf, other

    cs.LG cs.AI cs.CY

    CEHR-GPT: Generating Electronic Health Records with Chronological Patient Timelines

    Authors: Chao Pang, Xinzhuo Jiang, Nishanth Parameshwar Pavinkurve, Krishna S. Kalluri, Elise L. Minto, Jason Patterson, Linying Zhang, George Hripcsak, Gamze Gürsoy, Noémie Elhadad, Karthik Natarajan

    Abstract: Synthetic Electronic Health Records (EHR) have emerged as a pivotal tool in advancing healthcare applications and machine learning models, particularly for researchers without direct access to healthcare data. Although existing methods, like rule-based approaches and generative adversarial networks (GANs), generate synthetic data that resembles real-world EHR data, these methods often use a tabula… ▽ More

    Submitted 5 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  38. arXiv:2402.03545  [pdf, other

    cs.LG

    Online Feature Updates Improve Online (Generalized) Label Shift Adaptation

    Authors: Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger

    Abstract: This paper addresses the prevalent issue of label shift in an online setting with missing labels, where data distributions change over time and obtaining timely labels is challenging. While existing methods primarily focus on adjusting or updating the final layer of a pre-trained classifier, we explore the untapped potential of enhancing feature representations using unlabeled data at test-time. O… ▽ More

    Submitted 31 October, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  39. arXiv:2401.03340  [pdf, other

    cs.CV

    Classifying cow stall numbers using YOLO

    Authors: Dheeraj Vajjarapu

    Abstract: This paper introduces the CowStallNumbers dataset, a collection of images extracted from videos focusing on cow teats, designed to advance the field of cow stall number detection. The dataset comprises 1042 training images and 261 test images, featuring stall numbers ranging from 0 to 60. To enhance the dataset, we performed fine-tuning on a YOLO model and applied data augmentation techniques, inc… ▽ More

    Submitted 23 November, 2023; originally announced January 2024.

  40. arXiv:2311.09799  [pdf, other

    cs.CL

    How Far Can We Extract Diverse Perspectives from Large Language Models?

    Authors: Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang

    Abstract: Collecting diverse human opinions is costly and challenging. This leads to a recent trend in exploiting large language models (LLMs) for generating diverse data for potential scalable and efficient solutions. However, the extent to which LLMs can generate diverse perspectives on subjective topics is still unclear. In this study, we explore LLMs' capacity of generating diverse perspectives and rati… ▽ More

    Submitted 13 October, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2024 Main Conference

  41. ezBIDS: Guided standardization of neuroimaging data interoperable with major data archives and platforms

    Authors: Daniel Levitas, Soichi Hayashi, Sophia Vinci-Booher, Anibal Heinsfeld, Dheeraj Bhatia, Nicholas Lee, Anthony Galassi, Guiomar Niso, Franco Pestilli

    Abstract: Data standardization has become one of the leading methods neuroimaging researchers rely on for data sharing and reproducibility. Data standardization promotes a common framework through which researchers can utilize others' data. Yet, as of today, formatting datasets that adhere to community best practices requires technical expertise involving coding and considerable knowledge of file formats an… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  42. arXiv:2311.03319  [pdf, other

    cs.CL cs.AI

    DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase

    Authors: Dawei Li, Yaxuan Li, Dheeraj Mekala, Shuyao Li, Yulin wang, Xueqi Wang, William Hogan, Jingbo Shang

    Abstract: In-Context Learning (ICL) combined with pre-trained large language models has achieved promising results on various NLP tasks. However, ICL requires high-quality annotated demonstrations which might not be available in real-world scenarios. To overcome this limitation, we propose \textbf{D}ata \textbf{A}ugmentation for \textbf{I}n-Context \textbf{L}earning (\textbf{DAIL}). DAIL leverages the intui… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Course project for DSC 253 (Advanced Data-Driven Text Mining) at UCSD

  43. arXiv:2310.16132  [pdf, other

    cs.SE

    Diversity in Software Engineering Conferences and Journals

    Authors: Aditya Shankar Narayanan, Dheeraj Vagavolu, Nancy A Day, Meiyappan Nagappan

    Abstract: Diversity with respect to ethnicity and gender has been studied in open-source and industrial settings for software development. Publication avenues such as academic conferences and journals contribute to the growing technology industry. However, there have been very few diversity-related studies conducted in the context of academia. In this paper, we study the ethnic, gender, and geographical div… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 13 pages, 10 figures, 4 tables

  44. arXiv:2310.14526  [pdf, other

    cs.LG cs.AI

    Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization

    Authors: Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe

    Abstract: Restless multi-arm bandits (RMABs), a class of resource allocation problems with broad application in areas such as healthcare, online advertising, and anti-poaching, have recently been studied from a multi-agent reinforcement learning perspective. Prior RMAB research suffers from several limitations, e.g., it fails to adequately address continuous states, and requires retraining from scratch when… ▽ More

    Submitted 29 January, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  45. arXiv:2310.12963  [pdf, other

    cs.CL cs.AI

    AutoMix: Automatically Mixing Language Models

    Authors: Pranjal Aggarwal, Aman Madaan, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Manaal Faruqui, Mausam

    Abstract: Large language models (LLMs) are now available from cloud API providers in various sizes and configurations. While this diversity offers a broad spectrum of choices, effectively leveraging the options to optimize computational cost and performance remains challenging. In this work, we present Automix, an approach that strategically routes queries to larger LMs, based on the approximate correctness… ▽ More

    Submitted 19 January, 2025; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024). The first two authors contributed equally. Work started and partly done during Aman's internship at Google. This version adds results on additional models and datasets

  46. PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutiérrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A signif… ▽ More

    Submitted 26 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '23), November 15-16, 2023, Istanbul, Turkey

    Journal ref: 2023 BuildSys '23: Proceedings of the 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

  47. arXiv:2310.01515  [pdf, other

    quant-ph cs.LG

    Tensor Ring Optimized Quantum-Enhanced Tensor Neural Networks

    Authors: Debanjan Konar, Dheeraj Peddireddy, Vaneet Aggarwal, Bijaya K. Panigrahi

    Abstract: Quantum machine learning researchers often rely on incorporating Tensor Networks (TN) into Deep Neural Networks (DNN) and variational optimization. However, the standard optimization techniques used for training the contracted trainable weights of each model layer suffer from the correlations and entanglement structure between the model parameters on classical implementations. To address this issu… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  48. arXiv:2309.09206  [pdf, other

    cs.RO cs.CV cs.LG

    Differentiable SLAM Helps Deep Learning-based LiDAR Perception Tasks

    Authors: Prashant Kumar, Dheeraj Vattikonda, Vedang Bhupesh Shenvi Nadkarni, Erqun Dong, Sabyasachi Sahoo

    Abstract: We investigate a new paradigm that uses differentiable SLAM architectures in a self-supervised manner to train end-to-end deep learning models in various LiDAR based applications. To the best of our knowledge there does not exist any work that leverages SLAM as a training signal for deep learning based models. We explore new ways to improve the efficiency, robustness, and adaptability of LiDAR sys… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 15 pages,6 Tables, 3 figures. Accepted at BMVC 2023

  49. arXiv:2308.16041  [pdf, other

    cs.CV

    From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

    Authors: Shreyank N Gowda, Dheeraj Pandey, Shashank Narayana Gowda

    Abstract: Recent advancements in deep learning and computer vision have led to a surge of interest in generating realistic talking heads. This paper presents a comprehensive survey of state-of-the-art methods for talking head generation. We systematically categorises them into four main approaches: image-driven, audio-driven, video-driven and others (including neural radiance fields (NeRF), and 3D-based met… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  50. arXiv:2307.03884  [pdf, other

    quant-ph cs.LG

    Noisy Tensor Ring approximation for computing gradients of Variational Quantum Eigensolver for Combinatorial Optimization

    Authors: Dheeraj Peddireddy, Utkarsh Priyam, Vaneet Aggarwal

    Abstract: Variational Quantum algorithms, especially Quantum Approximate Optimization and Variational Quantum Eigensolver (VQE) have established their potential to provide computational advantage in the realm of combinatorial optimization. However, these algorithms suffer from classically intractable gradients limiting the scalability. This work addresses the scalability challenge for VQE by proposing a cla… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 12 pages, 13 figures, preprint