Skip to main content

Showing 1–21 of 21 results for author: Khandelwal, D

.
  1. arXiv:2506.04019  [pdf, ps, other

    cs.SE cs.CL cs.LG cs.PL

    CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking

    Authors: Neeva Oza, Ishaan Govil, Parul Gupta, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: LLMs have been extensively used for the task of automated code generation. In this work, we examine the applicability of LLMs for the related but relatively unexplored task of code-equivalence checking, i.e., given two programs, whether they are functionally equivalent or not. This is an important problem since benchmarking code equivalence can play a critical role in evaluating LLM capabilities f… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    MSC Class: 68-02 (Primary) 68T50; 68T07; 68N19; 68N30 (Secondary) ACM Class: I.2.7; I.2.6; I.2.5; D.3.0; D.3.3; D.3.1; F.3.2; F.3.1; F.3.3; D.2.3; D.2.5

  2. arXiv:2502.08356  [pdf, other

    cs.CL

    Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAG

    Authors: Kushagra Bhushan, Yatin Nandwani, Dinesh Khandelwal, Sonam Gupta, Gaurav Pandey, Dinesh Raghu, Sachindra Joshi

    Abstract: Retrieval-Augmented Generation (RAG) has emerged as a prominent method for incorporating domain knowledge into Large Language Models (LLMs). While RAG enhances response relevance by incorporating retrieved domain knowledge in the context, retrieval errors can still lead to hallucinations and incorrect answers. To recover from retriever failures, domain knowledge is injected by fine-tuning the mode… ▽ More

    Submitted 27 March, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: 22 pages, 14 tables, to be published in NAACL 2025

  3. arXiv:2502.08130  [pdf, other

    cs.CL

    Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

    Authors: Sonam Gupta, Yatin Nandwani, Asaf Yehudai, Dinesh Khandelwal, Dinesh Raghu, Sachindra Joshi

    Abstract: Fine-tuning Large Language Models (LLMs) on specific datasets is a common practice to improve performance on target tasks. However, this performance gain often leads to overfitting, where the model becomes too specialized in either the task or the characteristics of the training data, resulting in a loss of generalization. This paper introduces Selective Self-to-Supervised Fine-Tuning (S3FT), a fi… ▽ More

    Submitted 20 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: 10 pages, Accepted to NAACL Findings 2025. arXiv admin note: text overlap with arXiv:2409.04787

  4. arXiv:2310.01991  [pdf, other

    cs.CL cs.AI cs.LG

    Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems

    Authors: Aniruddha Deb, Neeva Oza, Sarthak Singla, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: While forward reasoning (i.e., find the answer given the question) has been explored extensively in recent literature, backward reasoning is relatively unexplored. We examine the backward reasoning capabilities of LLMs on Math Word Problems (MWPs): given a mathematical question and its answer, with some details omitted from the question, can LLMs effectively retrieve the missing information? On mo… ▽ More

    Submitted 7 July, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures

    ACM Class: I.2.3

  5. arXiv:2305.14410  [pdf, other

    cs.CV cs.AI cs.CL

    Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach

    Authors: Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: We are interested in image manipulation via natural language text -- a task that is useful for multiple AI applications but requires complex reasoning over multi-modal spaces. We extend recently proposed Neuro Symbolic Concept Learning (NSCL), which has been quite effective for the task of Visual Question Answering (VQA), for the task of image manipulation. Our system referred to as NeuroSIM can p… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  6. arXiv:2304.07851  [pdf

    econ.GN

    Study on the tea market in India

    Authors: Adit Vinod Nair, Adarsh Damani, Devansh Khandelwal, Harshita Sachdev, Sreayans Jain

    Abstract: India's tea business has a long history and plays a significant role in the economics of the nation. India is the world's second-largest producer of tea, with Assam and Darjeeling being the most well-known tea-growing regions. Since the British introduced tea cultivation to India in the 1820s, the nation has produced tea. Millions of people are employed in the tea sector today, and it contributes… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 14 pages

  7. arXiv:2207.13843  [pdf, other

    cs.SD cs.LG eess.AS

    Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and Augmentations

    Authors: Devesh Khandelwal, Sean Campos, Shwetha Nagaraj, Fred Nugen, Alberto Todeschini

    Abstract: In this paper, we demonstrate a unique recipe to enhance the effectiveness of audio machine learning approaches by fusing pre-processing techniques into a deep learning model. Our solution accelerates training and inference performance by optimizing hyper-parameters through training instead of costly random searches to build a reliable mosquito detector from audio signals. The experiments and the… ▽ More

    Submitted 18 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    ACM Class: I.2.m; I.2.1

  8. arXiv:2203.11054  [pdf, other

    cs.CL cs.AI

    Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases

    Authors: Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) systems have the goal of answering complex natural language questions by reasoning over relevant facts retrieved from Knowledge Bases (KB). One of the major challenges faced by these systems is their inability to retrieve all relevant facts due to factors such as incomplete KB and entity/relation linking errors. In this paper, we address this particular cha… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    ACM Class: I.2.7; I.2.4

  9. arXiv:2201.05793  [pdf, other

    cs.CL cs.AI

    A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning are emerging as an important research direction. However, most existing KBQA datasets focus primarily on generic multi-hop reasoning over explicit facts, largely ignoring other reasoning types such as temporal, spatial, and taxonomic reasoning. In this paper, we present a benchmark dataset for temporal reasoning, TempQA-… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430

  10. arXiv:2109.13430  [pdf, other

    cs.CL cs.AI

    SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme LimaRyan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that in-volve complex reasoning are emerging as an important re-search direction. However, most KBQA systems struggle withgeneralizability, particularly on two dimensions: (a) acrossmultiple reasoning types where both datasets and systems haveprimarily focused on multi-hop reasoning, and (b) across mul-tiple knowledge bases, where KBQA approaches are… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  11. arXiv:2109.09475  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Knowledge Graph Question Answering via SPARQL Silhouette Generation

    Authors: Sukannya Purkayastha, Saswati Dana, Dinesh Garg, Dinesh Khandelwal, G P Shrivatsa Bhargav

    Abstract: Knowledge Graph Question Answering (KGQA) has become a prominent area in natural language processing due to the emergence of large-scale Knowledge Graphs (KGs). Recently Neural Machine Translation based approaches are gaining momentum that translates natural language queries to structured query languages thereby solving the KGQA task. However, most of these methods struggle with out-of-vocabulary… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 7 + 6 pages, 10 figures

  12. arXiv:2012.08834  [pdf, other

    eess.SY

    Toolbox for Discovering Dynamic System Relations via TAG Guided Genetic Programming

    Authors: Stefan-Cristian Nechita, Roland Toth, Dhruv Khandelwal, Maarten Schoukens

    Abstract: Data-driven modeling of nonlinear dynamical systems often require an expert user to take critical decisions a priori to the identification procedure. Recently an automated strategy for data driven modeling of \textit{single-input single-output} (SISO) nonlinear dynamical systems based on \textit{Genetic Programming} (GP) and \textit{Tree Adjoining Grammars} (TAG) has been introduced. The current p… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

  13. arXiv:2012.01707  [pdf, other

    cs.CL cs.AI

    Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

    Authors: Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy , et al. (5 additional authors not shown)

    Abstract: Knowledge base question answering (KBQA)is an important task in Natural Language Processing. Existing approaches face significant challenges including complex question understanding, necessity for reasoning, and lack of large end-to-end training datasets. In this work, we propose Neuro-Symbolic Question Answering (NSQA), a modular KBQA system, that leverages (1) Abstract Meaning Representation (AM… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to Findings of ACL

  14. A Tree Adjoining Grammar Representation for Models Of Stochastic Dynamical Systems

    Authors: Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

    Abstract: Model structure and complexity selection remains a challenging problem in system identification, especially for parametric non-linear models. Many Evolutionary Algorithm (EA) based methods have been proposed in the literature for estimating model structure and complexity. In most cases, the proposed methods are devised for estimating structure and complexity within a specified model class and henc… ▽ More

    Submitted 25 May, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted as brief paper by Automatica

  15. arXiv:1911.02984  [pdf, other

    cs.CL cs.IR

    The TechQA Dataset

    Authors: Vittorio Castelli, Rishav Chakravarti, Saswati Dana, Anthony Ferritto, Radu Florian, Martin Franz, Dinesh Garg, Dinesh Khandelwal, Scott McCarley, Mike McCawley, Mohamed Nasr, Lin Pan, Cezar Pendus, John Pitrelli, Saurabh Pujar, Salim Roukos, Andrzej Sakrajda, Avirup Sil, Rosario Uceda-Sosa, Todd Ward, Rong Zhang

    Abstract: We introduce TechQA, a domain-adaptation question answering dataset for the technical support domain. The TechQA corpus highlights two real-world issues from the automated customer support domain. First, it contains actual questions posed by users on a technical forum, rather than questions generated specifically for a competition or a task. Second, it has a real-world size -- 600 training, 310 de… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: Long version of conference paper to be submitted

  16. arXiv:1904.03152  [pdf, other

    eess.SY cs.CL cs.NE

    Data-driven Modelling of Dynamical Systems Using Tree Adjoining Grammar and Genetic Programming

    Authors: Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

    Abstract: State-of-the-art methods for data-driven modelling of non-linear dynamical systems typically involve interactions with an expert user. In order to partially automate the process of modelling physical systems from data, many EA-based approaches have been proposed for model-structure selection, with special focus on non-linear systems. Recently, an approach for data-driven modelling of non-linear dy… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Paper accepted at IEEE CEC 2019

  17. Grammar-based Representation and Identification of Dynamical Systems

    Authors: Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

    Abstract: In this paper we propose a novel approach to identify dynamical systems. The method estimates the model structure and the parameters of the model simultaneously, automating the critical decisions involved in identification such as model structure and complexity selection. In order to solve the combined model structure and model parameter estimation problem, a new representation of dynamical system… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Submitted to European Control Conference (ECC) 2019

  18. arXiv:1811.09796  [pdf, other

    cs.CV

    A Novel Technique for Evidence based Conditional Inference in Deep Neural Networks via Latent Feature Perturbation

    Authors: Dinesh Khandelwal, Suyash Agrawal, Parag Singla, Chetan Arora

    Abstract: Auxiliary information can be exploited in machine learning models using the paradigm of evidence based conditional inference. Multi-modal techniques in Deep Neural Networks (DNNs) can be seen as perturbing the latent feature representation for incorporating evidence from the auxiliary modality. However, they require training a specialized network which can map sparse evidence to a high dimensional… ▽ More

    Submitted 6 December, 2019; v1 submitted 24 November, 2018; originally announced November 2018.

  19. On the Simulation of Polynomial NARMAX Models

    Authors: Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

    Abstract: In this paper, we show that the common approach for simulation non-linear stochastic models, commonly used in system identification, via setting the noise contributions to zero results in a biased response. We also demonstrate that to achieve unbiased simulation of finite order NARMAX models, in general, we require infinite order simulation models. The main contributions of the paper are two-fold.… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted in IEEE CDC 2018

  20. Robust Fault Diagnosis by Optimal Input Design for Self-sensing Systems

    Authors: Dhruv Khandelwal, Siep Weiland, Amol Khalate

    Abstract: This paper presents a methodology for model based robust fault diagnosis and a methodology for input design to obtain optimal diagnosis of faults. The proposed algorithm is suitable for real time implementation. Issues of robustness are addressed for the input design and fault diagnosis methodologies. The proposed technique allows robust fault diagnosis under suitable conditions on the system unce… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

    Comments: Accepted in IFAC World Congress 2017

  21. arXiv:1606.04506  [pdf, ps, other

    cs.LG cs.CV

    Max-Margin Feature Selection

    Authors: Yamuna Prasad, Dinesh Khandelwal, K. K. Biswas

    Abstract: Many machine learning applications such as in vision, biology and social networking deal with data in high dimensions. Feature selection is typically employed to select a subset of features which im- proves generalization accuracy as well as reduces the computational cost of learning the model. One of the criteria used for feature selection is to jointly minimize the redundancy and maximize the re… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

    Comments: submitted to PR Letters