Skip to main content

Showing 1–7 of 7 results for author: Srivatsa, K A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04920  [pdf, ps, other

    cs.CL cs.AI

    Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback

    Authors: Junior Cedric Tonga, KV Aditya Srivatsa, Kaushal Kumar Maurya, Fajri Koto, Ekaterina Kochmar

    Abstract: Large language models (LLMs) have demonstrated the ability to generate formative feedback and instructional hints in English, making them increasingly relevant for AI-assisted education. However, their ability to provide effective instructional support across different languages, especially for mathematically grounded reasoning tasks, remains largely unexamined. In this work, we present the first… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Preprint, in submission

  2. arXiv:2412.09416  [pdf, other

    cs.CL

    Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors

    Authors: Kaushal Kumar Maurya, KV Aditya Srivatsa, Kseniia Petukhova, Ekaterina Kochmar

    Abstract: In this paper, we investigate whether current state-of-the-art large language models (LLMs) are effective as AI tutors and whether they demonstrate pedagogical abilities necessary for good AI tutoring in educational dialogues. Previous efforts towards evaluation have been limited to subjective protocols and benchmarks. To bridge this gap, we propose a unified evaluation taxonomy with eight pedagog… ▽ More

    Submitted 8 February, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

    Comments: 9 pages

    Journal ref: NAACL 2025

  3. arXiv:2408.08545  [pdf, ps, other

    cs.CL

    SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models

    Authors: Kaushal Kumar Maurya, KV Aditya Srivatsa, Ekaterina Kochmar

    Abstract: Large language models (LLMs) have been widely adopted due to their remarkable performance across various applications, driving the accelerated development of a large number of diverse models. However, these individual LLMs show limitations in generalization and performance on complex tasks due to inherent training biases, model size constraints, and the quality or diversity of pre-training dataset… ▽ More

    Submitted 12 June, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: 9 pages

    Journal ref: ACL 2025 (Findings)

  4. arXiv:2405.00467  [pdf, other

    cs.CL

    Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing

    Authors: KV Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar

    Abstract: With the rapid development of LLMs, it is natural to ask how to harness their capabilities efficiently. In this paper, we explore whether it is feasible to direct each input query to a single most suitable LLM. To this end, we propose LLM routing for challenging reasoning tasks. Our extensive experiments suggest that such routing shows promise but is not feasible in all scenarios, so more robust a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted to Workshop on Insights from Negative Results in NLP 2024 (co-located with NAACL 2024)

  5. arXiv:2403.11369  [pdf, other

    cs.CL

    What Makes Math Word Problems Challenging for LLMs?

    Authors: KV Aditya Srivatsa, Ekaterina Kochmar

    Abstract: This paper investigates the question of what makes math word problems (MWPs) in English challenging for large language models (LLMs). We conduct an in-depth analysis of the key linguistic and mathematical characteristics of MWPs. In addition, we train feature-based classifiers to better understand the impact of each feature on the overall difficulty of MWPs for prominent LLMs and investigate wheth… ▽ More

    Submitted 1 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL Findings 2024

  6. arXiv:2306.08872  [pdf, other

    cs.CL cs.AI

    Neural models for Factual Inconsistency Classification with Explanations

    Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma

    Abstract: Factual consistency is one of the most important requirements when editing high quality documents. It is extremely important for automatic text generation systems like summarization, question answering, dialog modeling, and language modeling. Still, automated factual inconsistency detection is rather under-studied. Existing work has focused on (a) finding fake news keeping a knowledge base in cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ECML-PKDD 2023

  7. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter