Skip to main content

Showing 1–4 of 4 results for author: Kennedy, S J J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20707  [pdf, ps, other

    cs.CL cs.AI physics.ed-ph

    Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective

    Authors: Nicy Scaria, Silvester John Joseph Kennedy, Diksha Seth, Deepak Subramani

    Abstract: Small Language Models (SLMs) offer computational efficiency and accessibility, making them promising for educational applications. However, their capacity for complex reasoning, particularly in domains such as physics, remains underexplored. This study investigates the high school physics reasoning capabilities of state-of-the-art SLMs (under 4 billion parameters), including instruct versions of L… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.02850  [pdf, other

    cs.CL cs.AI cs.CY cs.DB

    Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors

    Authors: Nicy Scaria, Silvester John Joseph Kennedy, Diksha Seth, Ananya Thakur, Deepak Subramani

    Abstract: Generating high-quality MCQs, especially those targeting diverse cognitive levels and incorporating common misconceptions into distractor design, is time-consuming and expertise-intensive, making manual creation impractical at scale. Current automated approaches typically generate questions at lower cognitive levels and fail to incorporate domain-specific misconceptions. This paper presents a hier… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  3. arXiv:2408.12226  [pdf, ps, other

    cs.CL cs.AI

    EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts

    Authors: Nicy Scaria, Silvester John Joseph Kennedy, Thomas Latinovich, Deepak Subramani

    Abstract: Relying on human experts to evaluate CEFR speaking assessments in an e-learning environment creates scalability challenges, as it limits how quickly and widely assessments can be conducted. We aim to automate the evaluation of CEFR B2 English speaking assessments in e-learning environments from conversation transcripts. First, we evaluate the capability of leading open source and commercial Large… ▽ More

    Submitted 30 May, 2025; v1 submitted 22 August, 2024; originally announced August 2024.

  4. arXiv:2407.00996  [pdf, other

    cs.CL cs.LG

    Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?

    Authors: Nicy Scaria, Silvester John Joseph Kennedy, Deepak Subramani

    Abstract: With the growing need for efficient language models in resource-constrained environments, Small Language Models (SLMs) have emerged as compact and practical alternatives to Large Language Models (LLMs). While studies have explored noise handling in LLMs, little is known about how SLMs handle noise, a critical factor for their reliable real-world deployment. This study investigates the ability of S… ▽ More

    Submitted 27 May, 2025; v1 submitted 1 July, 2024; originally announced July 2024.