Skip to main content

Showing 1–2 of 2 results for author: Kathala, K C R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.15821  [pdf

    cs.CL cs.AI cs.LG

    Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?

    Authors: Aryan Sajith, Krishna Chaitanya Rao Kathala

    Abstract: This study investigates the relative impact of training data quality versus quantity on the performance of small language models (SLMs), utilizing the TinyStories dataset for empirical analysis. Analysis of dataset variations with respect to size (25% and 50% of the original size) and duplication (controlled rates of 25%, 50%, 75%, and 100%) were performed. Model performance was evaluated based on… ▽ More

    Submitted 23 May, 2025; v1 submitted 24 November, 2024; originally announced November 2024.

    Comments: 15 pages, 4 figures, 4 tables | Conference: International Conference on Neural Computing for Advanced Applications 2025, Conference info: https://aaci.org.hk/ncaa2025

  2. arXiv:2402.18139  [pdf, other

    cs.CL cs.AI

    Cause and Effect: Can Large Language Models Truly Understand Causality?

    Authors: Swagata Ashwani, Kshiteesh Hegde, Nishith Reddy Mannuru, Mayank Jindal, Dushyant Singh Sengar, Krishna Chaitanya Rao Kathala, Dishant Banga, Vinija Jain, Aman Chadha

    Abstract: With the rise of Large Language Models(LLMs), it has become crucial to understand their capabilities and limitations in deciphering and explaining the complex web of causal relationships that language entails. Current methods use either explicit or implicit causal reasoning, yet there is a strong need for a unified approach combining both to tackle a wide array of causal relationships more effecti… ▽ More

    Submitted 29 September, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: AI Trustworthiness and Risk Assessment for Challenged Contexts (ATRACC) AAAI 2024 Fall Symposium