Skip to main content

Showing 1–3 of 3 results for author: Kapu, N J

.
  1. arXiv:2502.15734  [pdf, other

    cs.DC cs.AI cs.CL cs.LG cs.OS

    Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation

    Authors: Shubham Agarwal, Sai Sundaresan, Subrata Mitra, Debabrata Mahapatra, Archit Gupta, Rounak Sharma, Nirmal Joshua Kapu, Tong Yu, Shiv Saini

    Abstract: Retrieval-Augmented Generation (RAG) is often used with Large Language Models (LLMs) to infuse domain knowledge or user-specific information. In RAG, given a user query, a retriever extracts chunks of relevant text from a knowledge base. These chunks are sent to an LLM as part of the input prompt. Typically, any given chunk is repeatedly retrieved across user questions. However, currently, for eve… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted at SIGMOD 2025

  2. arXiv:2411.18636  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications

    Authors: Nirmal Joshua Kapu, Raghav Karan

    Abstract: This article surveys convolution-based models including convolutional neural networks (CNNs), Conformers, ResNets, and CRNNs-as speech signal processing models and provide their statistical backgrounds and speech recognition, speaker identification, emotion recognition, and speech enhancement applications. Through comparative training cost assessment, model size, accuracy and speed assessment, we… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  3. Demo-Craft: Using In-Context Learning to Improve Code Generation in Large Language Models

    Authors: Nirmal Joshua Kapu, Mihit Sreejith

    Abstract: Generating executable code from natural language instructions using Large Language Models (LLMs) poses challenges such as semantic ambiguity and understanding taskspecific contexts. To address these issues, we propose a system called DemoCraft, which enhances code generation by leveraging in-context learning and demonstration selection, combined with latent concept learning. Latent concept learnin… ▽ More

    Submitted 22 March, 2025; v1 submitted 30 October, 2024; originally announced November 2024.

    Comments: Accepted at IEEE ICIITCEE 2025. Presented on 16th January 2025 in Bengaluru, India