Skip to main content

Showing 1–8 of 8 results for author: Bairi, R

.
  1. arXiv:2506.11060  [pdf, other

    cs.SE cs.AI

    Code Researcher: Deep Research Agent for Large Systems Code and Commit History

    Authors: Ramneet Singh, Sathvik Joel, Abhav Mehrotra, Nalin Wadhwa, Ramakrishna B Bairi, Aditya Kanade, Nagarajan Natarajan

    Abstract: Large Language Model (LLM)-based coding agents have shown promising results on coding benchmarks, but their effectiveness on systems code remains underexplored. Due to the size and complexities of systems code, making changes to a systems codebase is a daunting task, even for humans. It requires researching about many pieces of context, derived from the large codebase and its massive commit histor… ▽ More

    Submitted 27 May, 2025; originally announced June 2025.

  2. arXiv:2406.11638  [pdf, other

    cs.AI cs.SE

    MASAI: Modular Architecture for Software-engineering AI Agents

    Authors: Daman Arora, Atharv Sonwane, Nalin Wadhwa, Abhav Mehrotra, Saiteja Utpala, Ramakrishna Bairi, Aditya Kanade, Nagarajan Natarajan

    Abstract: A common method to solve complex problems in software engineering, is to divide the problem into multiple sub-problems. Inspired by this, we propose a Modular Architecture for Software-engineering AI (MASAI) agents, where different LLM-powered sub-agents are instantiated with well-defined objectives and strategies tuned to achieve those objectives. Our modular architecture offers several advantage… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: Ajinkya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  4. arXiv:2310.01892  [pdf, ps, other

    cs.LG cs.AI

    FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations

    Authors: Chanakya Ekbote, Ajinkya Pankaj Deshpande, Arun Iyer, Ramakrishna Bairi, Sundararajan Sellamanickam

    Abstract: Unsupervised node representations learnt using contrastive learning-based methods have shown good performance on downstream tasks. However, these methods rely on augmentations that mimic low-pass filters, limiting their performance on tasks requiring different eigen-spectrum parts. This paper presents a simple filter-based augmentation method to capture different parts of the eigen-spectrum. We sh… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  5. arXiv:2309.12499  [pdf, other

    cs.SE

    CodePlan: Repository-level Coding using LLMs and Planning

    Authors: Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet

    Abstract: Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  6. arXiv:2202.02385  [pdf, other

    cs.SE cs.AI

    Using Large-scale Heterogeneous Graph Representation Learning for Code Review Recommendations at Microsoft

    Authors: Jiyang Zhang, Chandra Maddila, Ram Bairi, Christian Bird, Ujjwal Raizada, Apoorva Agrawal, Yamini Jhawar, Kim Herzig, Arie van Deursen

    Abstract: Code review is an integral part of any mature software development process, and identifying the best reviewer for a code change is a well-accepted problem within the software engineering community. Selecting a reviewer who lacks expertise and understanding can slow development or result in more defects. To date, most reviewer recommendation systems rely primarily on historical file change and revi… ▽ More

    Submitted 2 February, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ICSE 2023 Software Engineering in Practice (camera ready)

  7. arXiv:2102.07575  [pdf, other

    cs.IR cs.LG

    User Embedding based Neighborhood Aggregation Method for Inductive Recommendation

    Authors: Rahul Ragesh, Sundararajan Sellamanickam, Vijay Lingam, Arun Iyer, Ramakrishna Bairi

    Abstract: We consider the problem of learning latent features (aka embedding) for users and items in a recommendation setting. Given only a user-item interaction graph, the goal is to recommend items for each user. Traditional approaches employ matrix factorization-based collaborative filtering methods. Recent methods using graph convolutional networks (e.g., LightGCN) achieve state-of-the-art performance.… ▽ More

    Submitted 16 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  8. arXiv:2008.12842  [pdf, other

    cs.CL cs.LG stat.ML

    HeteGCN: Heterogeneous Graph Convolutional Networks for Text Classification

    Authors: Rahul Ragesh, Sundararajan Sellamanickam, Arun Iyer, Ram Bairi, Vijay Lingam

    Abstract: We consider the problem of learning efficient and inductive graph convolutional networks for text classification with a large number of examples and features. Existing state-of-the-art graph embedding based methods such as predictive text embedding (PTE) and TextGCN have shortcomings in terms of predictive performance, scalability and inductive capability. To address these limitations, we propose… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.