Skip to main content

Showing 1–10 of 10 results for author: Jabbarvand, R

.
  1. arXiv:2501.18482  [pdf, other

    cs.SE

    A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models

    Authors: Changshu Liu, Reyhaneh Jabbarvand

    Abstract: Code Executing Reasoning is becoming a new non-functional metric that assesses the ability of large language models (LLMs) in programming tasks. State-of-the-art frameworks (CodeMind or REval) and benchmarks (CruxEval) usually focus on LLM's prediction of a given code's input/output or intermediate variable states/values on limited programs. However, there is no tool for more in-depth analysis of… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 5 pages

  2. arXiv:2411.13239  [pdf

    cs.DC cs.AI cs.AR cs.ET cs.MA

    Transforming the Hybrid Cloud for Emerging AI Workloads

    Authors: Deming Chen, Alaa Youssef, Ruchi Pendse, André Schleife, Bryan K. Clark, Hendrik Hamann, Jingrui He, Teodoro Laino, Lav Varshney, Yuxiong Wang, Avirup Sil, Reyhaneh Jabbarvand, Tianyin Xu, Volodymyr Kindratenko, Carlos Costa, Sarita Adve, Charith Mendis, Minjia Zhang, Santiago Núñez-Corrales, Raghu Ganti, Mudhakar Srivatsa, Nam Sung Kim, Josep Torrellas, Jian Huang, Seetharami Seelam , et al. (20 additional authors not shown)

    Abstract: This white paper, developed through close collaboration between IBM Research and UIUC researchers within the IIDAI Institute, envisions transforming hybrid cloud systems to meet the growing complexity of AI workloads through innovative, full-stack co-design approaches, emphasizing usability, manageability, affordability, adaptability, efficiency, and scalability. By integrating cutting-edge techno… ▽ More

    Submitted 21 May, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: 70 pages, 27 figures

  3. arXiv:2410.24117  [pdf, other

    cs.SE cs.LG

    AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation

    Authors: Ali Reza Ibrahimzada, Kaiyao Ke, Mrigank Pawagi, Muhammad Salman Abid, Rangeet Pan, Saurabh Sinha, Reyhaneh Jabbarvand

    Abstract: Code translation transforms programs from one programming language (PL) to another. Several rule-based transpilers have been designed to automate code translation between different pairs of PLs. However, the rules can become obsolete as the PLs evolve and cannot generalize to other PLs. Recent studies have explored the automation of code translation using Large Language Models (LLMs). One key obse… ▽ More

    Submitted 24 April, 2025; v1 submitted 31 October, 2024; originally announced October 2024.

    Comments: Published in FSE 2025

  4. arXiv:2404.09398  [pdf, other

    cs.SE

    A Generic Approach to Fix Test Flakiness in Real-World Projects

    Authors: Yang Chen, Reyhaneh Jabbarvand

    Abstract: Test flakiness, a non-deterministic behavior of builds irrelevant to code changes, is a major and continuing impediment to delivering reliable software. The very few techniques for the automated repair of test flakiness are specifically crafted to repair either Order-Dependent (OD) or Implementation-Dependent (ID) flakiness. They are also all symbolic approaches, i.e., leverage program analysis to… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  5. arXiv:2402.09664  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    CodeMind: Evaluating Large Language Models for Code Reasoning

    Authors: Changshu Liu, Yang Chen, Reyhaneh Jabbarvand

    Abstract: Large Language Models (LLMs) have been widely used to automate programming tasks. Their capabilities have been evaluated by assessing the quality of generated code through tests or proofs. The extent to which they can reason about code is a critical question revealing important insights about their true capabilities. This paper introduces CodeMind, a framework designed to gauge the code reasoning… ▽ More

    Submitted 22 May, 2025; v1 submitted 14 February, 2024; originally announced February 2024.

  6. arXiv:2310.15991  [pdf, other

    cs.SE cs.LG cs.PL

    WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models

    Authors: Chenyuan Yang, Yinlin Deng, Runyu Lu, Jiayi Yao, Jiawei Liu, Reyhaneh Jabbarvand, Lingming Zhang

    Abstract: Compiler correctness is crucial, as miscompilation can falsify program behaviors, leading to serious consequences. Fuzzing has been studied to uncover compiler defects. However, compiler fuzzing remains challenging: Existing arts focus on black- and grey-box fuzzing, which generates tests without sufficient understanding of internal compiler behaviors. Meanwhile, traditional white-box techniques,… ▽ More

    Submitted 4 September, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Published in OOPSLA 2024

    Journal ref: Proc. ACM Program. Lang., Vol. 8, No. OOPSLA2, Article 296. Publication date: October 2024

  7. arXiv:2310.02407  [pdf, other

    cs.SE cs.LG

    Automated Bug Generation in the era of Large Language Models

    Authors: Ali Reza Ibrahimzada, Yang Chen, Ryan Rong, Reyhaneh Jabbarvand

    Abstract: Bugs are essential in software engineering; many research studies in the past decades have been proposed to detect, localize, and repair bugs in software systems. Effectiveness evaluation of such techniques requires complex bugs, i.e., those that are hard to detect through testing and hard to repair through debugging. From the classic software engineering point of view, a hard-to-repair bug differ… ▽ More

    Submitted 3 October, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  8. Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code

    Authors: Rangeet Pan, Ali Reza Ibrahimzada, Rahul Krishna, Divya Sankar, Lambert Pouguem Wassi, Michele Merler, Boris Sobolev, Raju Pavuluri, Saurabh Sinha, Reyhaneh Jabbarvand

    Abstract: Code translation aims to convert source code from one programming language (PL) to another. Given the promising abilities of large language models (LLMs) in code synthesis, researchers are exploring their potential to automate code translation. The prerequisite for advancing the state of LLM-based code translation is to understand their promises and limitations over existing techniques. To that en… ▽ More

    Submitted 16 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: Published in ICSE 2024

  9. arXiv:2305.10314  [pdf, other

    cs.CL cs.AI cs.SE

    LeTI: Learning to Generate from Textual Interactions

    Authors: Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, Heng Ji

    Abstract: Fine-tuning pre-trained language models (LMs) is essential for enhancing their capabilities. Existing techniques commonly fine-tune on input-output pairs (e.g., instruction tuning) or with numerical rewards that gauge the output quality (e.g., RLHF). We explore LMs' potential to learn from textual interactions (LETI) that not only check their correctness with binary labels but also pinpoint and ex… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NAACL 2024 Findings

  10. arXiv:2302.01488  [pdf

    cs.SE cs.AI cs.LG

    Perfect is the enemy of test oracle

    Authors: Ali Reza Ibrahimzada, Yigit Varli, Dilara Tekinoglu, Reyhaneh Jabbarvand

    Abstract: Automation of test oracles is one of the most challenging facets of software testing, but remains comparatively less addressed compared to automated test input generation. Test oracles rely on a ground-truth that can distinguish between the correct and buggy behavior to determine whether a test fails (detects a bug) or passes. What makes the oracle problem challenging and undecidable is the assump… ▽ More

    Submitted 5 April, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Published in ESEC/FSE 2022