Skip to main content

Showing 1–6 of 6 results for author: Ouédraogo, W C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06767  [pdf, ps, other

    cs.SE

    Beyond Surface Similarity: Evaluating LLM-Based Test Refactorings with Structural and Semantic Awareness

    Authors: Wendkûuni C. Ouédraogo, Yinghua Li, Xueqi Dang, Xin Zhou, Anil Koyuncu, Jacques Klein, David Lo, Tegawendé F. Bissyandé

    Abstract: Large Language Models (LLMs) are increasingly employed to automatically refactor unit tests, aiming to enhance readability, naming, and structural clarity while preserving functional behavior. However, evaluating such refactorings remains challenging: traditional metrics like CodeBLEU are overly sensitive to renaming and structural edits, whereas embedding-based similarities capture semantics but… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  2. arXiv:2506.06764  [pdf, ps, other

    cs.SE

    Mind the Gap: A Readability-Aware Metric for Test Code Complexity

    Authors: Wendkûuni C. Ouédraogo, Yinghua Li, Xueqi Dang, Xin Zhou, Anil Koyuncu, Jacques Klein, David Lo, Tegawendé F. Bissyandé

    Abstract: Automatically generated unit tests-from search-based tools like EvoSuite or LLMs-vary significantly in structure and readability. Yet most evaluations rely on metrics like Cyclomatic Complexity and Cognitive Complexity, designed for functional code rather than test code. Recent studies have shown that SonarSource's Cognitive Complexity metric assigns near-zero scores to LLM-generated tests, yet it… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  3. arXiv:2410.10628  [pdf, other

    cs.SE

    Test smells in LLM-Generated Unit Tests

    Authors: Wendkûuni C. Ouédraogo, Yinghua Li, Kader Kaboré, Xunzhu Tang, Anil Koyuncu, Jacques Klein, David Lo, Tegawendé F. Bissyandé

    Abstract: The use of Large Language Models (LLMs) in automated test generation is gaining popularity, with much of the research focusing on metrics like compilability rate, code coverage and bug detection. However, an equally important quality metric is the presence of test smells design flaws or anti patterns in test code that hinder maintainability and readability. In this study, we explore the diffusion… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  4. arXiv:2407.00225  [pdf, ps, other

    cs.SE

    Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation

    Authors: Wendkûuni C. Ouédraogo, Kader Kaboré, Yinghua Li, Haoye Tian, Anil Koyuncu, Jacques Klein, David Lo, Tegawendé F. Bissyandé

    Abstract: Unit testing is essential for software reliability, yet manual test creation is time-consuming and often neglected. Although search-based software testing improves efficiency, it produces tests with poor readability and maintainability. Although LLMs show promise for test generation, existing research lacks comprehensive evaluation across execution-driven assessment, reasoning-based prompting, and… ▽ More

    Submitted 5 July, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

  5. arXiv:2312.14898  [pdf, other

    cs.SE

    Enriching Automatic Test Case Generation by Extracting Relevant Test Inputs from Bug Reports

    Authors: Wendkûuni C. Ouédraogo, Laura Plein, Kader Kaboré, Andrew Habib, Jacques Klein, David Lo, Tegawendé F. Bissyandé

    Abstract: The quality of software is closely tied to the effectiveness of the tests it undergoes. Manual test writing, though crucial for bug detection, is time-consuming, which has driven significant research into automated test case generation. However, current methods often struggle to generate relevant inputs, limiting the effectiveness of the tests produced. To address this, we introduce BRMiner, a nov… ▽ More

    Submitted 19 March, 2025; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted at Empirical Software Engineering (EMSE) journal on 4 March 2025

  6. arXiv:2310.06320  [pdf, other

    cs.SE

    Automatic Generation of Test Cases based on Bug Reports: a Feasibility Study with Large Language Models

    Authors: Laura Plein, Wendkûuni C. Ouédraogo, Jacques Klein, Tegawendé F. Bissyandé

    Abstract: Software testing is a core discipline in software engineering where a large array of research results has been produced, notably in the area of automatic test generation. Because existing approaches produce test cases that either can be qualified as simple (e.g. unit tests) or that require precise specifications, most testing procedures still rely on test cases written by humans to form test suite… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.