Skip to main content

Showing 1–1 of 1 results for author: Dristi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10155  [pdf, other

    cs.SE cs.LG

    The Fault in our Stars: Quality Assessment of Code Generation Benchmarks

    Authors: Mohammed Latif Siddiq, Simantika Dristi, Joy Saha, Joanna C. S. Santos

    Abstract: Large Language Models (LLMs) are gaining popularity among software engineers. A crucial aspect of developing effective code generation LLMs is to evaluate these models using a robust benchmark. Evaluation benchmarks with quality issues can provide a false sense of performance. In this work, we conduct the first-of-its-kind study of the quality of prompts within benchmarks used to compare the perfo… ▽ More

    Submitted 4 September, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at the 24th IEEE International Conference on Source Code Analysis and Manipulation(SCAM 2024) Research Track