Skip to main content

Showing 1–2 of 2 results for author: Chau, T P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.01999  [pdf, other

    cs.SE

    CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs

    Authors: Dung Nguyen Manh, Thang Phan Chau, Nam Le Hai, Thong T. Doan, Nam V. Nguyen, Quang Pham, Nghi D. Q. Bui

    Abstract: Recent advances in Code Large Language Models (CodeLLMs) have primarily focused on open-ended code generation, often overlooking the crucial aspect of code understanding and reasoning. To bridge this gap, we introduce CodeMMLU, a comprehensive multiple-choice benchmark designed to evaluate the depth of software and code comprehension in LLMs. CodeMMLU includes nearly 20,000 questions spanning dive… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  2. arXiv:2406.11912  [pdf, other

    cs.SE cs.AI

    AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology

    Authors: Minh Huynh Nguyen, Thang Phan Chau, Phong X. Nguyen, Nghi D. Q. Bui

    Abstract: Software agents have emerged as promising tools for addressing complex software engineering tasks. Existing works, on the other hand, frequently oversimplify software development workflows, despite the fact that such workflows are typically more complex in the real world. Thus, we propose AgileCoder, a multi agent system that integrates Agile Methodology (AM) into the framework. This system assign… ▽ More

    Submitted 14 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Work in progress