Skip to main content

Showing 1–7 of 7 results for author: Mohajer, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.06992  [pdf, ps, other

    cs.SE

    SWE-Bench+: Enhanced Coding Benchmark for LLMs

    Authors: Reem Aleithan, Haoran Xue, Mohammad Mahdi Mohajer, Elijah Nnorom, Gias Uddin, Song Wang

    Abstract: Large Language Models (LLMs) in Software Engineering (SE) can offer assistance for coding. To facilitate a rigorous evaluation of LLMs in practical coding contexts, Carlos et al. introduced the SWE-bench dataset, which comprises 2,294 real-world GitHub issues and their corresponding pull requests, collected from 12 widely used Python repositories. Several impressive LLM-based toolkits recently are… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  2. arXiv:2410.06440  [pdf, other

    cs.SE

    Checker Bug Detection and Repair in Deep Learning Libraries

    Authors: Nima Shiri Harzevili, Mohammad Mahdi Mohajer, Jiho Shin, Moshi Wei, Gias Uddin, Jinqiu Yang, Junjie Wang, Song Wang, Zhen Ming, Jiang, Nachiappan Nagappan

    Abstract: Checker bugs in Deep Learning (DL) libraries are critical yet not well-explored. These bugs are often concealed in the input validation and error-checking code of DL libraries and can lead to silent failures, incorrect results, or unexpected program behavior in DL applications. Despite their potential to significantly impact the reliability and performance of DL-enabled systems built with these li… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  3. arXiv:2409.12369  [pdf, other

    cs.SE

    Program Slicing in the Era of Large Language Models

    Authors: Kimya Khakzad Shahandashti, Mohammad Mahdi Mohajer, Alvine Boaye Belle, Song Wang, Hadi Hemmati

    Abstract: Program slicing is a critical technique in software engineering, enabling developers to isolate relevant portions of code for tasks such as bug detection, code comprehension, and debugging. In this study, we investigate the application of large language models (LLMs) to both static and dynamic program slicing, with a focus on Java programs. We evaluate the performance of four state-of-the-art LLMs… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  4. arXiv:2401.17991  [pdf, other

    cs.SE

    Evaluating the Effectiveness of GPT-4 Turbo in Creating Defeaters for Assurance Cases

    Authors: Kimya Khakzad Shahandashti, Mithila Sivakumar, Mohammad Mahdi Mohajer, Alvine B. Belle, Song Wang, Timothy C. Lethbridge

    Abstract: Assurance cases (ACs) are structured arguments that support the verification of the correct implementation of systems' non-functional requirements, such as safety and security, thereby preventing system failures which could lead to catastrophic outcomes, including loss of lives. ACs facilitate the certification of systems in accordance with industrial standards, for example, DO-178C and ISO 26262.… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  5. arXiv:2310.18532  [pdf, other

    cs.SE

    SkipAnalyzer: A Tool for Static Code Analysis with Large Language Models

    Authors: Mohammad Mahdi Mohajer, Reem Aleithan, Nima Shiri Harzevili, Moshi Wei, Alvine Boaye Belle, Hung Viet Pham, Song Wang

    Abstract: We introduce SkipAnalyzer, a large language model (LLM)-powered tool for static code analysis. SkipAnalyzer has three components: 1) an LLM-based static bug detector that scans source code and reports specific types of bugs, 2) an LLM-based false-positive filter that can identify false-positive bugs in the results of static bug detectors (e.g., the result of step 1) to improve detection accuracy,… ▽ More

    Submitted 17 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  6. arXiv:2307.11298  [pdf, ps, other

    cs.SE

    A First Look at Fairness of Machine Learning Based Code Reviewer Recommendation

    Authors: Mohammad Mahdi Mohajer, Alvine Boaye Belle, Nima Shiri harzevili, Junjie Wang, Hadi Hemmati, Song Wang, Zhen Ming, Jiang

    Abstract: The fairness of machine learning (ML) approaches is critical to the reliability of modern artificial intelligence systems. Despite extensive study on this topic, the fairness of ML models in the software engineering (SE) domain has not been well explored yet. As a result, many ML-powered software systems, particularly those utilized in the software engineering community, continue to be prone to fa… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  7. arXiv:2306.03269  [pdf, other

    cs.CR cs.SE

    Security Knowledge-Guided Fuzzing of Deep Learning Libraries

    Authors: Nima Shiri Harzevili, Mohammad Mahdi Mohajer, Moshi Wei, Hung Viet Pham, Song Wang

    Abstract: Recently, many Deep Learning fuzzers have been proposed for testing of DL libraries. However, they either perform unguided input generation (e.g., not considering the relationship between API arguments when generating inputs) or only support a limited set of corner case test inputs. Furthermore, a substantial number of developer APIs crucial for library development remain untested, as they are typ… ▽ More

    Submitted 24 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.