PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Li, Hongwei; Tang, Yuheng; Wang, Shiqi; Guo, Wenbo

Computer Science > Robotics

arXiv:2502.02747 (cs)

[Submitted on 4 Feb 2025 (v1), last revised 10 Jun 2025 (this version, v2)]

Title:PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Authors:Hongwei Li, Yuheng Tang, Shiqi Wang, Wenbo Guo

View PDF HTML (experimental)

Abstract:Recent research builds various patching agents that combine large language models (LLMs) with non-ML tools and achieve promising results on the state-of-the-art (SOTA) software patching benchmark, SWE-bench. Based on how to determine the patching workflows, existing patching agents can be categorized as agent-based planning methods, which rely on LLMs for planning, and rule-based planning methods, which follow a pre-defined workflow. At a high level, agent-based planning methods achieve high patching performance but with a high cost and limited stability. Rule-based planning methods, on the other hand, are more stable and efficient but have key workflow limitations that compromise their patching performance. In this paper, we propose PatchPilot, an agentic patcher that strikes a balance between patching efficacy, stability, and cost-efficiency. PatchPilot proposes a novel rule-based planning workflow with five components: reproduction, localization, generation, validation, and refinement (where refinement is unique to PatchPilot). We introduce novel and customized designs to each component to optimize their effectiveness and efficiency. Through extensive experiments on the SWE-bench benchmarks, PatchPilot shows a superior performance than existing open-source methods while maintaining low cost (less than 1$ per instance) and ensuring higher stability. We also conduct a detailed ablation study to validate the key designs in each component. Our code is available at this https URL.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2502.02747 [cs.RO]
	(or arXiv:2502.02747v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2502.02747

Submission history

From: Yuheng Tang [view email]
[v1] Tue, 4 Feb 2025 22:30:02 UTC (625 KB)
[v2] Tue, 10 Jun 2025 18:19:40 UTC (622 KB)

Computer Science > Robotics

Title:PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators