Fine-Tuning Language Models Using Formal Methods Feedback

Yang, Yunhao; Bhatt, Neel P.; Ingebrand, Tyler; Ward, William; Carr, Steven; Wang, Zhangyang; Topcu, Ufuk

Computer Science > Artificial Intelligence

arXiv:2310.18239 (cs)

[Submitted on 27 Oct 2023]

Title:Fine-Tuning Language Models Using Formal Methods Feedback

Authors:Yunhao Yang, Neel P. Bhatt, Tyler Ingebrand, William Ward, Steven Carr, Zhangyang Wang, Ufuk Topcu

View PDF

Abstract:Although pre-trained language models encode generic knowledge beneficial for planning and control, they may fail to generate appropriate control policies for domain-specific tasks. Existing fine-tuning methods use human feedback to address this limitation, however, sourcing human feedback is labor intensive and costly. We present a fully automated approach to fine-tune pre-trained language models for applications in autonomous systems, bridging the gap between generic knowledge and domain-specific requirements while reducing cost. The method synthesizes automaton-based controllers from pre-trained models guided by natural language task descriptions. These controllers are verifiable against independently provided specifications within a world model, which can be abstract or obtained from a high-fidelity simulator. Controllers with high compliance with the desired specifications receive higher ranks, guiding the iterative fine-tuning process. We provide quantitative evidences, primarily in autonomous driving, to demonstrate the method's effectiveness across multiple tasks. The results indicate an improvement in percentage of specifications satisfied by the controller from 60% to 90%.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Robotics (cs.RO)
Cite as:	arXiv:2310.18239 [cs.AI]
	(or arXiv:2310.18239v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.18239

Submission history

From: Yunhao Yang [view email]
[v1] Fri, 27 Oct 2023 16:24:24 UTC (4,635 KB)

Computer Science > Artificial Intelligence

Title:Fine-Tuning Language Models Using Formal Methods Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Fine-Tuning Language Models Using Formal Methods Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators