Performance-Aligned LLMs for Generating Fast Code

Nichols, Daniel; Polasam, Pranav; Menon, Harshitha; Marathe, Aniruddha; Gamblin, Todd; Bhatele, Abhinav

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2404.18864 (cs)

[Submitted on 29 Apr 2024]

Title:Performance-Aligned LLMs for Generating Fast Code

Authors:Daniel Nichols, Pranav Polasam, Harshitha Menon, Aniruddha Marathe, Todd Gamblin, Abhinav Bhatele

View PDF

Abstract:Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardware among others. Causes of poor performance can originate from disparate sources and be difficult to diagnose. Recent years have seen a multitude of work that use large language models (LLMs) to assist in software development tasks. However, these tools are trained to model the distribution of code as text, and are not specifically designed to understand performance aspects of code. In this work, we introduce a reinforcement learning based methodology to align the outputs of code LLMs with performance. This allows us to build upon the current code modeling capabilities of LLMs and extend them to generate better performing code. We demonstrate that our fine-tuned model improves the expected speedup of generated code over base models for a set of benchmark tasks from 0.9 to 1.6 for serial code and 1.9 to 4.5 for OpenMP code.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
Cite as:	arXiv:2404.18864 [cs.DC]
	(or arXiv:2404.18864v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2404.18864

Submission history

From: Daniel Nichols [view email]
[v1] Mon, 29 Apr 2024 16:52:38 UTC (457 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Performance-Aligned LLMs for Generating Fast Code

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Performance-Aligned LLMs for Generating Fast Code

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators