Harnessing the Power of LLMs in Source Code Vulnerability Detection

Mahyari, Andrew A

Computer Science > Software Engineering

arXiv:2408.03489 (cs)

[Submitted on 7 Aug 2024]

Title:Harnessing the Power of LLMs in Source Code Vulnerability Detection

Authors:Andrew A Mahyari

View PDF HTML (experimental)

Abstract:Software vulnerabilities, caused by unintentional flaws in source code, are a primary root cause of cyberattacks. Static analysis of source code has been widely used to detect these unintentional defects introduced by software developers. Large Language Models (LLMs) have demonstrated human-like conversational abilities due to their capacity to capture complex patterns in sequential data, such as natural languages. In this paper, we harness LLMs' capabilities to analyze source code and detect known vulnerabilities. To ensure the proposed vulnerability detection method is universal across multiple programming languages, we convert source code to LLVM IR and train LLMs on these intermediate representations. We conduct extensive experiments on various LLM architectures and compare their accuracy. Our comprehensive experiments on real-world and synthetic codes from NVD and SARD demonstrate high accuracy in identifying source code vulnerabilities.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2408.03489 [cs.SE]
	(or arXiv:2408.03489v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2408.03489

Submission history

From: Arash Mahyari [view email]
[v1] Wed, 7 Aug 2024 00:48:49 UTC (760 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SE

< prev | next >

new | recent | 2024-08

Change to browse by:

cs
cs.AI
cs.CR

References & Citations

export BibTeX citation

Computer Science > Software Engineering

Title:Harnessing the Power of LLMs in Source Code Vulnerability Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Harnessing the Power of LLMs in Source Code Vulnerability Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators