Boosting the Capability of Intelligent Vulnerability Detection by Training in a Human-Learning Manner

Dou, Shihan; Wu, Yueming; Li, Wenxuan; Cheng, Feng; Yang, Wei; Liu, Yang

Abstract:Due to its powerful automatic feature extraction, deep learning (DL) has been widely used in source code vulnerability detection. However, although it performs well on artificial datasets, its performance is not satisfactory when detecting real-world vulnerabilities due to the high complexity of real-world samples. In this paper, we propose to train DL-based vulnerability detection models in a human-learning manner, that is, start with the simplest samples and then gradually transition to difficult knowledge. Specifically, we design a novel framework (Humer) that can enhance the detection ability of DL-based vulnerability detectors. To validate the effectiveness of Humer, we select five state-of-the-art DL-based vulnerability detection models (TokenCNN, VulDeePecker, StatementGRU, ASTGRU, and Devign) to complete our evaluations. Through the results, we find that the use of Humer can increase the F1 of these models by an average of 10.5%. Moreover, Humer can make the model detect up to 16.7% more real-world vulnerabilities. Meanwhile, we also conduct a case study to uncover vulnerabilities from real-world open source products by using these enhanced DL-based vulnerability detectors. Through the results, we finally discover 281 unreported vulnerabilities in NVD, of which 98 have been silently patched by vendors in the latest version of corresponding products, but 159 still exist in the products.

Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2112.06250 [cs.CR]
	(or arXiv:2112.06250v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2112.06250

Computer Science > Cryptography and Security

Title:Boosting the Capability of Intelligent Vulnerability Detection by Training in a Human-Learning Manner

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators