Computer Science > Software Engineering

arXiv:1908.00614 (cs)

[Submitted on 1 Aug 2019 (v1), last revised 5 Aug 2019 (this version, v2)]

Title:Learning to Identify Security-Related Issues Using Convolutional Neural Networks

Authors:David N. Palacio, Daniel McCrystal, Kevin Moran, Carlos Bernal-Cárdenas, Denys Poshyvanyk, Chris Shenefiel

View PDF

Abstract:Software security is becoming a high priority for both large companies and start-ups alike due to the increasing potential for harm that vulnerabilities and breaches carry with them. However, attaining robust security assurance while delivering features requires a precarious balancing act in the context of agile development practices. One path forward to help aid development teams in securing their software products is through the design and development of security-focused automation. Ergo, we present a novel approach, called SecureReqNet, for automatically identifying whether issues in software issue tracking systems describe security-related content. Our approach consists of a two-phase neural net architecture that operates purely on the natural language descriptions of issues. The first phase of our approach learns high dimensional word embeddings from hundreds of thousands of vulnerability descriptions listed in the CVE database and issue descriptions extracted from open source projects. The second phase then utilizes the semantic ontology represented by these embeddings to train a convolutional neural network capable of predicting whether a given issue is security-related. We evaluated SecureReqNet by applying it to identify security-related issues from a dataset of thousands of issues mined from popular projects on GitLab and GitHub. In addition, we also applied our approach to identify security-related requirements from a commercial software project developed by a major telecommunication company. Our preliminary results are encouraging, with SecureReqNet achieving an accuracy of 96% on open source issues and 71.6% on industrial requirements.

Comments:	5 pages, 3 Figures, ICSME 2019 conference
Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:1908.00614 [cs.SE]
	(or arXiv:1908.00614v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.1908.00614

Submission history

From: David N. Palacio [view email]
[v1] Thu, 1 Aug 2019 20:33:56 UTC (294 KB)
[v2] Mon, 5 Aug 2019 13:26:14 UTC (294 KB)

Computer Science > Software Engineering

Title:Learning to Identify Security-Related Issues Using Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Learning to Identify Security-Related Issues Using Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators