SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub

Carter, Benjamin C.; Contreras, Jonathan Rivas; Villegas, Carlos A. Llanes; Acharya, Pawan; Utzerath, Jack; Farner, Adonijah O.; Jenkins, Hunter; Johnson, Dylan; Penney, Jacob; Steinmacher, Igor; Gerosa, Marco A.; Santos, Fabio

Computer Science > Software Engineering

arXiv:2501.15922 (cs)

[Submitted on 27 Jan 2025]

Title:SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub

Authors:Benjamin C. Carter, Jonathan Rivas Contreras, Carlos A. Llanes Villegas, Pawan Acharya, Jack Utzerath, Adonijah O. Farner, Hunter Jenkins, Dylan Johnson, Jacob Penney, Igor Steinmacher, Marco A. Gerosa, Fabio Santos

View PDF HTML (experimental)

Abstract:New contributors often struggle to find tasks that they can tackle when onboarding onto a new Open Source Software (OSS) project. One reason for this difficulty is that issue trackers lack explanations about the knowledge or skills needed to complete a given task successfully. These explanations can be complex and time-consuming to produce. Past research has partially addressed this problem by labeling issues with issue types, issue difficulty level, and issue skills. However, current approaches are limited to a small set of labels and lack in-depth details about their semantics, which may not sufficiently help contributors identify suitable issues. To surmount this limitation, this paper explores large language models (LLMs) and Random Forest (RF) to predict the multilevel skills required to solve the open issues. We introduce a novel tool, SkillScope, which retrieves current issues from Java projects hosted on GitHub and predicts the multilevel programming skills required to resolve these issues. In a case study, we demonstrate that SkillScope could predict 217 multilevel skills for tasks with 91% precision, 88% recall, and 89% F-measure on average. Practitioners can use this tool to better delegate or choose tasks to solve in OSS projects.

Subjects:	Software Engineering (cs.SE); Machine Learning (cs.LG)
Cite as:	arXiv:2501.15922 [cs.SE]
	(or arXiv:2501.15922v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2501.15922

Submission history

From: Fabio Marcos De Abreu Santos [view email]
[v1] Mon, 27 Jan 2025 10:17:38 UTC (1,704 KB)

Computer Science > Software Engineering

Title:SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators