Rethinking Skill Extraction in the Job Market Domain using Large Language Models

Nguyen, Khanh Cao; Zhang, Mike; Montariol, Syrielle; Bosselut, Antoine

Computer Science > Computation and Language

arXiv:2402.03832 (cs)

[Submitted on 6 Feb 2024]

Title:Rethinking Skill Extraction in the Job Market Domain using Large Language Models

Authors:Khanh Cao Nguyen, Mike Zhang, Syrielle Montariol, Antoine Bosselut

View PDF

Abstract:Skill Extraction involves identifying skills and qualifications mentioned in documents such as job postings and resumes. The task is commonly tackled by training supervised models using a sequence labeling approach with BIO tags. However, the reliance on manually annotated data limits the generalizability of such approaches. Moreover, the common BIO setting limits the ability of the models to capture complex skill patterns and handle ambiguous mentions. In this paper, we explore the use of in-context learning to overcome these challenges, on a benchmark of 6 uniformized skill extraction datasets. Our approach leverages the few-shot learning capabilities of large language models (LLMs) to identify and extract skills from sentences. We show that LLMs, despite not being on par with traditional supervised models in terms of performance, can better handle syntactically complex skill mentions in skill extraction tasks.

Comments:	Published at NLP4HR 2024 (EACL Workshop)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.03832 [cs.CL]
	(or arXiv:2402.03832v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.03832

Submission history

From: Syrielle Montariol [view email]
[v1] Tue, 6 Feb 2024 09:23:26 UTC (8,063 KB)

Computer Science > Computation and Language

Title:Rethinking Skill Extraction in the Job Market Domain using Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Rethinking Skill Extraction in the Job Market Domain using Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators