ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios

Bayer, Markus; Lutz, Justin; Reuter, Christian

Computer Science > Computation and Language

arXiv:2405.10808 (cs)

[Submitted on 17 May 2024 (v1), last revised 23 May 2025 (this version, v2)]

Title:ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios

Authors:Markus Bayer, Justin Lutz, Christian Reuter

View PDF HTML (experimental)

Abstract:Active learning is designed to minimize annotation efforts by prioritizing instances that most enhance learning. However, many active learning strategies struggle with a `cold-start' problem, needing substantial initial data to be effective. This limitation reduces their utility in the increasingly relevant few-shot scenarios, where the instance selection has a substantial impact. To address this, we introduce ActiveLLM, a novel active learning approach that leverages Large Language Models such as GPT-4, o1, Llama 3, or Mistral Large for selecting instances. We demonstrate that ActiveLLM significantly enhances the classification performance of BERT classifiers in few-shot scenarios, outperforming traditional active learning methods as well as improving the few-shot learning methods ADAPET, PERFECT, and SetFit. Additionally, ActiveLLM can be extended to non-few-shot scenarios, allowing for iterative selections. In this way, ActiveLLM can even help other active learning strategies to overcome their cold-start problem. Our results suggest that ActiveLLM offers a promising solution for improving model performance across various learning setups.

Comments:	20 pages, 10 figures, 7 tables
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2405.10808 [cs.CL]
	(or arXiv:2405.10808v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.10808

Submission history

From: Markus Bayer [view email]
[v1] Fri, 17 May 2024 14:23:54 UTC (525 KB)
[v2] Fri, 23 May 2025 13:27:21 UTC (3,015 KB)

Computer Science > Computation and Language

Title:ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators