Multi-label Sequential Sentence Classification via Large Language Model

Lan, Mengfei; Zheng, Lecheng; Ming, Shufan; Kilicoglu, Halil

Computer Science > Computation and Language

arXiv:2411.15623 (cs)

[Submitted on 23 Nov 2024 (v1), last revised 29 Nov 2024 (this version, v2)]

Title:Multi-label Sequential Sentence Classification via Large Language Model

Authors:Mengfei Lan, Lecheng Zheng, Shufan Ming, Halil Kilicoglu

View PDF HTML (experimental)

Abstract:Sequential sentence classification (SSC) in scientific publications is crucial for supporting downstream tasks such as fine-grained information retrieval and extractive summarization. However, current SSC methods are constrained by model size, sequence length, and single-label setting. To address these limitations, this paper proposes LLM-SSC, a large language model (LLM)-based framework for both single- and multi-label SSC tasks. Unlike previous approaches that employ small- or medium-sized language models, the proposed framework utilizes LLMs to generate SSC labels through designed prompts, which enhance task understanding by incorporating demonstrations and a query to describe the prediction target. We also present a multi-label contrastive learning loss with auto-weighting scheme, enabling the multi-label classification task. To support our multi-label SSC analysis, we introduce and release a new dataset, biorc800, which mainly contains unstructured abstracts in the biomedical domain with manual annotations. Experiments demonstrate LLM-SSC's strong performance in SSC under both in-context learning and task-specific tuning settings. We release biorc800 and our code at: this https URL.

Comments:	Accepted by EMNLP 2024 Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2411.15623 [cs.CL]
	(or arXiv:2411.15623v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.15623

Submission history

From: Lecheng Zheng [view email]
[v1] Sat, 23 Nov 2024 18:27:35 UTC (2,090 KB)
[v2] Fri, 29 Nov 2024 17:18:49 UTC (2,090 KB)

Computer Science > Computation and Language

Title:Multi-label Sequential Sentence Classification via Large Language Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multi-label Sequential Sentence Classification via Large Language Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators