Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Taylor, Niall; Ghose, Upamanyu; Rohanian, Omid; Nouriborji, Mohammadmahdi; Kormilitzin, Andrey; Clifton, David; Nevado-Holgado, Alejo

Computer Science > Computation and Language

arXiv:2402.10597 (cs)

[Submitted on 16 Feb 2024]

Title:Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Authors:Niall Taylor, Upamanyu Ghose, Omid Rohanian, Mohammadmahdi Nouriborji, Andrey Kormilitzin, David Clifton, Alejo Nevado-Holgado

View PDF

Abstract:The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models without the need for complete fine-tuning, using Parameter Efficient Fine-tuning (PEFT) methods. We present an investigation into the suitability of different PEFT methods to clinical decision-making tasks, across a range of model sizes, including extremely small models with as few as $25$ million parameters.
Our analysis shows that the performance of most PEFT approaches varies significantly from one task to another, with the exception of LoRA, which maintains relatively high performance across all model sizes and tasks, typically approaching or matching full fine-tuned performance. The effectiveness of PEFT methods in the clinical domain is evident, particularly for specialised models which can operate on low-cost, in-house computing infrastructure. The advantages of these models, in terms of speed and reduced training costs, dramatically outweighs any performance gain from large foundation LLMs. Furthermore, we highlight how domain-specific pre-training interacts with PEFT methods and model size, and discuss how these factors interplay to provide the best efficiency-performance trade-off. Full code available at: tbd.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.10597 [cs.CL]
	(or arXiv:2402.10597v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.10597

Submission history

From: Niall Taylor [view email]
[v1] Fri, 16 Feb 2024 11:30:11 UTC (3,051 KB)

Computer Science > Computation and Language

Title:Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators