Low-rank finetuning for LLMs: A fairness perspective

Das, Saswat; Romanelli, Marco; Tran, Cuong; Reza, Zarreen; Kailkhura, Bhavya; Fioretto, Ferdinando

Computer Science > Machine Learning

arXiv:2405.18572 (cs)

[Submitted on 28 May 2024]

Title:Low-rank finetuning for LLMs: A fairness perspective

Authors:Saswat Das, Marco Romanelli, Cuong Tran, Zarreen Reza, Bhavya Kailkhura, Ferdinando Fioretto

View PDF HTML (experimental)

Abstract:Low-rank approximation techniques have become the de facto standard for fine-tuning Large Language Models (LLMs) due to their reduced computational and memory requirements. This paper investigates the effectiveness of these methods in capturing the shift of fine-tuning datasets from the initial pre-trained data distribution. Our findings reveal that there are cases in which low-rank fine-tuning falls short in learning such shifts. This, in turn, produces non-negligible side effects, especially when fine-tuning is adopted for toxicity mitigation in pre-trained models, or in scenarios where it is important to provide fair models. Through comprehensive empirical evidence on several models, datasets, and tasks, we show that low-rank fine-tuning inadvertently preserves undesirable biases and toxic behaviors. We also show that this extends to sequential decision-making tasks, emphasizing the need for careful evaluation to promote responsible LLMs development.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2405.18572 [cs.LG]
	(or arXiv:2405.18572v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.18572

Submission history

From: Saswat Das [view email]
[v1] Tue, 28 May 2024 20:43:53 UTC (4,464 KB)

Computer Science > Machine Learning

Title:Low-rank finetuning for LLMs: A fairness perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Low-rank finetuning for LLMs: A fairness perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators