DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Kwon, Yongchan; Wu, Eric; Wu, Kevin; Zou, James

Computer Science > Machine Learning

arXiv:2310.00902 (cs)

[Submitted on 2 Oct 2023 (v1), last revised 13 Mar 2024 (this version, v3)]

Title:DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Authors:Yongchan Kwon, Eric Wu, Kevin Wu, James Zou

View PDF HTML (experimental)

Abstract:Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image models. In this work, we propose DataInf, an efficient influence approximation method that is practical for large-scale generative AI models. Leveraging an easy-to-compute closed-form expression, DataInf outperforms existing influence computation algorithms in terms of computational and memory efficiency. Our theoretical analysis shows that DataInf is particularly well-suited for parameter-efficient fine-tuning techniques such as LoRA. Through systematic empirical evaluations, we show that DataInf accurately approximates influence scores and is orders of magnitude faster than existing methods. In applications to RoBERTa-large, Llama-2-13B-chat, and stable-diffusion-v1.5 models, DataInf effectively identifies the most influential fine-tuning examples better than other approximate influence scores. Moreover, it can help to identify which data points are mislabeled.

Comments:	ICLR 2024
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2310.00902 [cs.LG]
	(or arXiv:2310.00902v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.00902

Submission history

From: Yongchan Kwon [view email]
[v1] Mon, 2 Oct 2023 04:59:19 UTC (4,691 KB)
[v2] Fri, 23 Feb 2024 05:34:07 UTC (6,231 KB)
[v3] Wed, 13 Mar 2024 14:27:46 UTC (6,231 KB)

Computer Science > Machine Learning

Title:DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators