BYOM: Building Your Own Multi-Task Model For Free

Jiang, Weisen; Lin, Baijiong; Shi, Han; Zhang, Yu; Li, Zhenguo; Kwok, James T.

Computer Science > Machine Learning

arXiv:2310.01886 (cs)

[Submitted on 3 Oct 2023 (v1), last revised 3 Feb 2024 (this version, v3)]

Title:BYOM: Building Your Own Multi-Task Model For Free

Authors:Weisen Jiang, Baijiong Lin, Han Shi, Yu Zhang, Zhenguo Li, James T. Kwok

View PDF

Abstract:Recently, various merging methods have been proposed to build a multi-task model from task-specific finetuned models without retraining. However, existing methods suffer from a large performance deterioration compared to using multiple task-specific models. In this paper, we propose to inject task-specific knowledge into the merged model and design two parameter-efficient approaches (BYOM-FFT and BYOM-LoRA) to Build Your Own Multi-task model. BYOM-FFT is for merging fully finetuned models, while BYOM-LoRA is for LoRA-finetuned models. Both methods are data-free and computation-efficient. Extensive experiments on computer vision and natural language processing tasks show that the proposed BYOM methods outperform existing merging methods by a large margin. Moreover, BYOM-FFT is general and can be integrated into existing merging methods to further boost performance.

Comments:	Technical Report
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.01886 [cs.LG]
	(or arXiv:2310.01886v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.01886

Submission history

From: Weisen Jiang [view email]
[v1] Tue, 3 Oct 2023 08:39:33 UTC (246 KB)
[v2] Wed, 4 Oct 2023 02:30:27 UTC (246 KB)
[v3] Sat, 3 Feb 2024 15:22:33 UTC (249 KB)

Computer Science > Machine Learning

Title:BYOM: Building Your Own Multi-Task Model For Free

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:BYOM: Building Your Own Multi-Task Model For Free

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators