Skip to main content

Showing 1–1 of 1 results for author: Deep, P T

.
  1. arXiv:2406.11617  [pdf, other

    cs.CL

    DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling

    Authors: Pala Tej Deep, Rishabh Bhardwaj, Soujanya Poria

    Abstract: With the proliferation of domain-specific models, model merging has emerged as a set of techniques that combine the capabilities of multiple models into one that can multitask without the cost of additional training. In this paper, we propose a new model merging technique, Drop and rEscaLe via sampLing with mAgnitude (DELLA-Merging), that employs a novel pruning technique, MAGPRUNE, which shows si… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.