Skip to main content

Showing 1–2 of 2 results for author: Minut, A R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11427  [pdf, other

    cs.LG cs.AI cs.NE

    Mergenetic: a Simple Evolutionary Model Merging Library

    Authors: Adrian Robert Minut, Tommaso Mencattini, Andrea Santilli, Donato Crisostomi, Emanuele RodolĂ 

    Abstract: Model merging allows combining the capabilities of existing models into a new one - post hoc, without additional training. This has made it increasingly popular thanks to its low cost and the availability of libraries that support merging on consumer GPUs. Recent work shows that pairing merging with evolutionary algorithms can boost performance, but no framework currently supports flexible experim… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Link: https://github.com/tommasomncttn/mergenetic

  2. arXiv:2502.10436  [pdf, other

    cs.NE cs.AI cs.LG

    MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

    Authors: Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi, Andrea Santilli, Emanuele RodolĂ 

    Abstract: Evolutionary model merging enables the creation of high-performing multi-task models but remains computationally prohibitive for consumer hardware. We introduce MERGE$^3$, an efficient framework that makes evolutionary merging feasible on a single GPU by reducing fitness computation costs 50$\times$ while preserving performance. MERGE$^3$ achieves this by Extracting a reduced dataset for evaluatio… ▽ More

    Submitted 9 May, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: In Proceedings of The Forty-Second International Conference on Machine Learning (ICML 2025)