DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Cohen, Seffi; Goldshlager, Niv; Cohen-Inger, Nurit; Shapira, Bracha; Rokach, Lior

Computer Science > Machine Learning

arXiv:2501.17479 (cs)

[Submitted on 29 Jan 2025 (v1), last revised 6 Feb 2025 (this version, v2)]

Title:DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Authors:Seffi Cohen, Niv Goldshlager, Nurit Cohen-Inger, Bracha Shapira, Lior Rokach

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have shown remarkable capabilities across various natural language processing tasks but often struggle to excel uniformly in diverse or complex domains. We propose a novel ensemble method - Diverse Fingerprint Ensemble (DFPE), which leverages the complementary strengths of multiple LLMs to achieve more robust performance. Our approach involves: (1) clustering models based on response "fingerprints" patterns, (2) applying a quantile-based filtering mechanism to remove underperforming models at a per-subject level, and (3) assigning adaptive weights to remaining models based on their subject-wise validation accuracy. In experiments on the Massive Multitask Language Understanding (MMLU) benchmark, DFPE outperforms the best single model by 3% overall accuracy and 5% in discipline-level accuracy. This method increases the robustness and generalization of LLMs and underscores how model selection, diversity preservation, and performance-driven weighting can effectively address challenging, multi-faceted language understanding tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2501.17479 [cs.LG]
	(or arXiv:2501.17479v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.17479

Submission history

From: Seffi Cohen [view email]
[v1] Wed, 29 Jan 2025 08:44:45 UTC (405 KB)
[v2] Thu, 6 Feb 2025 21:47:55 UTC (405 KB)

Computer Science > Machine Learning

Title:DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators