Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models

Peeperkorn, Max; Kouwenhoven, Tom; Brown, Dan; Jordanous, Anna

Computer Science > Computation and Language

arXiv:2507.20956 (cs)

[Submitted on 28 Jul 2025]

Title:Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models

Authors:Max Peeperkorn, Tom Kouwenhoven, Dan Brown, Anna Jordanous

View PDF HTML (experimental)

Abstract:Instruction-tuning large language models (LLMs) reduces the diversity of their outputs, which has implications for many tasks, particularly for creative tasks. This paper investigates the ``diversity gap'' for a writing prompt narrative generation task. This gap emerges as measured by current diversity metrics for various open-weight and open-source LLMs. The results show significant decreases in diversity due to instruction-tuning. We explore the diversity loss at each fine-tuning stage for the OLMo and OLMo 2 models to further understand how output diversity is affected. The results indicate that DPO has the most substantial impact on diversity. Motivated by these findings, we present a new decoding strategy, conformative decoding, which guides an instruct model using its more diverse base model to reintroduce output diversity. We show that conformative decoding typically increases diversity and even maintains or improves quality.

Comments:	9 pages, 3 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2507.20956 [cs.CL]
	(or arXiv:2507.20956v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2507.20956

Submission history

From: Max Peeperkorn [view email]
[v1] Mon, 28 Jul 2025 16:04:25 UTC (584 KB)

Computer Science > Computation and Language

Title:Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators