A Teacher Is Worth A Million Instructions

Kothari, Nikhil; Nayak, Ravindra; Shetty, Shreyas; Patil, Amey; Garera, Nikesh

Computer Science > Machine Learning

arXiv:2406.19112 (cs)

[Submitted on 27 Jun 2024]

Title:A Teacher Is Worth A Million Instructions

Authors:Nikhil Kothari, Ravindra Nayak, Shreyas Shetty, Amey Patil, Nikesh Garera

View PDF HTML (experimental)

Abstract:Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved training method for these models by utilising knowledge from larger models, such as a mixture of experts (8x7B) architectures. The scale of these larger models allows them to capture a wide range of variations from data alone, making them effective teachers for smaller models. Moreover, we implement a novel post-training domain alignment phase that employs domain-specific expert models to boost domain-specific knowledge during training while preserving the model's ability to generalise. Fine-tuning Mistral 7B and 2x7B with our method surpasses the performance of state-of-the-art language models with more than 7B and 13B parameters: achieving up to $7.9$ in MT-Bench and $93.04\%$ on AlpacaEval.

Comments:	7 pages, 4 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.19112 [cs.LG]
	(or arXiv:2406.19112v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.19112

Submission history

From: Nikhil Kothari [view email]
[v1] Thu, 27 Jun 2024 11:48:25 UTC (1,392 KB)

Computer Science > Machine Learning

Title:A Teacher Is Worth A Million Instructions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Teacher Is Worth A Million Instructions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators