Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Wassie, Aman Kassahun; Molaei, Mahdi; Moslem, Yasmin

Computer Science > Computation and Language

arXiv:2412.05862 (cs)

[Submitted on 8 Dec 2024 (v1), last revised 30 May 2025 (this version, v4)]

Title:Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Authors:Aman Kassahun Wassie, Mahdi Molaei, Yasmin Moslem

View PDF HTML (experimental)

Abstract:In this work, we compare the domain-specific translation performance of open-source autoregressive decoder-only large language models (LLMs) with task-oriented machine translation (MT) models. Our experiments focus on the medical domain and cover four language directions with varied resource availability: English-to-French, English-to-Portuguese, English-to-Swahili, and Swahili-to-English. Despite recent advancements, LLMs demonstrate a significant quality gap in specialized translation compared to multilingual encoder-decoder MT models such as NLLB-200. Our results indicate that NLLB-200 3.3B outperforms all evaluated LLMs in the 7-8B parameter range across three out of the four language directions. While fine-tuning improves the performance of LLMs such as Mistral and Llama, these models still underperform compared to fine-tuned NLLB-200 3.3B models. Our findings highlight the ongoing need for specialized MT models to achieve high-quality domain-specific translation, especially in medium-resource and low-resource settings. Moreover, the superior performance of larger LLMs over their 8B variants suggests potential value in pre-training domain-specific medium-sized language models, employing targeted data selection and knowledge distillation approaches to enhance both quality and efficiency in specialized translation tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.05862 [cs.CL]
	(or arXiv:2412.05862v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.05862

Submission history

From: Yasmin Moslem [view email]
[v1] Sun, 8 Dec 2024 08:54:13 UTC (283 KB)
[v2] Tue, 25 Feb 2025 18:59:04 UTC (301 KB)
[v3] Thu, 1 May 2025 07:36:13 UTC (297 KB)
[v4] Fri, 30 May 2025 22:50:03 UTC (303 KB)

Computer Science > Computation and Language

Title:Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators