Skip to main content

Showing 1–7 of 7 results for author: Exel, M

.
  1. arXiv:2502.08561  [pdf, ps, other

    cs.CL

    Quality-Aware Decoding: Unifying Quality Estimation and Decoding

    Authors: Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

    Abstract: Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampli… ▽ More

    Submitted 1 June, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: IWSLT 2025

  2. arXiv:2410.02320  [pdf, other

    cs.CL cs.AI cs.LG

    Post-edits Are Preferences Too

    Authors: Nathaniel Berger, Miriam Exel, Matthias Huck, Stefan Riezler

    Abstract: Preference Optimization (PO) techniques are currently one of the state of the art techniques for fine-tuning large language models (LLMs) on pairwise preference feedback from human annotators. However, in machine translation, this sort of feedback can be difficult to solicit. Additionally, Kreutzer et al. (2018) have shown that, for machine translation, pairwise preferences are less reliable than… ▽ More

    Submitted 21 February, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: To appear at the Ninth Conference on Machine Translation (WMT24)

  3. arXiv:2408.11327  [pdf, other

    cs.CL cs.AI

    Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies

    Authors: Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

    Abstract: Recent advancements in NLP have resulted in models with specialized strengths, such as processing multimodal inputs or excelling in specific domains. However, real-world tasks, like multimodal translation, often require a combination of these strengths, such as handling both translation and image processing. While individual translation and vision models are powerful, they typically lack the abili… ▽ More

    Submitted 4 November, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: WMT 2024

  4. arXiv:2406.02267  [pdf, ps, other

    cs.CL

    Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation

    Authors: Nathaniel Berger, Stefan Riezler, Miriam Exel, Matthias Huck

    Abstract: While large language models (LLMs) pre-trained on massive amounts of unpaired language data have reached the state-of-the-art in machine translation (MT) of general domain texts, post-editing (PE) is still required to correct errors and to enhance term translation quality in specialized domains. In this paper we present a pilot study of enhancing translation memories (TM) produced by PE (source se… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: To appear at The 25th Annual Conference of the European Association for Machine Translation (EAMT 2024)

  5. arXiv:2310.14855  [pdf, other

    cs.CL cs.AI

    Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

    Authors: Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues

    Abstract: Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigat… ▽ More

    Submitted 18 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: NAACL 2024

  6. arXiv:2307.08416  [pdf, other

    cs.CL

    Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

    Authors: Nathaniel Berger, Miriam Exel, Matthias Huck, Stefan Riezler

    Abstract: Supervised learning in Neural Machine Translation (NMT) typically follows a teacher forcing paradigm where reference tokens constitute the conditioning context in the model's prediction, instead of its own previous predictions. In order to alleviate this lack of exploration in the space of translations, we present a simple extension of standard maximum likelihood estimation by a contrastive markin… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 24th Annual Conference of the European Association for Machine Translation, p. 69-78 Tampere, Finland, June 2023

  7. arXiv:2008.04550  [pdf, other

    cs.CL

    A Parallel Evaluation Data Set of Software Documentation with Document Structure Annotation

    Authors: Bianka Buschbeck, Miriam Exel

    Abstract: This paper accompanies the software documentation data set for machine translation, a parallel evaluation data set of data originating from the SAP Help Portal, that we released to the machine translation community for research purposes. It offers the possibility to tune and evaluate machine translation systems in the domain of corporate software documentation and contributes to the availability o… ▽ More

    Submitted 12 November, 2020; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at WAT 2020; update to camera-ready version