Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

Wan, Yu; Bao, Keqin; Liu, Dayiheng; Yang, Baosong; Wong, Derek F.; Chao, Lidia S.; Lei, Wenqiang; Xie, Jun

Computer Science > Computation and Language

arXiv:2210.09683 (cs)

[Submitted on 18 Oct 2022 (v1), last revised 17 Feb 2023 (this version, v2)]

Title:Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

Authors:Yu Wan, Keqin Bao, Dayiheng Liu, Baosong Yang, Derek F. Wong, Lidia S. Chao, Wenqiang Lei, Jun Xie

View PDF

Abstract:In this report, we present our submission to the WMT 2022 Metrics Shared Task. We build our system based on the core idea of UNITE (Unified Translation Evaluation), which unifies source-only, reference-only, and source-reference-combined evaluation scenarios into one single model. Specifically, during the model pre-training phase, we first apply the pseudo-labeled data examples to continuously pre-train UNITE. Notably, to reduce the gap between pre-training and fine-tuning, we use data cropping and a ranking-based score normalization strategy. During the fine-tuning phase, we use both Direct Assessment (DA) and Multidimensional Quality Metrics (MQM) data from past years' WMT competitions. Specially, we collect the results from models with different pre-trained language model backbones, and use different ensembling strategies for involved translation directions.

Comments:	WMT 2022 Metrics Shared Task
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.09683 [cs.CL]
	(or arXiv:2210.09683v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.09683

Submission history

From: Yu Wan [view email]
[v1] Tue, 18 Oct 2022 08:51:25 UTC (46 KB)
[v2] Fri, 17 Feb 2023 15:56:56 UTC (93 KB)

Computer Science > Computation and Language

Title:Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators