WMT22通用机器翻译任务的自动评估

论文标题

WMT22通用机器翻译任务的自动评估

An Automatic Evaluation of the WMT22 General Machine Translation Task

论文作者

Marie, Benjamin

论文摘要

本报告介绍了第七次机器翻译会议（WMT22）的通用机器翻译任务的自动评估。它总共评估了21个翻译方向的185个系统，包括高资源至低资源语言对以及与遥远语言密切相关的系统。这种大规模的自动评估突出了最新机器翻译系统的一些当前限制。它还显示了自动指标，即CHRF，BLEU和COMET，如何在解释性和准确性方面进行补充以减轻自己的限制。

This report presents an automatic evaluation of the general machine translation task of the Seventh Conference on Machine Translation (WMT22). It evaluates a total of 185 systems for 21 translation directions including high-resource to low-resource language pairs and from closely related to distant languages. This large-scale automatic evaluation highlights some of the current limits of state-of-the-art machine translation systems. It also shows how automatic metrics, namely chrF, BLEU, and COMET, can complement themselves to mitigate their own limits in terms of interpretability and accuracy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题