论文标题
WMT22通用机器翻译任务的自动评估
An Automatic Evaluation of the WMT22 General Machine Translation Task
论文作者
论文摘要
本报告介绍了第七次机器翻译会议(WMT22)的通用机器翻译任务的自动评估。它总共评估了21个翻译方向的185个系统,包括高资源至低资源语言对以及与遥远语言密切相关的系统。这种大规模的自动评估突出了最新机器翻译系统的一些当前限制。它还显示了自动指标,即CHRF,BLEU和COMET,如何在解释性和准确性方面进行补充以减轻自己的限制。
This report presents an automatic evaluation of the general machine translation task of the Seventh Conference on Machine Translation (WMT22). It evaluates a total of 185 systems for 21 translation directions including high-resource to low-resource language pairs and from closely related to distant languages. This large-scale automatic evaluation highlights some of the current limits of state-of-the-art machine translation systems. It also shows how automatic metrics, namely chrF, BLEU, and COMET, can complement themselves to mitigate their own limits in terms of interpretability and accuracy.