论文标题

翻译单词级自动完成:我们可以开箱即用什么?

Translation Word-Level Auto-Completion: What can we achieve out of the box?

论文作者

Moslem, Yasmin, Haque, Rejwanul, Way, Andy

论文摘要

机器翻译(MT)的研究在多个领域取得了重要的突破。尽管为了建立这一成功,还有很多事情要做,但我们认为语言行业需要更好的方法来充分利用当前成就。由于包括时间,资源和技能在内的因素组合,企业倾向于将实用主义应用于其AI工作流程。因此,它们更多地集中在结果上,例如在可能的情况下,交付,运输,发布和功能,并采用高级工作生产解决方案。在被认为对翻译人员有用的功能中,有句子级别和单词级翻译自动探索和自动完成。提出替代方案可以激发翻译人员,并限制他们参考外部资源的需求,这希望提高其生产力。这项工作描述了我们对WMT关于单词级自动完成的共同任务的意见,该任务是对中文到英语,英语对英语,德语到英语和英语对德语的指示的提交。我们研究了使用可用库的预训练模型和开箱即用的功能的可能性。我们采用随机抽样来生成各种替代方案,从而揭示了良好的结果。此外,我们基于Ctranslate2介绍了我们的开源API,以提供翻译,自动烟和自动完成。

Research on Machine Translation (MT) has achieved important breakthroughs in several areas. While there is much more to be done in order to build on this success, we believe that the language industry needs better ways to take full advantage of current achievements. Due to a combination of factors, including time, resources, and skills, businesses tend to apply pragmatism into their AI workflows. Hence, they concentrate more on outcomes, e.g. delivery, shipping, releases, and features, and adopt high-level working production solutions, where possible. Among the features thought to be helpful for translators are sentence-level and word-level translation auto-suggestion and auto-completion. Suggesting alternatives can inspire translators and limit their need to refer to external resources, which hopefully boosts their productivity. This work describes our submissions to WMT's shared task on word-level auto-completion, for the Chinese-to-English, English-to-Chinese, German-to-English, and English-to-German language directions. We investigate the possibility of using pre-trained models and out-of-the-box features from available libraries. We employ random sampling to generate diverse alternatives, which reveals good results. Furthermore, we introduce our open-source API, based on CTranslate2, to serve translations, auto-suggestions, and auto-completions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源