作物：用多语言标记序列翻译的零射击跨语义命名实体识别

论文标题

作物：用多语言标记序列翻译的零射击跨语义命名实体识别

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation

论文作者

Yang, Jian, Huang, Shaohan, Ma, Shuming, Yin, Yuwei, Dong, Li, Zhang, Dongdong, Guo, Hongcheng, Li, Zhoujun, Wei, Furu

论文摘要

指定的实体识别（NER）遭受了带注释的培训数据的稀缺性，尤其是对于没有标记数据的低资源语言。已经提出了跨语性的NER，以通过对齐的跨语言表示或机器翻译结果将知识从高资源语言转移到低资源语言来减轻此问题。但是，跨语言NER方法的性能受到翻译或标签投影质量不令人满意的严重影响。为了解决这些问题，我们提出了一个跨语性实体投影框架（农作物），以借助多语言标记的序列翻译模型，使零射击交叉语言ner。具体而言，目标序列首先转换为源语言，然后由源NER模型标记。我们进一步采用标记的序列翻译模型将标记的序列投射回目标语言并标记目标原始句子。最终，通过自我训练的方式将整个管道整合到端到端模型中。两个基准的实验结果表明，我们的方法基本上优于先前的强基线，其大幅度 +3〜7 F1得分并实现了最先进的性能。

Named entity recognition (NER) suffers from the scarcity of annotated training data, especially for low-resource languages without labeled data. Cross-lingual NER has been proposed to alleviate this issue by transferring knowledge from high-resource languages to low-resource languages via aligned cross-lingual representations or machine translation results. However, the performance of cross-lingual NER methods is severely affected by the unsatisfactory quality of translation or label projection. To address these problems, we propose a Cross-lingual Entity Projection framework (CROP) to enable zero-shot cross-lingual NER with the help of a multilingual labeled sequence translation model. Specifically, the target sequence is first translated into the source language and then tagged by a source NER model. We further adopt a labeled sequence translation model to project the tagged sequence back to the target language and label the target raw sentence. Ultimately, the whole pipeline is integrated into an end-to-end model by the way of self-training. Experimental results on two benchmarks demonstrate that our method substantially outperforms the previous strong baseline by a large margin of +3~7 F1 scores and achieves state-of-the-art performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题