细粒度的对比度学习以提取关系

论文标题

细粒度的对比度学习以提取关系

Fine-grained Contrastive Learning for Relation Extraction

论文作者

Hogan, William, Li, Jiacheng, Shang, Jingbo

论文摘要

最近的关系提取（RE）工作表明，通过对远处监督产生的银标签进行对比度学习，然后在金标签上进行微调，从而令人鼓舞。现有方法通常假定所有这些银标签都是准确的，并平等地对待它们。但是，遥远的监督不可避免地嘈杂 - 有些银牌比其他标签更可靠。在本文中，我们建议对RE提出细粒度的对比度学习（FineCl），该学习利用了哪些银标签是哪些银标的，也不是嘈杂的信息来提高RE学习的关系表示的质量。我们首先通过一种简单而自动的方法评估银牌的质量，我们称为“学习顺序DeNoising”，在这里我们训练一种语言模型来学习这些关系并记录学习培训实例的顺序。我们表明，学习顺序在很大程度上对应于标签精度 - 早期学习的银标签平均比后来学习的银标签具有更准确的标签。然后，在预训练期间，我们在新颖的对比度学习目标中增加了准确标签的权重。几个基准测试的实验表明，FineCl对最先进的方法具有一致且显着的性能增长。

Recent relation extraction (RE) works have shown encouraging improvements by conducting contrastive learning on silver labels generated by distant supervision before fine-tuning on gold labels. Existing methods typically assume all these silver labels are accurate and treat them equally; however, distant supervision is inevitably noisy -- some silver labels are more reliable than others. In this paper, we propose fine-grained contrastive learning (FineCL) for RE, which leverages fine-grained information about which silver labels are and are not noisy to improve the quality of learned relationship representations for RE. We first assess the quality of silver labels via a simple and automatic approach we call "learning order denoising," where we train a language model to learn these relations and record the order of learned training instances. We show that learning order largely corresponds to label accuracy -- early-learned silver labels have, on average, more accurate labels than later-learned silver labels. Then, during pre-training, we increase the weights of accurate labels within a novel contrastive learning objective. Experiments on several RE benchmarks show that FineCL makes consistent and significant performance gains over state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题