通过上下文定义对准有效的零射击事件提取

论文标题

通过上下文定义对准有效的零射击事件提取

Efficient Zero-shot Event Extraction with Context-Definition Alignment

论文作者

Zhang, Hongming, Yao, Wenlin, Yu, Dong

论文摘要

事件提取（EE）是从文本中识别感兴趣的事件提及的任务。传统努力主要集中在监督环境上。但是，这些监督模型无法从预定的本体论中推广到事件类型。为了填补这一空白，许多努力已致力于零射EE问题。本文遵循建模事件类型语义的趋势，但进一步移动了一步。我们认为，使用事件类型名称的静态嵌入可能还不够，因为单个单词可能是模棱两可的，我们需要一个句子来准确定义类型语义。为了建模定义语义，我们使用两个独立的变压器模型将上下文化事件提及和相应的定义投射到相同的嵌入空间中，然后通过对比度学习最小化其嵌入距离。最重要的是，我们还提出了一个变暖阶段，以帮助模型学习相似定义之间的较小差异。我们将方法命名为“零射击事件提取”（ZED）。 MAVEN数据集的实验表明，由于不相交设计，我们的模型以快速推理速度的所有先前所有零击EE方法都显着胜过。进一步的实验还表明，ZED可以轻松地应用于注释时，并持续优于基线监督方法。

Event extraction (EE) is the task of identifying interested event mentions from text. Conventional efforts mainly focus on the supervised setting. However, these supervised models cannot generalize to event types out of the pre-defined ontology. To fill this gap, many efforts have been devoted to the zero-shot EE problem. This paper follows the trend of modeling event-type semantics but moves one step further. We argue that using the static embedding of the event type name might not be enough because a single word could be ambiguous, and we need a sentence to define the type semantics accurately. To model the definition semantics, we use two separate transformer models to project the contextualized event mentions and corresponding definitions into the same embedding space and then minimize their embedding distance via contrastive learning. On top of that, we also propose a warming phase to help the model learn the minor difference between similar definitions. We name our approach Zero-shot Event extraction with Definition (ZED). Experiments on the MAVEN dataset show that our model significantly outperforms all previous zero-shot EE methods with fast inference speed due to the disjoint design. Further experiments also show that ZED can be easily applied to the few-shot setting when the annotation is available and consistently outperforms baseline supervised methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题