基于及时学习各种信息提取任务的统一生成框架

论文标题

基于及时学习各种信息提取任务的统一生成框架

A Unified Generative Framework based on Prompt Learning for Various Information Extraction Tasks

论文作者

Kan, Zhigang, Feng, Linhui, Yin, Zhangyue, Qiao, Linbo, Qiu, Xipeng, Li, Dongsheng

论文摘要

及时学习是一个有效的范式，它在训练前任务和相应的下游应用程序之间弥合了差距。基于此范式的方法在各种应用中取得了巨大的超越结果。但是，仍然需要回答如何根据及时的学习范式设计统一的框架，以实现各种信息提取任务。在本文中，我们提出了一种新型的基于及时的及时生成框架，可以应用于信息提取领域的各种任务。具体而言，我们将信息提取任务重新调整为预先设计的特定类型提示中填充插槽的形式，该提示由一个或多个子奖励组成。提出了一种构建合并提示的策略，以增强在数据筛选方案中提取事件的概括能力。此外，为了符合此框架，我们将关系提取转换为确定提示语义一致性的任务。实验结果表明，我们的方法超过了数据丰富和数据筛查方案中现实世界数据集上的基准。提出了对所提出的框架的进一步分析，以及进行数值实验，以研究绩效对各种任务的影响因素。

Prompt learning is an effective paradigm that bridges gaps between the pre-training tasks and the corresponding downstream applications. Approaches based on this paradigm have achieved great transcendent results in various applications. However, it still needs to be answered how to design a unified framework based on the prompt learning paradigm for various information extraction tasks. In this paper, we propose a novel composable prompt-based generative framework, which could be applied to a wide range of tasks in the field of Information Extraction. Specifically, we reformulate information extraction tasks into the form of filling slots in pre-designed type-specific prompts, which consist of one or multiple sub-prompts. A strategy of constructing composable prompts is proposed to enhance the generalization ability to extract events in data-scarce scenarios. Furthermore, to fit this framework, we transform Relation Extraction into the task of determining semantic consistency in prompts. The experimental results demonstrate that our approach surpasses compared baselines on real-world datasets in data-abundant and data-scarce scenarios. Further analysis of the proposed framework is presented, as well as numerical experiments conducted to investigate impact factors of performance on various tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题