论文标题
从医学文本中嵌套命名的实体识别:具有细心CRF的自适应共享网络体系结构
Nested Named Entity Recognition from Medical Texts: An Adaptive Shared Network Architecture with Attentive CRF
论文作者
论文摘要
识别有用的命名实体在医疗信息处理中起着至关重要的作用,这有助于推动医疗领域研究的发展。深度学习方法在医学名称实体识别(NER)中取得了良好的成果。但是,我们发现现有方法在处理嵌套命名实体时面临巨大的挑战。在这项工作中,我们提出了一种称为ASAC的新方法,以解决由嵌套现象引起的困境,在该难题中,核心思想是对实体识别的不同类别识别之间的依赖性进行建模。提出的方法包含两个关键模块:自适应共享(AS)部分和细心的条件随机场(ACRF)模块。前一部分会自动在每个任务上分配自适应权重,以在多层网络中实现最佳识别精度。后一个模块采用注意操作来建模不同实体之间的依赖关系。这样,我们的模型可以通过捕获不同类别的实体之间的隐性区别和关系来学习更好的实体表示。公共数据集上的大量实验验证了我们方法的有效性。此外,我们还进行消融分析以深入了解我们的方法。
Recognizing useful named entities plays a vital role in medical information processing, which helps drive the development of medical area research. Deep learning methods have achieved good results in medical named entity recognition (NER). However, we find that existing methods face great challenges when dealing with the nested named entities. In this work, we propose a novel method, referred to as ASAC, to solve the dilemma caused by the nested phenomenon, in which the core idea is to model the dependency between different categories of entity recognition. The proposed method contains two key modules: the adaptive shared (AS) part and the attentive conditional random field (ACRF) module. The former part automatically assigns adaptive weights across each task to achieve optimal recognition accuracy in the multi-layer network. The latter module employs the attention operation to model the dependency between different entities. In this way, our model could learn better entity representations by capturing the implicit distinctions and relationships between different categories of entities. Extensive experiments on public datasets verify the effectiveness of our method. Besides, we also perform ablation analyses to deeply understand our methods.