论文标题
序列标签的不确定性吸引标签的细化
Uncertainty-Aware Label Refinement for Sequence Labeling
论文作者
论文摘要
标签解码的条件随机字段(CRF)在序列标记任务中已无处不在。但是,局部标签依赖性和效率低下的Viterbi解码一直是要解决的问题。在这项工作中,我们介绍了一种新型的两阶段标签解码框架,以建模长期标签依赖性,同时更加有效。基本模型首先预测标签,然后基于远程标签依赖项对这些草稿预测进行了改进,这可以实现并行解码以进行更快的预测。此外,为了减轻不正确的草稿标签的副作用,贝叶斯神经网络用于指示具有很高错误可能性的标签,这可以极大地有助于防止错误传播。三个序列标记基准的实验结果表明,该提出的方法不仅表现优于基于CRF的方法,而且大大加速了推理过程。
Conditional random fields (CRF) for label decoding has become ubiquitous in sequence labeling tasks. However, the local label dependencies and inefficient Viterbi decoding have always been a problem to be solved. In this work, we introduce a novel two-stage label decoding framework to model long-term label dependencies, while being much more computationally efficient. A base model first predicts draft labels, and then a novel two-stream self-attention model makes refinements on these draft predictions based on long-range label dependencies, which can achieve parallel decoding for a faster prediction. In addition, in order to mitigate the side effects of incorrect draft labels, Bayesian neural networks are used to indicate the labels with a high probability of being wrong, which can greatly assist in preventing error propagation. The experimental results on three sequence labeling benchmarks demonstrated that the proposed method not only outperformed the CRF-based methods but also greatly accelerated the inference process.