论文标题

联合中国单词分割和基于跨度的选区解析

Joint Chinese Word Segmentation and Span-based Constituency Parsing

论文作者

Wang, Zhicheng, Shi, Tianyu, Liu, Cong

论文摘要

在选区解析中,基于跨度的解码是一个重要方向。但是,对于中国句子,由于其语言特征,有必要首先使用其他模型来执行单词分割,这引入了一系列不确定性,并且通常会导致选区树的计算中的错误。这项工作提出了一种通过在解析树上的单个汉字中添加额外的标签,用于中文单词分割和基于跨度的选区解析方法。通过实验,提出的算法的表现优于CTB 5.1上的联合分割和选区解析的最新模型。

In constituency parsing, span-based decoding is an important direction. However, for Chinese sentences, because of their linguistic characteristics, it is necessary to utilize other models to perform word segmentation first, which introduces a series of uncertainties and generally leads to errors in the computation of the constituency tree afterward. This work proposes a method for joint Chinese word segmentation and Span-based Constituency Parsing by adding extra labels to individual Chinese characters on the parse trees. Through experiments, the proposed algorithm outperforms the recent models for joint segmentation and constituency parsing on CTB 5.1.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源