CDSM：在文本图上级联的深层语义匹配，利用临时邻居选择

论文标题

CDSM：在文本图上级联的深层语义匹配，利用临时邻居选择

CDSM: Cascaded Deep Semantic Matching on Textual Graphs Leveraging Ad-hoc Neighbor Selection

论文作者

Yao, Jing, Liu, Zheng, Yang, Junhan, Dou, Zhicheng, Xie, Xing, Wen, Ji-Rong

论文摘要

深层语义匹配旨在区分基于深神网络的文档之间的关系。近年来，用图结构组织文档，然后利用内在文档特征和外部邻居特征来获得歧视变得越来越流行。现有的大多数作品主要关心如何利用介绍的邻居，而有限的努力是为了过滤适当的邻居。我们认为，邻居功能可能很嘈杂，并且部分有用。因此，缺乏有效的邻居选择不仅会产生大量不必要的计算成本，而且会严重限制匹配的准确性。在这项工作中，我们提出了一个新颖的框架，级联的深层语义匹配（CDSM），以在文本图上进行准确有效的语义匹配。 CDSM以其两阶段的工作流程而突出显示。在第一阶段，将部署一个基于CNN的轻巧的AD-HOD邻居选择器，以通过小型计算成本过滤有用的邻居来进行匹配任务。我们设计一步和多步选择方法。在第二阶段，采用了基于图形的高能力匹配网络来根据良好的邻居计算细粒度的相关性分数。值得注意的是，CDSM是一个通用框架，可容纳大多数基于图形的主流语义匹配网络。主要的挑战是选择者如何学会区分没有明确标签的邻居有用性。为了解决这个问题，我们设计了一种弱监视策略以进行优化，在该策略中，我们首先在基于图的匹配网络训练基于图的匹配网络，然后在匹配网络的注释之上学习了临时邻居选择器。

Deep semantic matching aims to discriminate the relationship between documents based on deep neural networks. In recent years, it becomes increasingly popular to organize documents with a graph structure, then leverage both the intrinsic document features and the extrinsic neighbor features to derive discrimination. Most of the existing works mainly care about how to utilize the presented neighbors, whereas limited effort is made to filter appropriate neighbors. We argue that the neighbor features could be highly noisy and partially useful. Thus, a lack of effective neighbor selection will not only incur a great deal of unnecessary computation cost, but also restrict the matching accuracy severely. In this work, we propose a novel framework, Cascaded Deep Semantic Matching (CDSM), for accurate and efficient semantic matching on textual graphs. CDSM is highlighted for its two-stage workflow. In the first stage, a lightweight CNN-based ad-hod neighbor selector is deployed to filter useful neighbors for the matching task with a small computation cost. We design both one-step and multi-step selection methods. In the second stage, a high-capacity graph-based matching network is employed to compute fine-grained relevance scores based on the well-selected neighbors. It is worth noting that CDSM is a generic framework which accommodates most of the mainstream graph-based semantic matching networks. The major challenge is how the selector can learn to discriminate the neighbors usefulness which has no explicit labels. To cope with this problem, we design a weak-supervision strategy for optimization, where we train the graph-based matching network at first and then the ad-hoc neighbor selector is learned on top of the annotations from the matching network.

下载PDF全文

下载文献需遵守相关版权规定

论文标题