论文标题
知识图简单的问题回答看不见的域
Knowledge Graph Simple Question Answering for Unseen Domains
论文作者
论文摘要
知识图以其标准形式的知识简单问答(KGSQA)并未考虑到人类策划的问题回答培训数据仅涵盖知识图(kg)中存在的一小部分关系,甚至更糟糕的是,涵盖了看不见的新领域,与现有领域关系相同的新领域被添加到kg中。在这项工作中,我们在以前未研究的环境中研究KGSQA,在测试时间内添加了新的,看不见的域。在这种情况下,在培训期间没有出现问答的新领域,因此使任务更具挑战性。我们提出了一个以数据为中心的域自适应框架,该框架由适用于新域的KGSQA系统组成,以及一个序列问题生成方法的序列,该方法会自动为新域生成问答对。由于问题生成的有效性可以受到生成的问题的有限词汇变化的限制,因此我们使用远处的监督来提取一组关键字,这些关键字表达了看不见的域的每个关系,并将这些关键词纳入问题生成方法中。实验结果表明,我们的框架在零发基线上显着改善,并且在整个域之间都有坚固的框架。
Knowledge graph simple question answering (KGSQA), in its standard form, does not take into account that human-curated question answering training data only cover a small subset of the relations that exist in a Knowledge Graph (KG), or even worse, that new domains covering unseen and rather different to existing domains relations are added to the KG. In this work, we study KGSQA in a previously unstudied setting where new, unseen domains are added during test time. In this setting, question-answer pairs of the new domain do not appear during training, thus making the task more challenging. We propose a data-centric domain adaptation framework that consists of a KGSQA system that is applicable to new domains, and a sequence to sequence question generation method that automatically generates question-answer pairs for the new domain. Since the effectiveness of question generation for KGSQA can be restricted by the limited lexical variety of the generated questions, we use distant supervision to extract a set of keywords that express each relation of the unseen domain and incorporate those in the question generation method. Experimental results demonstrate that our framework significantly improves over zero-shot baselines and is robust across domains.