论文标题
从语义潜在空间产生对话响应
Generating Dialogue Responses from a Semantic Latent Space
论文作者
论文摘要
现有的开放域对话生成模型通常经过训练,以模仿词汇上的跨膜片损失的训练集中的黄金响应。但是,良好的响应不需要类似于黄金响应,因为对给定提示有多种可能的响应。在这项工作中,我们假设当前的模型无法集成来自提示的多个语义相似的有效响应的信息,从而产生了通用和非信息响应。为了解决这个问题,我们提出了词汇端到端分类的替代方案。我们将提示和响应之间的夫妻关系作为潜在空间上的回归任务。在我们新颖的对话生成模型中,语义相关句子的表示在潜在空间上彼此近乎近距离。人类评估表明,在连续空间上学习任务可以产生相关且内容丰富的响应。
Existing open-domain dialogue generation models are usually trained to mimic the gold response in the training set using cross-entropy loss on the vocabulary. However, a good response does not need to resemble the gold response, since there are multiple possible responses to a given prompt. In this work, we hypothesize that the current models are unable to integrate information from multiple semantically similar valid responses of a prompt, resulting in the generation of generic and uninformative responses. To address this issue, we propose an alternative to the end-to-end classification on vocabulary. We learn the pair relationship between the prompts and responses as a regression task on a latent space instead. In our novel dialog generation model, the representations of semantically related sentences are close to each other on the latent space. Human evaluation showed that learning the task on a continuous space can generate responses that are both relevant and informative.