论文标题

Salesbot:从chit聊天过渡到面向任务的对话

SalesBot: Transitioning from Chit-Chat to Task-Oriented Dialogues

论文作者

Chiu, Ssu, Li, Maolin, Lin, Yen-Ting, Chen, Yun-Nung

论文摘要

对话系统通常分为两种类型,开放域和面向任务。第一个专注于与用户聊天并使他们参与对话,其中选择适合对话上下文的适当主题对于成功的对话至关重要。另一个专注于特定的任务,而不是随意演讲,例如,在星期五晚上找到电影或播放歌曲。由于它们的不同目的,已经分别研究了这两个方向。但是,从社交聊天到以任务为导向的对话的顺利过渡对于触发商机很重要,而且没有关注这种情况的公共数据。因此,本文着重研究从开放域的社交聊天开始的对话,然后逐渐过渡到以任务为导向的目的,并发布了一个大规模数据集,并提供了详细的注释,以鼓励这一研究方向。为了实现这一目标,本文提出了一个框架来自动生成许多对话,而无需人类参与,其中任何强大的开放域对话生成模型都可以轻松利用。人类评估表明,我们生成的对话数据具有合理质量的自然流程,这表明我们发布的数据具有指导未来的研究方向和商业活动的巨大潜力。此外,发布的模型允许研究人员在目标场景中自动生成无限的对话,这可以极大地使半监督和无监督的方法受益。

Dialogue systems are usually categorized into two types, open-domain and task-oriented. The first one focuses on chatting with users and making them engage in the conversations, where selecting a proper topic to fit the dialogue context is essential for a successful dialogue. The other one focuses on a specific task instead of casual talks, e.g., finding a movie on Friday night, or playing a song. These two directions have been studied separately due to their different purposes. However, how smoothly transitioning from social chatting to task-oriented dialogues is important for triggering business opportunities, and there is no public data focusing on such scenarios. Hence, this paper focuses on investigating the conversations starting from open-domain social chatting and then gradually transitioning to task-oriented purposes, and releases a large-scale dataset with detailed annotations for encouraging this research direction. To achieve this goal, this paper proposes a framework to automatically generate many dialogues without human involvement, in which any powerful open-domain dialogue generation model can be easily leveraged. The human evaluation shows that our generated dialogue data has a natural flow at a reasonable quality, showing that our released data has a great potential of guiding future research directions and commercial activities. Furthermore, the released models allow researchers to automatically generate unlimited dialogues in the target scenarios, which can greatly benefit semi-supervised and unsupervised approaches.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源