论文标题
高质量的对话系统
High-quality Conversational Systems
论文作者
论文摘要
会话系统或聊天机器人是AI注入应用程序(AIIA)的示例。聊天机器人尤其重要,因为它们通常是客户与企业的第一次互动,并且是AI(人工智能)世界的业务的切入点。因此,聊天机器人的质量是关键。但是,就像AIIAS的一般情况一样,评估和控制聊天机器人系统质量尤其具有挑战性。除了这些系统固有的统计性质之外,偶尔失败是可以接受的,我们还确定了两个主要挑战。首先是释放具有足够质量的初始系统,以便人类与之相互作用。第二个是维持质量,增强其功能,改善它并根据不断变化的用户要求或漂移进行必要的调整。之所以存在这些挑战,是因为不可能预测用户请求的真实分布以及他们将用来表达这些请求的自然语言。此外,请求的任何经验分布可能会随着时间而变化。这可能是由于周期性,更改使用和主题的漂移。 我们提供了一组方法和一组技术来解决这些挑战,并通过人类的方法提供自动帮助。我们注意到,在聊天机器人开发的生命周期中的不同阶段之间建立联系至关重要,并确保它提供了预期的业务价值。例如,它可以释放人类代理人处理以外的其他任务。我们的方法和技术在聊天机器人培训期间适用于预生产阶段的聊天机器人培训,以实现后期制作阶段的聊天机器人使用。他们通过协助敏捷设计实现“测试第一”范式,并通过可行的见解来支持持续的集成。
Conversational systems or chatbots are an example of AI-Infused Applications (AIIA). Chatbots are especially important as they are often the first interaction of clients with a business and are the entry point of a business into the AI (Artificial Intelligence) world. The quality of the chatbot is, therefore, key. However, as is the case in general with AIIAs, it is especially challenging to assess and control the quality of chatbot systems. Beyond the inherent statistical nature of these systems, where occasional failure is acceptable, we identify two major challenges. The first is to release an initial system that is of sufficient quality such that humans will interact with it. The second is to maintain the quality, enhance its capabilities, improve it and make necessary adjustments based on changing user requests or drift. These challenges exist because it is impossible to predict the real distribution of user requests and the natural language they will use to express these requests. Moreover, any empirical distribution of requests is likely to change over time. This may be due to periodicity, changing usage, and drift of topics. We provide a methodology and set of technologies to address these challenges and to provide automated assistance through a human-in-the-loop approach. We notice that it is crucial to connect between the different phases in the lifecycle of the chatbot development and to make sure it provides its expected business value. For example, that it frees human agents to deal with tasks other than answering human users. Our methodology and technologies apply during chatbot training in the pre-production phase, through to chatbot usage in the field in the post-production phase. They implement the `test first' paradigm by assisting in agile design, and support continuous integration through actionable insights.