多跳问题回答

论文标题

多跳问题回答

Multi-hop Question Answering

论文作者

Mavi, Vaibhav, Jangra, Anubhav, Jatowt, Adam

论文摘要

回答的任务（QA）长期引起了重大的研究兴趣。它与语言理解和知识检索任务的相关性以及简单的设置使QA的任务对强大的AI系统至关重要。在简单的质量检查任务上的最新成功将重点转移到了更复杂的设置上。其中，多跳质量质量检查（MHQA）是近年来研究最多的任务之一。从广义上讲，MHQA是回答自然语言问题的任务，该问题涉及提取和结合多个信息并执行多个推理步骤。一个多跳的问题的一个例子是“阿根廷PGA冠军唱片持有者赢得了全球有多少锦标赛？”。回答这个问题可能需要两个信息：“阿根廷PGA冠军赛的记录持有人是谁？”和“第1季度的答案赢得了多少锦标赛？”。回答多跳问题和执行多步推理的能力可以显着改善NLP系统的实用性。因此，该领域看到了具有高质量数据集，模型和评估策略的激增。 “多个啤酒花”的概念有些抽象，这导致了需要多跳推理的各种任务。这导致了不同的数据集和模型，这些数据集和模型彼此之间有很大差异，并使该领域具有挑战性地概括和调查。我们旨在提供对MHQA任务的一般和正式定义，并组织和总结现有的MHQA框架。我们还概述了构建MHQA数据集的一些最佳实践。这本书提供了系统的详尽介绍，以及对这项非常有趣但又具有挑战性的任务的现有尝试的结构。

The task of Question Answering (QA) has attracted significant research interest for long. Its relevance to language understanding and knowledge retrieval tasks, along with the simple setting makes the task of QA crucial for strong AI systems. Recent success on simple QA tasks has shifted the focus to more complex settings. Among these, Multi-Hop QA (MHQA) is one of the most researched tasks over the recent years. In broad terms, MHQA is the task of answering natural language questions that involve extracting and combining multiple pieces of information and doing multiple steps of reasoning. An example of a multi-hop question would be "The Argentine PGA Championship record holder has won how many tournaments worldwide?". Answering the question would need two pieces of information: "Who is the record holder for Argentine PGA Championship tournaments?" and "How many tournaments did [Answer of Sub Q1] win?". The ability to answer multi-hop questions and perform multi step reasoning can significantly improve the utility of NLP systems. Consequently, the field has seen a surge with high quality datasets, models and evaluation strategies. The notion of 'multiple hops' is somewhat abstract which results in a large variety of tasks that require multi-hop reasoning. This leads to different datasets and models that differ significantly from each other and makes the field challenging to generalize and survey. We aim to provide a general and formal definition of the MHQA task, and organize and summarize existing MHQA frameworks. We also outline some best practices for building MHQA datasets. This book provides a systematic and thorough introduction as well as the structuring of the existing attempts to this highly interesting, yet quite challenging task.

下载PDF全文

下载文献需遵守相关版权规定

论文标题