论文标题
学习通过随机散步来浏览维基百科
Learning to Navigate Wikipedia by Taking Random Walks
论文作者
论文摘要
智能网络代理的基本能力是寻求和获取新信息。互联网搜索引擎可靠地找到了正确的附近,但最高的结果可能是与所需目标相距的几个链接。一种互补的方法是通过超链接进行导航,采用理解本地内容的策略,并选择一个将其更接近目标的链接。在本文中,我们表明,随机采样轨迹的行为克隆足以学习有效的链接选择策略。我们在Wikipedia的图形版本上演示了3800万节点和387m边缘的方法。该模型能够分别在相距96%和92%的时间之间有效地导航。然后,我们在下游事实验证中使用所得的嵌入式和策略,并提出回答任务,与基本的TF-IDF搜索和排名方法结合使用,它们是最先进方法的竞争结果。
A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this paper, we show that behavioral cloning of randomly sampled trajectories is sufficient to learn an effective link selection policy. We demonstrate the approach on a graph version of Wikipedia with 38M nodes and 387M edges. The model is able to efficiently navigate between nodes 5 and 20 steps apart 96% and 92% of the time, respectively. We then use the resulting embeddings and policy in downstream fact verification and question answering tasks where, in combination with basic TF-IDF search and ranking methods, they are competitive results to the state-of-the-art methods.