论文标题

引号:报价推荐的基准写作

QuoteR: A Benchmark of Quote Recommendation for Writing

论文作者

Qi, Fanchao, Yang, Yanhui, Yi, Jing, Cheng, Zhili, Liu, Zhiyuan, Sun, Maosong

论文摘要

使用引号(引号)使我们的著作更加优雅或令人信服是非常普遍的。为了帮助人们有效地找到适当的报价,提出了报价建议的任务,旨在推荐适合当前写作背景的报价。有各种报价建议方法,但它们在不同的未发表数据集上进行了评估。为了促进有关此任务的研究,我们构建了一个名为“报价”的大型且完全开放的报价数据集,该数据集包括三个部分,包括英语,标准中文和古典中文。它的任何部分都比以前未发表的同行大。我们对引号的现有报价建议方法进行了广泛的评估。此外,我们提出了一个新的报价推荐模型,该模型在所有三个部分的所有三个部分上都显着胜过以前的方法。本文的所有代码和数据均可在https://github.com/thunlp/quoter上获得。

It is very common to use quotations (quotes) to make our writings more elegant or convincing. To help people find appropriate quotes efficiently, the task of quote recommendation is presented, aiming to recommend quotes that fit the current context of writing. There have been various quote recommendation approaches, but they are evaluated on different unpublished datasets. To facilitate the research on this task, we build a large and fully open quote recommendation dataset called QuoteR, which comprises three parts including English, standard Chinese and classical Chinese. Any part of it is larger than previous unpublished counterparts. We conduct an extensive evaluation of existing quote recommendation methods on QuoteR. Furthermore, we propose a new quote recommendation model that significantly outperforms previous methods on all three parts of QuoteR. All the code and data of this paper are available at https://github.com/thunlp/QuoteR.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源