U级：以公用事业为导向的学习，以隐式反馈排名

论文标题

U级：以公用事业为导向的学习，以隐式反馈排名

U-rank: Utility-oriented Learning to Rank with Implicit Feedback

论文作者

Dai, Xinyi, Hou, Jiawei, Liu, Qing, Xi, Yunjia, Tang, Ruiming, Zhang, Weinan, He, Xiuqiang, Wang, Jun, Yu, Yong

论文摘要

在许多实际信息系统中，学习与隐式反馈进行排名是目标是某些特定实用程序的最重要任务之一，例如点击和收入。但是，我们指出的是，基于概率排名原则的现有方法并不一定能达到最高的效用。为此，我们提出了一个名为U级的新型排名框架，该框架直接优化了排名列表的预期效用。通过具有位置意识的深点击率预测模型，我们考虑了查询级别和项目级特征的注意力偏差。由于特定于项目的注意力偏置建模，对预期实用程序的优化对应于项目位置双方图上的最大重量匹配。我们将该目标的优化基于有效的Lambdaloss框架，该框架得到了理论和经验分析的支持。我们在三个基准数据集和两个专有数据集上对Web搜索和推荐系统进行了广泛的实验，其中展示了U级的性能比最先进的。此外，我们提出的U级已在大规模的商业推荐人中部署，并且在在线A/B测试中观察到了对生产基线的大量改进。

Learning to rank with implicit feedback is one of the most important tasks in many real-world information systems where the objective is some specific utility, e.g., clicks and revenue. However, we point out that existing methods based on probabilistic ranking principle do not necessarily achieve the highest utility. To this end, we propose a novel ranking framework called U-rank that directly optimizes the expected utility of the ranking list. With a position-aware deep click-through rate prediction model, we address the attention bias considering both query-level and item-level features. Due to the item-specific attention bias modeling, the optimization for expected utility corresponds to a maximum weight matching on the item-position bipartite graph. We base the optimization of this objective in an efficient Lambdaloss framework, which is supported by both theoretical and empirical analysis. We conduct extensive experiments for both web search and recommender systems over three benchmark datasets and two proprietary datasets, where the performance gain of U-rank over state-of-the-arts is demonstrated. Moreover, our proposed U-rank has been deployed on a large-scale commercial recommender and a large improvement over the production baseline has been observed in an online A/B testing.

下载PDF全文

下载文献需遵守相关版权规定

论文标题