论文标题
从用户与排名的互动中学习:该领域的统一
Learning from User Interactions with Rankings: A Unification of the Field
论文作者
论文摘要
排名系统构成了在线搜索引擎和推荐服务的基础。他们处理大量项目,例如网页或电子商务产品,并向用户提供少量订购的选择。排名系统的目的是帮助用户以最少的精力找到所需的项目。因此,他们产生的排名应将最相关或最喜欢的项目放在排名的顶部。学习排名是机器学习中的一个领域,涵盖了优化排名系统W.R.T.的方法。这个目标。传统的监督学习对方法进行排名的方法利用专家判断来评估和学习,但是,在许多情况下,这种判断是不可能或不可行的。作为解决方案,已经引入了通过用户点击进行进行学习以进行排名的方法。点击的困难在于,它们不仅受用户偏好的影响,而且还受到显示哪些排名的影响。因此,这些方法必须防止以外的其他因素偏向用户偏好。该论文涉及学习基于用户点击的方法,并特别旨在统一这些方法的不同家庭。 总体而言,本文的第二部分提出了一个框架,该框架弥合了在线,反事实和监督学习排名之间的许多差距。它采用了以前被认为是独立的方法,并将它们统一为一种方法,以广泛适用有效的学习来从用户点击中排名。
Ranking systems form the basis for online search engines and recommendation services. They process large collections of items, for instance web pages or e-commerce products, and present the user with a small ordered selection. The goal of a ranking system is to help a user find the items they are looking for with the least amount of effort. Thus the rankings they produce should place the most relevant or preferred items at the top of the ranking. Learning to rank is a field within machine learning that covers methods which optimize ranking systems w.r.t. this goal. Traditional supervised learning to rank methods utilize expert-judgements to evaluate and learn, however, in many situations such judgements are impossible or infeasible to obtain. As a solution, methods have been introduced that perform learning to rank based on user clicks instead. The difficulty with clicks is that they are not only affected by user preferences, but also by what rankings were displayed. Therefore, these methods have to prevent being biased by other factors than user preference. This thesis concerns learning to rank methods based on user clicks and specifically aims to unify the different families of these methods. As a whole, the second part of this thesis proposes a framework that bridges many gaps between areas of online, counterfactual, and supervised learning to rank. It has taken approaches, previously considered independent, and unified them into a single methodology for widely applicable and effective learning to rank from user clicks.