论文标题
显示广告中的印象分配和政策搜索
Impression Allocation and Policy Search in Display Advertising
论文作者
论文摘要
在在线展示广告中,保证合同和实时投标(RTB)是出版商出售印象的两种主要方法。对于大型出版商而言,同时通过保证合同和内部RTB出售印象已成为一个流行的选择。一般而言,出版商需要得出保证合同和RTB之间的印象分配策略,以最大程度地提高其整体结果(例如收入和/或印象质量)。但是,得出最佳策略并不是一项琐碎的任务,例如,该策略应鼓励RTB中的激励兼容性,并应对现实世界中的不稳定交通模式(例如,印象量和竞标景观的改变)的共同挑战。在本文中,我们将印象分配作为拍卖问题,每个保证合同都会提交个人印象的虚拟出价。通过此公式,我们得出了保证合同的最佳招标函数,从而导致最佳印象分配。为了应对不稳定的交通模式挑战并实现最佳的总体结果,我们提出了一种多代理增强学习方法,以调整每个保证合同的投标,这是一个简单的,有效且可扩展的。在现实世界数据集上进行的实验证明了我们方法的有效性。
In online display advertising, guaranteed contracts and real-time bidding (RTB) are two major ways to sell impressions for a publisher. For large publishers, simultaneously selling impressions through both guaranteed contracts and in-house RTB has become a popular choice. Generally speaking, a publisher needs to derive an impression allocation strategy between guaranteed contracts and RTB to maximize its overall outcome (e.g., revenue and/or impression quality). However, deriving the optimal strategy is not a trivial task, e.g., the strategy should encourage incentive compatibility in RTB and tackle common challenges in real-world applications such as unstable traffic patterns (e.g., impression volume and bid landscape changing). In this paper, we formulate impression allocation as an auction problem where each guaranteed contract submits virtual bids for individual impressions. With this formulation, we derive the optimal bidding functions for the guaranteed contracts, which result in the optimal impression allocation. In order to address the unstable traffic pattern challenge and achieve the optimal overall outcome, we propose a multi-agent reinforcement learning method to adjust the bids from each guaranteed contract, which is simple, converging efficiently and scalable. The experiments conducted on real-world datasets demonstrate the effectiveness of our method.