论文标题
多域推荐的自适应域兴趣网络
Adaptive Domain Interest Network for Multi-domain Recommendation
论文作者
论文摘要
工业推荐系统通常从多种业务方案中保存数据,并有望同时为这些方案提供建议服务。在检索步骤中,从大量语料库中选择的TOPK高质量项目通常需要在多种情况下都是各种各样的。以阿里巴巴展示广告系统为例,不仅是因为淘宝用户的行为模式是多样的,而且广告客户分配的方案的出价价格也有很大差异。传统方法要么分别针对每种情况进行训练模型,因此忽略了用户组和项目的跨域重叠,或者简单地将所有样本混合并维护共享模型,这使得很难在场景之间捕获重要的多样性。在本文中,我们提出了自适应领域兴趣网络,该网络可以自适应地处理各个方案的共同点和多样性,从而在培训期间充分利用多阶段数据。然后,提出的方法能够通过在线推断期间为不同方案提供各种TOPK候选人来提高每个业务领域的性能。具体而言,我们提出的ADI分别通过共享网络和特定领域的网络对不同领域的共同点和多样性进行了建模。此外,我们将特定于域的批处理归一化应用于特征级域的适应性域兴趣适应层。还制定了一种自训练策略,以捕获跨域的标签级连接。ADI已在阿里巴巴的展示广告系统中部署,并获得了1.8%的广告收入。
Industrial recommender systems usually hold data from multiple business scenarios and are expected to provide recommendation services for these scenarios simultaneously. In the retrieval step, the topK high-quality items selected from a large number of corpus usually need to be various for multiple scenarios. Take Alibaba display advertising system for example, not only because the behavior patterns of Taobao users are diverse, but also differentiated scenarios' bid prices assigned by advertisers vary significantly. Traditional methods either train models for each scenario separately, ignoring the cross-domain overlapping of user groups and items, or simply mix all samples and maintain a shared model which makes it difficult to capture significant diversities between scenarios. In this paper, we present Adaptive Domain Interest network that adaptively handles the commonalities and diversities across scenarios, making full use of multi-scenarios data during training. Then the proposed method is able to improve the performance of each business domain by giving various topK candidates for different scenarios during online inference. Specifically, our proposed ADI models the commonalities and diversities for different domains by shared networks and domain-specific networks, respectively. In addition, we apply the domain-specific batch normalization and design the domain interest adaptation layer for feature-level domain adaptation. A self training strategy is also incorporated to capture label-level connections across domains.ADI has been deployed in the display advertising system of Alibaba, and obtains 1.8% improvement on advertising revenue.