论文标题
垂直半效率学习以进行有效的在线广告
Vertical Semi-Federated Learning for Efficient Online Advertising
论文作者
论文摘要
传统的垂直联合学习模式遇到了两个主要问题:1)将适用的范围限制为重叠样本和2)实时联合服务的高系统挑战,这限制了其在广告系统中的应用。为此,我们主张一种新的学习设置半VFL(垂直半赋予学习),以应对这些挑战。建议通过学习一种比单方型模型更好的联邦宣传本地模型,并维持本地服务的便利性,从而实现了VFL实用的行业应用方式。为此,我们向i)提出精心设计的联合特权学习框架(JPL),以减轻被动方的缺乏,ii)适应整个样本空间。具体而言,我们构建了适用于整个样本空间的推理单方学生模型,同时保持联合特征扩展的优势。新的表示蒸馏方法旨在提取重叠和非重叠数据的交叉派对特征相关性。我们对现实世界广告数据集进行了广泛的实验。结果表明,我们的方法在基线方法上实现了最佳性能,并在半VFL设置中验证其优越性。
The traditional vertical federated learning schema suffers from two main issues: 1) restricted applicable scope to overlapped samples and 2) high system challenge of real-time federated serving, which limits its application to advertising systems. To this end, we advocate a new learning setting Semi-VFL (Vertical Semi-Federated Learning) to tackle these challenge. Semi-VFL is proposed to achieve a practical industry application fashion for VFL, by learning a federation-aware local model which performs better than single-party models and meanwhile maintain the convenience of local-serving. For this purpose, we propose the carefully designed Joint Privileged Learning framework (JPL) to i) alleviate the absence of the passive party's feature and ii) adapt to the whole sample space. Specifically, we build an inference-efficient single-party student model applicable to the whole sample space and meanwhile maintain the advantage of the federated feature extension. New representation distillation methods are designed to extract cross-party feature correlations for both the overlapped and non-overlapped data. We conducted extensive experiments on real-world advertising datasets. The results show that our method achieves the best performance over baseline methods and validate its superiority in the Semi-VFL setting.