论文标题
gatenet:单击速率预测的登陆式增强深网络
GateNet: Gating-Enhanced Deep Network for Click-Through Rate Prediction
论文作者
论文摘要
广告和饲料排名对于许多互联网公司(例如Facebook)至关重要。在许多现实世界的广告和饲料排名系统中,单击率(CTR)预测起着核心作用。近年来,已经提出并取得了许多基于神经网络的CTR模型,例如分解计算机支持的神经网络,DEEPFM和XDEEPFM。其中许多包含两个常用的组件:嵌入层和MLP隐藏层。另一方面,门控机制也广泛应用于许多研究领域,例如计算机视觉(CV)和自然语言处理(NLP)。一些研究证明,门控机制可以提高非凸深深度神经网络的训练性。受这些观察的启发,我们提出了一个名为Gatenet的新型模型,该模型分别引入特征嵌入门或隐藏的门分别为嵌入层或DNN CTR模型的隐藏层。功能嵌入门提供了一个可学习的功能门控模块,可从功能级别中选择明显的潜在信息。隐藏的门可帮助模型更有效地隐式捕获高阶相互作用。在三个现实世界数据集上进行的广泛实验证明了其有效性,可以提高所有数据集上各种最新模型(例如FM,DEEPFM和XDEEPFM)的性能。
Advertising and feed ranking are essential to many Internet companies such as Facebook. Among many real-world advertising and feed ranking systems, click through rate (CTR) prediction plays a central role. In recent years, many neural network based CTR models have been proposed and achieved success such as Factorization-Machine Supported Neural Networks, DeepFM and xDeepFM. Many of them contain two commonly used components: embedding layer and MLP hidden layers. On the other side, gating mechanism is also widely applied in many research fields such as computer vision(CV) and natural language processing(NLP). Some research has proved that gating mechanism improves the trainability of non-convex deep neural networks. Inspired by these observations, we propose a novel model named GateNet which introduces either the feature embedding gate or the hidden gate to the embedding layer or hidden layers of DNN CTR models, respectively. The feature embedding gate provides a learnable feature gating module to select salient latent information from the feature-level. The hidden gate helps the model to implicitly capture the high-order interaction more effectively. Extensive experiments conducted on three real-world datasets demonstrate its effectiveness to boost the performance of various state-of-the-art models such as FM, DeepFM and xDeepFM on all datasets.