为通用用户表示剥削行为一致性

论文标题

为通用用户表示剥削行为一致性

Exploiting Behavioral Consistence for Universal User Representation

论文作者

Gu, Jie, Wang, Feng, Sun, Qinghui, Ye, Zhiquan, Xu, Xiaoxiao, Chen, Jingmin, Zhang, Jun

论文摘要

用户建模对于开发行业个性化服务至关重要。用户建模的一种常见方法是学习可以通过其兴趣或偏好区分的用户表示形式。在这项工作中，我们专注于开发通用用户表示模型。所获得的通用表示形式有望包含丰富的信息，并适用于各种下游应用程序，而无需进行进一步的修改（例如，用户偏好预测和用户分析）。因此，像以前的工作一样，我们可以摆脱针对每个下游任务的培训特定任务模型的大量工作。具体而言，我们建议自我监督的用户建模网络（SUMN）将行为数据编码为通用表示形式。它包括两个关键组件。第一个是一个新的学习目标，它指导模型在自我监督的学习框架下完全识别和保留有价值的用户信息。另一个是一个多跳聚合层，它使模型能力在汇总各种行为方面受益。基准数据集的广泛实验表明，我们的方法可以胜过最先进的无监督表示方法，甚至可以与受监督的方法竞争。

User modeling is critical for developing personalized services in industry. A common way for user modeling is to learn user representations that can be distinguished by their interests or preferences. In this work, we focus on developing universal user representation model. The obtained universal representations are expected to contain rich information, and be applicable to various downstream applications without further modifications (e.g., user preference prediction and user profiling). Accordingly, we can be free from the heavy work of training task-specific models for every downstream task as in previous works. In specific, we propose Self-supervised User Modeling Network (SUMN) to encode behavior data into the universal representation. It includes two key components. The first one is a new learning objective, which guides the model to fully identify and preserve valuable user information under a self-supervised learning framework. The other one is a multi-hop aggregation layer, which benefits the model capacity in aggregating diverse behaviors. Extensive experiments on benchmark datasets show that our approach can outperform state-of-the-art unsupervised representation methods, and even compete with supervised ones.

下载PDF全文

下载文献需遵守相关版权规定

论文标题