论文标题

社会科学指导特征工程:一种新颖的签名链接分析方法

Social Science Guided Feature Engineering: A Novel Approach to Signed Link Analysis

论文作者

Beigi, Ghazaleh, Tang, Jiliang, Liu, Huan

论文摘要

许多现实世界的关系可以通过具有积极联系(例如友谊和信任)和负面联系(例如敌人和不信任)的签名网络来表示。链接预测有助于社交网络分析(例如推荐系统)中的进步任务。链接分析上的大多数现有工作都集中在未签名的社交网络上。负面链接的存在激发了研究兴趣在研究签名网络的财产和原理是否与未签名网络的财产和原则不同,并要求为签名的社交网络进行链接分析的专门努力。最近的发现表明,签名网络的属性与未签名网络的属性有很大差异,而负面链接可能会在签名的链接分析中以互补的方式具有重要帮助。在本文中,我们将讨论集中在签名链接分析的具有挑战性的问题上。签名的链接分析面临数据稀疏问题,即仅给出了少数签名的链接。当负面链接比积极的链接稀疏时,这个问题甚至会变得更糟,因为用户更倾向于积极的处置而不是负面。我们研究如何利用其他信息来源进行签名链接分析。这项研究主要以三种社会科学理论,情感信息,创新的扩散和个人个性为指导。在这些指导下,我们提取三类相关功能,并利用它们进行签名链接分析。实验显示了从社会理论中收集的特征的重要性,这些特征对于签名的链接预测并解决了数据稀疏挑战。

Many real-world relations can be represented by signed networks with positive links (e.g., friendships and trust) and negative links (e.g., foes and distrust). Link prediction helps advance tasks in social network analysis such as recommendation systems. Most existing work on link analysis focuses on unsigned social networks. The existence of negative links piques research interests in investigating whether properties and principles of signed networks differ from those of unsigned networks, and mandates dedicated efforts on link analysis for signed social networks. Recent findings suggest that properties of signed networks substantially differ from those of unsigned networks and negative links can be of significant help in signed link analysis in complementary ways. In this article, we center our discussion on a challenging problem of signed link analysis. Signed link analysis faces the problem of data sparsity, i.e. only a small percentage of signed links are given. This problem can even get worse when negative links are much sparser than positive ones as users are inclined more towards positive disposition rather than negative. We investigate how we can take advantage of other sources of information for signed link analysis. This research is mainly guided by three social science theories, Emotional Information, Diffusion of Innovations, and Individual Personality. Guided by these, we extract three categories of related features and leverage them for signed link analysis. Experiments show the significance of the features gleaned from social theories for signed link prediction and addressing the data sparsity challenge.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源