论文标题

签名网络中的平衡集合计算:概念和算法

Balanced Clique Computation in Signed Networks: Concepts and Algorithms

论文作者

Chen, Zi, Yuan, Long, Lin, Xuemin, Qin, Lu, Zhang, Wenjie

论文摘要

集团是网络分析中凝聚性子图挖掘的最基本模型之一。现有的集团模型主要关注未签名的网络。但是,在现实世界中,许多应用程序被建模为具有正边和负边​​缘的签名网络。由于签名的网络拥有自己的属性与未签名的网络不同,因此现有的集团模型不适用于签名的网络。在此激励的情况下,我们提出了平衡的集团模型,该模型认为签名网络是最基本和最主要的理论,结构平衡理论。遵循平衡的集团模型,我们研究了最大平衡集团枚举问题(MBCE),该问题计算给定符号网络中的所有最大平衡集团以及最大平衡的集体搜索问题(MBC),该问题计算以最大大小计算平衡的集团。我们表明MBCE问题和MBCS问题都是NP-HARD。对于MBCE问题,一个直接的解决方案是将签名的网络视为两个未签名的网络,并利用未签名网络的现成技术。但是,对于大型签名网络,这种解决方案效率低下。为了解决这个问题,在本文中,我们首先提出了一种新的最大平衡集团枚举算法,通过利用签名网络的独特属性。基于新提出的算法,我们制定了两种优化策略,以进一步提高枚举的效率。对于MBCS问题,我们根据搜索空间分区提出了一个新的搜索框架。为了进一步提高新框架的效率,我们提出了有关冗余搜索分支和无效候选者的多种优化策略。我们在大型真实数据集上进行了广泛的实验。实验结果证明了我们提出的针对MBCE问题和MBCS问题的算法的效率,有效性和可扩展性。

Clique is one of the most fundamental models for cohesive subgraph mining in network analysis. Existing clique model mainly focuses on unsigned networks. However, in real world, many applications are modeled as signed networks with positive and negative edges. As the signed networks hold their own properties different from the unsigned networks, the existing clique model is inapplicable for the signed networks. Motivated by this, we propose the balanced clique model that considers the most fundamental and dominant theory, structural balance theory, for signed networks. Following the balanced clique model, we study the maximal balanced clique enumeration problem (MBCE) which computes all the maximal balanced cliques in a given signed network and the maximum balanced clique search problem (MBCS) which computes the balanced clique with maximum size. We show that MBCE problem and MBCS problem are both NP-Hard. For the MBCE problem, a straightforward solution is to treat the signed network as two unsigned networks and leverage the off-the-shelf techniques for unsigned networks. However, such a solution is inefficient for large signed networks. To address this problem, in this paper, we first propose a new maximal balanced clique enumeration algorithm by exploiting the unique properties of signed networks. Based on the new proposed algorithm, we devise two optimization strategies to further improve the efficiency of the enumeration. For the MBCS problem, we propose a new search framework based on search space partition. To further improve the efficiency of the new framework, we propose multiple optimization strategies regarding to redundant search branches and invalid candidates. We conduct extensive experiments on large real datasets. The experimental results demonstrate the efficiency, effectiveness and scalability of our proposed algorithms for MBCE problem and MBCS problem.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源