论文标题

KCOREMOTIF:通过利用K核分解和基序来用于大型网络的有效图聚类算法

KCoreMotif: An Efficient Graph Clustering Algorithm for Large Networks by Exploiting k-core Decomposition and Motifs

论文作者

Mei, Gang, Tu, Jingzhi, Xiao, Lei, Piccialli, Francesco

论文摘要

聚类分析已广泛用于在各种复杂网络(例如无线传感器网络和在线社交网络)上的信任评估中。光谱群集是用于图形结构数据(网络)的最常用算法之一。但是,由于需要计算昂贵的矩阵操作,因此与大规模网络的常规光谱群集本质上很难使用。为了处理大型网络,在本文中,我们提出了一种有效的图形聚类算法,KCOREMOTIF,特别是通过利用K核心分解和图案的大型网络。所提出的聚类算法背后的基本思想是在K核子图上执行基于基序的光谱聚类算法,而不是在整个图上执行。更具体地说,(1)我们首先进行大型输入网络的K核分解; (2)然后,我们对顶部K核子图执行基于基序的光谱聚类; (3)我们将其余的顶点(K-1)核子图分组为先前发现的簇;并最终获得大型输入网络的所需簇。为了评估所提出的图形聚类算法KcoreMotif的性能,我们同时使用常规和基于基序的光谱群集算法作为基础线,并将我们的算法与它们的算法与18组现实世界数据集进行比较。比较结果表明,所提出的图形聚类算法对于大型网络而言是准确但有效的,这也意味着它可以进一步用于评估大型网络上的群集内和集群间信任。

Clustering analysis has been widely used in trust evaluation on various complex networks such as wireless sensors networks and online social networks. Spectral clustering is one of the most commonly used algorithms for graph-structured data (networks). However, the conventional spectral clustering is inherently difficult to work with large-scale networks due to the fact that it needs computationally expensive matrix manipulations. To deal with large networks, in this paper, we propose an efficient graph clustering algorithm, KCoreMotif, specifically for large networks by exploiting k-core decomposition and motifs. The essential idea behind the proposed clustering algorithm is to perform the efficient motif-based spectral clustering algorithm on k-core subgraphs, rather than on the entire graph. More specifically, (1) we first conduct the k-core decomposition of the large input network; (2) we then perform the motif-based spectral clustering for the top k-core subgraphs; (3) we group the remaining vertices in the rest (k-1)-core subgraphs into previously found clusters; and finally obtain the desired clusters of the large input network. To evaluate the performance of the proposed graph clustering algorithm KCoreMotif, we use both the conventional and the motif-based spectral clustering algorithms as the baselines and compare our algorithm with them for 18 groups of real-world datasets. Comparative results demonstrate that the proposed graph clustering algorithm is accurate yet efficient for large networks, which also means that it can be further used to evaluate the intra-cluster and inter-cluster trusts on large networks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源