论文标题

在找到具有最大持久概率的社区

On Finding the Community with Maximum Persistence Probability

论文作者

Avellone, Alessandro, Benati, Stefano, Grassi, Rosanna, Rizzini, Giorgio

论文摘要

持久性概率是一个统计指数,已提出,该指数检测一个或多个嵌入网络中的社区。即使其定义很简单,例如,随机助行器保留在一组节点中的可能性,很少使用它来开发有效的算法来计算它的困难。在这里,我们提出了一个新的数学编程模型,以找到最大持久性概率的社区。该模型是整数分数编程,但可以将其简化为具有适当变量替换的混合构成线性编程。然而,该问题只能在合理的时间内解决小规模的网络,因此我们开发了一些启发式程序来近似最佳解决方案。首先,我们利用特殊的数据结构详细阐述了一种随机的贪婪方法,以快速生成可行的解决方案。在分析了贪婪的输出并确定最终解决方案最终位于何处之后,我们基于本地交易所实施了改进程序,但要应用不同的长期多元化原则,这些原则基于可变的邻域搜索和随机重新启动。接下来,我们在模拟图上应用了算法,这些算法准确地重现了实际网络中发现的聚类特征,以确定我们方法的可靠性和有效性。最后,我们将方法应用于两个真实的网络,将我们的发现与两个众所周知的替代社区检测程序发现的结果进行了比较。

The persistence probability is a statistical index that has been proposed to detect one or more communities embedded in a network. Even though its definition is straightforward, e.g, the probability that a random walker remains in a group of nodes, it has been seldom applied possibly for the difficulty of developing an efficient algorithm to calculate it. Here, we propose a new mathematical programming model to find the community with the largest persistence probability. The model is integer fractional programming, but it can be reduced to mixed-integer linear programming with an appropriate variable substitution. Nevertheless, the problem can be solved in a reasonable time for networks of small size only, therefore we developed some heuristic procedures to approximate the optimal solution. First, we elaborated a randomized greedy-ascent method, taking advantage of a peculiar data structure to generate feasible solutions fast. After analyzing the greedy output and determining where the optimal solution is eventually located, we implemented improving procedures based on a local exchange, but applying different long term diversification principles, that are based on variable neighborhood search and random restart. Next, we applied the algorithms on simulated graphs that reproduce accurately the clustering characteristics found in real networks to determine the reliability and the effectiveness of our methodology. Finally, we applied our method to two real networks, comparing our findings to what found by two well-known alternative community detection procedures.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源