论文标题
评论:随机步行图抽样
A Review: Random Walk in Graph Sampling
论文作者
论文摘要
图形采样是一种从原始图中挑选顶点和/或边缘子集的技术。在各种图形采样方法中,基于遍历的采样(TBS)被广泛使用,因为许多情况下的成本低和可行性,其中简单的随机步行(SRW)及其变体在TBS中具有很大比例的TBS。我们说明了SRW的基础,并提出了SRW的问题。基于这些问题,我们提供了基于不同随机步行(RW)的图形采样方法的分类法,并深入了解它们的原因以及如何修改SRW。我们的摘要包括经典方法和最先进的RW方法。有3种方法可以基于SRW提出新算法,包括SRW及其组合,修改的选择机制以及图形拓扑修改。我们解释了这些算法背后的想法,并提供了详细的伪代码。此外,我们添加了随机步行背后的数学以及随机步行变体的本质,这在许多研究论文和文献评论中均未详细提及。除基于RW的方法外,SRW还与非RW和非TBS方法有关,我们讨论了SRW和非RW方法之间的关系,以及SRW和非TBS方法之间的关系。这些方法之间的关系是正式的,并提供了桥接理论分析和实际实施的一般框架。
Graph sampling is a technique to pick a subset of vertices and/ or edges from original graph. Among various graph sampling approaches, Traversal Based Sampling (TBS) are widely used due to low cost and feasibility for many cases, in which Simple Random Walk (SRW) and its variants share a large proportion in TBS. We illustrate the foundation SRW and presents the problems of SRW. Based on the problems, we provide a taxonomy of different Random Walk (RW) based graph sampling methods and give an insight to the reason why and how they revise SRW. our summary includes classical methods and state-of-art RW-based methods. There are 3 ways to propose new algorithms based on SRW, including SRW and its combinations, modified selection mechanisms, and the graph topology modification. We explained the ideas behind those algorithms, and present detailed pseudo codes. In addition, we add the mathematics behind random walk, and the essence of random walk variants, which is not mentioned in detail in many research papers and literature reviews. Apart from RW-based methods, SRW also has related with the non-RW and non-TBS methods, we discuss the relationships between SRW and non-RW methods, and the relationships between SRW and non-TBS methods. The relations between these approaches are formally argued and a general framework to bridge theoretical analysis and practical implementation is provided.