通过光谱嵌入和二次编程之间的顶点提名

论文标题

通过光谱嵌入和二次编程之间的顶点提名

Vertex nomination between graphs via spectral embedding and quadratic programming

论文作者

Zheng, Runbing, Lyzinski, Vince, Priebe, Carey E., Tang, Minh

论文摘要

给定一个网络和一系列有趣的顶点，其身份仅是部分知道的，顶点提名问题旨在将其剩余顶点排名，以使有趣的顶点排在列表的顶部。该问题的一个重要变体是在多画像设置中的顶点提名。给定两个图形$ g_1，g_2 $，带有常见顶点和一个兴趣的顶点$ x \在g_1 $中，我们希望对$ g_2 $的顶点进行排名，以使与$ x $最相似的顶点排名在列表的顶部。当前的论文解决了此问题，并提出了一种方法，该方法首先应用邻接光谱图将图嵌入到一个共同的欧几里得空间中，然后解决惩罚的线性分配问题以获取提名列表。由于图形的光谱嵌入仅是唯一的到正交变换，因此我们提出了两种消除这种潜在的非识别性的方法。一种方法是基于正交式的，并且当两个图之间有已知对应关系时，适用于正交倾向。另一种方法使用自适应点集注册，并且当很少或没有具有已知对应的顶点或没有的顶点时适用。我们表明，我们的提名方案导致在生成模型下的准确提名，这些模型对成对的随机图对，这些图形大约是低级且可能具有成对边缘相关性的。我们通过模拟合成数据的模拟研究来说明算法的性能，并分析了Bing搜索引擎上网页之间的过渡速率的分析。

Given a network and a subset of interesting vertices whose identities are only partially known, the vertex nomination problem seeks to rank the remaining vertices in such a way that the interesting vertices are ranked at the top of the list. An important variant of this problem is vertex nomination in the multi-graphs setting. Given two graphs $G_1, G_2$ with common vertices and a vertex of interest $x \in G_1$, we wish to rank the vertices of $G_2$ such that the vertices most similar to $x$ are ranked at the top of the list. The current paper addresses this problem and proposes a method that first applies adjacency spectral graph embedding to embed the graphs into a common Euclidean space, and then solves a penalized linear assignment problem to obtain the nomination lists. Since the spectral embedding of the graphs are only unique up to orthogonal transformations, we present two approaches to eliminate this potential non-identifiability. One approach is based on orthogonal Procrustes and is applicable when there are enough vertices with known correspondence between the two graphs. Another approach uses adaptive point set registration and is applicable when there are few or no vertices with known correspondence. We show that our nomination scheme leads to accurate nomination under a generative model for pairs of random graphs that are approximately low-rank and possibly with pairwise edge correlations. We illustrate our algorithm's performance through simulation studies on synthetic data as well as analysis of a high-school friendship network and analysis of transition rates between web pages on the Bing search engine.

下载PDF全文

下载文献需遵守相关版权规定

论文标题