论文标题
在随机区块模型中对外源分区结构的贝叶斯测试
Bayesian Testing for Exogenous Partition Structures in Stochastic Block Models
论文作者
论文摘要
网络数据通常显示出具有相似边缘形成模式的节点簇的块结构。当此类关系数据得到有关外源节点分区的其他信息补充时,这些知识来源通常包括在模型中,以监督群集分配机制或改善对边缘概率的推断。尽管这些解决方案是常规实施的,但缺乏正式方法来测试给定的外部节点分区是否与网络节点之间编码随机等价模式的内源性聚类结构一致。为了填补这一空白,我们开发了一种正式的贝叶斯测试程序,该程序依赖于贝叶斯因子在随机块模型之间的计算,其由外源性节点分区定义的已知分组结构和无限关系模型定义,该模型允许内源性聚类构型在网络中未知,随机且完全揭示。提出了一种简单的马尔可夫链蒙特卡洛方法,用于计算贝叶斯因子和量化内源性组的不确定性。在模拟和应用阿尔茨海默氏症患者脑网络中的外源性等效结构的应用中评估了此例程。
Network data often exhibit block structures characterized by clusters of nodes with similar patterns of edge formation. When such relational data are complemented by additional information on exogenous node partitions, these sources of knowledge are typically included in the model to supervise the cluster assignment mechanism or to improve inference on edge probabilities. Although these solutions are routinely implemented, there is a lack of formal approaches to test if a given external node partition is in line with the endogenous clustering structure encoding stochastic equivalence patterns among the nodes in the network. To fill this gap, we develop a formal Bayesian testing procedure which relies on the calculation of the Bayes factor between a stochastic block model with known grouping structure defined by the exogenous node partition and an infinite relational model that allows the endogenous clustering configurations to be unknown, random and fully revealed by the block-connectivity patterns in the network. A simple Markov chain Monte Carlo method for computing the Bayes factor and quantifying uncertainty in the endogenous groups is proposed. This routine is evaluated in simulations and in an application to study exogenous equivalence structures in brain networks of Alzheimer's patients.