论文标题

通过邻里选择同时估算图形模型

Simultaneous Estimation of Graphical Models by Neighborhood Selection

论文作者

Moysidis, Ilias, Li, Bing

论文摘要

在许多有关统计图形模型的应用中,数据源自几个具有相似之处但也有显着差异的亚群。这就提出了一个问题,即如何同时估算几个图形模型。将所有数据汇编在一起估算单个图将忽略亚种群之间的差异。另一方面,分别从每个亚群中估算图形并不能有效利用数据中的共同结构。我们通过在稀疏的诱导惩罚下估算所涉及变量的拓扑邻域来开发一种与多个图形模型同时估算的新方法,该社区考虑了子群中的共同结构。与现有的联合图形模型的方法不同,我们的方法不依赖大型矩阵的光谱分解,因此在计算上对估计大型网络更具吸引力。此外,我们开发了方法的渐近特性,证明其数值复杂性,并通过模拟将其与几种现有方法进行比较。最后,我们将方法应用于肺癌数据集的基因组网络的估计,该数据集由几个亚群组成。

In many applications concerning statistical graphical models the data originate from several subpopulations that share similarities but have also significant differences. This raises the question of how to estimate several graphical models simultaneously. Compiling all the data together to estimate a single graph would ignore the differences among subpopulations. On the other hand, estimating a graph from each subpopulation separately does not make efficient use of the common structure in the data. We develop a new method for simultaneous estimation of multiple graphical models by estimating the topological neighborhoods of the involved variables under a sparse inducing penalty that takes into account the common structure in the subpopulations. Unlike the existing methods for joint graphical models, our method does not rely on spectral decomposition of large matrices, and is therefore more computationally attractive for estimating large networks. In addition, we develop the asymptotic properties of our method, demonstrate its the numerical complexity, and compare it with several existing methods by simulation. Finally, we apply our method to the estimation of genomic networks for a lung cancer dataset which consists of several subpopulations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源