论文标题

标记与未标记网络的熵

Entropy of labeled versus unlabeled networks

论文作者

Paton, Jeremy, Hartle, Harrison, Stepanyants, Huck, van der Hoorn, Pim, Krioukov, Dmitri

论文摘要

网络的结构是一个未标记的图形,但是在大多数复杂网络模型中的图都被毫无意义的随机整数标记。关联的标记噪声是否总是可以忽略不计,还是可以压倒网络结构信号?为了解决这个问题,我们介绍并考虑了流行网络模型的稀疏未标记版本,并将其熵与原始标记版本进行比较。我们表明,即使其度分布却大不相同,标记和未标记的Erdos-Renyi图在熵上是等效的。配置模型的标记和未标记的版本在其前熵项中可能具有不同的预成分,尽管这仍然是猜想的。我们的主要结果是标记和未标记的一维随机几何图的熵的上限和下限。我们表明,与标记的熵相比,它们的未标记熵可以忽略不计。这意味着在稀疏网络中,无意义的标签的熵可能主导网络结构的熵。该结果的主要含义是,使用可交换模型来推理具有可区分节点的现实世界网络的共同实践可能会将不受控制的畸变引入对这些网络的结论中,这表明需要对网络科学的统计基础进行彻底重新检查。

The structure of a network is an unlabeled graph, yet graphs in most models of complex networks are labeled by meaningless random integers. Is the associated labeling noise always negligible, or can it overpower the network-structural signal? To address this question, we introduce and consider the sparse unlabeled versions of popular network models, and compare their entropy against the original labeled versions. We show that labeled and unlabeled Erdos-Renyi graphs are entropically equivalent, even though their degree distributions are very different. The labeled and unlabeled versions of the configuration model may have different prefactors in their leading entropy terms, although this remains conjectural. Our main results are upper and lower bounds for the entropy of labeled and unlabeled one-dimensional random geometric graphs. We show that their unlabeled entropy is negligible in comparison with the labeled entropy. This means that in sparse networks the entropy of meaningless labeling may dominate the entropy of the network structure. The main implication of this result is that the common practice of using exchangeable models to reason about real-world networks with distinguishable nodes may introduce uncontrolled aberrations into conclusions made about these networks, suggesting a need for a thorough reexamination of the statistical foundations and key results of network science.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源