论文标题
气候网络建设的陷阱:统计观点
Pitfalls of Climate Network Construction: A Statistical Perspective
论文作者
论文摘要
基于网络的动态系统分析在气候科学中变得越来越流行。在这里,我们从统计的角度解决了网络构建,并突出了通常被忽略的事实,即计算出的相关值只是经验估计。为了衡量偏离地面真理网络的偏差,我们模拟了球体上的时间依赖性各向同性随机场,并应用了通用的网络构建技术。我们找到了几种方法,这些方法来自估计程序对网络特征产生重大影响。当数据具有局部连贯的相关结构时,必须期望伪造的链接束链连接和虚假的高度集群。各向异性估计方差也会引起严重的偏见为经验网络。我们通过ERA5重新分析数据来验证我们的发现。此外,我们解释了为什么常用的重采样程序不适合显着性评估,并提出了一个统计上更有意义的整体构造框架。通过传达从稀缺数据的估计中出现的困难以及通过提出哪些设计决策会提高鲁棒性,我们希望将来有助于更可靠的气候网络构建。
Network-based analyses of dynamical systems have become increasingly popular in climate science. Here we address network construction from a statistical perspective and highlight the often ignored fact that the calculated correlation values are only empirical estimates. To measure spurious behaviour as deviation from a ground truth network, we simulate time-dependent isotropic random fields on the sphere and apply common network construction techniques. We find several ways in which the uncertainty stemming from the estimation procedure has major impact on network characteristics. When the data has locally coherent correlation structure, spurious link bundle teleconnections and spurious high-degree clusters have to be expected. Anisotropic estimation variance can also induce severe biases into empirical networks. We validate our findings with ERA5 reanalysis data. Moreover we explain why commonly applied resampling procedures are inappropriate for significance evaluation and propose a statistically more meaningful ensemble construction framework. By communicating which difficulties arise in estimation from scarce data and by presenting which design decisions increase robustness, we hope to contribute to more reliable climate network construction in the future.