论文标题
检测$ p $ hacking的测试功能
The Power of Tests for Detecting $p$-Hacking
论文作者
论文摘要
蓬勃发展的经验文献研究了基于$ p $ p $ p $ bailues在研究中的分布的普遍性。解释本文中的结果需要仔细了解检测$ p $ hake的方法的力量。从理论上讲,我们可以研究$ p $的可能形式对分布$ p $ - 价值的含义,以了解检测测试的能力。功率可能很低,并取决于$ p $的策略和真实效果的分布。 $ p $ curve的连续性的上限和单调性和测试的结合测试往往具有检测$ p $ hacking的最高功能。
A flourishing empirical literature investigates the prevalence of $p$-hacking based on the distribution of $p$-values across studies. Interpreting results in this literature requires a careful understanding of the power of methods for detecting $p$-hacking. We theoretically study the implications of likely forms of $p$-hacking on the distribution of $p$-values to understand the power of tests for detecting it. Power can be low and depends crucially on the $p$-hacking strategy and the distribution of true effects. Combined tests for upper bounds and monotonicity and tests for continuity of the $p$-curve tend to have the highest power for detecting $p$-hacking.