论文标题

在用二进制端点的案例研究中,评估多个插定以解决预期和意外丢失的数据

Evaluation of multiple imputation to address intended and unintended missing data in case-cohort studies with a binary endpoint

论文作者

Middleton, Melissa, Nguyen, Cattram, Carlin, John B., Moreno-Betancur, Margarita, Lee, Katherine J.

论文摘要

在队列研究中进行了病例研究研究,其中的暴露数据收集仅限于同类的一部分,从而导致大量删除数据划分的数据。标准分析使用逆概率加权(IPW)来解决此预期缺失的数据,但是在也意外丢失时,几乎没有研究如何最好地进行分析。多个插补(MI)已成为处理意外失踪性的默认标准,但是当与IPW结合使用时,归合模型需要考虑加权以确保与分析模型的兼容性。另外,MI可用于处理预期和意外的失踪性。虽然已经在案例研究的情况下研究了唯一的MI方法的性能,但尚不清楚该方法如何以二元结果进行。我们进行了一项模拟研究,以评估和比较仅使用MI,仅IPW和MI和IPW组合的方法的性能,以处理这种情况下的预期和意外失踪。我们还将这些方法应用于案例研究。我们的结果表明,当样本量较大时,合并的方法大致无偏见,以估算暴露效果,并且在两个样本大小设置中均具有较小的样本量的偏差,而仅MI-limy或仅仅IPW均显示出较大的偏见。这些发现表明,在具有二元结果的情况下,MI是处理预期和意外丢失数据的首选方法。

Case-cohort studies are conducted within cohort studies, wherein collection of exposure data is limited to a subset of the cohort, leading to a large proportion of missing data by design. Standard analysis uses inverse probability weighting (IPW) to address this intended missing data, but little research has been conducted into how best to perform analysis when there is also unintended missingness. Multiple imputation (MI) has become a default standard for handling unintended missingness, but when used in combination with IPW, the imputation model needs to take account of the weighting to ensure compatibility with the analysis model. Alternatively, MI could be used to handle both the intended and unintended missingness. While the performance of a solely MI approach has been investigated in the context of a case-cohort study with a time-to-event outcome, it is unclear how this approach performs with binary outcomes. We conducted a simulation study to assess and compare the performance of approaches using only MI, only IPW, and a combination of MI and IPW, for handling intended and unintended missingness in this setting. We also applied the approaches to a case study. Our results show that the combined approach is approximately unbiased for estimation of the exposure effect when the sample size is large, and was the least biased with small sample sizes, while MI-only or IPW-only exhibited larger biases in both sample size settings. These findings suggest that MI is the preferred approach to handle intended and unintended missing data in case-cohort studies with binary outcomes.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源