数据集特征如何影响倾向得分方法的性能和控制观察性研究混淆的回归？模拟研究

论文标题

数据集特征如何影响倾向得分方法的性能和控制观察性研究混淆的回归？模拟研究

How do dataset characteristics affect the performance of propensity score methods and regression for controlling confounding in observational studies? A simulation study

论文作者

Wilkinson, J., Mamas, M. A., Kontopantelis, E.

论文摘要

在观察性研究中，研究人员必须选择一种控制混淆的方法。选项包括倾向得分方法和回归。目前尚不清楚数据集特性（大小，倾向分数重叠，暴露率的重叠）如何影响方法的相对性能，从而难以为特定数据集选择最佳方法。与逻辑回归相比，一项用于评估数据集特征对倾向得分方法性能的作用的仿真研究，以估计存在混杂的情况下的边际优势比。从逻辑和互补日志模型模拟结果，大小，倾向得分重叠，并且暴露率的流行率变化了。回归显示小样本量的覆盖范围较差，但是在样本量较大的情况下，与倾向分数方法相比，倾向得分的不平衡和低暴露率更高。倾向得分方法经常显示出次优覆盖范围，尤其是随着倾向分数的重叠率下降。这些问题在较大的样本量中加剧了。匹配方法的功率尤其受到缺乏重叠，暴露率低和样本量较小的影响。处理加权的反比概率的性能在很大程度上取决于数据集特性，覆盖率和偏见较差，重叠率较低。在灵敏度分析中，回归对大数据大小的优势尚不清楚，具有互补的日志结果产生机制和无法测量的混杂，具有优越的偏见和误差，但覆盖率较低，但覆盖率低于最近的邻居和卡尺匹配。

In observational studies, researchers must select a method to control for confounding. Options include propensity score methods and regression. It remains unclear how dataset characteristics (size, overlap in propensity scores, exposure prevalence) influence the relative performance of the methods, making it difficult to select the best method for a particular dataset. A simulation study to evaluate the role of dataset characteristics on the performance of propensity score methods, compared to logistic regression, for estimating a marginal odds ratio in the presence of confounding was conducted. Outcomes were simulated from logistic and complementary log-log models, and size, overlap in propensity scores, and prevalence of the exposure were varied. Regression showed poor coverage for small sample sizes, but with large sample sizes it was more robust to imbalance in propensity scores and low exposure prevalence than were propensity score methods. Propensity score methods frequently displayed suboptimal coverage, particularly as overlap in propensity scores decreased. These problems were exacerbated at larger sample sizes. Power of matching methods was particularly affected by lack of overlap, low prevalence of exposure, and small sample size. Performance of inverse probability of treatment weighting depended heavily on dataset characteristics, with poor coverage and bias with low overlap. The advantage of regression for large data size was less clear in sensitivity analysis with a complementary log-log outcome generation mechanism and unmeasured confounding, with superior bias and error but lower coverage than nearest neighbour and caliper matching.

下载PDF全文

下载文献需遵守相关版权规定

论文标题