论文标题
对机器学习的信息审查,对危险比的强劲推断
Doubly Robust Inference for Hazard Ratio under Informative Censoring with Machine Learning
论文作者
论文摘要
传统上使用对数秩检验的随机临床试验,随后是COX比例危害(PH)模型来估计治疗组之间的危害比。在假设右审查机制是非信息的,即独立于每个治疗组中的活动时间。更一般而言,检查时间可能取决于其他协变量,并且可以使用审查权重(IPCW)的逆概率来纠正信息范围内的审查造成的偏差。 IPCW需要正确指定的审查时间模型在治疗和协变量上。由于COX模型的非碰撞性,在这种情况下在这种情况下的双重鲁棒推断是不合理的。但是,随着数据自适应机器学习方法的最新开发,我们得出了增强的IPCW(AIPCW)估计器,该估计器具有以下双重稳健(DR)的属性:它是型号的型号,因为它是一致且渐近的正常(CAN),只要有两种模型,即用于失败的时间和一个用于陈述的时间,用于CESCENSING CESSINGINDECESSING,是正确的;它也是双重稳定的,因为只要这两个模型下的估计错误率的乘积比根$ n $快。我们使用有限样品中的大量模拟研究了AIPCW估计器。
Randomized clinical trials with time-to-event outcomes have traditionally used the log-rank test followed by the Cox proportional hazards (PH) model to estimate the hazard ratio between the treatment groups. These are valid under the assumption that the right-censoring mechanism is non-informative, i.e. independent of the time-to-event of interest within each treatment group. More generally, the censoring time might depend on additional covariates, and inverse probability of censoring weighting (IPCW) can be used to correct for the bias resulting from the informative censoring. IPCW requires a correctly specified censoring time model conditional on the treatment and the covariates. Doubly robust inference in this setting has not been plausible previously due to the non-collapsibility of the Cox model. However, with the recent development of data-adaptive machine learning methods we derive an augmented IPCW (AIPCW) estimator that has the following doubly robust (DR) properties: it is model doubly robust, in that it is consistent and asymptotic normal (CAN), as long as one of the two models, one for the failure time and one for the censoring time, is correctly specified; it is also rate doubly robust, in that it is CAN as long as the product of the estimation error rates under these two models is faster than root-$n$. We investigate the AIPCW estimator using extensive simulation in finite samples.