论文标题
临床时间序列的不确定性感知变量归因网络
Uncertainty-Aware Variational-Recurrent Imputation Network for Clinical Time Series
论文作者
论文摘要
电子健康记录(EHR)包括纵向临床观察,这些纵向临床观察表现出稀疏性,不规则性和高维度,这成为绘制可靠下游临床结果的主要障碍。尽管存在许多解决这些问题的插补方法,但其中大多数忽略了相关的特征,时间动态,并且完全搁置了不确定性。由于缺少的价值估计值涉及不准确的风险,因此适用于与可靠数据不同某些信息较小的方法的方法。在这方面,我们可以利用不确定性来估计缺失值作为进一步利用的保真度得分来减轻偏见丢失价值估计的风险。在这项工作中,我们提出了一个新颖的变分归因网络,该网络通过考虑相关特征,时间动力学以及不确定性来统一插补和预测网络。具体而言,我们利用了插补的深层生成模型,该模型基于变量之间的分布以及一个经常出现的插定网络来利用时间关系,并结合利用不确定性。我们验证了我们提出的模型对两个公开可用的现实EHR数据集的有效性:2012 Physionet Challenge和Mimic-III,并将结果与文献中其他竞争的最先进方法进行了比较。
Electronic health records (EHR) consist of longitudinal clinical observations portrayed with sparsity, irregularity, and high-dimensionality, which become major obstacles in drawing reliable downstream clinical outcomes. Although there exist great numbers of imputation methods to tackle these issues, most of them ignore correlated features, temporal dynamics and entirely set aside the uncertainty. Since the missing value estimates involve the risk of being inaccurate, it is appropriate for the method to handle the less certain information differently than the reliable data. In that regard, we can use the uncertainties in estimating the missing values as the fidelity score to be further utilized to alleviate the risk of biased missing value estimates. In this work, we propose a novel variational-recurrent imputation network, which unifies an imputation and a prediction network by taking into account the correlated features, temporal dynamics, as well as the uncertainty. Specifically, we leverage the deep generative model in the imputation, which is based on the distribution among variables, and a recurrent imputation network to exploit the temporal relations, in conjunction with utilization of the uncertainty. We validated the effectiveness of our proposed model on two publicly available real-world EHR datasets: PhysioNet Challenge 2012 and MIMIC-III, and compared the results with other competing state-of-the-art methods in the literature.