论文标题
克服与医学图像生成的数据共享的障碍:全面评估
Overcoming Barriers to Data Sharing with Medical Image Generation: A Comprehensive Evaluation
论文作者
论文摘要
共享个人身份信息的隐私问题是医学研究中数据共享的主要实用障碍。但是,在许多情况下,研究人员对特定个人的信息没有兴趣,而是旨在获得对同伙水平的见解。在这里,我们利用生成的对抗网络(GAN)创建完全由合成患者数据组成的衍生医学成像数据集。合成图像理想情况下,总体上具有与源数据集相似的统计属性,但不包含敏感的个人信息。我们评估了两个GAN模型生成的合成数据的质量,用于具有14个不同放射学发现和脑计算机断层扫描(CT)扫描的胸部X光片,并具有六种类型的颅内出血。我们通过在合成数据集或实际数据集训练的预测模型的性能差异来测量合成图像质量。我们发现,合成数据性能从减少的唯一标签组合数量减少中受益不成比例。我们的开源基准还表明,在每个类别的样本数量较低时,标签效果开始主导GAN训练。我们还进行了一项读者研究,其中训练有素的放射科医生的表现不如在区分中间分辨率的合成和真实医学图像方面的随机性更好。根据我们的基准结果,放射科医生的分类准确性在较高的空间分辨率水平下提高。我们的研究提供了有价值的指南和概述实用条件,在这些指南中,从合成医学图像中获得的见解与本来可以从真实成像数据中得出的见解相似。我们的结果表明,合成数据共享可能是在正确的设置中共享实际患者级数据的有吸引力且具有隐私性的替代方法。
Privacy concerns around sharing personally identifiable information are a major practical barrier to data sharing in medical research. However, in many cases, researchers have no interest in a particular individual's information but rather aim to derive insights at the level of cohorts. Here, we utilize Generative Adversarial Networks (GANs) to create derived medical imaging datasets consisting entirely of synthetic patient data. The synthetic images ideally have, in aggregate, similar statistical properties to those of a source dataset but do not contain sensitive personal information. We assess the quality of synthetic data generated by two GAN models for chest radiographs with 14 different radiology findings and brain computed tomography (CT) scans with six types of intracranial hemorrhages. We measure the synthetic image quality by the performance difference of predictive models trained on either the synthetic or the real dataset. We find that synthetic data performance disproportionately benefits from a reduced number of unique label combinations. Our open-source benchmark also indicates that at low number of samples per class, label overfitting effects start to dominate GAN training. We additionally conducted a reader study in which trained radiologists do not perform better than random on discriminating between synthetic and real medical images for intermediate levels of resolutions. In accordance with our benchmark results, the classification accuracy of radiologists increases at higher spatial resolution levels. Our study offers valuable guidelines and outlines practical conditions under which insights derived from synthetic medical images are similar to those that would have been derived from real imaging data. Our results indicate that synthetic data sharing may be an attractive and privacy-preserving alternative to sharing real patient-level data in the right settings.