论文标题
您需要重新连接吗?建模用于检测眼底图像中引用的糖尿病性视网膜病变的强基线
A ResNet is All You Need? Modeling A Strong Baseline for Detecting Referable Diabetic Retinopathy in Fundus Images
论文作者
论文摘要
目前,深度学习是对彩色眼底照片(CFP)自动检测自动检测的最新检测。虽然普遍的兴趣是通过方法学创新提高结果,但与经过适当设置训练的标准深层分类模型相比,这些方法的表现尚不清楚。在本文中,我们建议基于简单且标准的Resnet-18体系结构为该任务建模强大的基线。为此,我们通过使用标准的预处理策略训练模型,但使用来自多个公共资源的图像和经验校准的数据增强设置来建立在先前的艺术之上。为了评估其表现,我们涵盖了多种临床相关的观点,包括图像和患者水平DR筛查,通过输入质量和DR等级来区分反应,评估模型不确定性并以定性的方式分析其结果。除了经过精心设计的培训外,我们的Resnet模型没有其他方法论创新,因此在组合的61007个测试图像组合的61007个测试图像中获得了AUC = 0.955(0.953-0.956),该测试图是来自不同公共数据集的,这是排队甚至比其他更复杂的深度学习模型所报道的其他更复杂的深度学习模型。从两个专门为本研究准备的独立内部数据库中获得的480张图像中获得了类似的AUC值,这强调了其泛化能力。这证实,如果经过适当的培训,标准网络仍然可以成为此任务的强大基线。
Deep learning is currently the state-of-the-art for automated detection of referable diabetic retinopathy (DR) from color fundus photographs (CFP). While the general interest is put on improving results through methodological innovations, it is not clear how good these approaches perform compared to standard deep classification models trained with the appropriate settings. In this paper we propose to model a strong baseline for this task based on a simple and standard ResNet-18 architecture. To this end, we built on top of prior art by training the model with a standard preprocessing strategy but using images from several public sources and an empirically calibrated data augmentation setting. To evaluate its performance, we covered multiple clinically relevant perspectives, including image and patient level DR screening, discriminating responses by input quality and DR grade, assessing model uncertainties and analyzing its results in a qualitative manner. With no other methodological innovation than a carefully designed training, our ResNet model achieved an AUC = 0.955 (0.953 - 0.956) on a combined test set of 61007 test images from different public datasets, which is in line or even better than what other more complex deep learning models reported in the literature. Similar AUC values were obtained in 480 images from two separate in-house databases specially prepared for this study, which emphasize its generalization ability. This confirms that standard networks can still be strong baselines for this task if properly trained.