论文标题
在前列腺组织学图像中自动检测丝布状生长模式
Automated Detection of Cribriform Growth Patterns in Prostate Histology Images
论文作者
论文摘要
前列腺癌的筛状生长模式与预后不良有关。我们旨在引入一种深入学习方法来自动检测此类模式。为此,对卷积神经网络进行了训练,以检测128个前列腺活检的曲折生长模式。考虑到训练期间其他肿瘤生长模式的合奏学习用于应对异质性和有限的肿瘤组织的发生。对ROC和FROC分析进行了用于评估有关具有丝布状生长模式的活检的网络性能。 ROC分析在曲线下产生了一个平均面积,最高为0.81。 FROC分析表明,对于大于0.0150 mm2的区域的灵敏度为0.9,平均为7.5个假阳性。对于观察者内注释变异性的基准方法性能,病理学家重新评估了假阳性和阴性检测。病理学家将9%的假阳性区域视为筛状区域,而11%可能是筛状的。 44%的假阴性区域没有注释为筛状。作为最终实验,该网络还应用于由23位病理学家注释的60个活检区域的数据集。随着截止的最高灵敏度,所有将至少7/23的病理学家注释为筛状的图像均被网络检测为纤维状,并将9/60的图像检测为筛状的,而没有病理学家将其标记为这样。总之,提出的深度学习方法具有高灵敏度,可用于以有限数量的假阳性为代价来检测筛状的生长模式。它可以检测到至少少数病理学家标记的丝布状区域。因此,它可以通过建议可疑区域来帮助临床决策。
Cribriform growth patterns in prostate carcinoma are associated with poor prognosis. We aimed to introduce a deep learning method to detect such patterns automatically. To do so, convolutional neural network was trained to detect cribriform growth patterns on 128 prostate needle biopsies. Ensemble learning taking into account other tumor growth patterns during training was used to cope with heterogeneous and limited tumor tissue occurrences. ROC and FROC analyses were applied to assess network performance regarding detection of biopsies harboring cribriform growth pattern. The ROC analysis yielded a mean area under the curve up to 0.81. FROC analysis demonstrated a sensitivity of 0.9 for regions larger than 0.0150 mm2 with on average 7.5 false positives. To benchmark method performance for intra-observer annotation variability, false positive and negative detections were re-evaluated by the pathologists. Pathologists considered 9% of the false positive regions as cribriform, and 11% as possibly cribriform; 44% of the false negative regions were not annotated as cribriform. As a final experiment, the network was also applied on a dataset of 60 biopsy regions annotated by 23 pathologists. With the cut-off reaching highest sensitivity, all images annotated as cribriform by at least 7/23 of the pathologists, were all detected as cribriform by the network and 9/60 of the images were detected as cribriform whereas no pathologist labelled them as such. In conclusion, the proposed deep learning method has high sensitivity for detecting cribriform growth patterns at the expense of a limited number of false positives. It can detect cribriform regions that are labelled as such by at least a minority of pathologists. Therefore, it could assist clinical decision making by suggesting suspicious regions.