克服基于斑块的学习的局限性，以检测整个幻灯片图像中的癌症

论文标题

克服基于斑块的学习的局限性，以检测整个幻灯片图像中的癌症

Overcoming the limitations of patch-based learning to detect cancer in whole slide images

论文作者

Ciga, Ozan, Xu, Tony, Nofech-Mozes, Sharon, Noy, Shawna, Lu, Fang-I, Martel, Anne L.

论文摘要

训练深度学习模型时，整个幻灯片图像（WSIS）构成了独特的挑战。它们非常大，因此有必要将每个图像分解为较小的补丁以进行分析，必须在多个尺度上提取图像特征才能捕获细节和上下文，并且可能存在极端的类不平衡。在对这些图像的分析中，取得了重大进展，这在很大程度上要归功于公共注释的数据集的可用性。但是，我们假设，即使方法在挑战任务上得分良好，这种成功也可能无法转化为更临床相关的工作流程中的良好表现。许多数据集由可能遭受数据策划偏差的图像贴片组成；其他数据集仅在整个幻灯片级别上标记，只要最终决定正确，图像上缺乏注释就可能掩盖了错误的本地预测。在本文中，我们概述了需要在整个幻灯片上准确定位或分割癌症的贴片或幻灯片级分类与方法之间的差异，并且我们通过实验验证了两种情况下的最佳实践是否有所不同。我们在新辅助治疗后乳腺癌WSI上应用二元癌症检测网络，以找到肿瘤床，概述了癌症的程度，这项任务需要在整个滑梯上灵敏度和精确性。我们广泛研究了多种设计选择及其对结果的影响，包括架构和增强。此外，我们提出了一种负数据采样策略，该策略大大降低了假阳性率（在幻灯片水平上为7％），并改善了与我们的问题相关的每个度量，肿瘤程度的误差降低了15％。

Whole slide images (WSIs) pose unique challenges when training deep learning models. They are very large which makes it necessary to break each image down into smaller patches for analysis, image features have to be extracted at multiple scales in order to capture both detail and context, and extreme class imbalances may exist. Significant progress has been made in the analysis of these images, thanks largely due to the availability of public annotated datasets. We postulate, however, that even if a method scores well on a challenge task, this success may not translate to good performance in a more clinically relevant workflow. Many datasets consist of image patches which may suffer from data curation bias; other datasets are only labelled at the whole slide level and the lack of annotations across an image may mask erroneous local predictions so long as the final decision is correct. In this paper, we outline the differences between patch or slide-level classification versus methods that need to localize or segment cancer accurately across the whole slide, and we experimentally verify that best practices differ in both cases. We apply a binary cancer detection network on post neoadjuvant therapy breast cancer WSIs to find the tumor bed outlining the extent of cancer, a task which requires sensitivity and precision across the whole slide. We extensively study multiple design choices and their effects on the outcome, including architectures and augmentations. Furthermore, we propose a negative data sampling strategy, which drastically reduces the false positive rate (7% on slide level) and improves each metric pertinent to our problem, with a 15% reduction in the error of tumor extent.

下载PDF全文

下载文献需遵守相关版权规定

论文标题