Saunet：适合可解释的医学图像分割的Actentive U-NET

论文标题

Saunet：适合可解释的医学图像分割的Actentive U-NET

SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation

论文作者

Sun, Jesse, Darbehani, Fatemeh, Zaidi, Mark, Wang, Bo

论文摘要

对于许多临床操作（例如心脏双重室量估计），医疗图像分割是一项困难但重要的任务。最近，已经转移了使用深度学习和完全卷积神经网络（CNN）进行图像分割的转变，这在许多公共基准数据集中产生了最先进的结果。尽管在医学图像细分中进行了深入学习的进展，但标准CNN在临床环境中仍未完全采用，因为它们缺乏可靠性和可解释性。形状通常比仅仅是图像的纹理更有意义，这些特征是常规CNN学习的特征，从而导致缺乏健壮性。同样，围绕模型可解释性的以前的作品集中在基于事后梯度后的显着性方法上。但是，基于梯度的显着性方法通常需要事后发生的其他计算，并且已证明对可解释性是不可靠的。因此，我们提出了一个名为Shape Actentive U-Net（Saunet）的新体系结构，该体系结构着重于模型的解释性和鲁棒性。所提出的体系结构试图通过使用辅助形状流来解决这些局限性，该流与常规纹理流并行捕获丰富的形状相关信息。此外，我们建议可以使用我们的双重注意解码器模块来学习多分辨率显着图，该模块允许多级别的可解释性，并减轻事后事后进行其他计算的需求。我们的方法还可以在SUN09和AC17的两个大型公共心脏MRI图像分割数据集上实现最新的结果。

Medical image segmentation is a difficult but important task for many clinical operations such as cardiac bi-ventricular volume estimation. More recently, there has been a shift to utilizing deep learning and fully convolutional neural networks (CNNs) to perform image segmentation that has yielded state-of-the-art results in many public benchmark datasets. Despite the progress of deep learning in medical image segmentation, standard CNNs are still not fully adopted in clinical settings as they lack robustness and interpretability. Shapes are generally more meaningful features than solely textures of images, which are features regular CNNs learn, causing a lack of robustness. Likewise, previous works surrounding model interpretability have been focused on post hoc gradient-based saliency methods. However, gradient-based saliency methods typically require additional computations post hoc and have been shown to be unreliable for interpretability. Thus, we present a new architecture called Shape Attentive U-Net (SAUNet) which focuses on model interpretability and robustness. The proposed architecture attempts to address these limitations by the use of a secondary shape stream that captures rich shape-dependent information in parallel with the regular texture stream. Furthermore, we suggest multi-resolution saliency maps can be learned using our dual-attention decoder module which allows for multi-level interpretability and mitigates the need for additional computations post hoc. Our method also achieves state-of-the-art results on the two large public cardiac MRI image segmentation datasets of SUN09 and AC17.

下载PDF全文

下载文献需遵守相关版权规定

论文标题