与深度卷积神经网络的合奏学习优化医学图像分类的分析

论文标题

与深度卷积神经网络的合奏学习优化医学图像分类的分析

An Analysis on Ensemble Learning optimized Medical Image Classification with Deep Convolutional Neural Networks

论文作者

Müller, Dominik, Soto-Rey, Iñaki, Kramer, Frank

论文摘要

新颖和高性能的医学图像分类管道正在大力利用集合学习策略。合奏学习的想法是组装不同的模型或多个预测，从而提高预测性能。但是，这仍然是一个开放的问题，在多大程度上以及哪种集成学习策略对基于深度学习的医学图像分类管道有益。在这项工作中，我们提出了一条可重现的医学图像分类管道，以分析以下集合学习技术的性能影响：增强，堆叠和包装。该管道包括最新的预处理和图像增强方法以及9个深卷积神经网络体系结构。它应用于四个流行的医学成像数据集，具有不同的复杂性。此外，分析了12个用于组合多个预测的合并函数，从简单的统计功能（如未加权平均）到更复杂的基于学习的功能，例如支持向量机。我们的结果表明，堆叠实现了最大的F1得分增长性能增长。增强表现出一致的提高功能高达4％，并且也适用于基于单个模型的管道。基于交叉验证的装袋显示出巨大的性能增益接近堆积，这导致F1分数增加到 +11％。此外，我们证明了简单的统计池函数比更复杂的池函数相等甚至更好。我们得出的结论是，集成学习技术的整合是任何医学图像分类管道的有力方法，可以提高稳健性和提高性能。

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies. The idea of ensemble learning is to assemble diverse models or multiple predictions and, thus, boost prediction performance. However, it is still an open question to what extent as well as which ensemble learning strategies are beneficial in deep learning based medical image classification pipelines. In this work, we proposed a reproducible medical image classification pipeline for analyzing the performance impact of the following ensemble learning techniques: Augmenting, Stacking, and Bagging. The pipeline consists of state-of-the-art preprocessing and image augmentation methods as well as 9 deep convolution neural network architectures. It was applied on four popular medical imaging datasets with varying complexity. Furthermore, 12 pooling functions for combining multiple predictions were analyzed, ranging from simple statistical functions like unweighted averaging up to more complex learning-based functions like support vector machines. Our results revealed that Stacking achieved the largest performance gain of up to 13% F1-score increase. Augmenting showed consistent improvement capabilities by up to 4% and is also applicable to single model based pipelines. Cross-validation based Bagging demonstrated significant performance gain close to Stacking, which resulted in an F1-score increase up to +11%. Furthermore, we demonstrated that simple statistical pooling functions are equal or often even better than more complex pooling functions. We concluded that the integration of ensemble learning techniques is a powerful method for any medical image classification pipeline to improve robustness and boost performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题