深度功能扩展用于遮挡图像分类

论文标题

深度功能扩展用于遮挡图像分类

Deep Feature Augmentation for Occluded Image Classification

论文作者

Cen, Feng, Zhao, Xiaoyu, Li, Wuzhuang, Wang, Guanghui

论文摘要

由于难以获取大规模的特定任务封闭图像，因此具有深卷积神经网络（CNN）的遮挡图像的分类仍然是高度挑战。为了减轻对大规模遮挡图像数据集的依赖性，我们提出了一种新的方法，通过用一组增强的深度特征向量（DFVS）微调预训练的模型，以提高遮挡图像的分类精度。增强DFV集由原始DFV和伪DFV组成。伪DFV是通过随机添加差异向量（DVS）生成的，从一小部分清洁和遮挡的图像对提取到真实的DFV。在微调中，在DFV数据流程上进行了后传播，以更新网络参数。各种数据集和网络结构上的实验表明，深度功能增强显着提高了遮挡图像的分类精度，而不会对清洁图像的性能产生明显影响。具体而言，在ILSVRC2012数据集中，带有合成的遮挡图像的数据集，所提出的方法分别在咬合闭合训练和包括咬合的训练集中对Resnet50网络的分类准确性的平均分类精度达到11.21％和9.14％。

Due to the difficulty in acquiring massive task-specific occluded images, the classification of occluded images with deep convolutional neural networks (CNNs) remains highly challenging. To alleviate the dependency on large-scale occluded image datasets, we propose a novel approach to improve the classification accuracy of occluded images by fine-tuning the pre-trained models with a set of augmented deep feature vectors (DFVs). The set of augmented DFVs is composed of original DFVs and pseudo-DFVs. The pseudo-DFVs are generated by randomly adding difference vectors (DVs), extracted from a small set of clean and occluded image pairs, to the real DFVs. In the fine-tuning, the back-propagation is conducted on the DFV data flow to update the network parameters. The experiments on various datasets and network structures show that the deep feature augmentation significantly improves the classification accuracy of occluded images without a noticeable influence on the performance of clean images. Specifically, on the ILSVRC2012 dataset with synthetic occluded images, the proposed approach achieves 11.21% and 9.14% average increases in classification accuracy for the ResNet50 networks fine-tuned on the occlusion-exclusive and occlusion-inclusive training sets, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题