通过动态特征聚合的强大表示

论文标题

通过动态特征聚合的强大表示

Robust Representation via Dynamic Feature Aggregation

论文作者

Liu, Haozhe, Ji, Haoqin, Li, Yuexiang, He, Nanjun, Wu, Haoqian, Liu, Feng, Shen, Linlin, Zheng, Yefeng

论文摘要

基于深度卷积神经网络（CNN）模型容易受到对抗性攻击的影响。可能的原因之一是，基于CNN的模型的嵌入空间稀疏，为生成对抗样品的生成很大。在这项研究中，我们提出了一种被称为动态特征聚集的方法，以通过新颖的正则化来压缩嵌入空间。特别是，两个样品之间的凸组合被认为是聚集的枢轴。在嵌入空间中，所选样品被指导为类似于枢轴的表示。另一方面，为了减轻这种正则化的琐碎解决方案，模型的最后一个完全连接的层被正交分类器取代，其中不同类别的嵌入式代码是正交和分别处理的。借助正则化和正交分类器，可以获得更紧凑的嵌入空间，因此可以提高模型对对抗性攻击的鲁棒性。我们的CIFAR-10方法对各种攻击方法实现了56.91％的平均精度，该方法可显着超过固体基线（混合）37.31％。更令人惊讶的是，经验结果表明，由于学到的紧凑特征空间，该提出的方法还可以实现无法分布（OOD）检测的最新性能。当采用CIFAR-10作为分布（ID）数据集和LSUN作为OOD数据集时，通过提出的方法实现了0.937的F1分数。代码可在https://github.com/haozheliu-st/dynamicfeatureaggregregation上找到。

Deep convolutional neural network (CNN) based models are vulnerable to the adversarial attacks. One of the possible reasons is that the embedding space of CNN based model is sparse, resulting in a large space for the generation of adversarial samples. In this study, we propose a method, denoted as Dynamic Feature Aggregation, to compress the embedding space with a novel regularization. Particularly, the convex combination between two samples are regarded as the pivot for aggregation. In the embedding space, the selected samples are guided to be similar to the representation of the pivot. On the other side, to mitigate the trivial solution of such regularization, the last fully-connected layer of the model is replaced by an orthogonal classifier, in which the embedding codes for different classes are processed orthogonally and separately. With the regularization and orthogonal classifier, a more compact embedding space can be obtained, which accordingly improves the model robustness against adversarial attacks. An averaging accuracy of 56.91% is achieved by our method on CIFAR-10 against various attack methods, which significantly surpasses a solid baseline (Mixup) by a margin of 37.31%. More surprisingly, empirical results show that, the proposed method can also achieve the state-of-the-art performance for out-of-distribution (OOD) detection, due to the learned compact feature space. An F1 score of 0.937 is achieved by the proposed method, when adopting CIFAR-10 as in-distribution (ID) dataset and LSUN as OOD dataset. Code is available at https://github.com/HaozheLiu-ST/DynamicFeatureAggregation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题