通用对抗扰动：小图像数据集的效率

论文标题

通用对抗扰动：小图像数据集的效率

Universal Adversarial Perturbations: Efficiency on a small image dataset

论文作者

Radji, Waris

论文摘要

尽管神经网络在图像分类任务上表现良好，但它们仍然容易受到对抗性扰动的影响，这些扰动可以欺骗神经网络而无需明显地更改输入图像。一篇论文表明了普遍的对抗性扰动的存在，这些扰动将其添加到任何图像中，将以很高的可能性欺骗神经网络。在本文中，我们将尝试重现通用对抗扰动论文的经验，但是在较小的神经网络架构和训练集中，以便能够研究计算出的扰动的效率。

Although neural networks perform very well on the image classification task, they are still vulnerable to adversarial perturbations that can fool a neural network without visibly changing an input image. A paper has shown the existence of Universal Adversarial Perturbations which when added to any image will fool the neural network with a very high probability. In this paper we will try to reproduce the experience of the Universal Adversarial Perturbations paper, but on a smaller neural network architecture and training set, in order to be able to study the efficiency of the computed perturbation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题