论文标题

通用对抗扰动:小图像数据集的效率

Universal Adversarial Perturbations: Efficiency on a small image dataset

论文作者

Radji, Waris

论文摘要

尽管神经网络在图像分类任务上表现良好,但它们仍然容易受到对抗性扰动的影响,这些扰动可以欺骗神经网络而无需明显地更改输入图像。一篇论文表明了普遍的对抗性扰动的存在,这些扰动将其添加到任何图像中,将以很高的可能性欺骗神经网络。在本文中,我们将尝试重现通用对抗扰动论文的经验,但是在较小的神经网络架构和训练集中,以便能够研究计算出的扰动的效率。

Although neural networks perform very well on the image classification task, they are still vulnerable to adversarial perturbations that can fool a neural network without visibly changing an input image. A paper has shown the existence of Universal Adversarial Perturbations which when added to any image will fool the neural network with a very high probability. In this paper we will try to reproduce the experience of the Universal Adversarial Perturbations paper, but on a smaller neural network architecture and training set, in order to be able to study the efficiency of the computed perturbation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源