论文标题
对深神经网络的不可察觉和多通道后门攻击
Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks
论文作者
论文摘要
最近的研究表明,深神经网络(DNN)模型容易受到后门攻击的影响。当包含后门触发器的图像到达时,后排DNN模型将恶意行为。迄今为止,现有的后门攻击是单触发器和单目标攻击,大多数现有后门攻击的触发器很明显,因此很容易被检测到或注意到。在本文中,我们通过利用离散的余弦变换(DCT)隐志提出了一种新颖的不察觉和多通道后门攻击,对深神经网络。根据提出的后门攻击方法,我们实施了两个后门攻击的变体,即N到N后门攻击和N到一对后门攻击。具体而言,对于彩色图像,我们利用DCT隐志在图像的不同通道上构造触发器。结果,扳机是隐形和自然的。基于提出的方法,我们实施了多目标和多触发后门攻击。实验结果表明,CIFAR-10数据集的N到N后门攻击的平均攻击成功率分别为93.95%,Tinyimagenet数据集的平均攻击率分别为91.55%。 CIFAR-10和Tinyimagenet数据集的N至一对攻击的平均攻击成功率分别为90.22%和89.53%。同时,提出的后门攻击不影响DNN模型的分类精度。此外,拟议的袭击事实证明对最先进的后门防御(神经清洁)是强大的。
Recent researches demonstrate that Deep Neural Networks (DNN) models are vulnerable to backdoor attacks. The backdoored DNN model will behave maliciously when images containing backdoor triggers arrive. To date, existing backdoor attacks are single-trigger and single-target attacks, and the triggers of most existing backdoor attacks are obvious thus are easy to be detected or noticed. In this paper, we propose a novel imperceptible and multi-channel backdoor attack against Deep Neural Networks by exploiting Discrete Cosine Transform (DCT) steganography. Based on the proposed backdoor attack method, we implement two variants of backdoor attacks, i.e., N-to-N backdoor attack and N-to-One backdoor attack. Specifically, for a colored image, we utilize DCT steganography to construct the trigger on different channels of the image. As a result, the trigger is stealthy and natural. Based on the proposed method, we implement multi-target and multi-trigger backdoor attacks. Experimental results demonstrate that the average attack success rate of the N-to-N backdoor attack is 93.95% on CIFAR-10 dataset and 91.55% on TinyImageNet dataset, respectively. The average attack success rate of N-to-One attack is 90.22% and 89.53% on CIFAR-10 and TinyImageNet datasets, respectively. Meanwhile, the proposed backdoor attack does not affect the classification accuracy of the DNN model. Moreover, the proposed attack is demonstrated to be robust to the state-of-the-art backdoor defense (Neural Cleanse).