论文标题
通过内部过度激活分析来防御可见的对抗性攻击
Defending From Physically-Realizable Adversarial Attacks Through Internal Over-Activation Analysis
论文作者
论文摘要
这项工作提出了Z-Mask,这是一种强大而有效的策略,旨在改善卷积网络的对抗性鲁棒性,以防止具有物理变化的对抗性攻击。提出的防御依赖于对内部网络特征进行的特定Z分析分析,以检测和掩盖与输入图像中对抗对象相对应的像素。为此,在浅层和深层中检查了空间连续的激活,以暗示潜在的对抗区域。然后,通过多个售出机制来汇总此类建议。通过在语义分割和对象检测的模型上进行了一组广泛的实验,评估了Z掩码的有效性。评估均使用两个数字补丁添加到现实世界中的输入图像和印刷贴片中。获得的结果证实,从检测准确性和在攻击中的网络的总体性能方面,Z-Mask优于最先进的方法。其他实验表明,Z面具对可能的防御感知攻击也是强大的。
This work presents Z-Mask, a robust and effective strategy to improve the adversarial robustness of convolutional networks against physically-realizable adversarial attacks. The presented defense relies on specific Z-score analysis performed on the internal network features to detect and mask the pixels corresponding to adversarial objects in the input image. To this end, spatially contiguous activations are examined in shallow and deep layers to suggest potential adversarial regions. Such proposals are then aggregated through a multi-thresholding mechanism. The effectiveness of Z-Mask is evaluated with an extensive set of experiments carried out on models for both semantic segmentation and object detection. The evaluation is performed with both digital patches added to the input images and printed patches positioned in the real world. The obtained results confirm that Z-Mask outperforms the state-of-the-art methods in terms of both detection accuracy and overall performance of the networks under attack. Additional experiments showed that Z-Mask is also robust against possible defense-aware attacks.