论文标题
用最小的,有针对性的图像操纵欺骗灵长类动物的大脑
Fooling the primate brain with minimal, targeted image manipulation
论文作者
论文摘要
人工神经网络(ANN)被认为是当前的生物视觉最佳模型。 ANN是腹中神经活动的最佳预测指标。此外,最近的工作表明,适合神经元活动的ANN模型可以指导驱动小神经元种群中预先指定响应模式的图像的综合。尽管在预测和转向发射活动方面取得了成功,但这些结果尚未与知觉或行为变化有关。在这里,我们提出了一系列方法,用于创建最小,有针对性的图像扰动,从而导致神经元活动和感知的变化,如行为所反映的。我们生成了人脸,猴子面孔和噪声模式的“欺骗性图像”,以便将它们视为不同的,预先指定的目标类别,并测量了对这些图像的猴子神经元反应和人类行为。我们发现了几种改变灵长类动物视觉分类的有效方法,这些方法需要更小的图像变化,而不是固定的噪声。我们的工作与对抗性攻击共享相同的目标,即用最小的,有针对性的噪声操纵图像,导致Ann模型错误地分类图像。我们的结果代表了量化和表征生物和人造视力扰动鲁棒性差异的宝贵步骤。
Artificial neural networks (ANNs) are considered the current best models of biological vision. ANNs are the best predictors of neural activity in the ventral stream; moreover, recent work has demonstrated that ANN models fitted to neuronal activity can guide the synthesis of images that drive pre-specified response patterns in small neuronal populations. Despite the success in predicting and steering firing activity, these results have not been connected with perceptual or behavioral changes. Here we propose an array of methods for creating minimal, targeted image perturbations that lead to changes in both neuronal activity and perception as reflected in behavior. We generated 'deceptive images' of human faces, monkey faces, and noise patterns so that they are perceived as a different, pre-specified target category, and measured both monkey neuronal responses and human behavior to these images. We found several effective methods for changing primate visual categorization that required much smaller image change compared to untargeted noise. Our work shares the same goal with adversarial attack, namely the manipulation of images with minimal, targeted noise that leads ANN models to misclassify the images. Our results represent a valuable step in quantifying and characterizing the differences in perturbation robustness of biological and artificial vision.