学习构图表示有效的低弹性概括

论文标题

学习构图表示有效的低弹性概括

Learning Compositional Representations for Effective Low-Shot Generalization

论文作者

Mishra, Samarth, Zhu, Pengkai, Saligrama, Venkatesh

论文摘要

我们建议将识别作为部分组成（RPC），这是一种受人类认知启发的图像编码方法。它基于认知理论，即人类通过组件识别复杂的对象，并且它们建立了一个小的紧凑型概念词汇，以代表每个实例。 RPC首先将图像分解为显着部分来编码图像，然后将每个部分编码为少数原型的混合物，每个原型都代表某个概念。我们发现，受人类认知启发的这种学习可以克服低调泛化任务中深度卷积网络所面临的障碍，例如零射击学习，很少的学习学习和无监督的领域适应。此外，我们发现使用RPC图像编码器的分类器对于对抗性攻击非常健壮，已知深神经网络容易易受。鉴于我们的形象编码原理是基于人类认知的，因此人们希望这些编码可以被人类解释，我们发现通过众包实验发现了这种编码。最后，我们建议以生成合成属性注释的形式应用这些可解释的编码，以评估新数据集上的零摄像学习方法。

We propose Recognition as Part Composition (RPC), an image encoding approach inspired by human cognition. It is based on the cognitive theory that humans recognize complex objects by components, and that they build a small compact vocabulary of concepts to represent each instance with. RPC encodes images by first decomposing them into salient parts, and then encoding each part as a mixture of a small number of prototypes, each representing a certain concept. We find that this type of learning inspired by human cognition can overcome hurdles faced by deep convolutional networks in low-shot generalization tasks, like zero-shot learning, few-shot learning and unsupervised domain adaptation. Furthermore, we find a classifier using an RPC image encoder is fairly robust to adversarial attacks, that deep neural networks are known to be prone to. Given that our image encoding principle is based on human cognition, one would expect the encodings to be interpretable by humans, which we find to be the case via crowd-sourcing experiments. Finally, we propose an application of these interpretable encodings in the form of generating synthetic attribute annotations for evaluating zero-shot learning methods on new datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题