学习比较关系：几次学习的语义一致性

论文标题

学习比较关系：几次学习的语义一致性

Learning to Compare Relation: Semantic Alignment for Few-Shot Learning

论文作者

Cao, Congqi, Zhang, Yanning

论文摘要

很少有学习是一个基本且具有挑战性的问题，因为它只需要从几个示例中识别出新的类别。识别对象具有多个变体，可以在图像中的任何位置定位。直接比较查询图像与示例图像无法处理内容未对准。比较的表示和指标至关重要，但由于几次学习中样本的稀缺性和广泛的差异，学习挑战。在本文中，我们提出了一种比较关系的新颖的语义一致性模型，这对内容不一致是可靠的。我们建议将两种关键成分添加到现有的少量学习框架中，以获得更好的功能和度量学习能力。首先，我们引入了语义一致性损失，以使属于同一类别的样本的特征的关系统计数据对齐。其次，引入了局部和全局互信息最大化，从而允许在图像中跨结构位置跨越本地和类内共享信息的表示形式。第三，我们通过考虑每个流的均质不确定性来引入一种原则性的方法来权衡多个损失函数。我们在几个几次学习数据集上进行了广泛的实验。实验结果表明，所提出的方法能够比较与语义一致性策略的关系，并实现最先进的性能。

Few-shot learning is a fundamental and challenging problem since it requires recognizing novel categories from only a few examples. The objects for recognition have multiple variants and can locate anywhere in images. Directly comparing query images with example images can not handle content misalignment. The representation and metric for comparison are critical but challenging to learn due to the scarcity and wide variation of the samples in few-shot learning. In this paper, we present a novel semantic alignment model to compare relations, which is robust to content misalignment. We propose to add two key ingredients to existing few-shot learning frameworks for better feature and metric learning ability. First, we introduce a semantic alignment loss to align the relation statistics of the features from samples that belong to the same category. And second, local and global mutual information maximization is introduced, allowing for representations that contain locally-consistent and intra-class shared information across structural locations in an image. Thirdly, we introduce a principled approach to weigh multiple loss functions by considering the homoscedastic uncertainty of each stream. We conduct extensive experiments on several few-shot learning datasets. Experimental results show that the proposed method is capable of comparing relations with semantic alignment strategies, and achieves state-of-the-art performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题