与少数拍摄对象检测的顶级元学习方法

论文标题

与少数拍摄对象检测的顶级元学习方法

Top-Related Meta-Learning Method for Few-Shot Object Detection

论文作者

Li, Qian, Guo, Nan, Ye, Xiaochun, Wang, Duo, Fan, Dongrui, Tang, Zhimin

论文摘要

提出了许多用于几次检测的元学习方法。但是，以前的大多数方法都有两个主要问题，检测AP不良，并且由于数据集不平衡而存在强大的偏见。先前的工作主要通过其他数据集，多关系注意机制和子模块来减轻这些问题。但是，它们需要更多的成本。在这项工作中，对于元学习，我们发现主要挑战集中在类别之间相关或无关的语义特征上。因此，基于语义特征，我们提出了用于分类任务的顶级C分类损失（即TCL-C），以及由Meta模型获得的基于类别的元元功能的基于类别的分组机制。 TCL-C利用了真正标签的预测和最可能的C-1错误分类预测，以提高几个射击类别的检测性能。根据相似的外观（即视觉外观，形状和四肢等）以及对象经常出现的环境，基于类别的分组机制将类别分为脱节组，以使组中类别之间的类似语义特征更加紧凑，并在组之间获得更重要的差异，减轻强偏置问题，从而进一步改善检测APS。整个培训由基本模型和微调阶段组成。根据分组机制，我们将元模型获得的元功能向量分组，因此组之间的分布差很明显，并且每个组中的分布差异较小。 Pascal VOC数据集的广泛实验表明，将TCL-C与基于类别的分组相结合的我们的分组大大优于先前的最新方法，用于几次检测。与以前的竞争基线相比，我们的检测AP将近4％提高了4％。

Many meta-learning methods are proposed for few-shot detection. However, previous most methods have two main problems, poor detection APs, and strong bias because of imbalance and insufficient datasets. Previous works mainly alleviate these issues by additional datasets, multi-relation attention mechanisms and sub-modules. However, they require more cost. In this work, for meta-learning, we find that the main challenges focus on related or irrelevant semantic features between categories. Therefore, based on semantic features, we propose a Top-C classification loss (i.e., TCL-C) for classification task and a category-based grouping mechanism for category-based meta-features obtained by the meta-model. The TCL-C exploits the true-label prediction and the most likely C-1 false classification predictions to improve detection performance on few-shot classes. According to similar appearance (i.e., visual appearance, shape, and limbs etc.) and environment in which objects often appear, the category-based grouping mechanism splits categories into disjoint groups to make similar semantic features more compact between categories within a group and obtain more significant difference between groups, alleviating the strong bias problem and further improving detection APs. The whole training consists of the base model and the fine-tuning phases. According to grouping mechanism, we group the meta-features vectors obtained by meta-model, so that the distribution difference between groups is obvious, and the one within each group is less. Extensive experiments on Pascal VOC dataset demonstrate that ours which combines the TCL-C with category-based grouping significantly outperforms previous state-of-the-art methods for few-shot detection. Compared with previous competitive baseline, ours improves detection APs by almost 4% for few-shot detection.

下载PDF全文

下载文献需遵守相关版权规定

论文标题