在状态对象组成部分中相互平衡的组成零照片学习

论文标题

在状态对象组成部分中相互平衡的组成零照片学习

Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning

论文作者

Jiang, Chenyi, Chen, Dubing, Wang, Shidong, Shen, Yuming, Zhang, Haofeng, Shao, Ling

论文摘要

组成零射击学习（CZSL）旨在识别所见状态和对象的看不见的构图。手动标记的语义信息及其实际的视觉特征之间的差异导致各种对象类和状态类的分布的视觉偏差显着失衡，这被现有方法忽略了。为了改善这些问题，我们将CZSL任务视为一项不平衡的多标签分类任务，并提出了一种新的方法，称为CZSL的状态对象组件（必须），该方法为模型提供了平衡的电感偏见。特别是，我们将组成类别的分类分为两个连续的过程，以分析两个组件的纠缠以提前获得其他知识，这反映了两个组件之间的视觉偏差程度。我们使用所获得的知识来修改模型的训练过程，以便为具有重大视觉偏差的类生成更不同的类边界。广泛的实验表明，当与基本CZSL框架结合使用时，我们的方法明显优于MIT态，UT-ZAPPOS和C-GQA的最新方法，并且可以改善各种CZSL框架。我们的代码可在https://anonymon.4open.science/r/must_cge/上找到。

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes and state classes, which is ignored by existing methods. To ameliorate these issues, we consider the CZSL task as an unbalanced multi-label classification task and propose a novel method called MUtual balancing in STate-object components (MUST) for CZSL, which provides a balancing inductive bias for the model. In particular, we split the classification of the composition classes into two consecutive processes to analyze the entanglement of the two components to get additional knowledge in advance, which reflects the degree of visual deviation between the two components. We use the knowledge gained to modify the model's training process in order to generate more distinct class borders for classes with significant visual deviations. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art on MIT-States, UT-Zappos, and C-GQA when combined with the basic CZSL frameworks, and it can improve various CZSL frameworks. Our codes are available on https://anonymous.4open.science/r/MUST_CGE/.

下载PDF全文

下载文献需遵守相关版权规定

论文标题