论文标题
通过基于内部分歧的OOD检测器,用于广义零摄取学习的语义多样性转移网络
Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector
论文作者
论文摘要
零拍学习(ZSL)旨在识别看不见的类别的对象,其中内核问题是通过在视觉和语义特征之间建立适当的映射来将知识从可见的类转移到看不见的类。许多现有作品中的知识转移主要是由于1)广泛使用的视觉特征是全局的,但与语义属性不完全一致; 2)在现有作品中只学会了一个映射,这无法有效地模拟各种视觉语义关系; 3)无法有效处理广义ZSL(GZSL)中的偏差问题。在本文中,我们提出了两种减轻这些局限性的技术。首先,我们提出了一个针对前两个限制的语义多样性传输网络(SETNET),其中1)提议进行多个注意事项结构和多样性正常化程序,以学习多个与语义属性更一致的局部视觉特征,并且2)投影仪的合奏,该集合具有多种多样的局部特征,以使局部具有多样性的局部关系,从而模拟了各种视觉范围的关系。其次,我们提出了一个基于内部分歧的域检测模块(ID3M),以减轻第三个限制,该限制在类级分类之前挑选了看不见的类数据。由于在训练阶段没有看不见的级别数据,ID3M采用了一种新颖的独立训练方案,并根据设计的内部分歧标准检测出不看见的级别数据。三个公共数据集的实验结果表明,拟议的SETNET与探索的ID3M相比,与30美元的最新方法相比,ID3M的拟议SETNET取得了重大改进。
Zero-shot learning (ZSL) aims to recognize objects from unseen classes, where the kernel problem is to transfer knowledge from seen classes to unseen classes by establishing appropriate mappings between visual and semantic features. The knowledge transfer in many existing works is limited mainly due to the facts that 1) the widely used visual features are global ones but not totally consistent with semantic attributes; 2) only one mapping is learned in existing works, which is not able to effectively model diverse visual-semantic relations; 3) the bias problem in the generalized ZSL (GZSL) could not be effectively handled. In this paper, we propose two techniques to alleviate these limitations. Firstly, we propose a Semantic-diversity transfer Network (SetNet) addressing the first two limitations, where 1) a multiple-attention architecture and a diversity regularizer are proposed to learn multiple local visual features that are more consistent with semantic attributes and 2) a projector ensemble that geometrically takes diverse local features as inputs is proposed to model visual-semantic relations from diverse local perspectives. Secondly, we propose an inner disagreement based domain detection module (ID3M) for GZSL to alleviate the third limitation, which picks out unseen-class data before class-level classification. Due to the absence of unseen-class data in training stage, ID3M employs a novel self-contained training scheme and detects out unseen-class data based on a designed inner disagreement criterion. Experimental results on three public datasets demonstrate that the proposed SetNet with the explored ID3M achieves a significant improvement against $30$ state-of-the-art methods.