论文标题
通过学习对比和得分来解决场景图中的类不平衡
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score
论文作者
论文摘要
场景图解析旨在检测图像场景中的对象并识别其关系。最近的方法在一些流行的基准测试中取得了很高的平均得分,但由于数据的长期分布偏向于频繁的标签,但无法检测到罕见关系。本文在现实世界应用中检测到这些罕见关系可能至关重要的事实是至关重要的,因此引入了一个新颖的分类和排名综合框架,以解决场景图解析中的类不平衡问题。具体而言,我们设计了一种新的对比横向渗透损失,该损失通过抑制不正确的频繁促进稀有关系来促进罕见关系的检测。此外,我们提出了一个新颖的评分模块,称为得分手,该模块学会根据图像特征和关系特征对关系进行排名,以改善预测的回忆。我们的框架简单有效,可以将其纳入当前场景图模型中。实验结果表明,所提出的方法改善了当前的最新方法,具有检测到罕见关系的明显优势。
Scene graph parsing aims to detect objects in an image scene and recognize their relations. Recent approaches have achieved high average scores on some popular benchmarks, but fail in detecting rare relations, as the highly long-tailed distribution of data biases the learning towards frequent labels. Motivated by the fact that detecting these rare relations can be critical in real-world applications, this paper introduces a novel integrated framework of classification and ranking to resolve the class imbalance problem in scene graph parsing. Specifically, we design a new Contrasting Cross-Entropy loss, which promotes the detection of rare relations by suppressing incorrect frequent ones. Furthermore, we propose a novel scoring module, termed as Scorer, which learns to rank the relations based on the image features and relation features to improve the recall of predictions. Our framework is simple and effective, and can be incorporated into current scene graph models. Experimental results show that the proposed approach improves the current state-of-the-art methods, with a clear advantage of detecting rare relations.