关系事项：域自适应对象检测的基于图形的基于图形的关系推理

论文标题

关系事项：域自适应对象检测的基于图形的基于图形的关系推理

Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object Detection

论文作者

Chen, Chaoqi, Li, Jiongcheng, Zhou, Hong-Yu, Han, Xiaoguang, Huang, Yue, Ding, Xinghao, Yu, Yizhou

论文摘要

域自适应对象检测（DAOD）着重于通过知识转移提高对象检测器的概括能力。 Daod的最新进展努力将适应过程的重点从全球转变为本地，以精细的特征比对方法。但是，由于忽略了域之间和域之间的明确依赖关系和相互作用，全球和局部对齐方法都无法捕获不同前景对象之间的拓扑关系。在这种情况下，仅寻求一vs-One对准并不一定要确保确切的知识转移。此外，由于不准确的定位导致目标域导致了基于不太转移的区域（例如背景），因此基于常规的一致性方法可能容易遭受灾难性的过度拟合（例如背景）。为了解决这些问题，我们首先将daod作为开放式域适应问题，其中前景和背景分别被视为``已知类别''和``未知类别''。 Accordingly, we propose a new and general framework for DAOD, named Foreground-aware Graph-based Relational Reasoning (FGRR), which incorporates graph structures into the detection pipeline to explicitly model the intra- and inter-domain foreground object relations on both pixel and semantic spaces, thereby endowing the DAOD model with the capability of relational reasoning beyond the popular alignment-based paradigm.域间的视觉和语义相关性是通过两分图结构进行层次建模的，并且域内关系通过图形注意机制进行编码。经验结果表明，提出的FGRR超过了四个daod基准的最先进性能。

Domain Adaptive Object Detection (DAOD) focuses on improving the generalization ability of object detectors via knowledge transfer. Recent advances in DAOD strive to change the emphasis of the adaptation process from global to local in virtue of fine-grained feature alignment methods. However, both the global and local alignment approaches fail to capture the topological relations among different foreground objects as the explicit dependencies and interactions between and within domains are neglected. In this case, only seeking one-vs-one alignment does not necessarily ensure the precise knowledge transfer. Moreover, conventional alignment-based approaches may be vulnerable to catastrophic overfitting regarding those less transferable regions (e.g. backgrounds) due to the accumulation of inaccurate localization results in the target domain. To remedy these issues, we first formulate DAOD as an open-set domain adaptation problem, in which the foregrounds and backgrounds are seen as the ``known classes'' and ``unknown class'' respectively. Accordingly, we propose a new and general framework for DAOD, named Foreground-aware Graph-based Relational Reasoning (FGRR), which incorporates graph structures into the detection pipeline to explicitly model the intra- and inter-domain foreground object relations on both pixel and semantic spaces, thereby endowing the DAOD model with the capability of relational reasoning beyond the popular alignment-based paradigm. The inter-domain visual and semantic correlations are hierarchically modeled via bipartite graph structures, and the intra-domain relations are encoded via graph attention mechanisms. Empirical results demonstrate that the proposed FGRR exceeds the state-of-the-art performance on four DAOD benchmarks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题