论文标题
在空中图像中进行任意面向对象检测的任务采样卷积
Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial Images
论文作者
论文摘要
任意为导向的对象检测(AOOD)已被广泛应用于在遥感图像中以各种方向进行定位和分类对象。但是,AOOD模型中的本地化和分类任务的不一致特征可能会导致歧义和低质量的对象预测,从而限制了检测性能。在本文中,提出了一种称为任务采样卷积(TS-CONV)的Aood方法。 TS-CONV适应从各个敏感区域进行任务特征,并将这些特征映射为对齐方式,以指导动态标签分配以获得更好的预测。具体而言,TS-CONV中本地化卷积的采样位置由与空间坐标相关的定向边界框(OBB)预测进行监督,而分类卷积的采样位置和卷积核的设计旨在根据不同方向进行适应性调节,以改善功能的方向鲁棒性。此外,制定了动态任务符合的标签分配(DTLA)策略,以选择最佳候选位置,并根据从TS-CONV获得的排名的任务吸引分数动态分配标签。在涵盖多个场景,多模式图像和多个对象的几个公共数据集上进行了广泛的实验,证明了所提出的TS-CONV的有效性,可扩展性和出色的性能。
Arbitrary-oriented object detection (AOOD) has been widely applied to locate and classify objects with diverse orientations in remote sensing images. However, the inconsistent features for the localization and classification tasks in AOOD models may lead to ambiguity and low-quality object predictions, which constrains the detection performance. In this article, an AOOD method called task-wise sampling convolutions (TS-Conv) is proposed. TS-Conv adaptively samples task-wise features from respective sensitive regions and maps these features together in alignment to guide a dynamic label assignment for better predictions. Specifically, sampling positions of the localization convolution in TS-Conv are supervised by the oriented bounding box (OBB) prediction associated with spatial coordinates, while sampling positions and convolutional kernel of the classification convolution are designed to be adaptively adjusted according to different orientations for improving the orientation robustness of features. Furthermore, a dynamic task-consistent-aware label assignment (DTLA) strategy is developed to select optimal candidate positions and assign labels dynamically according to ranked task-aware scores obtained from TS-Conv. Extensive experiments on several public datasets covering multiple scenes, multimodal images, and multiple categories of objects demonstrate the effectiveness, scalability, and superior performance of the proposed TS-Conv.