论文标题

单发两管孔检测器,具有纠正的损失

Single-Shot Two-Pronged Detector with Rectified IoU Loss

论文作者

Wang, Keyang, Zhang, Lei

论文摘要

在基于CNN的对象检测器中,特征金字塔被广泛利用以减轻对象实例的规模变化问题。这些对象检测器通过自上而下的路径和横向连接增强功能,主要是为了丰富低级特征的语义信息,但忽略了高级功能的增强。这可能会导致不同级别的功能之间的失衡,特别是严重缺乏高级功能中的详细信息,这使得很难获得准确的边界框。在本文中,我们介绍了一种新颖的两管齐下的托管想法,以探索向后和向前方向的不同层之间的关系,这可以同时丰富低级特征的语义信息和高级特征的详细信息。在两个普遍的想法的指导下,我们提出了一个两管齐的网络(TPNET),以实现高级特征和低级特征之间的双向传递,这对于在不同尺度上准确检测对象很有用。此外,由于单阶段探测器中的硬样品和简易样品之间的分布不平衡,定位损失的梯度始终以较差的定位准确性的硬示例主导。这将使模型能够偏向硬样品。因此,在我们的TPNET中,提出了一种基于自适应的本地化损失,称为Rectified IOU(RIOU)损失,以纠正每种样本的梯度。纠正的IOU损失增加了IOU较高的示例的梯度,同时抑制了低IOU的示例梯度,从而可以提高模型的整体定位精度。广泛的实验证明了我们的TPNET和RIOU损失的优势。

In the CNN based object detectors, feature pyramids are widely exploited to alleviate the problem of scale variation across object instances. These object detectors, which strengthen features via a top-down pathway and lateral connections, are mainly to enrich the semantic information of low-level features, but ignore the enhancement of high-level features. This can lead to an imbalance between different levels of features, in particular a serious lack of detailed information in the high-level features, which makes it difficult to get accurate bounding boxes. In this paper, we introduce a novel two-pronged transductive idea to explore the relationship among different layers in both backward and forward directions, which can enrich the semantic information of low-level features and detailed information of high-level features at the same time. Under the guidance of the two-pronged idea, we propose a Two-Pronged Network (TPNet) to achieve bidirectional transfer between high-level features and low-level features, which is useful for accurately detecting object at different scales. Furthermore, due to the distribution imbalance between the hard and easy samples in single-stage detectors, the gradient of localization loss is always dominated by the hard examples that have poor localization accuracy. This will enable the model to be biased toward the hard samples. So in our TPNet, an adaptive IoU based localization loss, named Rectified IoU (RIoU) loss, is proposed to rectify the gradients of each kind of samples. The Rectified IoU loss increases the gradients of examples with high IoU while suppressing the gradients of examples with low IoU, which can improve the overall localization accuracy of model. Extensive experiments demonstrate the superiority of our TPNet and RIoU loss.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源