论文标题
魔鬼正在分类中:长尾对象检测和实例分割的简单框架
The Devil is in Classification: A Simple Framework for Long-tail Object Detection and Instance Segmentation
论文作者
论文摘要
大多数现有的对象实例检测和分割模型仅在平衡的基准测试中效果很好,在该基准测试中,每个类别培训样本数字是可比的,例如可可。他们倾向于在通常长尾巴的现实数据集上遭受性能下降。这项工作旨在研究和应对此类公开挑战。具体而言,我们系统地研究了最新的两阶段实例分割模型掩盖R-CNN的性能下降,并在最近的长尾LVIS数据集上揭示了主要原因是对象建议的不准确分类。基于这样的观察,我们首先考虑了改善长尾分类性能的各种技术,这确实增强了实例分割结果。然后,我们提出了一个简单的校准框架,以更有效地减轻分类头偏置,使用双层级平衡采样方法。没有铃铛和口哨声,它会显着提高最近LVIS数据集和我们采样的可可LT数据集的尾部类别的实例分割的性能。我们的分析提供了解决长尾实例检测和分割问题的有用见解,而直接的\ emph {simcal}方法可以用作简单但强大的基线。通过方法,我们赢得了2019年LVIS挑战。代码和模型可在https://github.com/twangnh/simcal上找到。
Most existing object instance detection and segmentation models only work well on fairly balanced benchmarks where per-category training sample numbers are comparable, such as COCO. They tend to suffer performance drop on realistic datasets that are usually long-tailed. This work aims to study and address such open challenges. Specifically, we systematically investigate performance drop of the state-of-the-art two-stage instance segmentation model Mask R-CNN on the recent long-tail LVIS dataset, and unveil that a major cause is the inaccurate classification of object proposals. Based on such an observation, we first consider various techniques for improving long-tail classification performance which indeed enhance instance segmentation results. We then propose a simple calibration framework to more effectively alleviate classification head bias with a bi-level class balanced sampling approach. Without bells and whistles, it significantly boosts the performance of instance segmentation for tail classes on the recent LVIS dataset and our sampled COCO-LT dataset. Our analysis provides useful insights for solving long-tail instance detection and segmentation problems, and the straightforward \emph{SimCal} method can serve as a simple but strong baseline. With the method we have won the 2019 LVIS challenge. Codes and models are available at https://github.com/twangnh/SimCal.