论文标题
使用基于内核的方法进行机器人技术的快速对象细分学习
Fast Object Segmentation Learning with Kernel-based Methods for Robotics
论文作者
论文摘要
对象分割是机器人视觉系统中的一个关键组件,该组件执行诸如抓握和对象操纵之类的任务,尤其是在遮挡的情况下。像许多其他计算机视觉任务一样,采用深度体系结构也使算法具有出色的性能执行此任务。但是,培训需要大量的计算时间,并且不能在线执行机器人技术中采用这种算法。在这项工作中,我们为对象分割提出了一种新颖的体系结构,该架构克服了这个问题,并在最先进的方法所需的时间内提供了可比的性能。我们的方法基于预先训练的面具R-CNN,其中各种层已被一组分类器和回归器所取代,这些分类器和回归器已重新训练为新任务。我们采用了一种有效的基于内核的方法,可以在大规模问题上进行快速培训。我们的方法在YCB-Video数据集上得到了验证,该数据集在计算机视觉和机器人界广泛采用,证明我们可以实现甚至超过最先进的性能,并大幅减少培训时间($ {\ sim} 6 \ times $)。复制实验的代码在GitHub上公开可用。
Object segmentation is a key component in the visual system of a robot that performs tasks like grasping and object manipulation, especially in presence of occlusions. Like many other computer vision tasks, the adoption of deep architectures has made available algorithms that perform this task with remarkable performance. However, adoption of such algorithms in robotics is hampered by the fact that training requires large amount of computing time and it cannot be performed on-line. In this work, we propose a novel architecture for object segmentation, that overcomes this problem and provides comparable performance in a fraction of the time required by the state-of-the-art methods. Our approach is based on a pre-trained Mask R-CNN, in which various layers have been replaced with a set of classifiers and regressors that are re-trained for a new task. We employ an efficient Kernel-based method that allows for fast training on large scale problems. Our approach is validated on the YCB-Video dataset which is widely adopted in the computer vision and robotics community, demonstrating that we can achieve and even surpass performance of the state-of-the-art, with a significant reduction (${\sim}6\times$) of the training time. The code to reproduce the experiments is publicly available on GitHub.