论文标题
因果场景BERT:通过搜索具有挑战性的数据组来改善对象检测
Causal Scene BERT: Improving object detection by searching for challenging groups of data
论文作者
论文摘要
现代计算机视觉应用程序依赖于基于学习的感知模块,该模块用神经网络参数为对象检测等任务。这些模块通常总体上的预期误差较低,但由于训练过程中固有的偏见,对非典型数据组的误差很高。在构建自动驾驶汽车(AV)时,此问题是一个特别重要的挑战,因为它们的感知模块对于整体系统性能至关重要。在确定了AV中的失败之后,人类团队将通过相关数据梳理共同原因的组感知失败。然后,在重新录制模型以解决问题之前,收集和注释来自这些组的更多数据。换句话说,在事后发现错误组并解决了错误组。我们的主要贡献是一种伪自动方法,可以通过在模拟场景上执行因果干预措施来发现此类群体。为了保持我们对数据歧管的干预措施,我们使用蒙版语言模型。我们验证是否通过干预发现的优先组对对象检测器有挑战性,并表明从这些组中收集的数据进行重新培训有助于与添加更多的IID数据相比。我们还计划发布软件以在模拟场景中进行干预措施,我们希望这将使因果关系社区受益。
Modern computer vision applications rely on learning-based perception modules parameterized with neural networks for tasks like object detection. These modules frequently have low expected error overall but high error on atypical groups of data due to biases inherent in the training process. In building autonomous vehicles (AV), this problem is an especially important challenge because their perception modules are crucial to the overall system performance. After identifying failures in AV, a human team will comb through the associated data to group perception failures that share common causes. More data from these groups is then collected and annotated before retraining the model to fix the issue. In other words, error groups are found and addressed in hindsight. Our main contribution is a pseudo-automatic method to discover such groups in foresight by performing causal interventions on simulated scenes. To keep our interventions on the data manifold, we utilize masked language models. We verify that the prioritized groups found via intervention are challenging for the object detector and show that retraining with data collected from these groups helps inordinately compared to adding more IID data. We also plan to release software to run interventions in simulated scenes, which we hope will benefit the causality community.