自动驾驶汽车可以识别，恢复和适应分配变化吗？

论文标题

自动驾驶汽车可以识别，恢复和适应分配变化吗？

Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

论文作者

Filos, Angelos, Tigas, Panagiotis, McAllister, Rowan, Rhinehart, Nicholas, Levine, Sergey, Gal, Yarin

论文摘要

培训外分布（OOD）方案是部署时学习代理的普遍挑战，通常导致任意扣除和不富有明智的决定。原则上，检测和适应OOD场景可以减轻其不良影响。在本文中，我们强调了当前新型驾驶场景方法的局限性，并提出了一种认知不确定性的计划方法，称为\ emph {可靠的模仿计划}（RIP）。我们的方法可以从某些分布变化中检测和恢复，从而减少了OOD场景中的过度自信和灾难性的外推。如果模型的不确定性太大，无法提出安全的行动方案，则该模型可以向专家驱动程序查询反馈，从而使样本有效的在线适应能力，这是我们方法的一种变体，我们称为\ emph {自适应强大的模仿计划}（Adarip）。 Our methods outperform current state-of-the-art approaches in the nuScenes \emph{prediction} challenge, but since no benchmark evaluating OOD detection and adaption currently exists to assess \emph{control}, we introduce an autonomous car novel-scene benchmark, \texttt{CARNOVEL}, to evaluate the robustness of driving agents to a suite of tasks with distribution shifts.

Out-of-training-distribution (OOD) scenarios are a common challenge of learning agents at deployment, typically leading to arbitrary deductions and poorly-informed decisions. In principle, detection of and adaptation to OOD scenes can mitigate their adverse effects. In this paper, we highlight the limitations of current approaches to novel driving scenes and propose an epistemic uncertainty-aware planning method, called \emph{robust imitative planning} (RIP). Our method can detect and recover from some distribution shifts, reducing the overconfident and catastrophic extrapolations in OOD scenes. If the model's uncertainty is too great to suggest a safe course of action, the model can instead query the expert driver for feedback, enabling sample-efficient online adaptation, a variant of our method we term \emph{adaptive robust imitative planning} (AdaRIP). Our methods outperform current state-of-the-art approaches in the nuScenes \emph{prediction} challenge, but since no benchmark evaluating OOD detection and adaption currently exists to assess \emph{control}, we introduce an autonomous car novel-scene benchmark, \texttt{CARNOVEL}, to evaluate the robustness of driving agents to a suite of tasks with distribution shifts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题