使用图神经网络自动驾驶的合作行为计划

论文标题

使用图神经网络自动驾驶的合作行为计划

Cooperative Behavior Planning for Automated Driving using Graph Neural Networks

论文作者

Klimke, Marvin, Völz, Benjamin, Buchholz, Michael

论文摘要

由于静态优先规则和遮挡限制了对优先流量的观点，城市交叉点很容易延迟和效率低下。改善流量流的现有方法，广泛称为自动交叉管理系统，主要基于非学习预订方案或优化算法。基于机器学习的技术在计划单个自我车辆方面显示出令人鼓舞的结果。这项工作建议通过共同计划多辆车来利用机器学习算法来优化城市交叉点的交通流量。基于学习的行为计划提出了几个挑战，要求适合的输入和输出表示以及大量的基础数据。我们通过使用基于图形的输入表示并伴随图神经网络来解决以前的问题。这允许有效地编码场景，并固有地为所有相关车辆提供单独的输出。为了学习明智的政策，而无需依靠专家示范的模仿，合作计划任务被视为强化学习问题。我们在开源模拟环境中训练并评估所提出的方法，以进行自动驾驶的决策。与由静态优先级规则控制的第一届第一方案和流量相比，学识渊博的计划者表现出显着的流速增长，同时减少了诱导停止的数量。除了合成模拟外，还基于从公开可用的IND数据集中获取的现实世界流量数据进行评估。

Urban intersections are prone to delays and inefficiencies due to static precedence rules and occlusions limiting the view on prioritized traffic. Existing approaches to improve traffic flow, widely known as automatic intersection management systems, are mostly based on non-learning reservation schemes or optimization algorithms. Machine learning-based techniques show promising results in planning for a single ego vehicle. This work proposes to leverage machine learning algorithms to optimize traffic flow at urban intersections by jointly planning for multiple vehicles. Learning-based behavior planning poses several challenges, demanding for a suited input and output representation as well as large amounts of ground-truth data. We address the former issue by using a flexible graph-based input representation accompanied by a graph neural network. This allows to efficiently encode the scene and inherently provide individual outputs for all involved vehicles. To learn a sensible policy, without relying on the imitation of expert demonstrations, the cooperative planning task is considered as a reinforcement learning problem. We train and evaluate the proposed method in an open-source simulation environment for decision making in automated driving. Compared to a first-in-first-out scheme and traffic governed by static priority rules, the learned planner shows a significant gain in flow rate, while reducing the number of induced stops. In addition to synthetic simulations, the approach is also evaluated based on real-world traffic data taken from the publicly available inD dataset.

下载PDF全文

下载文献需遵守相关版权规定

论文标题