使用深厚的增强学习来应对现实世界的自动驾驶

论文标题

使用深厚的增强学习来应对现实世界的自动驾驶

Tackling Real-World Autonomous Driving using Deep Reinforcement Learning

论文作者

Maramotti, Paolo, Capasso, Alessandro Paolo, Bacchiani, Giulio, Broggi, Alberto

论文摘要

在典型的自动驾驶堆栈中，计划和控制系统代表了两个最关键的组件，其中传感器检索并通过感知算法处理的数据用于实施安全舒适的自动驾驶行为。特别是，计划模块可以预测自动驾驶汽车应遵循正确的高级操纵的路径，而控制系统则执行一系列低级动作，控制转向角度，油门和制动器。在这项工作中，我们提出了一个无模型的深钢筋学习计划者培训一个可以预测加速和转向角度的神经网络，从而获得了一个单个模块，可以使用自动驾驶汽车的本地化和感知算法处理的数据来驾驶车辆。特别是，在模拟中进行了充分训练的系统能够在模拟和帕尔马市现实世界中的无障碍环境中平稳驱动，并证明该系统具有良好的概括能力，也可以在训练场景之外的那些部分中驱动。此外，为了将系统部署在真正的自动驾驶汽车的船上并减少模拟和现实世界中的表演之间的差距，我们还开发了一个模块，该模块由一个微小的神经网络表示，能够在模拟中训练期间重现真正的车辆动态行为。

In the typical autonomous driving stack, planning and control systems represent two of the most crucial components in which data retrieved by sensors and processed by perception algorithms are used to implement a safe and comfortable self-driving behavior. In particular, the planning module predicts the path the autonomous car should follow taking the correct high-level maneuver, while control systems perform a sequence of low-level actions, controlling steering angle, throttle and brake. In this work, we propose a model-free Deep Reinforcement Learning Planner training a neural network that predicts both acceleration and steering angle, thus obtaining a single module able to drive the vehicle using the data processed by localization and perception algorithms on board of the self-driving car. In particular, the system that was fully trained in simulation is able to drive smoothly and safely in obstacle-free environments both in simulation and in a real-world urban area of the city of Parma, proving that the system features good generalization capabilities also driving in those parts outside the training scenarios. Moreover, in order to deploy the system on board of the real self-driving car and to reduce the gap between simulated and real-world performances, we also develop a module represented by a tiny neural network able to reproduce the real vehicle dynamic behavior during the training in simulation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题