基于射线的分布式自动驾驶汽车研究平台

论文标题

基于射线的分布式自动驾驶汽车研究平台

Ray Based Distributed Autonomous Vehicle Research Platform

论文作者

Xu, Derek

论文摘要

我的项目解决了是否可以使用模拟器（CARLA）快速训练自动驾驶汽车的问题，以及是否可以围绕它构建足够强大的平台。 Ray是一个开源框架，可实现分布式机器学习应用程序。分布式计算是一种在许多机器中平行于训练模型等计算任务的技术。雷摘除了这些机器的复杂协调，使其迅速可扩展。 Carla是一种车辆模拟器，生成用于训练模型的数据。该项目的大部分是编写Ray将用来训练我的分布式模型的培训逻辑。模仿学习最适合自动驾驶汽车。模仿学习是强化学习的替代方法，它通过试图通过模仿专家（通常是人）来学习最佳政策，从而有效。该项目的一个关键可交付是在一些基准测试中展示了我训练有素的代理商，例如导航复杂的交通转折。除此之外，更广泛的野心是开发一个研究平台，在该平台上，其他人可以快速训练并进行大量Carla车辆数据的实验。因此，我的最终产品不是单个模型，而是一个供自动驾驶汽车研究人员使用的大规模开源研究平台（Raycarla）。

My project tackles the question of whether Ray can be used to quickly train autonomous vehicles using a simulator (Carla), and whether a platform robust enough for further research purposes can be built around it. Ray is an open-source framework that enables distributed machine learning applications. Distributed computing is a technique which parallelizes computational tasks, such as training a model, among many machines. Ray abstracts away the complex coordination of these machines, making it rapidly scalable. Carla is a vehicle simulator that generates data used to train a model. The bulk of the project was writing the training logic that Ray would use to train my distributed model. Imitation learning is the best fit for autonomous vehicles. Imitation learning is an alternative to reinforcement learning and it works by trying to learn the optimal policy by imitating an expert (usually a human) given a set of demonstrations. A key deliverable for the project was showcasing my trained agent in a few benchmark tests, such as navigating a complex turn through traffic. Beyond that, the broader ambition was to develop a research platform where others could quickly train and run experiments on huge amounts of Carla vehicle data. Thus, my end product is not a single model, but a large-scale, open-source research platform (RayCarla) for autonomous vehicle researchers to utilize.

下载PDF全文

下载文献需遵守相关版权规定

论文标题