论文标题
分析点云的深度学习表示形式,以实时车载激光雷达感知
Analyzing Deep Learning Representations of Point Clouds for Real-Time In-Vehicle LiDAR Perception
论文作者
论文摘要
激光雷达传感器是现代自动驾驶汽车不可或缺的一部分,因为它们为车辆周围环境提供了准确的高分辨率3D表示。但是,在计算上很难利用来自多个高分辨率激光雷达传感器的不断增加的数据。随着帧速率,点云的大小和传感器分辨率的增加,这些点云的实时处理仍必须从越来越精确的车辆环境图片中提取语义。在这些点云上运行的深神经网络的运行时间性能和准确性的决定性因素是基础数据表示及其计算方式。在这项工作中,我们研究了神经网络中使用的计算表示及其性能特征之间的关系。为此,我们提出了一种新颖的计算分类法对LIDAR点云表示,用于3D点云处理中。使用这种分类法,我们对不同的方法家族进行结构化分析。因此,我们在计算效率,内存需求和代表性能力方面发现了共同的优势和局限性,这些能力是通过语义分割性能衡量的。最后,我们为神经点云处理方法的未来发展提供了一些见解和指导。
LiDAR sensors are an integral part of modern autonomous vehicles as they provide an accurate, high-resolution 3D representation of the vehicle's surroundings. However, it is computationally difficult to make use of the ever-increasing amounts of data from multiple high-resolution LiDAR sensors. As frame-rates, point cloud sizes and sensor resolutions increase, real-time processing of these point clouds must still extract semantics from this increasingly precise picture of the vehicle's environment. One deciding factor of the run-time performance and accuracy of deep neural networks operating on these point clouds is the underlying data representation and the way it is computed. In this work, we examine the relationship between the computational representations used in neural networks and their performance characteristics. To this end, we propose a novel computational taxonomy of LiDAR point cloud representations used in modern deep neural networks for 3D point cloud processing. Using this taxonomy, we perform a structured analysis of different families of approaches. Thereby, we uncover common advantages and limitations in terms of computational efficiency, memory requirements, and representational capacity as measured by semantic segmentation performance. Finally, we provide some insights and guidance for future developments in neural point cloud processing methods.