论文标题
多模式相机本地化的不确定性意识DNN
Uncertainty-Aware DNN for Multi-Modal Camera Localization
论文作者
论文摘要
相机定位,即相机姿势回归,代表了计算机视觉中的一项重要任务,因为它具有许多实际应用,例如在智能车辆及其本地化的背景下。对回归不确定性的可靠估计也很重要,因为这将使我们能够捕获危险的本地化故障。在文献中,经常通过采样方法(例如蒙特卡洛辍学(MCD)和深层集合(DE))进行深度神经网络(DNN)的不确定性估计,而以不良的执行时间或硬件资源的增加为代价。在这项工作中,我们考虑了一种不确定性估计方法,称为“深证回归”(DER),该方法避免了任何采样技术,提供了直接的不确定性估计。我们的目标是通过分析生成的不确定性来提供基于DNNS体系结构的相机本地化系统的拦截定位方法的系统方法。我们建议通过修改其内部配置以允许在KITTI数据集上进行广泛的实验活动来利用CMRNET,这是一种多模式图像对LIDAR MAP注册的DNN方法。 “实验部分”强调了CMRNET的主要缺陷,并证明我们的建议不会损害原始定位性能,而是同时提供了允许最终用户采取相应行动的必要内省措施。
Camera localization, i.e., camera pose regression, represents an important task in computer vision since it has many practical applications such as in the context of intelligent vehicles and their localization. Having reliable estimates of the regression uncertainties is also important, as it would allow us to catch dangerous localization failures. In the literature, uncertainty estimation in Deep Neural Networks (DNNs) is often performed through sampling methods, such as Monte Carlo Dropout (MCD) and Deep Ensemble (DE), at the expense of undesirable execution time or an increase in hardware resources. In this work, we considered an uncertainty estimation approach named Deep Evidential Regression (DER) that avoids any sampling technique, providing direct uncertainty estimates. Our goal is to provide a systematic approach to intercept localization failures of camera localization systems based on DNNs architectures, by analyzing the generated uncertainties. We propose to exploit CMRNet, a DNN approach for multi-modal image to LiDAR map registration, by modifying its internal configuration to allow for extensive experimental activity on the KITTI dataset. The experimental section highlights CMRNet's major flaws and proves that our proposal does not compromise the original localization performances but also provides, at the same time, the necessary introspection measures that would allow end-users to act accordingly.