类似人类导航的闭环感知，决策和推理机制

论文标题

类似人类导航的闭环感知，决策和推理机制

A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation

论文作者

Zhang, Wenqi, Zhao, Kai, Li, Peng, Zhu, Xiao, Shen, Yongliang, Ma, Yanna, Chen, Yingfeng, Lu, Weiming

论文摘要

可靠的导航系统在机器人技术和自动驾驶中具有广泛的应用。当前方法采用开环过程，将传感器输入直接转换为动作。但是，这些开环方案由于概括不佳而在处理复杂而动态的现实情况方面具有挑战性。在模仿人类导航的情况下，我们添加了一个推理过程，将动作转换回内部潜在状态，形成一个两阶段的感知，决策和推理的封闭环路。首先，VAE增强的演示学习赋予了对基本导航规则的理解。然后，在RL增强的相互作用学习中的两个双重过程彼此产生奖励反馈，并共同增强了避免障碍能力。推理模型可以实质上促进概括和鲁棒性，并促进算法将算法的部署到现实世界的机器人而无需精心转移的情况下。实验表明，与最先进的方法相比，我们的方法更适合新的情况。

Reliable navigation systems have a wide range of applications in robotics and autonomous driving. Current approaches employ an open-loop process that converts sensor inputs directly into actions. However, these open-loop schemes are challenging to handle complex and dynamic real-world scenarios due to their poor generalization. Imitating human navigation, we add a reasoning process to convert actions back to internal latent states, forming a two-stage closed loop of perception, decision-making, and reasoning. Firstly, VAE-Enhanced Demonstration Learning endows the model with the understanding of basic navigation rules. Then, two dual processes in RL-Enhanced Interaction Learning generate reward feedback for each other and collectively enhance obstacle avoidance capability. The reasoning model can substantially promote generalization and robustness, and facilitate the deployment of the algorithm to real-world robots without elaborate transfers. Experiments show our method is more adaptable to novel scenarios compared with state-of-the-art approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题