通过优先交流学习的多代理路径查找

论文标题

通过优先交流学习的多代理路径查找

Multi-Agent Path Finding with Prioritized Communication Learning

论文作者

Li, Wenhao, Chen, Hongjun, Jin, Bo, Tan, Wenzhe, Zha, Hongyuan, Wang, Xiangfeng

论文摘要

多代理探路（MAPF）已被广泛用于解决大型现实世界中的问题，例如自动化仓库。已经引入了基于学习的，完全分散的框架，以减轻实时问题并同时追求最佳计划政策。但是，现有方法可能会产生更多的顶点冲突（或碰撞），从而导致成功率较低或更高。在本文中，我们提出了一种优先的交流学习方法（PICO），该方法将\ textIt {intimit}计划优先级纳入了分散的多代理增强学习框架内的通信拓扑。与经典的耦合计划者组装，可以利用隐性优先学习模块形成动态通信拓扑，这也构建了有效的避免碰撞的机制。 PICO在成功率和碰撞率的大规模MAPF任务中的表现要比基于最先进的学习计划者要好得多。

Multi-agent pathfinding (MAPF) has been widely used to solve large-scale real-world problems, e.g., automation warehouses. The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy. However, existing methods might generate significantly more vertex conflicts (or collisions), which lead to a low success rate or more makespan. In this paper, we propose a PrIoritized COmmunication learning method (PICO), which incorporates the \textit{implicit} planning priorities into the communication topology within the decentralized multi-agent reinforcement learning framework. Assembling with the classic coupled planners, the implicit priority learning module can be utilized to form the dynamic communication topology, which also builds an effective collision-avoiding mechanism. PICO performs significantly better in large-scale MAPF tasks in success rates and collision rates than state-of-the-art learning-based planners.

下载PDF全文

下载文献需遵守相关版权规定

论文标题