论文标题

通过优先交流学习的多代理路径查找

Multi-Agent Path Finding with Prioritized Communication Learning

论文作者

Li, Wenhao, Chen, Hongjun, Jin, Bo, Tan, Wenzhe, Zha, Hongyuan, Wang, Xiangfeng

论文摘要

多代理探路(MAPF)已被广泛用于解决大型现实世界中的问题,例如自动化仓库。已经引入了基于学习的,完全分散的框架,以减轻实时问题并同时追求最佳计划政策。但是,现有方法可能会产生更多的顶点冲突(或碰撞),从而导致成功率较低或更高。在本文中,我们提出了一种优先的交流学习方法(PICO),该方法将\ textIt {intimit}计划优先级纳入了分散的多代理增强学习框架内的通信拓扑。与经典的耦合计划者组装,可以利用隐性优先学习模块形成动态通信拓扑,这也构建了有效的避免碰撞的机制。 PICO在成功率和碰撞率的大规模MAPF任务中的表现要比基于最先进的学习计划者要好得多。

Multi-agent pathfinding (MAPF) has been widely used to solve large-scale real-world problems, e.g., automation warehouses. The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy. However, existing methods might generate significantly more vertex conflicts (or collisions), which lead to a low success rate or more makespan. In this paper, we propose a PrIoritized COmmunication learning method (PICO), which incorporates the \textit{implicit} planning priorities into the communication topology within the decentralized multi-agent reinforcement learning framework. Assembling with the classic coupled planners, the implicit priority learning module can be utilized to form the dynamic communication topology, which also builds an effective collision-avoiding mechanism. PICO performs significantly better in large-scale MAPF tasks in success rates and collision rates than state-of-the-art learning-based planners.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源