深层学习持续区域

论文标题

深层学习持续区域

Deep R-Learning for Continual Area Sweeping

论文作者

Shah, Rishi, Jiang, Yuqian, Hart, Justin, Stone, Peter

论文摘要

覆盖路径计划是机器人技术中一个充分研究的问题，在该机器人技术中，机器人必须计划一条经过给定区域中每个点的路径，通常频率均匀。为了解决需要比其他要点更频繁地访问某些要点的情况，此问题已扩展到不均匀的覆盖范围计划。本文考虑了非均匀覆盖范围的变体，其中机器人不知道事先知道相关事件的分布，因此必须学会最大程度地提高检测感兴趣的事件的速度。以前，这种持续的范围清扫问题已经正式化，以对环境做出强烈的假设，迄今为止，只提出了一种贪婪的方法。我们将持续的范围扫描公式概括为包括更少的环境限制，并在半马尔可夫决策过程中提出了一种基于强化学习的新方法。在抽象模拟和高保真凉亭模拟中评估了这种方法。这些评估在一般环境中的现有方法上显示出显着改善，这在不断增长的服务机器人技术领域尤其重要。

Coverage path planning is a well-studied problem in robotics in which a robot must plan a path that passes through every point in a given area repeatedly, usually with a uniform frequency. To address the scenario in which some points need to be visited more frequently than others, this problem has been extended to non-uniform coverage planning. This paper considers the variant of non-uniform coverage in which the robot does not know the distribution of relevant events beforehand and must nevertheless learn to maximize the rate of detecting events of interest. This continual area sweeping problem has been previously formalized in a way that makes strong assumptions about the environment, and to date only a greedy approach has been proposed. We generalize the continual area sweeping formulation to include fewer environmental constraints, and propose a novel approach based on reinforcement learning in a Semi-Markov Decision Process. This approach is evaluated in an abstract simulation and in a high fidelity Gazebo simulation. These evaluations show significant improvement upon the existing approach in general settings, which is especially relevant in the growing area of service robotics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题