论文标题
用于机器人顺序操作的学习有效约束图表
Learning Efficient Constraint Graph Sampling for Robotic Sequential Manipulation
论文作者
论文摘要
从约束歧管中进行有效的采样,从而为可行性问题产生各种解决方案,这是一个基本挑战。我们考虑存在问题的情况,即,基本的非线性程序被分解为可区别的平等和不平等约束,每个程序仅取决于某些变量。此类问题是高效且健壮的连续机器人操纵计划的核心。单个变量的幼稚顺序条件采样,以及所有变量的完全关节采样(例如,利用优化方法)可能高效且不舒适。我们提出了一个新颖的框架,以学习如何将整个问题分解为较小的顺序抽样问题。具体而言,我们利用蒙特卡洛树搜索来学习可变材料的分配顺序,以最大程度地减少计算时间以生成可行的完整样本。此策略使我们能够有效地计算一组顺序操作任务中模式切换的不同有效的机器人配置,这是随后的轨迹优化或基于采样的运动计划算法的路点。我们表明,学习方法迅速融合到给定问题的最佳采样策略,并优于用户定义的订单或完全关节优化,同时提供了更高的样本多样性。
Efficient sampling from constraint manifolds, and thereby generating a diverse set of solutions for feasibility problems, is a fundamental challenge. We consider the case where a problem is factored, that is, the underlying nonlinear program is decomposed into differentiable equality and inequality constraints, each of which depends only on some variables. Such problems are at the core of efficient and robust sequential robot manipulation planning. Naive sequential conditional sampling of individual variables, as well as fully joint sampling of all variables at once (e.g., leveraging optimization methods), can be highly inefficient and non-robust. We propose a novel framework to learn how to break the overall problem into smaller sequential sampling problems. Specifically, we leverage Monte-Carlo Tree Search to learn assignment orders for the variable-subsets, in order to minimize the computation time to generate feasible full samples. This strategy allows us to efficiently compute a set of diverse valid robot configurations for mode-switches within sequential manipulation tasks, which are waypoints for subsequent trajectory optimization or sampling-based motion planning algorithms. We show that the learning method quickly converges to the best sampling strategy for a given problem, and outperforms user-defined orderings or fully joint optimization, while providing a higher sample diversity.