CAZSL：通过上下文概括来推动模型的零射击回归

论文标题

CAZSL：通过上下文概括来推动模型的零射击回归

CAZSL: Zero-Shot Regression for Pushing Models by Generalizing Through Context

论文作者

Zhang, Wenyu, Seto, Skyler, Jha, Devesh K.

论文摘要

许多机器人操作任务需要学习物理世界的准确模型。但是，在操纵过程中，预计机器人将与未知工件相互作用，因此建立可以推广到许多这些对象的预测模型是非常可取的。在本文中，我们研究了设计深度学习代理的问题，这些问题可以通过构建背景感知的学习模型来概括其物理世界的模型。这些代理的目的是根据某些特征，以某些功能对相互作用的对象进行一些特征，以快速调整和/或推广其在现实世界中的物理概念，这些对象为预测模型提供了不同的上下文。通过这种动机，我们提出了上下文感知的零射击学习（CAZSL，发音为休闲）模型，一种使用暹罗网络体系结构，基于上下文变量嵌入空间掩盖和正则化的方法，该变量使我们能够学习一个模型，该模型可以推广到相互作用对象的不同参数或功能。我们在最近发布的OmniPush数据集中测试了我们提出的学习算法，该算法允许使用低维数据测试元学习功能。 CAZSL的代码可在https://www.merl.com/research/license/cazsl上找到。

Learning accurate models of the physical world is required for a lot of robotic manipulation tasks. However, during manipulation, robots are expected to interact with unknown workpieces so that building predictive models which can generalize over a number of these objects is highly desirable. In this paper, we study the problem of designing deep learning agents which can generalize their models of the physical world by building context-aware learning models. The purpose of these agents is to quickly adapt and/or generalize their notion of physics of interaction in the real world based on certain features about the interacting objects that provide different contexts to the predictive models. With this motivation, we present context-aware zero shot learning (CAZSL, pronounced as casual) models, an approach utilizing a Siamese network architecture, embedding space masking and regularization based on context variables which allows us to learn a model that can generalize to different parameters or features of the interacting objects. We test our proposed learning algorithm on the recently released Omnipush datatset that allows testing of meta-learning capabilities using low-dimensional data. Codes for CAZSL are available at https://www.merl.com/research/license/CAZSL.

下载PDF全文

下载文献需遵守相关版权规定

论文标题