基于语法的基础词典学习

论文标题

基于语法的基础词典学习

Grammar-Based Grounded Lexicon Learning

论文作者

Mao, Jiayuan, Shi, Haoyue, Wu, Jiajun, Levy, Roger P., Tenenbaum, Joshua B.

论文摘要

我们提出了基于语法的基础词典学习（G2L2），这是一种词汇主义方法，用于学习从接地数据（例如成对的图像和文本）中学习语言的组成和基础含义。 G2L2的核心是词典条目的集合，它们将每个单词映射到句法类型的元组和神经符号符号语义程序。例如，Shiny一词具有句法类型的形容词。它的神经符号语义程序具有符号形式λx。滤波器（x，闪亮），其中概念光泽与神经网络嵌入相关联，该神经网络将用于对光泽对象进行分类。给定输入句子，G2L2首先查找与每个令牌相关的词典条目。然后，它通过基于语法构成词汇含义，将句子的含义作为可执行的神经符号程序。恢复的含义程序可以在接地输入上执行。为了促进在成倍增长的组成空间中学习，我们引入了一种联合解析和预期的执行算法，该算法对派生进行了局部边缘化，以减少训练时间。我们在两个域上评估G2L2：视觉推理和语言驱动导航。结果表明，G2L2可以从少量数据推广到单词的新颖组成。

We present Grammar-Based Grounded Lexicon Learning (G2L2), a lexicalist approach toward learning a compositional and grounded meaning representation of language from grounded data, such as paired images and texts. At the core of G2L2 is a collection of lexicon entries, which map each word to a tuple of a syntactic type and a neuro-symbolic semantic program. For example, the word shiny has a syntactic type of adjective; its neuro-symbolic semantic program has the symbolic form λx. filter(x, SHINY), where the concept SHINY is associated with a neural network embedding, which will be used to classify shiny objects. Given an input sentence, G2L2 first looks up the lexicon entries associated with each token. It then derives the meaning of the sentence as an executable neuro-symbolic program by composing lexical meanings based on syntax. The recovered meaning programs can be executed on grounded inputs. To facilitate learning in an exponentially-growing compositional space, we introduce a joint parsing and expected execution algorithm, which does local marginalization over derivations to reduce the training time. We evaluate G2L2 on two domains: visual reasoning and language-driven navigation. Results show that G2L2 can generalize from small amounts of data to novel compositions of words.

下载PDF全文

下载文献需遵守相关版权规定

论文标题