基于双重关节分析的无监督的多模式词发现，并共同出现线索

论文标题

基于双重关节分析的无监督的多模式词发现，并共同出现线索

Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues

论文作者

Taniguchi, Akira, Murakami, Hiroaki, Ozaki, Ryo, Taniguchi, Tadahiro

论文摘要

根据语音分布的统计特性和其他感觉刺激的同时存在，人类婴儿以最少的语言知识获得了口头词典。这项研究提出了一种新颖的无监督学习方法，用于使用语音信息作为分布提示和对象信息作为共同出现提示发现语音单元。所提出的方法可以使用无监督的学习从语音信号中获取单词和音素，并基于多种模态 - 视觉，触觉和听觉来利用对象信息。所提出的方法基于非参数贝叶斯双重关节分析仪（NPB-DAA），从语音特征发现音素和单词，以及对从对象获得的多模式信息进行分类的多模式潜在迪里奇分配（MLDA）。在实验中，提出的方法显示出比基线方法更高的单词发现性能。准确地分割了表达对象特征（即与名词和形容词相对应的单词）的单词。此外，我们研究了学习表现如何受到语言信息重要性差异的影响。相对于固定条件，增加单词模式的重量进一步提高了性能。

Human infants acquire their verbal lexicon with minimal prior knowledge of language based on the statistical properties of phonological distributions and the co-occurrence of other sensory stimuli. This study proposes a novel fully unsupervised learning method for discovering speech units using phonological information as a distributional cue and object information as a co-occurrence cue. The proposed method can acquire words and phonemes from speech signals using unsupervised learning and utilize object information based on multiple modalities-vision, tactile, and auditory-simultaneously. The proposed method is based on the nonparametric Bayesian double articulation analyzer (NPB-DAA) discovering phonemes and words from phonological features, and multimodal latent Dirichlet allocation (MLDA) categorizing multimodal information obtained from objects. In an experiment, the proposed method showed higher word discovery performance than baseline methods. Words that expressed the characteristics of objects (i.e., words corresponding to nouns and adjectives) were segmented accurately. Furthermore, we examined how learning performance is affected by differences in the importance of linguistic information. Increasing the weight of the word modality further improved performance relative to that of the fixed condition.

下载PDF全文

下载文献需遵守相关版权规定

论文标题