论文标题
言语保留什么公司?重新访问J.R. Firth和Zellig Harris的发行语义
What company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris
论文作者
论文摘要
单词嵌入的力量归因于语言理论,即相似的单词将出现在相似的上下文中。英国语言学家J.R. Firth的一句话特别引用了这个想法,并指出“您将遵守公司的一句话”。尽管Firth和Harris在所有主要的NLP教科书和许多基础论文中都被引用,但很少讨论其理论之间的内容和差异。仔细阅读他们的作品,我们发现了两个不同的意义理论。一个人专注于语言形式的内部运作,而另一个人则邀请我们考虑在新公司中的单词 - 不仅与其他语言要素有关,而且在更广泛的文化和情境背景下。从NLP当前的辩论的角度来看,我们在Firth中发现了这些理论,在Firth中发现了一个人物,他可以指导该领域朝着更具文化的语义概念构想。我们考虑如何通过两种不同的策略在实践中建立扩展的“上下文”概念:比较分层和义面扩展
The power of word embeddings is attributed to the linguistic theory that similar words will appear in similar contexts. This idea is specifically invoked by noting that "you shall know a word by the company it keeps," a quote from British linguist J.R. Firth who, along with his American colleague Zellig Harris, is often credited with the invention of "distributional semantics." While both Firth and Harris are cited in all major NLP textbooks and many foundational papers, the content and differences between their theories is seldom discussed. Engaging in a close reading of their work, we discover two distinct and in many ways divergent theories of meaning. One focuses exclusively on the internal workings of linguistic forms, while the other invites us to consider words in new company - not just with other linguistic elements, but also in a broader cultural and situational context. Contrasting these theories from the perspective of current debates in NLP, we discover in Firth a figure who could guide the field towards a more culturally grounded notion of semantics. We consider how an expanded notion of "context" might be modeled in practice through two different strategies: comparative stratification and syntagmatic extension