论文标题
ualberta在Semeval 2022任务2:利用光泽和翻译进行多语种惯用性检测
UAlberta at SemEval 2022 Task 2: Leveraging Glosses and Translations for Multilingual Idiomaticity Detection
论文作者
论文摘要
我们描述了艾伯塔大学的Semeval-2022任务2艾伯塔大学系统的多语种惯用性检测。在假设惯用表达是非序列的假设下,我们的第一个方法将有关表达式单词含义的信息集成到二进制分类器中。进一步假设字面和惯用表达式的翻译方式有所不同,我们的第二种方法在上下文中翻译了一种表达,并使用词汇知识库来确定翻译是否是字面的。我们的方法基于语言现象,并利用现有的词汇知识来源。我们的结果为这两种方法提供了支持,尤其是前者。
We describe the University of Alberta systems for the SemEval-2022 Task 2 on multilingual idiomaticity detection. Working under the assumption that idiomatic expressions are noncompositional, our first method integrates information on the meanings of the individual words of an expression into a binary classifier. Further hypothesizing that literal and idiomatic expressions translate differently, our second method translates an expression in context, and uses a lexical knowledge base to determine if the translation is literal. Our approaches are grounded in linguistic phenomena, and leverage existing sources of lexical knowledge. Our results offer support for both approaches, particularly the former.