OA-Mine：电子商务产品的开放世界属性采矿，有弱的监督

论文标题

OA-Mine：电子商务产品的开放世界属性采矿，有弱的监督

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

论文作者

Zhang, Xinyang, Zhang, Chenwei, Li, Xian, Dong, Xin Luna, Shang, Jingbo, Faloutsos, Christos, Han, Jiawei

论文摘要

从其文本描述中自动提取产品属性对于在线购物者体验至关重要。这项任务的一个固有挑战是电子商务产品的新兴本质 - 我们看到新型产品不断具有独特的新属性。大多数先前在此问题上都可以为一组已知属性的新值进行工作，但无法处理不断变化的数据引起的新属性。在这项工作中，我们研究了开放世界中的属性采矿问题，以提取新颖的属性及其价值。用户不需要提供一些已知属性类型的示例，而不是提供全面的培训数据。我们提出了一个原则性的框架，该框架首先生成属性值候选者，然后将其分组为属性群。候选生成步骤探究了预先训练的语言模型，以从产品标题中提取短语。然后，属性感知的微调方法优化了多任务目标，并塑造语言模型表示为属性 - 歧义性。最后，我们通过我们的框架的自我汇总发现了新的属性和价值，该框架处理了开放世界的挑战。我们在大型注释的开发集和我们收集的金标准人类注销的测试集上进行了广泛的实验。我们的模型极大地胜过强大的基线，并且可以推广到看不见的属性和产品类型。

Automatic extraction of product attributes from their textual descriptions is essential for online shopper experience. One inherent challenge of this task is the emerging nature of e-commerce products -- we see new types of products with their unique set of new attributes constantly. Most prior works on this matter mine new values for a set of known attributes but cannot handle new attributes that arose from constantly changing data. In this work, we study the attribute mining problem in an open-world setting to extract novel attributes and their values. Instead of providing comprehensive training data, the user only needs to provide a few examples for a few known attribute types as weak supervision. We propose a principled framework that first generates attribute value candidates and then groups them into clusters of attributes. The candidate generation step probes a pre-trained language model to extract phrases from product titles. Then, an attribute-aware fine-tuning method optimizes a multitask objective and shapes the language model representation to be attribute-discriminative. Finally, we discover new attributes and values through the self-ensemble of our framework, which handles the open-world challenge. We run extensive experiments on a large distantly annotated development set and a gold standard human-annotated test set that we collected. Our model significantly outperforms strong baselines and can generalize to unseen attributes and product types.

下载PDF全文

下载文献需遵守相关版权规定

论文标题