论文标题
数十亿个预训练的多式联运商业知识图的构建和应用
Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph
论文作者
论文摘要
业务知识图(KGS)对当今的许多企业都很重要,提供了导致许多产品并使它们更聪明的事实知识和结构化数据。尽管他们有希望的好处,但建立业务KG仍需要解决不足的结构和多种方式问题。在本文中,我们提出了对与非平凡现实世界中建立KG相关的实际挑战的理解。我们介绍了构建源自著名企业阿里巴巴集团的开放业务知识图(OpenBG)的过程。具体而言,我们定义了一个核心本体,以涵盖各种抽象产品和消费需求,并在部署的应用程序中具有细粒度的分类学和多模式事实。 OpenBG是空前规模的开放式业务公园:26亿三倍,超过8800万个实体覆盖了超过100万个核心类别/概念和2,681种类型的关系。我们发布了从IT为社区得出的所有开放资源(OpenBG基准),并报告以KG为中心的任务的实验结果。我们还基于OpenBG基准进行了在线竞赛,并吸引了数千支球队。我们进一步预先培训OpenBG,并将其应用于业务场景中的许多KG增强的下游任务,证明了十亿级多模式知识对电子商务的有效性。所有带有代码的资源已通过\ url {https://github.com/openbgbenchmark/openbg}发布。
Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related to building KG in non-trivial real-world systems. We introduce the process of building an open business knowledge graph (OpenBG) derived from a well-known enterprise, Alibaba Group. Specifically, we define a core ontology to cover various abstract products and consumption demands, with fine-grained taxonomy and multimodal facts in deployed applications. OpenBG is an open business KG of unprecedented scale: 2.6 billion triples with more than 88 million entities covering over 1 million core classes/concepts and 2,681 types of relations. We release all the open resources (OpenBG benchmarks) derived from it for the community and report experimental results of KG-centric tasks. We also run up an online competition based on OpenBG benchmarks, and has attracted thousands of teams. We further pre-train OpenBG and apply it to many KG- enhanced downstream tasks in business scenarios, demonstrating the effectiveness of billion-scale multimodal knowledge for e-commerce. All the resources with codes have been released at \url{https://github.com/OpenBGBenchmark/OpenBG}.