论文标题

将开放政府数据中的组织信息与知识图联系起来的挑战

Challenges of Linking Organizational Information in Open Government Data to Knowledge Graphs

论文作者

Portisch, Jan, Fallatah, Omaima, Neumaier, Sebastian, Jaradeh, Mohamad Yaser, Polleres, Axel

论文摘要

全球各种公共行政组织正在发布开放政府数据(OGD)。在OGD数据目录的元数据中,出版组织(1)并非唯一且明确地识别,更糟糕的是,(2)(2)随着时间的推移,通过合并或重组的公共管理部门随着时间的变化。为了实现有关发布组织级别的开放政府数据的细粒度分析或搜索,将OGD门户网站与公共可用知识图(KGS)(例如Wikidata和Dbpedia)联系起来似乎是一个明显的解决方案。尽管如此,正如我们在该职位上的论文中所显示的那样,在可用的(门户)元数据和KGS方面,在数据质量和完整性方面,组织联系仍面临重大挑战。我们在本文中特别强调了五个主要挑战,即(1)组织和门户元数据的时间变化,(2)缺乏描述组织结构和公共知识图的变化的基础本体,(3)元数据和KG数据质量,(4)多语言性,以及(5)扰乱公共部门的公共部门组织。基于开放数据门户手表的可用OGD门户网站元数据,我们对这些问题进行了深入的分析,对如何解决这些问题的具体起点提出了建议,并呼吁社区共同处理这些开放挑战。

Open Government Data (OGD) is being published by various public administration organizations around the globe. Within the metadata of OGD data catalogs, the publishing organizations (1) are not uniquely and unambiguously identifiable and, even worse, (2) change over time, by public administration units being merged or restructured. In order to enable fine-grained analyses or searches on Open Government Data on the level of publishing organizations, linking those from OGD portals to publicly available knowledge graphs (KGs) such as Wikidata and DBpedia seems like an obvious solution. Still, as we show in this position paper, organization linking faces significant challenges, both in terms of available (portal) metadata and KGs in terms of data quality and completeness. We herein specifically highlight five main challenges, namely regarding (1) temporal changes in organizations and in the portal metadata, (2) lack of a base ontology for describing organizational structures and changes in public knowledge graphs, (3) metadata and KG data quality, (4) multilinguality, and (5) disambiguating public sector organizations. Based on available OGD portal metadata from the Open Data Portal Watch, we provide an in-depth analysis of these issues, make suggestions for concrete starting points on how to tackle them along with a call to the community to jointly work on these open challenges.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源