论文标题
档案数据中的关系发现的探索方法
Exploratory Methods for Relation Discovery in Archival Data
论文作者
论文摘要
在本文中,我们提出了一种整体方法,以发现艺术历史社区中的关系,并丰富了历史学家的传记和档案描述,并具有与艺术史学研究相关的图形模式。我们使用探索性数据分析来检测模式,选择特征,然后使用它们来评估分类模型以预测新关系,并建议在分类阶段向档案管理员使用。结果表明,基于研究主题或机构关系的关系,基于传记信息的关系的精度可以更高。确定性和先验规则比概率方法提出了更好的结果。
In this article we propose a holistic approach to discover relations in art historical communities and enrich historians' biographies and archival descriptions with graph patterns relevant to art historiographic enquiry. We use exploratory data analysis to detect patterns, we select features, and we use them to evaluate classification models to predict new relations, to be recommended to archivists during the cataloguing phase. Results show that relations based on biographical information can be addressed with higher precision than relations based on research topics or institutional relations. Deterministic and a priori rules present better results than probabilistic methods.