论文标题
来自具有域知识的关系数据库的因果过程挖掘
Causal Process Mining from Relational Databases with Domain Knowledge
论文作者
论文摘要
流程挖掘研究领域的大量算法建立在直接关注关系的基础上。即使在过去十年中进行了各种改进,这些关系存在严重的弱点。一旦与不同对象相关的事件与1:n和n:m相互关联,基于直接与关系的技术会产生虚假的关系,自环和后跳。这是由于经典事件日志中所述的事件序列与事件因果关系有所不同。在本文中,我们解决了代表过程相关事件数据的因果结构的研究问题。为此,我们开发了一种称为因果过程挖掘的新方法。这种方法放弃了平面事件日志的使用,并将事件数据的关系数据库视为输入。更具体地说,我们根据因果过程模板将关系数据结构转换为我们称为因果事件图的内容。我们在与欧洲食品生产公司的案例研究中基于直接遵循关系的技术进行了评估,并将其输出与基于直接遵循关系的技术进行了比较。我们的结果证明了丰富过程开采的好处,并具有来自域中的其他知识。
The plethora of algorithms in the research field of process mining builds on directly-follows relations. Even though various improvements have been made in the last decade, there are serious weaknesses of these relationships. Once events associated with different objects that relate with a cardinality of 1:N and N:M to each other, techniques based on directly-follows relations produce spurious relations, self-loops, and back-jumps. This is due to the fact that event sequence as described in classical event logs differs from event causation. In this paper, we address the research problem of representing the causal structure of process-related event data. To this end, we develop a new approach called Causal Process Mining. This approach renounces the use of flat event logs and considers relational databases of event data as an input. More specifically, we transform the relational data structures based on the Causal Process Template into what we call Causal Event Graph. We evaluate our approach and compare its outputs with techniques based on directly-follows relations in a case study with an European food production company. Our results demonstrate the benefits for enriching process mining with additional knowledge from the domain.