论文标题
通过因果发现和生存分析来揭示生活课程模式
Uncovering life-course patterns with causal discovery and survival analysis
论文作者
论文摘要
我们提供了一种新颖的方法和一项探索性研究,用于通过因果发现和生存分析从概率的角度从概率角度进行建模和发生。我们的方法被称为双层问题。在高层,我们使用因果发现工具来构建生命事件图。在较低级别的情况下,对于生命事件的成对,通过生存分析进行事件的建模应用于模型依赖时间依赖的过渡概率。分析了几项生活事件,例如结婚,购买新车,儿童出生,搬迁和离婚,以及生存模型的社会人口统计学特征,其中一些是年龄,国籍,儿童数量,汽车数量和房屋所有权。该数据源自德国多特蒙德进行的一项调查,其中包含有关住宅和就业传记,旅行行为和节日旅行以及社会经济特征的一系列回顾性问题。尽管过去已使用生存分析来分析生命过程,但这是第一次制定双层模型。在高层中包含因果发现算法,使我们能够首先确定生命过程事件之间的因果关系,然后了解可能影响事件之间过渡速率的因素。这与更经典的选择模型大不相同,在该模型中,因果关系受到基于模型结果的专家解释。
We provide a novel approach and an exploratory study for modelling life event choices and occurrence from a probabilistic perspective through causal discovery and survival analysis. Our approach is formulated as a bi-level problem. In the upper level, we build the life events graph, using causal discovery tools. In the lower level, for the pairs of life events, time-to-event modelling through survival analysis is applied to model time-dependent transition probabilities. Several life events were analysed, such as getting married, buying a new car, child birth, home relocation and divorce, together with the socio-demographic attributes for survival modelling, some of which are age, nationality, number of children, number of cars and home ownership. The data originates from a survey conducted in Dortmund, Germany, with the questionnaire containing a series of retrospective questions about residential and employment biography, travel behaviour and holiday trips, as well as socio-economic characteristic. Although survival analysis has been used in the past to analyse life-course data, this is the first time that a bi-level model has been formulated. The inclusion of a causal discovery algorithm in the upper-level allows us to first identify causal relationships between life-course events and then understand the factors that might influence transition rates between events. This is very different from more classic choice models where causal relationships are subject to expert interpretations based on model results.