论文标题
基于图形因果建模的数据驱动因果效应估计:调查
Data-Driven Causal Effect Estimation Based on Graphical Causal Modelling: A Survey
论文作者
论文摘要
在科学研究和现实应用程序的许多领域中,非实验数据的因果效应的无偏估计对于理解数据的基础机制以及对有效响应或干预措施的决策至关重要。已经进行了大量研究,以从不同的角度解决这个具有挑战性的问题。为了估计观测数据中的因果效应,总是会做出诸如马尔可夫条件,忠诚和因果关系之类的假设。在假设下,通常需要一组协变量或基本因果图等全面知识。一个实用的挑战是,在许多应用中,没有这样的全部知识或只有某些部分知识。近年来,研究已经出现了使用基于图形因果模型的搜索策略,以从数据中发现有用的知识,以进行因果效应估计,并具有一些轻微的假设,并在应对实际挑战方面已显示出希望。在这项调查中,我们回顾了这些数据驱动的方法,这些方法对单一治疗的因果效应估计进行了单一的兴趣结果,并专注于数据驱动的因果效应估计所面临的挑战。我们简单地总结了使用图形因果建模对数据驱动的因果效应估计至关重要的基本概念和理论,但散布在文献周围。我们确定并讨论数据驱动的因果效应估计所面临的挑战,并通过其假设以及应对挑战的方法来表征现有方法。我们分析了不同类型方法的优势和局限性,并提出了支持讨论的经验评估。我们希望这篇综述将激励更多的研究人员根据图形因果建模设计更好的数据驱动方法,以解决因果效应估计的具有挑战性的问题。
In many fields of scientific research and real-world applications, unbiased estimation of causal effects from non-experimental data is crucial for understanding the mechanism underlying the data and for decision-making on effective responses or interventions. A great deal of research has been conducted to address this challenging problem from different angles. For estimating causal effect in observational data, assumptions such as Markov condition, faithfulness and causal sufficiency are always made. Under the assumptions, full knowledge such as, a set of covariates or an underlying causal graph, is typically required. A practical challenge is that in many applications, no such full knowledge or only some partial knowledge is available. In recent years, research has emerged to use search strategies based on graphical causal modelling to discover useful knowledge from data for causal effect estimation, with some mild assumptions, and has shown promise in tackling the practical challenge. In this survey, we review these data-driven methods on causal effect estimation for a single treatment with a single outcome of interest and focus on the challenges faced by data-driven causal effect estimation. We concisely summarise the basic concepts and theories that are essential for data-driven causal effect estimation using graphical causal modelling but are scattered around the literature. We identify and discuss the challenges faced by data-driven causal effect estimation and characterise the existing methods by their assumptions and the approaches to tackling the challenges. We analyse the strengths and limitations of the different types of methods and present an empirical evaluation to support the discussions. We hope this review will motivate more researchers to design better data-driven methods based on graphical causal modelling for the challenging problem of causal effect estimation.