论文标题
摊销因果结构学习的推断
Amortized Inference for Causal Structure Learning
论文作者
论文摘要
推断因果结构提出了一个组合搜索问题,该问题通常涉及评估分数或独立性测试的结构。最终的搜索是昂贵的,并且很难设计合适的分数或测试以捕获先验知识。在这项工作中,我们建议摊销因果结构学习。我们没有搜索结构,而是训练一个变异推理模型,直接从观察性或介入数据中预测因果结构。这使我们的推理模型仅从模拟器生成的数据中获取特定于域特异性的归纳偏见,绕过适当分数功能的手工工程和图形搜索。我们的推论模型的结构模拟了对结构学习中统计效率至关重要的置换不变,这促进了比训练期间所见的明显更大的问题实例的概括。在综合数据和半合成基因表达数据上,当经过实质性分布变化并显着胜过现有算法时,我们的模型表现出强大的概括能力,尤其是在具有挑战性的基因组学领域。我们的代码和模型可在以下网址公开获取:https://github.com/larslorch/avici。
Inferring causal structure poses a combinatorial search problem that typically involves evaluating structures with a score or independence test. The resulting search is costly, and designing suitable scores or tests that capture prior knowledge is difficult. In this work, we propose to amortize causal structure learning. Rather than searching over structures, we train a variational inference model to directly predict the causal structure from observational or interventional data. This allows our inference model to acquire domain-specific inductive biases for causal discovery solely from data generated by a simulator, bypassing both the hand-engineering of suitable score functions and the search over graphs. The architecture of our inference model emulates permutation invariances that are crucial for statistical efficiency in structure learning, which facilitates generalization to significantly larger problem instances than seen during training. On synthetic data and semisynthetic gene expression data, our models exhibit robust generalization capabilities when subject to substantial distribution shifts and significantly outperform existing algorithms, especially in the challenging genomics domain. Our code and models are publicly available at: https://github.com/larslorch/avici.