诗人：自适应决策树可解释的政策学习

论文标题

诗人：自适应决策树可解释的政策学习

POETREE: Interpretable Policy Learning with Adaptive Decision Trees

论文作者

Pace, Alizée, Chan, Alex J., van der Schaar, Mihaela

论文摘要

从观察到的行为中建立人类决策的模型对于更好地理解，诊断和支持临床护理等现实世界政策至关重要。随着既定的政策学习方法仍然专注于模仿绩效，他们无法解释所证明的决策过程。通过决策树（Poetree）提取政策是一种可解释的政策学习的新框架，与完全秘密和部分观察到的临床决策环境兼容，并构建了基于患者的观察和病史的医师行动来决定医师行动的概率树政策。在优化期间，完全不同的树架构会逐渐生长，以使其复杂性适应建模任务，并通过复发来了解患者病史的表示，从而产生了随着时间的推移而与患者信息相适应的决策树策略。在理解，量化和评估观察到的行为以及准确复制它方面，这种政策学习方法在真实和综合医学数据集上的最新方法都优于最先进的方法，并具有改善未来决策支持系统的潜力。

Building models of human decision-making from observed behaviour is critical to better understand, diagnose and support real-world policies such as clinical care. As established policy learning approaches remain focused on imitation performance, they fall short of explaining the demonstrated decision-making process. Policy Extraction through decision Trees (POETREE) is a novel framework for interpretable policy learning, compatible with fully-offline and partially-observable clinical decision environments -- and builds probabilistic tree policies determining physician actions based on patients' observations and medical history. Fully-differentiable tree architectures are grown incrementally during optimization to adapt their complexity to the modelling task, and learn a representation of patient history through recurrence, resulting in decision tree policies that adapt over time with patient information. This policy learning method outperforms the state-of-the-art on real and synthetic medical datasets, both in terms of understanding, quantifying and evaluating observed behaviour as well as in accurately replicating it -- with potential to improve future decision support systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题