论文标题
有效计算对比解释
Efficient computation of contrastive explanations
论文作者
论文摘要
随着机器学习系统在实践中的部署的增加,透明度和解释性已成为严重的问题。对比解释被认为是有用的和直观的,特别是在解释躺在外行的决定时,因为它们模仿了人类解释的方式。然而,到目前为止,很少有研究解决了计算可行的技术,这些技术可以保证解释的独特性和最佳性,并可以轻松纳入其他约束。在这里,我们将专注于特定类型的模型,而不是黑框技术。我们研究对比度和反事实解释的关系,并提出数学形式化以及有效计算(合理的)相关阳性的2期算法,这些算法是许多标准机器学习模型的相关阳性。
With the increasing deployment of machine learning systems in practice, transparency and explainability have become serious issues. Contrastive explanations are considered to be useful and intuitive, in particular when it comes to explaining decisions to lay people, since they mimic the way in which humans explain. Yet, so far, comparably little research has addressed computationally feasible technologies, which allow guarantees on uniqueness and optimality of the explanation and which enable an easy incorporation of additional constraints. Here, we will focus on specific types of models rather than black-box technologies. We study the relation of contrastive and counterfactual explanations and propose mathematical formalizations as well as a 2-phase algorithm for efficiently computing (plausible) pertinent positives of many standard machine learning models.