比较传统机器学习模型的解释方法第2部分：量化模型的解释性忠诚和改进，并降低维度

论文标题

比较传统机器学习模型的解释方法第2部分：量化模型的解释性忠诚和改进，并降低维度

Comparing Explanation Methods for Traditional Machine Learning Models Part 2: Quantifying Model Explainability Faithfulness and Improvements with Dimensionality Reduction

论文作者

Flora, Montgomery, Potvin, Corey, McGovern, Amy, Handler, Shawn

论文摘要

机器学习（ML）模型在具有广泛应用的大气科学界变得越来越普遍。为了使用户了解ML模型所学的知识，ML解释性已成为一个积极研究的领域。在这项两部分研究的第一部分中，我们描述了几种解释性方法，并证明了来自不同方法的特征排名可能会彼此不同意。但是，目前尚不清楚分歧是否过度膨胀，因为某些方法不太忠于分配重要性。在此，“忠诚”或“忠诚”是指指定的特征重要性与该功能对模型性能的贡献之间的对应关系。在本研究中，我们使用多种方法评估了特征排名方法的忠诚。鉴于解释方法具有相关性的敏感性，我们还量化了相关特征的限制后的解释性忠诚度有多改善。在降低维度之前，特征相关方法[例如，由于相关特征的负面影响，由于置换率，石灰，啤酒方差和逻辑回归（LR）系数]通常比排列重要性方法更为忠诚。一旦相关特征减少了，传统置换的重要性就成为最忠实的方法。此外，排名不确定性（即，通过不同排名方法分配给特征的等级中的传播）减少了2-10倍，并且排除了较少的忠实特征排名方法将其进一步降低。这项研究是量化限制相关特征并了解不同解释性方法的相对保真度的解释性提高的研究之一。

Machine learning (ML) models are becoming increasingly common in the atmospheric science community with a wide range of applications. To enable users to understand what an ML model has learned, ML explainability has become a field of active research. In Part I of this two-part study, we described several explainability methods and demonstrated that feature rankings from different methods can substantially disagree with each other. It is unclear, though, whether the disagreement is overinflated due to some methods being less faithful in assigning importance. Herein, "faithfulness" or "fidelity" refer to the correspondence between the assigned feature importance and the contribution of the feature to model performance. In the present study, we evaluate the faithfulness of feature ranking methods using multiple methods. Given the sensitivity of explanation methods to feature correlations, we also quantify how much explainability faithfulness improves after correlated features are limited. Before dimensionality reduction, the feature relevance methods [e.g., SHAP, LIME, ALE variance, and logistic regression (LR) coefficients] were generally more faithful than the permutation importance methods due to the negative impact of correlated features. Once correlated features were reduced, traditional permutation importance became the most faithful method. In addition, the ranking uncertainty (i.e., the spread in rank assigned to a feature by the different ranking methods) was reduced by a factor of 2-10, and excluding less faithful feature ranking methods reduces it further. This study is one of the first to quantify the improvement in explainability from limiting correlated features and knowing the relative fidelity of different explainability methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题