多维不确定性感知证据神经网络

论文标题

多维不确定性感知证据神经网络

Multidimensional Uncertainty-Aware Evidential Neural Networks

论文作者

Hu, Yibo, Ou, Yuzhe, Zhao, Xujiang, Cho, Jin-Hee, Chen, Feng

论文摘要

传统的深度神经网络（NNS）在各个应用领域下的分类任务中为最先进的表现做出了重大贡献。但是，NN并未考虑与班级概率相关的数据固有的不确定性，因为不确定性下的错误分类很可能很容易在现实世界中的决策中引起高风险（例如，道路上对物体的错误分类导致严重的事故）。与贝叶斯NN间接通过体重不确定性来推断不确定性不同，最近有人提出了证据NNS（ENN），以明确对类别概率的不确定性进行建模，并将其用于分类任务。 AN ENN提供了NNS作为主观意见的预测的表述，并通过收集可以从数据中确定性NN形成主观意见的证据来了解该功能。但是，ENN被训练为黑匣子，而没有明确考虑具有不同根本原因的数据固有的不确定性，例如空缺（即由于缺乏证据而导致的不确定性）或不和谐（即由于证据冲突而导致的不确定性）。通过考虑多维不确定性，我们提出了一种新型的不确定性意识到的证据NN，称为WGAN-enn（WENN），用于解决未分布（OOD）检测问题。我们采用了一种混合方法，将Wasserstein生成对抗网络（WGAN）与ENN相结合，以共同训练具有某个类别的先验知识的模型，该模型具有很高的OOD样品。通过基于合成和现实世界数据集的广泛的经验实验，我们证明了WENN对不确定性的估计可以显着帮助将OOD样品与边界样本区分开。与其他竞争性同行相比，Wenn在OOD检测中的表现优于检测。

Traditional deep neural networks (NNs) have significantly contributed to the state-of-the-art performance in the task of classification under various application domains. However, NNs have not considered inherent uncertainty in data associated with the class probabilities where misclassification under uncertainty may easily introduce high risk in decision making in real-world contexts (e.g., misclassification of objects in roads leads to serious accidents). Unlike Bayesian NN that indirectly infer uncertainty through weight uncertainties, evidential NNs (ENNs) have been recently proposed to explicitly model the uncertainty of class probabilities and use them for classification tasks. An ENN offers the formulation of the predictions of NNs as subjective opinions and learns the function by collecting an amount of evidence that can form the subjective opinions by a deterministic NN from data. However, the ENN is trained as a black box without explicitly considering inherent uncertainty in data with their different root causes, such as vacuity (i.e., uncertainty due to a lack of evidence) or dissonance (i.e., uncertainty due to conflicting evidence). By considering the multidimensional uncertainty, we proposed a novel uncertainty-aware evidential NN called WGAN-ENN (WENN) for solving an out-of-distribution (OOD) detection problem. We took a hybrid approach that combines Wasserstein Generative Adversarial Network (WGAN) with ENNs to jointly train a model with prior knowledge of a certain class, which has high vacuity for OOD samples. Via extensive empirical experiments based on both synthetic and real-world datasets, we demonstrated that the estimation of uncertainty by WENN can significantly help distinguish OOD samples from boundary samples. WENN outperformed in OOD detection when compared with other competitive counterparts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题